pairwise nucleotide differences: Topics by Science.gov

Sample records for pairwise nucleotide differences

Prospects for inferring pairwise relationships with single nucleotide polymorphisms

Treesearch

Jeffery C. Glaubitz; O. Eugene, Jr. Rhodes; J. Andrew DeWoody

2003-01-01

An extraordinarily large number of single nucleotide polymorphisms (SNPs) are now available in humans as well as in other model organisms. Technological advancements may soon make it feasible to assay hundreds of SNPs in virtually any organism of interest. One potential application of SNPs is the determination of pairwise genetic relationships in populations without...
Differences in the second internal transcribed spacer of four species of Nematodirus (Nematoda: Molineidae).

PubMed

Newton, L A; Chilton, N B; Beveridge, I; Gasser, R B

1998-02-01

Genetic differences among Nematodirus spathiger, Nematodirus filicollis, Nematodirus helvetianus and Nematodirus battus in the nucleotide sequence of the second internal transcribed spacer (ITS-2) of ribosomal DNA ranged from 3.9 to 24.7%. Pairwise comparisons of their ITS-2 sequences indicated that the most genetically similar species were N. spathiger and N. helvetianus. N. battus was the most genetically distinct species, with differences ranging from 22.8 to 24.7% with respect to the other three species. Some of the nucleotide differences among species provided different endonuclease restriction sites that could be used in restriction fragment length polymorphism studies. The ITS-2 sequence data may prove useful in studies of the systematics of molineid nematodes.
Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign

PubMed Central

2007-01-01

Background Joint alignment and secondary structure prediction of two RNA sequences can significantly improve the accuracy of the structural predictions. Methods addressing this problem, however, are forced to employ constraints that reduce computation by restricting the alignments and/or structures (i.e. folds) that are permissible. In this paper, a new methodology is presented for the purpose of establishing alignment constraints based on nucleotide alignment and insertion posterior probabilities. Using a hidden Markov model, posterior probabilities of alignment and insertion are computed for all possible pairings of nucleotide positions from the two sequences. These alignment and insertion posterior probabilities are additively combined to obtain probabilities of co-incidence for nucleotide position pairs. A suitable alignment constraint is obtained by thresholding the co-incidence probabilities. The constraint is integrated with Dynalign, a free energy minimization algorithm for joint alignment and secondary structure prediction. The resulting method is benchmarked against the previous version of Dynalign and against other programs for pairwise RNA structure prediction. Results The proposed technique eliminates manual parameter selection in Dynalign and provides significant computational time savings in comparison to prior constraints in Dynalign while simultaneously providing a small improvement in the structural prediction accuracy. Savings are also realized in memory. In experiments over a 5S RNA dataset with average sequence length of approximately 120 nucleotides, the method reduces computation by a factor of 2. The method performs favorably in comparison to other programs for pairwise RNA structure prediction: yielding better accuracy, on average, and requiring significantly lesser computational resources. Conclusion Probabilistic analysis can be utilized in order to automate the determination of alignment constraints for pairwise RNA structure prediction methods in a principled fashion. These constraints can reduce the computational and memory requirements of these methods while maintaining or improving their accuracy of structural prediction. This extends the practical reach of these methods to longer length sequences. The revised Dynalign code is freely available for download. PMID:17445273
Application of a time-dependent coalescence process for inferring the history of population size changes from DNA sequence data.

PubMed

Polanski, A; Kimmel, M; Chakraborty, R

1998-05-12

Distribution of pairwise differences of nucleotides from data on a sample of DNA sequences from a given segment of the genome has been used in the past to draw inferences about the past history of population size changes. However, all earlier methods assume a given model of population size changes (such as sudden expansion), parameters of which (e.g., time and amplitude of expansion) are fitted to the observed distributions of nucleotide differences among pairwise comparisons of all DNA sequences in the sample. Our theory indicates that for any time-dependent population size, N(tau) (in which time tau is counted backward from present), a time-dependent coalescence process yields the distribution, p(tau), of the time of coalescence between two DNA sequences randomly drawn from the population. Prediction of p(tau) and N(tau) requires the use of a reverse Laplace transform known to be unstable. Nevertheless, simulated data obtained from three models of monotone population change (stepwise, exponential, and logistic) indicate that the pattern of a past population size change leaves its signature on the pattern of DNA polymorphism. Application of the theory to the published mtDNA sequences indicates that the current mtDNA sequence variation is not inconsistent with a logistic growth of the human population.
MIDAS: software for analysis and visualisation of interallelic disequilibrium between multiallelic markers

PubMed Central

Gaunt, Tom R; Rodriguez, Santiago; Zapata, Carlos; Day, Ian NM

2006-01-01

Background Various software tools are available for the display of pairwise linkage disequilibrium across multiple single nucleotide polymorphisms. The HapMap project also presents these graphics within their website. However, these approaches are limited in their use of data from multiallelic markers and provide limited information in a graphical form. Results We have developed a software package (MIDAS – Multiallelic Interallelic Disequilibrium Analysis Software) for the estimation and graphical display of interallelic linkage disequilibrium. Linkage disequilibrium is analysed for each allelic combination (of one allele from each of two loci), between all pairwise combinations of any type of multiallelic loci in a contig (or any set) of many loci (including single nucleotide polymorphisms, microsatellites, minisatellites and haplotypes). Data are presented graphically in a novel and informative way, and can also be exported in tabular form for other analyses. This approach facilitates visualisation of patterns of linkage disequilibrium across genomic regions, analysis of the relationships between different alleles of multiallelic markers and inferences about patterns of evolution and selection. Conclusion MIDAS is a linkage disequilibrium analysis program with a comprehensive graphical user interface providing novel views of patterns of linkage disequilibrium between all types of multiallelic and biallelic markers. Availability Available from and PMID:16643648
Mitochondrial control-region sequence variation in aboriginal Australians.

PubMed Central

van Holst Pellekaan, S; Frommer, M; Sved, J; Boettcher, B

1998-01-01

The mitochondrial D-loop hypervariable segment 1 (mt HVS1) between nucleotides 15997 and 16377 has been examined in aboriginal Australian people from the Darling River region of New South Wales (riverine) and from Yuendumu in central Australia (desert). Forty-seven unique HVS1 types were identified, varying at 49 nucleotide positions. Pairwise analysis by calculation of BEPPI (between population proportion index) reveals statistically significant structure in the populations, although some identical HVS1 types are seen in the two contrasting regions. mt HVS1 types may reflect more-ancient distributions than do linguistic diversity and other culturally distinguishing attributes. Comparison with sequences from five published global studies reveals that these Australians demonstrate greatest divergence from some Africans, least from Papua New Guinea highlanders, and only slightly more from some Pacific groups (Indonesian, Asian, Samoan, and coastal Papua New Guinea), although the HVS1 types vary at different nucleotide sites. Construction of a median network, displaying three main groups, suggests that several hypervariable nucleotide sites within the HVS1 are likely to have undergone mutation independently, making phylogenetic comparison with global samples by conventional methods difficult. Specific nucleotide-site variants are major separators in median networks constructed from Australian HVS1 types alone and for one global selection. The distribution of these, requiring extended study, suggests that they may be signatures of different groups of prehistoric colonizers into Australia, for which the time of colonization remains elusive. PMID:9463317
Humans and Great Apes Cohabiting the Forest Ecosystem in Central African Republic Harbour the Same Hookworms

PubMed Central

Hasegawa, Hideo; Modrý, David; Kitagawa, Masahiro; Shutt, Kathryn A.; Todd, Angelique; Kalousová, Barbora; Profousová, Ilona; Petrželková, Klára J.

2014-01-01

Background Hookworms are important pathogens of humans. To date, Necator americanus is the sole, known species of the genus Necator infecting humans. In contrast, several Necator species have been described in African great apes and other primates. It has not yet been determined whether primate-originating Necator species are also parasitic in humans. Methodology/Principal Findings The infective larvae of Necator spp. were developed using modified Harada-Mori filter-paper cultures from faeces of humans and great apes inhabiting Dzanga-Sangha Protected Areas, Central African Republic. The first and second internal transcribed spacers (ITS-1 and ITS-2) of nuclear ribosomal DNA and partial cytochrome c oxidase subunit 1 (cox1) gene of mtDNA obtained from the hookworm larvae were sequenced and compared. Three sequence types (I–III) were recognized in the ITS region, and 34 cox1 haplotypes represented three phylogenetic groups (A–C). The combinations determined were I-A, II-B, II-C, III-B and III-C. Combination I-A, corresponding to N. americanus, was demonstrated in humans and western lowland gorillas; II-B and II-C were observed in humans, western lowland gorillas and chimpanzees; III-B and III-C were found only in humans. Pairwise nucleotide difference in the cox1 haplotypes between the groups was more than 8%, while the difference within each group was less than 2.1%. Conclusions/Significance The distinctness of ITS sequence variants and high number of pairwise nucleotide differences among cox1 variants indicate the possible presence of several species of Necator in both humans and great apes. We conclude that Necator hookworms are shared by humans and great apes co-habiting the same tropical forest ecosystems. PMID:24651493
Expansion of inverted repeat does not decrease substitution rates in Pelargonium plastid genomes.

PubMed

Weng, Mao-Lun; Ruhlman, Tracey A; Jansen, Robert K

2017-04-01

For species with minor inverted repeat (IR) boundary changes in the plastid genome (plastome), nucleotide substitution rates were previously shown to be lower in the IR than the single copy regions (SC). However, the impact of large-scale IR expansion/contraction on plastid nucleotide substitution rates among closely related species remains unclear. We included plastomes from 22 Pelargonium species, including eight newly sequenced genomes, and used both pairwise and model-based comparisons to investigate the impact of the IR on sequence evolution in plastids. Ten types of plastome organization with different inversions or IR boundary changes were identified in Pelargonium. Inclusion in the IR was not sufficient to explain the variation of nucleotide substitution rates. Instead, the rate heterogeneity in Pelargonium plastomes was a mixture of locus-specific, lineage-specific and IR-dependent effects. Our study of Pelargonium plastomes that vary in IR length and gene content demonstrates that the evolutionary consequences of retaining these repeats are more complicated than previously suggested. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
Genetic association with low concentrations of high density lipoprotein-cholesterol in a pediatric population of the Middle East and North Africa: the CASPIAN-III study.

PubMed

Kelishadi, Roya; Haghjooy Javanmard, Shaghayegh; Tajadini, Mohammad Hasan; Mansourian, Marjan; Motlagh, Mohammad Esmaeil; Ardalan, Gelayol; Ban, Matthew

2014-11-01

Depressed high-density lipoprotein cholesterol (HDL-C) is prevalent the Middle East and North Africa. Some studies have documented associations between HDL-C and several single nucleotide polymorphisms (SNPs) in candidate gene polymorphisms. We investigated the associations between SNP genotypes and HDL-C levels in Iranian students, aged 10-18 years. Genotyping was performed in 750 randomly selected participants among those with low HDL-C levels (below 5th percentile), intermediate HDL-C levels (5-95th) and high HDL-C levels (above the 95th percentile). Minor allele frequencies (MAFs) of the SNPs of interest were compared between the three HDL-C groups. The vast majority of pairwise comparisons of MAFs between HDL-C groups were significant. Pairwise comparisons between low and high HDL-C groups showed significant between-group differences in MAFs for all SNPs, except for APOC3 rs5128. Pairwise comparisons between low and intermediate HDL-C groups showed significant between-group differences in MAFs for all SNPs, except for APOC3 rs5128 and APOA1 rs2893157. Pairwise comparisons between intermediate and high HDL-C groups showed significant between-group differences in MAFs for all SNPs, except for ABCA1 APOC3 rs5128 and APOA1 rs2893157. After adjustment for confounding factors, including age, sex, body mass index, low physical activity, consumption of saturated fats, and socioeconomic status, ABCA1 r1587K and CETP A373P significantly increased the risk of depressed HDL-C, and CETP Taq1 had a protective role. This study replicated several associations between HDL-C levels and candidate gene SNPs from genome-wide associations with HDL-C in Iranians from the pediatric age group. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Phylogeny of the Genus Flavivirus

PubMed Central

Kuno, Goro; Chang, Gwong-Jen J.; Tsuchiya, K. Richard; Karabatsos, Nick; Cropp, C. Bruce

1998-01-01

We undertook a comprehensive phylogenetic study to establish the genetic relationship among the viruses of the genus Flavivirus and to compare the classification based on molecular phylogeny with the existing serologic method. By using a combination of quantitative definitions (bootstrap support level and the pairwise nucleotide sequence identity), the viruses could be classified into clusters, clades, and species. Our phylogenetic study revealed for the first time that from the putative ancestor two branches, non-vector and vector-borne virus clusters, evolved and from the latter cluster emerged tick-borne and mosquito-borne virus clusters. Provided that the theory of arthropod association being an acquired trait was correct, pairwise nucleotide sequence identity among these three clusters provided supporting data for a possibility that the non-vector cluster evolved first, followed by the separation of tick-borne and mosquito-borne virus clusters in that order. Clades established in our study correlated significantly with existing antigenic complexes. We also resolved many of the past taxonomic problems by establishing phylogenetic relationships of the antigenically unclassified viruses with the well-established viruses and by identifying synonymous viruses. PMID:9420202
Phylogeny of the genus Flavivirus.

PubMed

Kuno, G; Chang, G J; Tsuchiya, K R; Karabatsos, N; Cropp, C B

1998-01-01

We undertook a comprehensive phylogenetic study to establish the genetic relationship among the viruses of the genus Flavivirus and to compare the classification based on molecular phylogeny with the existing serologic method. By using a combination of quantitative definitions (bootstrap support level and the pairwise nucleotide sequence identity), the viruses could be classified into clusters, clades, and species. Our phylogenetic study revealed for the first time that from the putative ancestor two branches, non-vector and vector-borne virus clusters, evolved and from the latter cluster emerged tick-borne and mosquito-borne virus clusters. Provided that the theory of arthropod association being an acquired trait was correct, pairwise nucleotide sequence identity among these three clusters provided supporting data for a possibility that the non-vector cluster evolved first, followed by the separation of tick-borne and mosquito-borne virus clusters in that order. Clades established in our study correlated significantly with existing antigenic complexes. We also resolved many of the past taxonomic problems by establishing phylogenetic relationships of the antigenically unclassified viruses with the well-established viruses and by identifying synonymous viruses.
Kernel Machine SNP-set Testing under Multiple Candidate Kernels

PubMed Central

Wu, Michael C.; Maity, Arnab; Lee, Seunggeun; Simmons, Elizabeth M.; Harmon, Quaker E.; Lin, Xinyi; Engel, Stephanie M.; Molldrem, Jeffrey J.; Armistead, Paul M.

2013-01-01

Joint testing for the cumulative effect of multiple single nucleotide polymorphisms grouped on the basis of prior biological knowledge has become a popular and powerful strategy for the analysis of large scale genetic association studies. The kernel machine (KM) testing framework is a useful approach that has been proposed for testing associations between multiple genetic variants and many different types of complex traits by comparing pairwise similarity in phenotype between subjects to pairwise similarity in genotype, with similarity in genotype defined via a kernel function. An advantage of the KM framework is its flexibility: choosing different kernel functions allows for different assumptions concerning the underlying model and can allow for improved power. In practice, it is difficult to know which kernel to use a priori since this depends on the unknown underlying trait architecture and selecting the kernel which gives the lowest p-value can lead to inflated type I error. Therefore, we propose practical strategies for KM testing when multiple candidate kernels are present based on constructing composite kernels and based on efficient perturbation procedures. We demonstrate through simulations and real data applications that the procedures protect the type I error rate and can lead to substantially improved power over poor choices of kernels and only modest differences in power versus using the best candidate kernel. PMID:23471868
Prioritization based on neutral genetic diversity may fail to conserve important characteristics in cattle breeds.

PubMed

Hall, S J G; Lenstra, J A; Deeming, D C

2012-06-01

Conservation of the intraspecific genetic diversity of livestock species requires protocols that assess between-breed genetic variability and also take into account differences among individuals within breeds. Here, we focus on variation between breeds. Conservation of neutral genetic variation has been seen as promoting, through linkage processes, the retention of useful and potentially useful variation. Using public information on beef cattle breeds, with a total of 165 data sets each relating to a breed comparison of a performance variable, we have tested this paradigm by calculating the correlations between pairwise breed differences in performance and pairwise genetic distances deduced from biochemical and immunological polymorphisms, microsatellites and single-nucleotide polymorphisms. As already observed in floral and faunal biodiversity, significant positive correlations (n=54) were found, but many correlations were non-significant (n=100) or significantly negative (n=11). This implies that maximizing conserved neutral genetic variation with current techniques may conserve breed-level genetic variation in some traits but not in others and supports the view that genetic distance measurements based on neutral genetic variation are not sufficient as a determinant of conservation priority among breeds. © 2011 Blackwell Verlag GmbH.
Complete Genome Sequence of a Genomovirus Associated with Common Bean Plant Leaves in Brazil.

PubMed

Lamas, Natalia Silva; Fontenele, Rafaela Salgado; Melo, Fernando Lucas; Costa, Antonio Felix; Varsani, Arvind; Ribeiro, Simone Graça

2016-11-10

A new genomovirus has been identified in three common bean plants in Brazil. This virus has a circular genome of 2,220 nucleotides and 3 major open reading frames. It shares 80.7% genome-wide pairwise identity with a genomovirus recovered from Tongan fruit bat guano. Copyright © 2016 Lamas et al.
An Outbreak of Streptococcus pyogenes in a Mental Health Facility: Advantage of Well-Timed Whole-Genome Sequencing Over emm Typing.

PubMed

Bergin, Sarah M; Periaswamy, Balamurugan; Barkham, Timothy; Chua, Hong Choon; Mok, Yee Ming; Fung, Daniel Shuen Sheng; Su, Alex Hsin Chuan; Lee, Yen Ling; Chua, Ming Lai Ivan; Ng, Poh Yong; Soon, Wei Jia Wendy; Chu, Collins Wenhan; Tan, Siyun Lucinda; Meehan, Mary; Ang, Brenda Sze Peng; Leo, Yee Sin; Holden, Matthew T G; De, Partha; Hsu, Li Yang; Chen, Swaine L; de Sessions, Paola Florez; Marimuthu, Kalisvar

2018-05-09

OBJECTIVEWe report the utility of whole-genome sequencing (WGS) conducted in a clinically relevant time frame (ie, sufficient for guiding management decision), in managing a Streptococcus pyogenes outbreak, and present a comparison of its performance with emm typing.SETTINGA 2,000-bed tertiary-care psychiatric hospital.METHODSActive surveillance was conducted to identify new cases of S. pyogenes. WGS guided targeted epidemiological investigations, and infection control measures were implemented. Single-nucleotide polymorphism (SNP)-based genome phylogeny, emm typing, and multilocus sequence typing (MLST) were performed. We compared the ability of WGS and emm typing to correctly identify person-to-person transmission and to guide the management of the outbreak.RESULTSThe study included 204 patients and 152 staff. We identified 35 patients and 2 staff members with S. pyogenes. WGS revealed polyclonal S. pyogenes infections with 3 genetically distinct phylogenetic clusters (C1-C3). Cluster C1 isolates were all emm type 4, sequence type 915 and had pairwise SNP differences of 0-5, which suggested recent person-to-person transmissions. Epidemiological investigation revealed that cluster C1 was mediated by dermal colonization and transmission of S. pyogenes in a male residential ward. Clusters C2 and C3 were genomically diverse, with pairwise SNP differences of 21-45 and 26-58, and emm 11 and mostly emm120, respectively. Clusters C2 and C3, which may have been considered person-to-person transmissions by emm typing, were shown by WGS to be unlikely by integrating pairwise SNP differences with epidemiology.CONCLUSIONSWGS had higher resolution than emm typing in identifying clusters with recent and ongoing person-to-person transmissions, which allowed implementation of targeted intervention to control the outbreak.Infect Control Hosp Epidemiol 2018;1-9.
Deep Sequencing Reveals a Divergent Ugandan cassava brown streak virus Isolate from Malawi

PubMed Central

Winter, Stephan; Mukasa, Settumba; Tairo, Fred; Sseruwagi, Peter; Ndunguru, Joseph; Duffy, Siobain

2017-01-01

ABSTRACT Illumina sequencing of RNA from a cassava cutting from northern Malawi produced a genome of Ugandan cassava brown streak virus (UCBSV-MW-NB7_2013). Sequence comparisons revealed stronger similarity to an isolate from nearby Tanzania (93.4% pairwise nucleotide identity) than to those previously reported from Malawi (86.9 to 87.0%). PMID:28818908
Molecular characterization of echovirus 30-associated outbreak of aseptic meningitis in Korea in 2008.

PubMed

Choi, Young Jin; Park, Kwi Sung; Baek, Kyoung Ah; Jung, Eun Hye; Nam, Hae Seon; Kim, Yong Bae; Park, Joon Soo

2010-03-01

Evaluation of the primary etiologic agents that cause aseptic meningitis outbreaks may provide valuable information regarding the prevention and management of aseptic meningitis. In Korea, an outbreak of aseptic meningitis caused by echovirus type 30 (E30) occurred from May to October in 2008. In order to determine the etiologic agent, CSF and/or stool specimens from 140 children hospitalized for aseptic meningitis at Soonchunhyang University Cheonan Hospital between June and October of 2008 were tested for virus isolation and identification. E30 accounted for 61.7% (37 cases) and echovirus 6 accounted for 21.7% (13 cases) of all the human enteroviruses (HEVs) isolates (60 cases in total). For the molecular characterization of the isolates, the VP1 gene sequence of 18 Korean E30 isolates was compared pairwise using the MegAlign with 34 reference strains from the GenBank database. The pairwise comparison of the nucleotide sequences of the VP1 genes demonstrated that the sequences of the Korean strains differed from those of lineage groups A, B, C, D, E, F and G. Reconstruction of the phylogenetic tree based on the complete VP1 nucleotide sequences resulted in a monophyletic tree, with eight clustered lineage groups. All Korean isolates were segregated from other lineage groups, thus suggesting that the Korean strains were a distinct lineage of E30, and a probable cause of this outbreak. This manuscript is the first report, to the best of our knowledge, of the molecular characteristics of E30 strains associated with an aseptic meningitis outbreak in Korea, and their respective phylogenetic relationships.
Molecular systematics of higher primates: genealogical relations and classification.

PubMed Central

Miyamoto, M M; Koop, B F; Slightom, J L; Goodman, M; Tennant, M R

1988-01-01

We obtained 5' and 3' flanking sequences (5.4 kilobase pairs) from the psi eta-globin gene region of the rhesus macaque (Macaca mulatta) and combined them with available nucleotide data. The completed sequence, representing 10.8 kilobase pairs of contiguous noncoding DNA, was compared to the same orthologous regions available for human (Homo sapiens, as represented by five different alleles), common chimpanzee (Pan troglodytes), gorilla (Gorilla gorilla), and orangutan (Pongo pygmaeus). The nucleotide sequence for Macaca mulatta provided the outgroup perspective needed to evaluate better the relationships of humans and great apes. Pairwise comparisons and parsimony analysis of these orthologues clearly demonstrated (i) that humans and great apes share a high degree of genetic similarity and (ii) that humans, chimpanzees, and gorillas form a natural monophyletic group. These conclusions strongly favor a genealogical classification for higher primates consisting of a single family (Hominidae) with two subfamilies (Homininae for Homo, Pan, and Gorilla and Ponginae for Pongo). PMID:3174657
Alphasatellitidae: a new family with two subfamilies for the classification of geminivirus- and nanovirus-associated alphasatellites.

PubMed

Briddon, Rob W; Martin, Darren P; Roumagnac, Philippe; Navas-Castillo, Jesús; Fiallo-Olivé, Elvira; Moriones, Enrique; Lett, Jean-Michel; Zerbini, F Murilo; Varsani, Arvind

2018-05-09

Nanoviruses and geminiviruses are circular, single stranded DNA viruses that infect many plant species around the world. Nanoviruses and certain geminiviruses that belong to the Begomovirus and Mastrevirus genera are associated with additional circular, single stranded DNA molecules (~ 1-1.4 kb) that encode a replication-associated protein (Rep). These Rep-encoding satellite molecules are commonly referred to as alphasatellites and here we communicate the establishment of the family Alphasatellitidae to which these have been assigned. Within the Alphasatellitidae family two subfamilies, Geminialphasatellitinae and Nanoalphasatellitinae, have been established to respectively accommodate the geminivirus- and nanovirus-associated alphasatellites. Whereas the pairwise nucleotide sequence identity distribution of all the known geminialphasatellites (n = 628) displayed a troughs at ~ 70% and 88% pairwise identity, that of the known nanoalphasatellites (n = 54) had a troughs at ~ 67% and ~ 80% pairwise identity. We use these pairwise identity values as thresholds together with phylogenetic analyses to establish four genera and 43 species of geminialphasatellites and seven genera and 19 species of nanoalphasatellites. Furthermore, a divergent alphasatellite associated with coconut foliar decay disease is assigned to a species but not a subfamily as it likely represents a new alphasatellite subfamily that could be established once other closely related molecules are discovered.
Genetic differentiation in blue shark, Prionace glauca, from the central Pacific Ocean, as inferred by mitochondrial cytochrome b region.

PubMed

Li, Weiwen; Dai, Xiaojie; Zhu, Jiangfeng; Tian, Siquan; He, Shan; Wu, Feng

2017-07-01

Six hundred and ninety-seven base pairs of cytochrome b gene of mtDNA was sequenced and analyzed for 78 blue shark Prionace glauca individuals from three sampled locations in the central Pacific Ocean (CPO). In total, three polymorphic sites were detected which defined four haplotypes. The haplotype diversity (h) ranged from 0.517 to 0.768, and nucleotide diversity (π) was between 0.0007 and 0.0011. Analysis of molecular variance indicated a non-significant differentiation among subpopulations. Furthermore, pairwise F ST score analysis revealed a non-significant differentiation among three sampled regions. Generally, low genetic differences were found between different geographic locations in the CPO. This study suggests a single panmictic population of P. glauca in the CPO.

DNA Barcodes of Asian Houbara Bustard (Chlamydotis undulata macqueenii)

PubMed Central

Arif, Ibrahim A.; Khan, Haseeb A.; Williams, Joseph B.; Shobrak, Mohammad; Arif, Waad I.

2012-01-01

Populations of Houbara Bustards have dramatically declined in recent years. Captive breeding and reintroduction programs have had limited success in reviving population numbers and thus new technological solutions involving molecular methods are essential for the long term survival of this species. In this study, we sequenced the 694 bp segment of COI gene of the four specimens of Asian Houbara Bustard (Chlamydotis undulata macqueenii). We also compared these sequences with earlier published barcodes of 11 individuals comprising different families of the orders Gruiformes, Ciconiiformes, Podicipediformes and Crocodylia (out group). The pair-wise sequence comparison showed a total of 254 variable sites across all the 15 sequences from different taxa. Three of the four specimens of Houbara Bustard had an identical sequence of COI gene and one individual showed a single nucleotide difference (G > A transition at position 83). Within the bustard family (Otididae), comparison among the three species (Asian Houbara Bustard, Great Bustard (Otis tarda) and the Little Bustard (Tetrax tetrax)), representing three different genera, showed 116 variable sites. For another family (Rallidae), the intra-family variable sites among the individuals of four different genera were found to be 146. The COI genetic distances among the 15 individuals varied from 0.000 to 0.431. Phylogenetic analysis using 619 bp nucleotide segment of COI clearly discriminated all the species representing different genera, families and orders. All the four specimens of Houbara Bustard formed a single clade and are clearly separated from other two individuals of the same family (Otis tarda and Tetrax tetrax). The nucleotide sequence of partial segment of COI gene effectively discriminated the closely related species. This is the first study reporting the barcodes of Houbara Bustard and would be helpful in future molecular studies, particularly for the conservation of this threatened bird in Saudi Arabia. PMID:22408462
Complete nucleotide sequence of pig (Sus scrofa) mitochondrial genome and dating evolutionary divergence within Artiodactyla.

PubMed

Lin, C S; Sun, Y L; Liu, C Y; Yang, P C; Chang, L C; Cheng, I C; Mao, S J; Huang, M C

1999-08-05

The complete nucleotide sequence of the pig (Sus scrofa) mitochondrial genome, containing 16613bp, is presented in this report. The genome is not a specific length because of the presence of the variable numbers of tandem repeats, 5'-CGTGCGTACA in the displacement loop (D-loop). Genes responsible for 12S and 16S rRNAs, 22 tRNAs, and 13 protein-coding regions are found. The genome carries very few intergenic nucleotides with several instances of overlap between protein-coding or tRNA genes, except in the D-loop region. For evaluating the possible evolutionary relationships between Artiodactyla and Cetacea, the nucleotide substitutions and amino acid sequences of 13 protein-coding genes were aligned by pairwise comparisons of the pig, cow, and fin whale. By comparing these sequences, we suggest that there is a closer relationship between the pig and cow than that between either of these species and fin whale. In addition, the accumulation of transversions and gaps in pig 12S and 16S rRNA genes was compared with that in other eutherian species, including cow, fin whale, human, horse, and harbor seal. The results also reveal a close phylogenetic relationship between pig and cow, as compared to fin whale and others. Thus, according to the sequence differences of mitochondrial rRNA genes in eutherian species, the evolutionary separation of pig and cow occurred about 53-60 million years ago.
Modeling the Association of Space, Time, and Host Species with Variation of the HA, NA, and NS Genes of H5N1 Highly Pathogenic Avian Influenza Viruses Isolated from Birds in Romania in 2005–2007

PubMed Central

Alkhamis, Mohammad; Perez, Andres; Batey, Nicole; Howard, Wendy; Baillie, Greg; Watson, Simon; Franz, Stephanie; Focosi-Snyman, Raffaella; Onita, Iuliana; Cioranu, Raluca; Turcitu, Mihai; Kellam, Paul; Brown, Ian H.; Breed, Andrew C.

2014-01-01

SUMMARY Molecular characterization studies of a diverse collection of avian influenza viruses (AIVs) have demonstrated that AIVs’ greatest genetic variability lies in the HA, NA, and NS genes. The objective here was to quantify the association between geographical locations, periods of time, and host species and pairwise nucleotide variation in the HA, NA, and NS genes of 70 isolates of H5N1 highly pathogenic avian influenza virus (HPAIV) collected from October 2005 to December 2007 from birds in Romania. A mixed-binomial Bayesian regression model was used to quantify the probability of nucleotide variation between isolates and its association with space, time, and host species. As expected for the three target genes, a higher probability of nucleotide differences (odds ratios [ORs] > 1) was found between viruses sampled from places at greater geographical distances from each other, viruses sampled over greater periods of time, and viruses derived from different species. The modeling approach in the present study maybe useful in further understanding the molecular epidemiology of H5N1 HPAI virus in bird populations. The methodology presented here will be useful in predicting the most likely genetic distance for any of the three gene segments of viruses that have not yet been isolated or sequenced based on space, time, and host species during the course of an epidemic. PMID:24283126
Assessment of the Geographic Origins of Pinewood Nematode Isolates via Single Nucleotide Polymorphism in Effector Genes

PubMed Central

Figueiredo, Joana; Simões, Maria José; Gomes, Paula; Barroso, Cristina; Pinho, Diogo; Conceição, Luci; Fonseca, Luís; Abrantes, Isabel; Pinheiro, Miguel; Egas, Conceição

2013-01-01

The pinewood nematode, Bursaphelenchus xylophilus, is native to North America but it only causes damaging pine wilt disease in those regions of the world where it has been introduced. The accurate detection of the species and its dispersal routes are thus essential to define effective control measures. The main goals of this study were to analyse the genetic diversity among B. xylophilus isolates from different geographic locations and identify single nucleotide polymorphism (SNPs) markers for geographic origin, through a comparative transcriptomic approach. The transcriptomes of seven B. xylophilus isolates, from Continental Portugal (4), China (1), Japan (1) and USA (1), were sequenced in the next generation platform Roche 454. Analysis of effector gene transcripts revealed inter-isolate nucleotide diversity that was validated by Sanger sequencing in the genomic DNA of the seven isolates and eight additional isolates from different geographic locations: Madeira Island (2), China (1), USA (1), Japan (2) and South Korea (2). The analysis identified 136 polymorphic positions in 10 effector transcripts. Pairwise comparison of the 136 SNPs through Neighbor-Joining and the Maximum Likelihood methods and 5-mer frequency analysis with the alignment-independent bilinear multivariate modelling approach correlated the SNPs with the isolates geographic origin. Furthermore, the SNP analysis indicated a closer proximity of the Portuguese isolates to the Korean and Chinese isolates than to the Japanese or American isolates. Each geographic cluster carried exclusive alleles that can be used as SNP markers for B. xylophilus isolate identification. PMID:24391785
Modeling the association of space, time, and host species with variation of the HA, NA, and NS genes of H5N1 highly pathogenic avian influenza viruses isolated from birds in Romania in 2005-2007.

PubMed

Alkhamis, Mohammad; Perez, Andres; Batey, Nicole; Howard, Wendy; Baillie, Greg; Watson, Simon; Franz, Stephanie; Focosi-Snyman, Raffaella; Onita, Iuliana; Cioranu, Raluca; Turcitu, Mihai; Kellam, Paul; Brown, Ian H; Breed, Andrew C

2013-09-01

Molecular characterization studies of a diverse collection of avian influenza viruses (AIVs) have demonstrated that AIVs' greatest genetic variability lies in the HA, NA, and NS genes. The objective here was to quantify the association between geographical locations, periods of time, and host species and pairwise nucleotide variation in the HA, NA, and NS genes of 70 isolates of H5N1 highly pathogenic avian influenza virus (HPAIV) collected from October 2005 to December 2007 from birds in Romania. A mixed-binomial Bayesian regression model was used to quantify the probability of nucleotide variation between isolates and its association with space, time, and host species. As expected for the three target genes, a higher probability of nucleotide differences (odds ratios [ORs] > 1) was found between viruses sampled from places at greater geographical distances from each other, viruses sampled over greater periods of time, and viruses derived from different species. The modeling approach in the present study maybe useful in further understanding the molecular epidemiology of H5N1 HPAI virus in bird populations. The methodology presented here will be useful in predicting the most likely genetic distance for any of the three gene segments of viruses that have not yet been isolated or sequenced based on space, time, and host species during the course of an epidemic.
Complete sequence of two tick-borne flaviviruses isolated from Siberia and the UK: analysis and significance of the 5' and 3'-UTRs.

PubMed

Gritsun, T S; Venugopal, K; Zanotto, P M; Mikhailov, M V; Sall, A A; Holmes, E C; Polkinghorne, I; Frolova, T V; Pogodina, V V; Lashkevich, V A; Gould, E A

1997-05-01

The complete nucleotide sequence of two tick-transmitted flaviviruses, Vasilchenko (Vs) from Siberia and louping ill (LI) from the UK, have been determined. The genomes were respectively, 10928 and 10871 nucleotides (nt) in length. The coding strategy and functional protein sequence motifs of tick-borne flaviviruses are presented in both Vs and LI viruses. The phylogenies based on maximum likelihood, maximum parsimony and distance analysis of the polyproteins, identified Vs virus as a member of the tick-borne encephalitis virus subgroup within the tick-borne serocomplex, genus Flavivirus, family Flaviviridae. Comparative alignment of the 3'-untranslated regions revealed deletions of different lengths essentially at the same position downstream of the stop codon for all tick-borne viruses. Two direct 27 nucleotide repeats at the 3'-end were found only for Vs and LI virus. Immediately following the deletions a region of 332-334 nt with relatively conserved primary structure (67-94% identity) was observed at the 3'-non-coding end of the virus genome. Pairwise comparisons of the nucleotide sequence data revealed similar levels of variation between the coding region, and the 5' and 3'-termini of the genome, implying an equivalent strong selective control for translated and untranslated regions. Indeed the predicted folding of the 5' and 3'-untranslated regions revealed patterns of stem and loop structures conserved for all tick-borne flaviviruses suggesting a purifying selection for preservation of essential RNA secondary structures which could be involved in translational control and replication. The possible implications of these findings are discussed.
Trading genes along the silk road: mtDNA sequences and the origin of central Asian populations.

PubMed Central

Comas, D; Calafell, F; Mateu, E; Pérez-Lezaun, A; Bosch, E; Martínez-Arias, R; Clarimon, J; Facchini, F; Fiori, G; Luiselli, D; Pettener, D; Bertranpetit, J

1998-01-01

Central Asia is a vast region at the crossroads of different habitats, cultures, and trade routes. Little is known about the genetics and the history of the population of this region. We present the analysis of mtDNA control-region sequences in samples of the Kazakh, the Uighurs, the lowland Kirghiz, and the highland Kirghiz, which we have used to address both the population history of the region and the possible selective pressures that high altitude has on mtDNA genes. Central Asian mtDNA sequences present features intermediate between European and eastern Asian sequences, in several parameters-such as the frequencies of certain nucleotides, the levels of nucleotide diversity, mean pairwise differences, and genetic distances. Several hypotheses could explain the intermediate position of central Asia between Europe and eastern Asia, but the most plausible would involve extensive levels of admixture between Europeans and eastern Asians in central Asia, possibly enhanced during the Silk Road trade and clearly after the eastern and western Eurasian human groups had diverged. Lowland and highland Kirghiz mtDNA sequences are very similar, and the analysis of molecular variance has revealed that the fraction of mitochondrial genetic variance due to altitude is not significantly different from zero. Thus, it seems unlikely that altitude has exerted a major selective pressure on mitochondrial genes in central Asian populations. PMID:9837835
Epistatic SNP interaction of ERCC6 with ERCC8 and their joint protein expression contribute to gastric cancer/atrophic gastritis risk.

PubMed

Jing, Jing-Jing; Lu, You-Zhu; Sun, Li-Ping; Liu, Jing-Wei; Gong, Yue-Hua; Xu, Qian; Dong, Nan-Nan; Yuan, Yuan

2017-06-27

Excision repair cross-complementing group 6 and 8 (ERCC6 and ERCC8) are two indispensable genes for the initiation of transcription-coupled nucleotide excision repair pathway. This study aimed to evaluate the interactions between single nucleotide polymorphisms of ERCC6 (rs1917799) and ERCC8 (rs158572 and rs158916) in gastric cancer and its precancerous diseases. Besides, protein level analysis were performed to compare ERCC6 and ERCC8 expression in different stages of gastric diseases, and to correlate SNPs jointly with gene expression. Sequenom MassARRAY platform method was used to detect polymorphisms of ERCC6 and ERCC8 in 1916 subjects. In situ ERCC6 and ERCC8 protein expression were detected by immunohistochemistry in 109 chronic superficial gastritis, 109 chronic atrophic gastritis and 109 gastric cancer cases. Our results demonstrated pairwise epistatic interactions between ERCC6 and ERCC8 SNPs that ERCC6 rs1917799-ERCC8 rs158572 combination was associated with decreased risk of chronic atrophic gastritis and increased risk of gastric cancer. ERCC6 rs1917799 also showed a significant interaction with ERCC8 rs158916 to reduce gastric cancer risk. The expressions of ERCC6, ERCC8 and ERCC6-ERCC8 combination have similarities that higher positivity was observed in chronic superficial gastritis compared with chronic atrophic gastritis and gastric cancer. As for the effects of ERCC6 and ERCC8 SNPs on the protein expression, single SNP had no correlation with corresponding gene expression, whereas the ERCC6 rs1917799-ERCC8 rs158572 pair had significant influence on ERCC6 and ERCC6-ERCC8 expression. In conclusion, ERCC6 rs1917799, ERCC8 rs158572 and rs158916 demonstrated pairwise epistatic interactions to associate with chronic atrophic gastritis and gastric cancer risk. The ERCC6 rs1917799-ERCC8 rs158572 pair significantly influence ERCC6 and ERCC6-ERCC8 expression.
Epiregulin (EREG) and human V-ATPase (TCIRG1): genetic variation, ethnicity and pulmonary tuberculosis susceptibility in Guinea-Bissau and The Gambia

PubMed Central

White, Marquitta J.; Tacconelli, Alessandra; Chen, Jane S.; Wejse, Christian; Hill, Philip C.; Gomez, Victor F; Velez-Edwards, Digna R.; Østergaard, Lars J.; Hu, Ting; Moore, Jason H.; Novelli, Giuseppe; Scott, William K.; Williams, Scott M.; Sirugo, Giorgio

2017-01-01

We analyzed two West African samples (Guinea-Bissau: n = 289 cases, 322 controls; The Gambia: n = 240 cases, 248 controls) to evaluate single nucleotide polymorphisms (SNPs) in Epiregulin (EREG) and V-ATPase (T cell immune regulator 1, TCIRG1) using single and multi-locus analyses to determine whether previously described associations with pulmonary tuberculosis (PTB) in Vietnamese and Italians would replicate in African populations. We did not detect any significant single locus or haplotype associations in either sample. We also performed exploratory pairwise interaction analyses using Visualization of Statistical Epistasis Networks (ViSEN), a novel method to detect only interactions among multiple variables, to elucidate possible interaction effects between SNPs and demographic factors. Although we found no strong evidence of marginal effects, there were several significant pairwise interactions that were identified in either the Guinea-Bissau or The Gambia samples, two of which replicated across populations. Our results indicate that the effects of EREG and TCIRG1 variants on PTB susceptibility, to the extent that they exist, are dependent on gene-gene interactions in West African populations as detected with ViSEN. In addition, epistatic effects are likely to be influenced by inter- and intra-population differences in genetic or environmental context and/or the mycobacterial lineages causing disease. PMID:24898387
Analysis of DNA methylation in Arabidopsis thaliana based on methylation-sensitive AFLP markers.

PubMed

Cervera, M T; Ruiz-García, L; Martínez-Zapater, J M

2002-12-01

AFLP analysis using restriction enzyme isoschizomers that differ in their sensitivity to methylation of their recognition sites has been used to analyse the methylation state of anonymous CCGG sequences in Arabidopsis thaliana. The technique was modified to improve the quality of fingerprints and to visualise larger numbers of scorable fragments. Sequencing of amplified fragments indicated that detection was generally associated with non-methylation of the cytosine to which the isoschizomer is sensitive. Comparison of EcoRI/ HpaII and EcoRI/ MspI patterns in different ecotypes revealed that 35-43% of CCGG sites were differentially digested by the isoschizomers. Interestingly, the pattern of digestion among different plants belonging to the same ecotype is highly conserved, with the rate of intra-ecotype methylation-sensitive polymorphisms being less than 1%. However, pairwise comparisons of methylation patterns between samples belonging to different ecotypes revealed differences in up to 34% of the methylation-sensitive polymorphisms. The lack of correlation between inter-ecotype similarity matrices based on methylation-insensitive or methylation-sensitive polymorphisms suggests that whatever the mechanisms regulating methylation may be, they are not related to nucleotide sequence variation.
Sequence analysis of a few species of termites (Order: Isoptera) on the basis of partial characterization of COII gene.

PubMed

Sobti, Ranbir Chander; Kumari, Mamtesh; Sharma, Vijay Lakshmi; Sodhi, Monika; Mukesh, Manishi; Shouche, Yogesh

2009-11-01

The present study was aimed to get the nucleotide sequences of a part of COII mitochondrial gene amplified from individuals of five species of Termites (Isoptera: Termitidae: Macrotermitinae). Four of them belonged to the genus Odontotermes (O. obesus, O. horni, O. bhagwatii and Odontotermes sp.) and one to Microtermes (M. obesi). Partial COII gene fragments were amplified by using specific primers. The sequences so obtained were characterized to calculate the frequencies of each nucleotide bases and a high A + T content was observed. The interspecific pairwise sequence divergence in Odontotermes species ranged from 6.5% to 17.1% across COII fragment. M. obesi sequence diversity ranged from 2.5 with Odontotermes sp. to 19.0% with O. bhagwatii. Phylogenetic trees drawn on the basis of distance neighbour-joining method revealed three main clades clustering all the individuals according to their genera and families.
Molecular characterization of Atractolytocestus sagittatus (Cestoda: Caryophyllidea), monozoic parasite of common carp, and its differentiation from the invasive species Atractolytocestus huronensis.

PubMed

Bazsalovicsová, Eva; Králová-Hromadová, Ivica; Stefka, Jan; Scholz, Tomáš

2012-05-01

Sequence structure of complete internal transcribed spacer 1 and 2 (ITS1 and ITS2) of the ribosomal DNA region and partial mitochondrial cytochrome c oxidase subunit I (cox1) gene sequences were studied in the monozoic tapeworm Atractolytocestus sagittatus (Kulakovskaya et Akhmerov, 1965) (Cestoda: Caryophyllidea), a parasite of common carp (Cyprinus carpio carpio L.). Intraindividual sequence diversity was observed in both ribosomal spacers. In ITS1, a total number of 19 recombinant clones yielded eight different sequence types (pairwise sequence identity, 99.7-100%) which, however, did not resemble the structure typical for divergent intragenomic ITS copies (paralogues). Polymorphism was displayed by several single nucleotide mutations present exclusively in single clones, but variation in the number of short repetitive motifs was not observed. In ITS2, a total of 21 recombinant clones yielded ten different sequence types (pairwise sequence identity, 97.5-100%). They were mostly characterized by a varying number of (TCGT)(n) repeats resulting in assortment of ITS2 sequences into two sequence variants, which reflected the structure specific for ITS paralogues. The third DNA region analysed, mitochondrial cox1 gene (669 bp) was detected to be 100% identical in all studied A. sagittatus individuals. Comparison of molecular data on A. sagittatus with those on Atractolytocestus huronensis Anthony, 1958, an invasive parasite of common carp, has shown that interspecific differences significantly exceeded intraspecific variation in both ribosomal spacers (81.4-82.5% in ITS1, 74.4-75.2% in ITS2) as well as in mitochondrial cox1, which confirms validity of both congeneric tapeworms parasitic in the same fish host.
Diversity of partial RNA-dependent RNA polymerase gene sequences of soybean blotchy mosaic virus isolates from different host-, geographical- and temporal origins.

PubMed

Strydom, Elrea; Pietersen, Gerhard

2018-05-01

Infection of soybean by the plant cytorhabdovirus soybean blotchy mosaic virus (SbBMV) results in significant yield losses in the temperate, lower-lying soybean production regions of South Africa. A 277 bp portion of the RNA-dependent RNA polymerase gene of 66 SbBMV isolates from different: hosts, geographical locations in South Africa, and times of collection (spanning 16 years) were amplified by RT-PCR and sequenced to investigate the genetic diversity of isolates. Phylogenetic reconstruction revealed three main lineages, designated Groups A, B and C, with isolates grouping primarily according to geographic origin. Pairwise nucleotide identities ranged between 85.7% and 100% among all isolates, with isolates in Group A exhibiting the highest degree of sequence identity, and isolates of Groups A and B being more closely related to each other than to those in Group C. This is the first study investigating the genetic diversity of SbBMV.
Genome-wide single nucleotide polymorphisms reveal population history and adaptive divergence in wild guppies.

PubMed

Willing, Eva-Maria; Bentzen, Paul; van Oosterhout, Cock; Hoffmann, Margarete; Cable, Joanne; Breden, Felix; Weigel, Detlef; Dreyer, Christine

2010-03-01

Adaptation of guppies (Poecilia reticulata) to contrasting upland and lowland habitats has been extensively studied with respect to behaviour, morphology and life history traits. Yet population history has not been studied at the whole-genome level. Although single nucleotide polymorphisms (SNPs) are the most abundant form of variation in many genomes and consequently very informative for a genome-wide picture of standing natural variation in populations, genome-wide SNP data are rarely available for wild vertebrates. Here we use genetically mapped SNP markers to comprehensively survey genetic variation within and among naturally occurring guppy populations from a wide geographic range in Trinidad and Venezuela. Results from three different clustering methods, Neighbor-net, principal component analysis (PCA) and Bayesian analysis show that the population substructure agrees with geographic separation and largely with previously hypothesized patterns of historical colonization. Within major drainages (Caroni, Oropouche and Northern), populations are genetically similar, but those in different geographic regions are highly divergent from one another, with some indications of ancient shared polymorphisms. Clear genomic signatures of a previous introduction experiment were seen, and we detected additional potential admixture events. Headwater populations were significantly less heterozygous than downstream populations. Pairwise F(ST) values revealed marked differences in allele frequencies among populations from different regions, and also among populations within the same region. F(ST) outlier methods indicated some regions of the genome as being under directional selection. Overall, this study demonstrates the power of a genome-wide SNP data set to inform for studies on natural variation, adaptation and evolution of wild populations.
Prehistoric introduction of domestic pigs onto the Okinawa Islands: ancient mitochondrial DNA evidence.

PubMed

Watanobe, Takuma; Ishiguro, Naotaka; Nakano, Masuo; Takamiya, Hiroto; Matsui, Akira; Hongo, Hitomi

2002-08-01

Ancient DNAs of Sus scrofa specimens excavated from archaeological sites on the Okinawa islands were examined to clarify the genetic relationships among prehistoric Sus scrofa, modern wild boars and domestic pigs inhabiting the Ryukyu archipelago, the Japanese islands, and the Asian continent. We extracted remain DNA from 161 bone specimens excavated from 12 archaeological sites on the Okinawa islands and successfully amplified mitochondrial DNA control region fragments from 33 of 161 specimens. Pairwise difference between prehistoric and modern S. scrofa nucleotide sequences showed that haplotypes of the East Asian domestic pig lineage were found from archaeological specimens together with Ryukyu wild boars native to the Ryukyu archipelago. Phylogenetic analysis of 14 ancient sequences (11 haplotypes; 574 bp) indicated that S. scrofa specimens from two Yayoi-Heian sites (Kitahara and Ara shellmiddens) and two Recent Times sites (Wakuta Kiln and Kiyuna sites) are grouped with modern East Asian domestic pigs. Sus scrofa specimens from Shimizu shellmidden (Yayoi-Heian Period) were very closely related to modern Sus scrofa riukiuanus but had a unique nucleotide insertion, indicating that the population is genetically distinct from the lineage of modern Ryukyu wild boars. This genetic evidence suggests that domestic pigs from the Asian continent were introduced to the Okinawa islands in the early Yayoi-Heian period (1700-2000 BP), or earlier.
Molecular mechanisms of retroviral integration site selection

PubMed Central

Kvaratskhelia, Mamuka; Sharma, Amit; Larue, Ross C.; Serrao, Erik; Engelman, Alan

2014-01-01

Retroviral replication proceeds through an obligate integrated DNA provirus, making retroviral vectors attractive vehicles for human gene-therapy. Though most of the host cell genome is available for integration, the process of integration site selection is not random. Retroviruses differ in their choice of chromatin-associated features and also prefer particular nucleotide sequences at the point of insertion. Lentiviruses including HIV-1 preferentially integrate within the bodies of active genes, whereas the prototypical gammaretrovirus Moloney murine leukemia virus (MoMLV) favors strong enhancers and active gene promoter regions. Integration is catalyzed by the viral integrase protein, and recent research has demonstrated that HIV-1 and MoMLV targeting preferences are in large part guided by integrase-interacting host factors (LEDGF/p75 for HIV-1 and BET proteins for MoMLV) that tether viral intasomes to chromatin. In each case, the selectivity of epigenetic marks on histones recognized by the protein tether helps to determine the integration distribution. In contrast, nucleotide preferences at integration sites seem to be governed by the ability for the integrase protein to locally bend the DNA duplex for pairwise insertion of the viral DNA ends. We discuss approaches to alter integration site selection that could potentially improve the safety of retroviral vectors in the clinic. PMID:25147212
How Hot Are Drosophila Hotspots? Examining Recombination Rate Variation and Associations with Nucleotide Diversity, Divergence, and Maternal Age in Drosophila pseudoobscura

PubMed Central

Manzano-Winkler, Brenda; McGaugh, Suzanne E.; Noor, Mohamed A. F.

2013-01-01

Fine scale meiotic recombination maps have uncovered a large amount of variation in crossover rate across the genomes of many species, and such variation in mammalian and yeast genomes is concentrated to <5kb regions of highly elevated recombination rates (10–100x the background rate) called “hotspots.” Drosophila exhibit substantial recombination rate heterogeneity across their genome, but evidence for these highly-localized hotspots is lacking. We assayed recombination across a 40Kb region of Drosophila pseudoobscura chromosome 2, with one 20kb interval assayed every 5Kb and the adjacent 20kb interval bisected into 10kb pieces. We found that recombination events across the 40kb stretch were relatively evenly distributed across each of the 5kb and 10kb intervals, rather than concentrated in a single 5kb region. This, in combination with other recent work, indicates that the recombination landscape of Drosophila may differ from the punctate recombination pattern observed in many mammals and yeast. Additionally, we found no correlation of average pairwise nucleotide diversity and divergence with recombination rate across the 20kb intervals, nor any effect of maternal age in weeks on recombination rate in our sample. PMID:23967224
A combination of PhP typing and β-d-glucuronidase gene sequence variation analysis for differentiation of Escherichia coli from humans and animals.

PubMed

Masters, N; Christie, M; Katouli, M; Stratton, H

2015-06-01

We investigated the usefulness of the β-d-glucuronidase gene variance in Escherichia coli as a microbial source tracking tool using a novel algorithm for comparison of sequences from a prescreened set of host-specific isolates using a high-resolution PhP typing method. A total of 65 common biochemical phenotypes belonging to 318 E. coli strains isolated from humans and domestic and wild animals were analysed for nucleotide variations at 10 loci along a 518 bp fragment of the 1812 bp β-d-glucuronidase gene. Neighbour-joining analysis of loci variations revealed 86 (76.8%) human isolates and 91.2% of animal isolates were correctly identified. Pairwise hierarchical clustering improved assignment; where 92 (82.1%) human and 204 (99%) animal strains were assigned to their respective cluster. Our data show that initial typing of isolates and selection of common types from different hosts prior to analysis of the β-d-glucuronidase gene sequence improves source identification. We also concluded that numerical profiling of the nucleotide variations can be used as a valuable approach to differentiate human from animal E. coli. This study signifies the usefulness of the β-d-glucuronidase gene as a marker for differentiating human faecal pollution from animal sources.
[Genetic characterization of different populations of Rhopilema esculentum based on the mitochondrial COI sequence.

PubMed

Li, Yu Long; Dong, Jing; Wang, Bin; Li, Yi Ping; Yu, Xu Guang; Fu, Jie; Wang, Wen Bo

2016-07-01

To investigate the genetic characterization and population genetic structure of Rhopilema esculentum, we sequenced the mtDNA COI gene (624 bp) in 56 individuals collected from Liaodong Bay and the Ganghwado Island in the estuarine waters of the Han River. In addition, the homologous sequences of other 15 individuals which were sampled from the Bohai and Yellow seas and Sea of Japan were analyzed. A total of 28 polymorphic nucleotide sites were detected among the 71 individuals, which defined 32 haplotypes. Haplotype diversity levels were high (0.91±0.06-0.94±0.01) in R. esculentum populations, whereas those of nucleotide diversity were moderate to low [(0.60±0.34)%-(0.68±0.40)%]. Compared with several other giant jellyfish species, the variation level of R. esculentum was high. Phylogeographic analysis of the COI region revealed two lineages. The pairwise F ST comparison and hierarchical molecular variance analysis (AMOVA) showed that significant population structure existed throughout the range of R. esculentum. The results of this study indicated that the life-cycle characteristics, together with possible anthropogenic introduction such as stock enhancement and the prevailing ocean currents in this region, were proposed as the main factors that determined the genetic patterns of R. esculentum.
Lotka-Volterra pairwise modeling fails to capture diverse pairwise microbial interactions

PubMed Central

Momeni, Babak; Xie, Li; Shou, Wenying

2017-01-01

Pairwise models are commonly used to describe many-species communities. In these models, an individual receives additive fitness effects from pairwise interactions with each species in the community ('additivity assumption'). All pairwise interactions are typically represented by a single equation where parameters reflect signs and strengths of fitness effects ('universality assumption'). Here, we show that a single equation fails to qualitatively capture diverse pairwise microbial interactions. We build mechanistic reference models for two microbial species engaging in commonly-found chemical-mediated interactions, and attempt to derive pairwise models. Different equations are appropriate depending on whether a mediator is consumable or reusable, whether an interaction is mediated by one or more mediators, and sometimes even on quantitative details of the community (e.g. relative fitness of the two species, initial conditions). Our results, combined with potential violation of the additivity assumption in many-species communities, suggest that pairwise modeling will often fail to predict microbial dynamics. DOI: http://dx.doi.org/10.7554/eLife.25051.001 PMID:28350295

Intransitivity is infrequent and fails to promote annual plant coexistence without pairwise niche differences.

PubMed

Godoy, Oscar; Stouffer, Daniel B; Kraft, Nathan J B; Levine, Jonathan M

2017-05-01

Intransitive competition is often projected to be a widespread mechanism of species coexistence in ecological communities. However, it is unknown how much of the coexistence we observe in nature results from this mechanism when species interactions are also stabilized by pairwise niche differences. We combined field-parameterized models of competition among 18 annual plant species with tools from network theory to quantify the prevalence of intransitive competitive relationships. We then analyzed the predicted outcome of competitive interactions with and without pairwise niche differences. Intransitive competition was found for just 15-19% of the 816 possible triplets, and this mechanism was never sufficient to stabilize the coexistence of the triplet when the pair-wise niche differences between competitors were removed. Of the transitive and intransitive triplets, only four were predicted to coexist and these were more similar in multidimensional trait space defined by 11 functional traits than non-coexisting triplets. Our results argue that intransitive competition may be less frequent than recently posed, and that even when it does operate, pairwise niche differences may be key to possible coexistence. © 2017 by the Ecological Society of America.
Candida phyllophila sp. nov. and Candida vitiphila sp. nov., two novel yeast species from grape phylloplane in Thailand.

PubMed

Limtong, Savitree; Kaewwichian, Rungluk

2013-01-01

Three strains (K59(T), K60 and K70 (T)) representing two novel yeast species were isolated from the external surface of leaves of different wine grape (Vitis vinifera) plants, which were collected from the Kanchanaburi Research Station (N14°07'15.1″ E099°19'05.6″), Wang Dong Sub-district, Mueang District, Kanchanaburi Province, Thailand, by an enrichment technique. The sequences of the D1/D2 domain of the large subunit (LSU) rRNA gene of two strains (K59(T) and K60) were identical and differed from that of strain K70(T). In terms of pairwise sequence similarity of the D1/D2 domain, the closest species to the three strains was Candida asparagi but with 2.3% nucleotide substitutions for strains K59(T) and K60, and 2.1% nucleotide substitutions for strain K70(T). On the basis of morphological, biochemical, physiological and chemotaxonomic characteristics and the sequence analysis of the D1/D2 domain of the large subunit (LSU) rRNA gene, the three strains were assigned to be two novel Candida species. Two strains (K59(T) and K60) were assigned as Candida phyllophila sp. nov. (type strain K59(T)=BCC 42662(T)=NBRC 107776(T)=CBS 12671(T)). Candida vitiphila sp. nov. is proposed for strain K70(T) (=BCC 42663(T)=NBRC 107777(T)=CBS 12672(T)).
Single nucleotide polymorphisms unravel hierarchical divergence and signatures of selection among Alaskan sockeye salmon (Oncorhynchus nerka) populations.

PubMed

Gomez-Uchida, Daniel; Seeb, James E; Smith, Matt J; Habicht, Christopher; Quinn, Thomas P; Seeb, Lisa W

2011-02-18

Disentangling the roles of geography and ecology driving population divergence and distinguishing adaptive from neutral evolution at the molecular level have been common goals among evolutionary and conservation biologists. Using single nucleotide polymorphism (SNP) multilocus genotypes for 31 sockeye salmon (Oncorhynchus nerka) populations from the Kvichak River, Alaska, we assessed the relative roles of geography (discrete boundaries or continuous distance) and ecology (spawning habitat and timing) driving genetic divergence in this species at varying spatial scales within the drainage. We also evaluated two outlier detection methods to characterize candidate SNPs responding to environmental selection, emphasizing which mechanism(s) may maintain the genetic variation of outlier loci. For the entire drainage, Mantel tests suggested a greater role of geographic distance on population divergence than differences in spawn timing when each variable was correlated with pairwise genetic distances. Clustering and hierarchical analyses of molecular variance indicated that the largest genetic differentiation occurred between populations from distinct lakes or subdrainages. Within one population-rich lake, however, Mantel tests suggested a greater role of spawn timing than geographic distance on population divergence when each variable was correlated with pairwise genetic distances. Variable spawn timing among populations was linked to specific spawning habitats as revealed by principal coordinate analyses. We additionally identified two outlier SNPs located in the major histocompatibility complex (MHC) class II that appeared robust to violations of demographic assumptions from an initial pool of eight candidates for selection. First, our results suggest that geography and ecology have influenced genetic divergence between Alaskan sockeye salmon populations in a hierarchical manner depending on the spatial scale. Second, we found consistent evidence for diversifying selection in two loci located in the MHC class II by means of outlier detection methods; yet, alternative scenarios for the evolution of these loci were also evaluated. Both conclusions argue that historical contingency and contemporary adaptation have likely driven differentiation between Kvichak River sockeye salmon populations, as revealed by a suite of SNPs. Our findings highlight the need for conservation of complex population structure, because it provides resilience in the face of environmental change, both natural and anthropogenic.
Single nucleotide polymorphisms unravel hierarchical divergence and signatures of selection among Alaskan sockeye salmon (Oncorhynchus nerka) populations

PubMed Central

2011-01-01

Background Disentangling the roles of geography and ecology driving population divergence and distinguishing adaptive from neutral evolution at the molecular level have been common goals among evolutionary and conservation biologists. Using single nucleotide polymorphism (SNP) multilocus genotypes for 31 sockeye salmon (Oncorhynchus nerka) populations from the Kvichak River, Alaska, we assessed the relative roles of geography (discrete boundaries or continuous distance) and ecology (spawning habitat and timing) driving genetic divergence in this species at varying spatial scales within the drainage. We also evaluated two outlier detection methods to characterize candidate SNPs responding to environmental selection, emphasizing which mechanism(s) may maintain the genetic variation of outlier loci. Results For the entire drainage, Mantel tests suggested a greater role of geographic distance on population divergence than differences in spawn timing when each variable was correlated with pairwise genetic distances. Clustering and hierarchical analyses of molecular variance indicated that the largest genetic differentiation occurred between populations from distinct lakes or subdrainages. Within one population-rich lake, however, Mantel tests suggested a greater role of spawn timing than geographic distance on population divergence when each variable was correlated with pairwise genetic distances. Variable spawn timing among populations was linked to specific spawning habitats as revealed by principal coordinate analyses. We additionally identified two outlier SNPs located in the major histocompatibility complex (MHC) class II that appeared robust to violations of demographic assumptions from an initial pool of eight candidates for selection. Conclusions First, our results suggest that geography and ecology have influenced genetic divergence between Alaskan sockeye salmon populations in a hierarchical manner depending on the spatial scale. Second, we found consistent evidence for diversifying selection in two loci located in the MHC class II by means of outlier detection methods; yet, alternative scenarios for the evolution of these loci were also evaluated. Both conclusions argue that historical contingency and contemporary adaptation have likely driven differentiation between Kvichak River sockeye salmon populations, as revealed by a suite of SNPs. Our findings highlight the need for conservation of complex population structure, because it provides resilience in the face of environmental change, both natural and anthropogenic. PMID:21332997
Stability of Tandem Repeats in the Drosophila Melanogaster HSR-Omega Nuclear RNA

PubMed Central

Hogan, N. C.; Slot, F.; Traverse, K. L.; Garbe, J. C.; Bendena, W. G.; Pardue, M. L.

1995-01-01

The Drosophila melanogaster Hsr-omega locus produces a nuclear RNA containing >5 kb of tandem repeat sequences. These repeats are unique to Hsr-omega and show concerted evolution similar to that seen with classical satellite DNAs. In D. melanogaster the monomer is ~280 bp. Sequences of 191/2 monomers differ by 8 +/- 5% (mean +/- SD), when all pairwise comparisons are considered. Differences are single nucleotide substitutions and 1-3 nucleotide deletions/insertions. Changes appear to be randomly distributed over the repeat unit. Outer repeats do not show the decrease in monomer homogeneity that might be expected if homogeneity is maintained by recombination. However, just outside the last complete repeat at each end, there are a few fragments of sequence similar to the monomer. The sequences in these flanking regions are not those predicted for sequences decaying in the absence of recombination. Instead, the fragmentation of the sequence homology suggests that flanking regions have undergone more severe disruptions, possibly during an insertion or amplification event. Hsr-omega alleles differing in the number of repeats are detected and appear to be stable over a few thousand generations; however, both increases and decreases in repeat numbers have been observed. The new alleles appear to be as stable as their predecessors. No alleles of less than ~5 kb nor more than ~16 kb of repeats were seen in any stocks examined. The evidence that there is a limit on the minimum number of repeats is consistent with the suggestion that these repeats are important in the function of the unusual Hsr-omega nuclear RNA. PMID:7540581
Alignment of RNA molecules: Binding energy and statistical properties of random sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Valba, O. V., E-mail: valbaolga@gmail.com; Nechaev, S. K., E-mail: sergei.nechaev@gmail.com; Tamm, M. V., E-mail: thumm.m@gmail.com

2012-02-15

A new statistical approach to the problem of pairwise alignment of RNA sequences is proposed. The problem is analyzed for a pair of interacting polymers forming an RNA-like hierarchical cloverleaf structures. An alignment is characterized by the numbers of matches, mismatches, and gaps. A weight function is assigned to each alignment; this function is interpreted as a free energy taking into account both direct monomer-monomer interactions and a combinatorial contribution due to formation of various cloverleaf secondary structures. The binding free energy is determined for a pair of RNA molecules. Statistical properties are discussed, including fluctuations of the binding energymore » between a pair of RNA molecules and loop length distribution in a complex. Based on an analysis of the free energy per nucleotide pair complexes of random RNAs as a function of the number of nucleotide types c, a hypothesis is put forward about the exclusivity of the alphabet c = 4 used by nature.« less
Novel tetra-nucleotide microsatellite DNA markers for assessing the evolutionary genetics and demographics of Northern Snakehead (Channa argus) invading North America

USGS Publications Warehouse

King, Timothy L.; Johnson, Robin L.

2011-01-01

We document the isolation and characterization of 19 tetra-nucleotide microsatellite DNA markers in northern snakehead (Channa argus) fish that recently colonized Meadow Lake, New York City, New York. These markers displayed moderate levels of allelic diversity (averaging 6.8 alleles/locus) and heterozygosity (averaging 74.2%). Demographic analyses suggested that the Meadow Lake collection has not achieved mutation-drift equilibrium. These results were consistent with instances of deviations from Hardy–Weinberg equilibrium and the presence of some linkage disequilibrium. A comparison of individual pair-wise distances suggested the presence of multiple differentiated groups of related individuals. Results of all analyses are consistent with a pattern of multiple, recent introductions. The microsatellite markers developed for C. argus yielded sufficient genetic diversity to potentially: (1) delineate kinship; (2) elucidate fine-scale population structure; (3) define management (eradication) units; (4) estimate dispersal rates; (5) estimate population sizes; and (6) provide unique demographic perspectives of control or eradication effectiveness.
Molecular epidemiology of Epizootic haematopoietic necrosis virus (EHNV).

PubMed

Hick, Paul M; Subramaniam, Kuttichantran; Thompson, Patrick M; Waltzek, Thomas B; Becker, Joy A; Whittington, Richard J

2017-11-01

Low genetic diversity of Epizootic haematopoietic necrosis virus (EHNV) was determined for the complete genome of 16 isolates spanning the natural range of hosts, geography and time since the first outbreaks of disease. Genomes ranged from 125,591-127,487 nucleotides with 97.47% pairwise identity and 106-109 genes. All isolates shared 101 core genes with 121 potential genes predicted within the pan-genome of this collection. There was high conservation within 90,181 nucleotides of the core genes with isolates separated by average genetic distance of 3.43 × 10 -4 substitutions per site. Evolutionary analysis of the core genome strongly supported historical epidemiological evidence of iatrogenic spread of EHNV to naïve hosts and establishment of endemic status in discrete ecological niches. There was no evidence of structural genome reorganization, however, the complement of non-core genes and variation in repeat elements enabled fine scale molecular epidemiological investigation of this unpredictable pathogen of fish. Copyright © 2017 Elsevier Inc. All rights reserved.
Characterization of Foodborne Outbreaks of Salmonella enterica Serovar Enteritidis with Whole-Genome Sequencing Single Nucleotide Polymorphism-Based Analysis for Surveillance and Outbreak Detection.

PubMed

Taylor, Angela J; Lappi, Victoria; Wolfgang, William J; Lapierre, Pascal; Palumbo, Michael J; Medus, Carlota; Boxrud, David

2015-10-01

Salmonella enterica serovar Enteritidis is a significant cause of gastrointestinal illness in the United States; however, current molecular subtyping methods lack resolution for this highly clonal serovar. Advances in next-generation sequencing technologies have made it possible to examine whole-genome sequencing (WGS) as a potential molecular subtyping tool for outbreak detection and source trace back. Here, we conducted a retrospective analysis of S. Enteritidis isolates from seven epidemiologically confirmed foodborne outbreaks and sporadic isolates (not epidemiologically linked) to determine the utility of WGS to identify outbreaks. A collection of 55 epidemiologically characterized clinical and environmental S. Enteritidis isolates were sequenced. Single nucleotide polymorphism (SNP)-based cluster analysis of the S. Enteritidis genomes revealed well supported clades, with less than four-SNP pairwise diversity, that were concordant with epidemiologically defined outbreaks. Sporadic isolates were an average of 42.5 SNPs distant from the outbreak clusters. Isolates collected from the same patient over several weeks differed by only two SNPs. Our findings show that WGS provided greater resolution between outbreak, sporadic, and suspect isolates than the current gold standard subtyping method, pulsed-field gel electrophoresis (PFGE). Furthermore, results could be obtained in a time frame suitable for surveillance activities, supporting the use of WGS as an outbreak detection and characterization method for S. Enteritidis. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Shallow Population Genetic Structures of Thread-sail Filefish (Stephanolepis cirrhifer) Populations from Korean Coastal Waters.

PubMed

Yoon, M; Park, W; Nam, Y K; Kim, D S

2012-02-01

Genetic diversities, population genetic structures and demographic histories of the thread-sail filefish Stephanolepis cirrhifer were investigated by nucleotide sequencing of 336 base pairs of the mitochondrial DNA (mtDNA) control region in 111 individuals collected from six populations in Korean coastal waters. A total of 70 haplotypes were defined by 58 variable nucleotide sites. The neighbor-joining tree of the 70 haplotypes was shallow and did not provide evidence of geographical associations. Expansion of S. cirrhifer populations began approximate 51,000 to 102,000 years before present, correlating with the period of sea level rise since the late Pleistocene glacial maximum. High levels of haplotype diversities (0.974±0.029 to 1.000±0.076) and nucleotide diversities (0.014 to 0.019), and low levels of genetic differentiation among populations inferred from pairwise population F ST values (-0.007 to 0.107), support an expansion of the S. cirrhifer population. Hierarchical analysis of molecular variance (AMOVA) revealed weak but significant genetic structures among three groups (F CT = 0.028, p<0.05), and no genetic variation within groups (0.53%; F SC = 0.005, p = 0.23). These results may help establish appropriate fishery management strategies for stocks of S. cirrhifer and related species.
Shallow Population Genetic Structures of Thread-sail Filefish (Stephanolepis cirrhifer) Populations from Korean Coastal Waters

PubMed Central

Yoon, M.; Park, W.; Nam, Y. K.; Kim, D. S.

2012-01-01

Genetic diversities, population genetic structures and demographic histories of the thread-sail filefish Stephanolepis cirrhifer were investigated by nucleotide sequencing of 336 base pairs of the mitochondrial DNA (mtDNA) control region in 111 individuals collected from six populations in Korean coastal waters. A total of 70 haplotypes were defined by 58 variable nucleotide sites. The neighbor-joining tree of the 70 haplotypes was shallow and did not provide evidence of geographical associations. Expansion of S. cirrhifer populations began approximate 51,000 to 102,000 years before present, correlating with the period of sea level rise since the late Pleistocene glacial maximum. High levels of haplotype diversities (0.974±0.029 to 1.000±0.076) and nucleotide diversities (0.014 to 0.019), and low levels of genetic differentiation among populations inferred from pairwise population FST values (−0.007 to 0.107), support an expansion of the S. cirrhifer population. Hierarchical analysis of molecular variance (AMOVA) revealed weak but significant genetic structures among three groups (FCT = 0.028, p<0.05), and no genetic variation within groups (0.53%; FSC = 0.005, p = 0.23). These results may help establish appropriate fishery management strategies for stocks of S. cirrhifer and related species. PMID:25049547
Nucleotide sequence and phylogenetic analysis of Cucurbit yellow stunting disorder virus RNA 2.

PubMed

Livieratos, Ioannis C; Coutts, Robert H A

2002-06-01

The complete nucleotide sequence of Cucurbit yellow stunting disorder virus (CYSDV) RNA 2, a whitefly (Bemisia tabaci)-transmitted closterovirus with a bi-partite genome, is reported. CYSDV RNA 2 is 7,281 nucleotides long and contains the closterovirus hallmark gene array with a similar arrangement to the prototype member of the genus Crinivirus, Lettuce infectious yellows virus (LIYV). CYSDV RNA 2 contains open reading frames (ORFs) potentially encoding in a 5' to 3' direction for proteins of 5 kDa (ORF 1; hydrophobic protein), 62 kDa (ORF 2; heat shock protein 70 homolog, HSP70h), 59 kDa (ORF 3; protein of unknown function), 9 kDa (ORF 4; protein of unknown function), 28.5 kDa (ORF 5; coat protein, CP), 53 kDa (ORF 6; coat protein minor, CPm), and 26.5 kDa (ORF 7; protein of unknown function). Pairwise comparisons of CYSDV RNA 2-encoded proteins (HSP70h, p59 and CPm) among the closteroviruses showed that CYSDV is closely related to LIYV. Phylogenetic analysis based on the amino acid sequence of the HSP70h, indicated that CYSDV clusters with other members of the genus Crinivirus, and it is related to Little cherry virus-1 (LChV-1), but is distinct from the aphid- or mealybug-transmitted closteroviruses.
Haplotype diversity in 11 candidate genes across four populations.

PubMed

Beaty, T H; Fallin, M D; Hetmanski, J B; McIntosh, I; Chong, S S; Ingersoll, R; Sheng, X; Chakraborty, R; Scott, A F

2005-09-01

Analysis of haplotypes based on multiple single-nucleotide polymorphisms (SNP) is becoming common for both candidate gene and fine-mapping studies. Before embarking on studies of haplotypes from genetically distinct populations, however, it is important to consider variation both in linkage disequilibrium (LD) and in haplotype frequencies within and across populations, as both vary. Such diversity will influence the choice of "tagging" SNPs for candidate gene or whole-genome association studies because some markers will not be polymorphic in all samples and some haplotypes will be poorly represented or completely absent. Here we analyze 11 genes, originally chosen as candidate genes for oral clefts, where multiple markers were genotyped on individuals from four populations. Estimated haplotype frequencies, measures of pairwise LD, and genetic diversity were computed for 135 European-Americans, 57 Chinese-Singaporeans, 45 Malay-Singaporeans, and 46 Indian-Singaporeans. Patterns of pairwise LD were compared across these four populations and haplotype frequencies were used to assess genetic variation. Although these populations are fairly similar in allele frequencies and overall patterns of LD, both haplotype frequencies and genetic diversity varied significantly across populations. Such haplotype diversity has implications for designing studies of association involving samples from genetically distinct populations.
Genomic variation among populations of threatened coral: Acropora cervicornis.

PubMed

Drury, C; Dale, K E; Panlilio, J M; Miller, S V; Lirman, D; Larson, E A; Bartels, E; Crawford, D L; Oleksiak, M F

2016-04-13

Acropora cervicornis, a threatened, keystone reef-building coral has undergone severe declines (>90 %) throughout the Caribbean. These declines could reduce genetic variation and thus hamper the species' ability to adapt. Active restoration strategies are a common conservation approach to mitigate species' declines and require genetic data on surviving populations to efficiently respond to declines while maintaining the genetic diversity needed to adapt to changing conditions. To evaluate active restoration strategies for the staghorn coral, the genetic diversity of A. cervicornis within and among populations was assessed in 77 individuals collected from 68 locations along the Florida Reef Tract (FRT) and in the Dominican Republic. Genotyping by Sequencing (GBS) identified 4,764 single nucleotide polymorphisms (SNPs). Pairwise nucleotide differences (π) within a population are large (~37 %) and similar to π across all individuals. This high level of genetic diversity along the FRT is similar to the diversity within a small, isolated reef. Much of the genetic diversity (>90 %) exists within a population, yet GBS analysis shows significant variation along the FRT, including 300 SNPs with significant FST values and significant divergence relative to distance. There are also significant differences in SNP allele frequencies over small spatial scales, exemplified by the large FST values among corals collected within Miami-Dade county. Large standing diversity was found within each population even after recent declines in abundance, including significant, potentially adaptive divergence over short distances. The data here inform conservation and management actions by uncovering population structure and high levels of diversity maintained within coral collections among sites previously shown to have little genetic divergence. More broadly, this approach demonstrates the power of GBS to resolve differences among individuals and identify subtle genetic structure, informing conservation goals with evolutionary implications.
Brief Note :Variability in the cathelicidin 6 (CATHL-6) gene in Tianzhu white yak from Tibetan area in China.

PubMed

E, G X; Na, R S; Zhao, Y J; Chen, L P; Qiu, X Y; Huang, Y F

2015-04-10

Cathelicidins are a major family of antimicrobial peptides (AMPs), an important component of innate immune system, playing a critical role in host defense and disease resistance in virtually all living species. Polymorphism and functional studies on cathelicidin of Tianzhu white yak contribute to understanding the specific innate immune mechanism in animals living at high altitudes in comparison to cattle and domesticated white yak. Thirty-six individuals of Tianzhu white yak, originating from the area of three ecotypes (Gansu in China), were investigated. The total length of the aligned Yak cathelicidin 6 (CATHL-6) sequences was 1923 bp, including six single nucleotide polymorphisms and one indel. Ten haplotypes were identified, and phylogenetic analyses resolved those 10 haplotypes in two clusters. The results indicate that the white yak originated from two domestication sites. In addition, lack of significant pairwise difference between sequences (Tajima's D = 0.92865, P > 0.10) in the CATHL-6 region indicates absence of population size expansion in current white yak population.
Phylogeographical structure in mitochondrial DNA of eggplant fruit and shoot borer, Leucinodes orbonalis Guenée (Lepidoptera: Crambidae) in South and Southeast Asia.

PubMed

Chang, Jian-Cheng; Ponnath, Daniel W; Ramasamy, Srinivasan

2016-01-01

Leucinodes orbonalis is the most detrimental South and Southeast Asian insect pest of eggplant. To help reduce the impact of this pest, population genetic diversity and structure of L. orbonalis were examined in eight populations from six countries using mitochondrial cytochrome c oxidase subunit I DNA sequences. No correlation between genetic diversity and geographic distance was detected among populations. Low levels of haplotype and nucleotide diversities were observed in the Philippines population, suggesting recent colonization. No significant gene flow was found among local populations in different countries. The Vietnam population is highly differentiated, indicated by significant pairwise FST values, and may be ascribed to a new subspecies or race. India was confirmed to be the source of genetic variation in L. orbonalis populations. Our study showed that L. orbonalis formed subpopulations for each local region, and the corresponding pest management technology should be developed at the country scale.
SVM-dependent pairwise HMM: an application to protein pairwise alignments.

PubMed

Orlando, Gabriele; Raimondi, Daniele; Khan, Taushif; Lenaerts, Tom; Vranken, Wim F

2017-12-15

Methods able to provide reliable protein alignments are crucial for many bioinformatics applications. In the last years many different algorithms have been developed and various kinds of information, from sequence conservation to secondary structure, have been used to improve the alignment performances. This is especially relevant for proteins with highly divergent sequences. However, recent works suggest that different features may have different importance in diverse protein classes and it would be an advantage to have more customizable approaches, capable to deal with different alignment definitions. Here we present Rigapollo, a highly flexible pairwise alignment method based on a pairwise HMM-SVM that can use any type of information to build alignments. Rigapollo lets the user decide the optimal features to align their protein class of interest. It outperforms current state of the art methods on two well-known benchmark datasets when aligning highly divergent sequences. A Python implementation of the algorithm is available at http://ibsquare.be/rigapollo. wim.vranken@vub.be. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Automatic Camera Calibration Using Multiple Sets of Pairwise Correspondences.

PubMed

Vasconcelos, Francisco; Barreto, Joao P; Boyer, Edmond

2018-04-01

We propose a new method to add an uncalibrated node into a network of calibrated cameras using only pairwise point correspondences. While previous methods perform this task using triple correspondences, these are often difficult to establish when there is limited overlap between different views. In such challenging cases we must rely on pairwise correspondences and our solution becomes more advantageous. Our method includes an 11-point minimal solution for the intrinsic and extrinsic calibration of a camera from pairwise correspondences with other two calibrated cameras, and a new inlier selection framework that extends the traditional RANSAC family of algorithms to sampling across multiple datasets. Our method is validated on different application scenarios where a lack of triple correspondences might occur: addition of a new node to a camera network; calibration and motion estimation of a moving camera inside a camera network; and addition of views with limited overlap to a Structure-from-Motion model.
A comparison of somatic mutational spectra in healthy study populations from Russia, Sweden and USA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Noori, P; Hou, S; Jones, I M

Comparison of mutation spectra at the hypoxanthine-phosphoribosyl transferase (HPRT) gene of peripheral blood T lymphocytes may provide insight into the aetiology of somatic mutation contributing to carcinogenesis and other diseases. To increase knowledge of mutation spectra in healthy people, we have analyzed HPRT mutant T-cells of 50 healthy Russians originally recruited as controls for a study of Chernobyl clean-up workers (Jones et al. Radiation Res. 158, 2002, 424). Reverse transcriptase polymerase chain reactions and DNA sequencing identified 161 independent mutations among 176 thioguanine resistant mutants. Forty (40) mutations affected splicing mechanisms and 27 deletions or insertions of 1 to 60more » nucleotides were identified. Ninety four (94) single base substitutions were identified, including 62 different mutations at 55 different nucleotide positions, of which 19 had not previously been reported in human T-cells. Comparison of this base substitution spectrum with mutation spectra in a USA (Burkhart-Schultz et al. Carcinogenesis 17, 1996, 1871) and two Swedish populations (Podlutsky et al, Carcinogenesis 19, 1998, 557, Podlutsky et al. Mutation Res. 431, 1999, 325) revealed similarity in the type, frequency and distribution of mutations in the four spectra, consistent with aetiologies inherent in human metabolism. There were 15-19 identical mutations in the three pair-wise comparisons of Russian with USA and Swedish spectra. Intriguingly, there were 21 mutations unique to the Russian spectrum, and comparison by the Monte Carlo method of Adams and Skopek (J. Mol. Biol. 194, 1987, 391) indicated that the Russian spectrum was different from both Swedish spectra (P=0.007, 0.002) but not different from the USA spectrum (P=0.07), when Bonferroni correction for multiple comparisons was made (p < 0.008 required for significance). Age and smoking did not account for these differences. Other factors causing mutational differences need to be explored.« less
Characterization of Adelphocoris suturalis (Hemiptera: Miridae) Transcriptome from Different Developmental Stages

NASA Astrophysics Data System (ADS)

Tian, Caihong; Tek Tay, Wee; Feng, Hongqiang; Wang, Ying; Hu, Yongmin; Li, Guoping

2015-06-01

Adelphocoris suturalis is one of the most serious pest insects of Bt cotton in China, however its molecular genetics, biochemistry and physiology are poorly understood. We used high throughput sequencing platform to perform de novo transcriptome assembly and gene expression analyses across different developmental stages (eggs, 2nd and 5th instar nymphs, female and male adults). We obtained 20 GB of clean data and revealed 88,614 unigenes, including 23,830 clusters and 64,784 singletons. These unigene sequences were annotated and classified by Gene Ontology, Clusters of Orthologous Groups, and Kyoto Encyclopedia of Genes and Genomes databases. A large number of differentially expressed genes were discovered through pairwise comparisons between these developmental stages. Gene expression profiles were dramatically different between life stage transitions, with some of these most differentially expressed genes being associated with sex difference, metabolism and development. Quantitative real-time PCR results confirm deep-sequencing findings based on relative expression levels of nine randomly selected genes. Furthermore, over 791,390 single nucleotide polymorphisms and 2,682 potential simple sequence repeats were identified. Our study provided comprehensive transcriptional gene expression information for A. suturalis that will form the basis to better understanding of development pathways, hormone biosynthesis, sex differences and wing formation in mirid bugs.

Characterization of Adelphocoris suturalis (Hemiptera: Miridae) Transcriptome from Different Developmental Stages

PubMed Central

Tian, Caihong; Tek Tay, Wee; Feng, Hongqiang; Wang, Ying; Hu, Yongmin; Li, Guoping

2015-01-01

Adelphocoris suturalis is one of the most serious pest insects of Bt cotton in China, however its molecular genetics, biochemistry and physiology are poorly understood. We used high throughput sequencing platform to perform de novo transcriptome assembly and gene expression analyses across different developmental stages (eggs, 2nd and 5th instar nymphs, female and male adults). We obtained 20 GB of clean data and revealed 88,614 unigenes, including 23,830 clusters and 64,784 singletons. These unigene sequences were annotated and classified by Gene Ontology, Clusters of Orthologous Groups, and Kyoto Encyclopedia of Genes and Genomes databases. A large number of differentially expressed genes were discovered through pairwise comparisons between these developmental stages. Gene expression profiles were dramatically different between life stage transitions, with some of these most differentially expressed genes being associated with sex difference, metabolism and development. Quantitative real-time PCR results confirm deep-sequencing findings based on relative expression levels of nine randomly selected genes. Furthermore, over 791,390 single nucleotide polymorphisms and 2,682 potential simple sequence repeats were identified. Our study provided comprehensive transcriptional gene expression information for A. suturalis that will form the basis to better understanding of development pathways, hormone biosynthesis, sex differences and wing formation in mirid bugs. PMID:26047353
Sequence determination and analysis of the NSs genes of two tospoviruses.

PubMed

Hallwass, Mariana; Leastro, Mikhail O; Lima, Mirtes F; Inoue-Nagata, Alice K; Resende, Renato O

2012-03-01

The tospoviruses groundnut ringspot virus (GRSV) and zucchini lethal chlorosis virus (ZLCV) cause severe losses in many crops, especially in solanaceous and cucurbit species. In this study, the non-structural NSs gene and the 5'UTRs of these two biologically distinct tospoviruses were cloned and sequenced. The NSs sequence of GRSV and ZLCV were both 1,404 nucleotides long. Pairwise comparison showed that the NSs amino acid sequence of GRSV shared 69.6% identity with that of ZLCV and 75.9% identity with that of TSWV, while the NSs sequence of ZLCV and TSWV shared 67.9% identity. Phylogenetic analysis based on NSs sequences confirmed that these viruses cluster in the American clade.
Host switch during evolution of a genetically distinct hantavirus in the American shrew mole (Neurotrichus gibbsii)

PubMed Central

Kang, Hae Ji; Bennett, Shannon N.; Dizney, Laurie; Sumibcay, Laarni; Arai, Satoru; Ruedas, Luis A.; Song, Jin-Won; Yanagihara, Richard

2009-01-01

A genetically distinct hantavirus, designated Oxbow virus (OXBV), was detected in tissues of an American shrew mole (Neurotrichus gibbsii), captured in Gresham, Oregon, in September 2003. Pairwise analysis of full-length S- and M- and partial L-segment nucleotide and amino acid sequences of OXBV indicated low sequence similarity with rodent-borne hantaviruses. Phylogenetic analyses using maximum-likelihood and Bayesian methods, and host-parasite evolutionary comparisons, showed that OXBV and Asama virus, a hantavirus recently identified from the Japanese shrew mole (Urotrichus talpoides), were related to soricine shrew-borne hantaviruses from North America and Eurasia, respectively, suggesting parallel evolution associated with cross-species transmission. PMID:19394994
Divergent ancestral lineages of newfound hantaviruses harbored by phylogenetically related crocidurine shrew species in Korea

PubMed Central

Arai, Satoru; Gu, Se Hun; Baek, Luck Ju; Tabara, Kenji; Bennett, Shannon; Oh, Hong-Shik; Takada, Nobuhiro; Kang, Hae Ji; Tanaka-Taya, Keiko; Morikawa, Shigeru; Okabe, Nobuhiko; Yanagihara, Richard; Song, Jin-Won

2012-01-01

Spurred by the recent isolation of a novel hantavirus, named Imjin virus (MJNV), from the Ussuri white-toothed shrew (Crocidura lasiura), targeted trapping was conducted for the phylogenetically related Asian lesser white-toothed shrew (Crocidura shantungensis). Pair-wise alignment and comparison of the S, M and L segments of a newfound hantavirus, designated Jeju virus (JJUV), indicated remarkably low nucleotide and amino acid sequence similarity with MJNV. Phylogenetic analyses, using maximum likelihood and Bayesian methods, showed divergent ancestral lineages for JJUV and MJNV, despite the close phylogenetic relationship of their reservoir soricid hosts. Also, no evidence of host switching was apparent in tanglegrams, generated by TreeMap 2.0β. PMID:22230701
Historical DNA reveals the demographic history of Atlantic cod (Gadus morhua) in medieval and early modern Iceland

PubMed Central

Ólafsdóttir, Guðbjörg Ásta; Westfall, Kristen M.; Edvardsson, Ragnar; Pálsson, Snæbjörn

2014-01-01

Atlantic cod (Gadus morhua) vertebrae from archaeological sites were used to study the history of the Icelandic Atlantic cod population in the time period of 1500–1990. Specifically, we used coalescence modelling to estimate population size and fluctuations from the sequence diversity at the cytochrome b (cytb) and Pantophysin I (PanI) loci. The models are consistent with an expanding population during the warm medieval period, large historical effective population size (NE), a marked bottleneck event at 1400–1500 and a decrease in NE in early modern times. The model results are corroborated by the reduction of haplotype and nucleotide variation over time and pairwise population distance as a significant portion of nucleotide variation partitioned across the 1550 time mark. The mean age of the historical fished stock is high in medieval times with a truncation in age in early modern times. The population size crash coincides with a period of known cooling in the North Atlantic, and we conclude that the collapse may be related to climate or climate-induced ecosystem change. PMID:24403343
Historical DNA reveals the demographic history of Atlantic cod (Gadus morhua) in medieval and early modern Iceland.

PubMed

Ólafsdóttir, Guðbjörg Ásta; Westfall, Kristen M; Edvardsson, Ragnar; Pálsson, Snæbjörn

2014-02-22

Atlantic cod (Gadus morhua) vertebrae from archaeological sites were used to study the history of the Icelandic Atlantic cod population in the time period of 1500-1990. Specifically, we used coalescence modelling to estimate population size and fluctuations from the sequence diversity at the cytochrome b (cytb) and Pantophysin I (PanI) loci. The models are consistent with an expanding population during the warm medieval period, large historical effective population size (NE), a marked bottleneck event at 1400-1500 and a decrease in NE in early modern times. The model results are corroborated by the reduction of haplotype and nucleotide variation over time and pairwise population distance as a significant portion of nucleotide variation partitioned across the 1550 time mark. The mean age of the historical fished stock is high in medieval times with a truncation in age in early modern times. The population size crash coincides with a period of known cooling in the North Atlantic, and we conclude that the collapse may be related to climate or climate-induced ecosystem change.
iPARTS2: an improved tool for pairwise alignment of RNA tertiary structures, version 2.

PubMed

Yang, Chung-Han; Shih, Cheng-Ting; Chen, Kun-Tze; Lee, Po-Han; Tsai, Ping-Han; Lin, Jian-Cheng; Yen, Ching-Yu; Lin, Tiao-Yin; Lu, Chin Lung

2016-07-08

Since its first release in 2010, iPARTS has become a valuable tool for globally or locally aligning two RNA 3D structures. It was implemented by a structural alphabet (SA)-based approach, which uses an SA of 23 letters to reduce RNA 3D structures into 1D sequences of SA letters and applies traditional sequence alignment to these SA-encoded sequences for determining their global or local similarity. In this version, we have re-implemented iPARTS into a new web server iPARTS2 by constructing a totally new SA, which consists of 92 elements with each carrying both information of base and backbone geometry for a representative nucleotide. This SA is significantly different from the one used in iPARTS, because the latter consists of only 23 elements with each carrying only the backbone geometry information of a representative nucleotide. Our experimental results have shown that iPARTS2 outperforms its previous version iPARTS and also achieves better accuracy than other popular tools, such as SARA, SETTER and RASS, in RNA alignment quality and function prediction. iPARTS2 takes as input two RNA 3D structures in the PDB format and outputs their global or local alignments with graphical display. iPARTS2 is now available online at http://genome.cs.nthu.edu.tw/iPARTS2/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Molecular and morphological characterization of the tapeworm Taenia hydatigena (Pallas, 1766) in sheep from Iran.

PubMed

Rostami, S; Salavati, R; Beech, R N; Babaei, Z; Sharbatkhori, M; Baneshi, M R; Hajialilo, E; Shad, H; Harandi, M F

2015-03-01

Although Taenia hydatigena is one of the most prevalent taeniid species of livestock, very little molecular genetic information exists for this parasite. Up to 100 sheep isolates of T. hydatigena were collected from 19 abattoirs located in the provinces of Tehran, Alborz and Kerman. A calibrated microscope was used to measure the larval rostellar hook lengths. Following DNA extraction, fragments of cytochrome c oxidase 1 (CO1) and 12S rRNA genes were amplified by the polymerase chain reaction method and the amplicons were subjected to sequencing. The mean total length of large and small hooks was 203.4 μm and 135.9 μm, respectively. Forty CO1 and 39 12S rRNA sequence haplotypes were obtained in the study. The levels of pairwise nucleotide variation between individual haplotypes of CO1 and 12S rRNA genes were determined to be between 0.3-3.4% and 0.2-2.1%, respectively. The overall nucleotide variation among all the CO1 haplotypes was 9.7%, and for all the 12S rRNA haplotypes it was 10.1%. A significant difference was observed between rostellar hook morphometry and both CO1 and 12S rRNA sequence variability. A significantly high level of genetic variation was observed in the present study. The results showed that the 12S rRNA gene is more variable than CO1.
Genetic diversity analysis of the oriental river prawn (Macrobrachium nipponense) in Huaihe River.

PubMed

Cui, Feng; Yu, Yanyan; Bao, Fangyin; Wang, Song; Xiao, Ming Song

2018-04-19

The oriental river prawn (Macrobrachium nipponense) is an economically and nutritionally important species of decapod crustaceans in China. Genetic structure and demographic history of Macrobrachium nipponense were examined using sequence data from portions of the mitochondrial DNA cytochrome oxidase subunit I (COI) gene. Samples of 191 individuals were collected from 10 localities in the upper to middle reaches of the Huaihe River. Variability was detected at a total of 42 nucleotide sites along 684 bp length of homologous sequence (6.14%), and base substitutions occurred mostly at the second codon position. Haplotype diversity (h) and nucleotide diversity (π) of all populations were 0.9136 ± 0.0116 and 0.0078 ± 0.0042, respectively. Phylogenetic tree constructed using the maximum-likelihood (ML) method showed that the 44 haplotypes were assigned to two obvious clades associated with geographic regions. Moreover, the median-joining network was similar to the topology of the phylogenetic tree with 44 haplotypes. The pairwise F ST values between the populations varied from -0.0298 to 0.2994. Generally, moderate genetic differentiation (F ST = 0.1598, p = .0000) among different geographic populations was detected, with the significant differentiation between the Huaibin (HB) and other Macrobrachium nipponense populations. Both mismatch distribution analyses and neutrality tests suggested the early stage of Late Pleistocene population expansion 85,500 years before present for the species, which was consistent with the palaeoclimatic condition of the Huaihe River Basin.
Building-up of a DNA barcode library for true bugs (insecta: hemiptera: heteroptera) of Germany reveals taxonomic uncertainties and surprises.

PubMed

Raupach, Michael J; Hendrich, Lars; Küchler, Stefan M; Deister, Fabian; Morinière, Jérome; Gossner, Martin M

2014-01-01

During the last few years, DNA barcoding has become an efficient method for the identification of species. In the case of insects, most published DNA barcoding studies focus on species of the Ephemeroptera, Trichoptera, Hymenoptera and especially Lepidoptera. In this study we test the efficiency of DNA barcoding for true bugs (Hemiptera: Heteroptera), an ecological and economical highly important as well as morphologically diverse insect taxon. As part of our study we analyzed DNA barcodes for 1742 specimens of 457 species, comprising 39 families of the Heteroptera. We found low nucleotide distances with a minimum pairwise K2P distance <2.2% within 21 species pairs (39 species). For ten of these species pairs (18 species), minimum pairwise distances were zero. In contrast to this, deep intraspecific sequence divergences with maximum pairwise distances >2.2% were detected for 16 traditionally recognized and valid species. With a successful identification rate of 91.5% (418 species) our study emphasizes the use of DNA barcodes for the identification of true bugs and represents an important step in building-up a comprehensive barcode library for true bugs in Germany and Central Europe as well. Our study also highlights the urgent necessity of taxonomic revisions for various taxa of the Heteroptera, with a special focus on various species of the Miridae. In this context we found evidence for on-going hybridization events within various taxonomically challenging genera (e.g. Nabis Latreille, 1802 (Nabidae), Lygus Hahn, 1833 (Miridae), Phytocoris Fallén, 1814 (Miridae)) as well as the putative existence of cryptic species (e.g. Aneurus avenius (Duffour, 1833) (Aradidae) or Orius niger (Wolff, 1811) (Anthocoridae)).
Building-Up of a DNA Barcode Library for True Bugs (Insecta: Hemiptera: Heteroptera) of Germany Reveals Taxonomic Uncertainties and Surprises

PubMed Central

Raupach, Michael J.; Hendrich, Lars; Küchler, Stefan M.; Deister, Fabian; Morinière, Jérome; Gossner, Martin M.

2014-01-01

During the last few years, DNA barcoding has become an efficient method for the identification of species. In the case of insects, most published DNA barcoding studies focus on species of the Ephemeroptera, Trichoptera, Hymenoptera and especially Lepidoptera. In this study we test the efficiency of DNA barcoding for true bugs (Hemiptera: Heteroptera), an ecological and economical highly important as well as morphologically diverse insect taxon. As part of our study we analyzed DNA barcodes for 1742 specimens of 457 species, comprising 39 families of the Heteroptera. We found low nucleotide distances with a minimum pairwise K2P distance <2.2% within 21 species pairs (39 species). For ten of these species pairs (18 species), minimum pairwise distances were zero. In contrast to this, deep intraspecific sequence divergences with maximum pairwise distances >2.2% were detected for 16 traditionally recognized and valid species. With a successful identification rate of 91.5% (418 species) our study emphasizes the use of DNA barcodes for the identification of true bugs and represents an important step in building-up a comprehensive barcode library for true bugs in Germany and Central Europe as well. Our study also highlights the urgent necessity of taxonomic revisions for various taxa of the Heteroptera, with a special focus on various species of the Miridae. In this context we found evidence for on-going hybridization events within various taxonomically challenging genera (e.g. Nabis Latreille, 1802 (Nabidae), Lygus Hahn, 1833 (Miridae), Phytocoris Fallén, 1814 (Miridae)) as well as the putative existence of cryptic species (e.g. Aneurus avenius (Duffour, 1833) (Aradidae) or Orius niger (Wolff, 1811) (Anthocoridae)). PMID:25203616
Methanosarcina acetivorans 16S rRNA and transcription factor nucleotide fluctuation with implications in exobiology and pathology

NASA Astrophysics Data System (ADS)

Holden, Todd; Tremberger, G., Jr.; Cheung, E.; Subramaniam, R.; Sullivan, R.; Schneider, P.; Flamholz, A.; Marchese, P.; Hiciano, O.; Yao, H.; Lieberman, D.; Cheung, T.

2008-08-01

Cultures of the methane-producing archaea Methanosarcina, have recently been isolated from Alaskan sediments. It has been proposed that methanogens are strong candidates for exobiological life in extreme conditions. The spatial environmental gradients, such as those associated with the polygons on Mars' surface, could have been produced by past methanogenesis activity. The 16S rRNA gene has been used routinely to classify phenotypes. Using the fractal dimension of nucleotide fluctuation, a comparative study of the 16S rRNA nucleotide fluctuation in Methanosarcina acetivorans C2A, Deinococcus radiodurans, and E. coli was conducted. The results suggest that Methanosarcina acetivorans has the lowest fractal dimension, consistent with its ancestral position in evolution. Variation in fluctuation complexity was also detected in the transcription factors. The transcription factor B (TFB) was found to have a higher fractal dimension as compared to transcription factor E (TFE), consistent with the fact that a single TFB in Methanosarcina acetivorans can code three different TATA box proteins. The average nucleotide pair-wise free energy of the DNA repair genes was found to be highest for Methanosarcina acetivorans, suggesting a relatively weak bonding, which is consistent with its low prevalence in pathology. Multitasking capacity comparison of type-I and type-II topoisomerases has been shown to correlate with fractal dimension using the methicillin-resistant strain MRSA 252. The analysis suggests that gene adaptation in a changing chemical environment can be measured in terms of bioinformatics. Given that the radiation resistant Deinococcus radiodurans is a strong candidate for an extraterrestrial origin and that the cold temperature Psychrobacter cryohalolentis K5 can function in Siberian permafrost, the fractal dimension comparison in this study suggests that a chemical resistant methanogen could exist in extremely cold conditions (such as that which existed on early Mars) where demands on gene activity are low. In addition, the comparative study of the Methanococcoides burtonii cold shock domain sequence has provided further support for the correlation between multitasking capacity and fractal dimension.
Macrobenthic assemblages of the Changjiang River estuary (Yangtze River, China) and adjacent continental shelf relative to mild summer hypoxia

NASA Astrophysics Data System (ADS)

Liao, Yibo; Shou, Lu; Tang, Yanbin; Zeng, Jiangning; Gao, Aigen; Chen, Quanzhen; Yan, Xiaojun

2017-05-01

To assess the effects of hypoxia, macrobenthic communities along an estuarine gradient of the Changjiang estuary and adjacent continental shelf were analyzed. This revealed spatial variations in the communities and relationships with environmental variables during periods of reduced dissolved oxygen (DO) concentration in summer. Statistical analyses revealed significant differences in macrobenthic community composition among the three zones: estuarine zone (EZ), mildly hypoxic zone (MHZ) in the continental shelf, and normoxic zone (NZ) in the continental shelf (Global R =0.206, P =0.002). Pairwise tests showed that the macrobenthic community composition of the EZ was significantly different from the MHZ (pairwise test R =0.305, P =0.001) and the NZ (pairwise test R =0.259, P =0.001). There was no significant difference in macrobenthic communities between the MHZ and the NZ (pairwise test R =0.062, P =0.114). The taxa included small and typically opportunistic polychaetes, which made the greatest contribution to the dissimilarity between the zones. The effects of mild hypoxia on the macrobenthic communities are a result not only of reduced DO concentration but also of differences in environmental variables such as temperature, salinity, and nutrient concentrations caused by stratification.
Mitochondrial DNA markers reveal high genetic diversity and strong genetic differentiation in populations of Dendrolimus kikuchii Matsumura (Lepidoptera: Lasiocampidae).

PubMed

Men, Qiulei; Xue, Guoxi; Mu, Dan; Hu, Qingling; Huang, Minyi

2017-01-01

Dendrolimus kikuchii Matsumura, 1927 is a serious forest pest causing great damage to coniferous trees in China. Despite its economic importance, the population genetics of this pest are poorly known. We used three mitochondrial genes (COI, COII and Cytb) to investigate the genetic diversity and genetic differentiation of 15 populations collected from the main distribution regions of D. kikuchii in China. Populations show high haplotype and nucleotide diversity. Haplotype network and phylogenetic analysis divides the populations into three major clades, the central and southeastern China (CC+SEC) clade, the eastern China (EC) clade, and the southwestern China (SWC) clade. Populations collected from adjacent localities share the same clade, which is consistent with the strong relationship of isolation by distance (r = 0.74824, P = 0.00001). AMOVA analysis indicated that the major portion of this molecular genetic variation is found among the three groups of CC+SEC, EC and SWC (61.26%). Of 105 pairwise FST comparisons, 93 show high genetic differentiation. Populations of Puer (PE), Yangshuo (YS) and Leishan (LS) are separated from other populations by a larger genetic distance. Distributions of pairwise differences obtained with single and combined gene data from the overall populations are multimodal, suggesting these populations had no prior population expansion in southern China. The nonsignificant neutral test on the basis of Tajima' D and Fu's Fs, and the lack of a star-shaped haplotype network together with the multiple haplotypes support this hypothesis. Pleistocene climatic fluctuations, combined with the host specificity to Pinus species, made these regions of south China into a refuge for D. kikuchii. The high level of population genetic structuring is related to their weak flight capacity, their variations of life history and the geographic distance among populations.
Dynamics of prebiotic RNA reproduction illuminated by chemical game theory

PubMed Central

Yeates, Jessica A. M.; Hilbe, Christian; Zwick, Martin; Nowak, Martin A.; Lehman, Niles

2016-01-01

Many origins-of-life scenarios depict a situation in which there are common and potentially scarce resources needed by molecules that compete for survival and reproduction. The dynamics of RNA assembly in a complex mixture of sequences is a frequency-dependent process and mimics such scenarios. By synthesizing Azoarcus ribozyme genotypes that differ in their single-nucleotide interactions with other genotypes, we can create molecules that interact among each other to reproduce. Pairwise interplays between RNAs involve both cooperation and selfishness, quantifiable in a 2 × 2 payoff matrix. We show that a simple model of differential equations based on chemical kinetics accurately predicts the outcomes of these molecular competitions using simple rate inputs into these matrices. In some cases, we find that mixtures of different RNAs reproduce much better than each RNA type alone, reflecting a molecular form of reciprocal cooperation. We also demonstrate that three RNA genotypes can stably coexist in a rock–paper–scissors analog. Our experiments suggest a new type of evolutionary game dynamics, called prelife game dynamics or chemical game dynamics. These operate without template-directed replication, illustrating how small networks of RNAs could have developed and evolved in an RNA world. PMID:27091972
Dynamics of prebiotic RNA reproduction illuminated by chemical game theory.

PubMed

Yeates, Jessica A M; Hilbe, Christian; Zwick, Martin; Nowak, Martin A; Lehman, Niles

2016-05-03

Many origins-of-life scenarios depict a situation in which there are common and potentially scarce resources needed by molecules that compete for survival and reproduction. The dynamics of RNA assembly in a complex mixture of sequences is a frequency-dependent process and mimics such scenarios. By synthesizing Azoarcus ribozyme genotypes that differ in their single-nucleotide interactions with other genotypes, we can create molecules that interact among each other to reproduce. Pairwise interplays between RNAs involve both cooperation and selfishness, quantifiable in a 2 × 2 payoff matrix. We show that a simple model of differential equations based on chemical kinetics accurately predicts the outcomes of these molecular competitions using simple rate inputs into these matrices. In some cases, we find that mixtures of different RNAs reproduce much better than each RNA type alone, reflecting a molecular form of reciprocal cooperation. We also demonstrate that three RNA genotypes can stably coexist in a rock-paper-scissors analog. Our experiments suggest a new type of evolutionary game dynamics, called prelife game dynamics or chemical game dynamics. These operate without template-directed replication, illustrating how small networks of RNAs could have developed and evolved in an RNA world.
Slow but not low: genomic comparisons reveal slower evolutionary rate and higher dN/dS in conifers compared to angiosperms.

PubMed

Buschiazzo, Emmanuel; Ritland, Carol; Bohlmann, Jörg; Ritland, Kermit

2012-01-20

Comparative genomics can inform us about the processes of mutation and selection across diverse taxa. Among seed plants, gymnosperms have been lacking in genomic comparisons. Recent EST and full-length cDNA collections for two conifers, Sitka spruce (Picea sitchensis) and loblolly pine (Pinus taeda), together with full genome sequences for two angiosperms, Arabidopsis thaliana and poplar (Populus trichocarpa), offer an opportunity to infer the evolutionary processes underlying thousands of orthologous protein-coding genes in gymnosperms compared with an angiosperm orthologue set. Based upon pairwise comparisons of 3,723 spruce and pine orthologues, we found an average synonymous genetic distance (dS) of 0.191, and an average dN/dS ratio of 0.314. Using a fossil-established divergence time of 140 million years between spruce and pine, we extrapolated a nucleotide substitution rate of 0.68 × 10(-9) synonymous substitutions per site per year. When compared to angiosperms, this indicates a dramatically slower rate of nucleotide substitution rates in conifers: on average 15-fold. Coincidentally, we found a three-fold higher dN/dS for the spruce-pine lineage compared to the poplar-Arabidopsis lineage. This joint occurrence of a slower evolutionary rate in conifers with higher dN/dS, and possibly positive selection, showcases the uniqueness of conifer genome evolution. Our results are in line with documented reduced nucleotide diversity, conservative genome evolution and low rates of diversification in conifers on the one hand and numerous examples of local adaptation in conifers on the other hand. We propose that reduced levels of nucleotide mutation in large and long-lived conifer trees, coupled with large effective population size, were the main factors leading to slow substitution rates but retention of beneficial mutations.
Green turtles (Chelonia mydas) foraging at Arvoredo Island in Southern Brazil: Genetic characterization and mixed stock analysis through mtDNA control region haplotypes

PubMed Central

2009-01-01

We analyzed mtDNA control region sequences of green turtles (Chelonia mydas) from Arvoredo Island, a foraging ground in southern Brazil, and identified eight haplotypes. Of these, CM-A8 (64%) and CM-A5 (22%) were dominant, the remainder presenting low frequencies (< 5%). Haplotype (h) and nucleotide (π) diversities were 0.5570 ± 0.0697 and 0.0021 ± 0.0016, respectively. Exact tests of differentiation and AMOVA ΦST pairwise values between the study area and eight other Atlantic foraging grounds revealed significant differences in most areas, except Ubatuba and Rocas/Noronha, in Brazil (p > 0.05). Mixed Stock Analysis, incorporating eleven Atlantic and one Mediterranean rookery as possible sources of individuals, indicated Ascension and Aves islands as the main contributing stocks to the Arvoredo aggregation (68.01% and 22.96%, respectively). These results demonstrate the extensive relationships between Arvoredo Island and other Atlantic foraging and breeding areas. Such an understanding provides a framework for establishing adequate management and conservation strategies for this endangered species. PMID:21637527
Population genetic structure and genetic diversity of Chinese pomfret at the coast of the East China Sea and the South China Sea.

PubMed

Sun, Peng; Tang, Baojun; Yin, Fei

2018-05-01

The Chinese pomfret Pampus chinensis is one of the most economic and ecological important marine fish species in China. In the present study, the population genetic structure and genetic diversity of P. chinensis were evaluated from a total sample size of 180 individuals representing six populations from the East China Sea and the South China Sea using mitochondrial cytochrome c oxidase subunit I (COI) gene. A total of 24 variable sites (including 3 singleton sites and 21 parsimony information sites) were observed, and 18 haplotypes were defined. The haplotype diversity (Hd) of the populations ranged from 0.559 to 0.775, and the nucleotide diversity (π) ranged from 0.330 to 1.090%. Analysis of molecular variance (AMOVA) reveals that the main variation (66.02%) was among individuals within populations. The average pairwise differences and ϕ ST values indicated significant genetic differentiation between Dongxing population and the other populations. The results of the present study are helpful for the sustainable management and utilization of this species.
Phylogenetic Characterizations of Highly Mutated EV-B106 Recombinants Showing Extensive Genetic Exchanges with Other EV-B in Xinjiang, China.

PubMed

Song, Yang; Zhang, Yong; Fan, Qin; Cui, Hui; Yan, Dongmei; Zhu, Shuangli; Tang, Haishu; Sun, Qiang; Wang, Dongyan; Xu, Wenbo

2017-02-23

Human enterovirus B106 (EV-B106) is a new member of the enterovirus B species. To date, only three nucleotide sequences of EV-B106 have been published, and only one full-length genome sequence (the Yunnan strain 148/YN/CHN/12) is available in the GenBank database. In this study, we conducted phylogenetic characterisation of four EV-B106 strains isolated in Xinjiang, China. Pairwise comparisons of the nucleotide sequences and the deduced amino acid sequences revealed that the four Xinjiang EV-B106 strains had only 80.5-80.8% nucleotide identity and 95.4-97.3% amino acid identity with the Yunnan EV-B106 strain, indicating high mutagenicity. Similarity plots and bootscanning analyses revealed that frequent intertypic recombination occurred in all four Xinjiang EV-B106 strains in the non-structural region. These four strains may share a donor sequence with the EV-B85 strain, which circulated in Xinjiang in 2011, indicating extensive genetic exchanges between these strains. All Xinjiang EV-B106 strains were temperature-sensitive. An antibody seroprevalence study against EV-B106 in two Xinjiang prefectures also showed low titres of neutralizing antibodies, suggesting limited exposure and transmission in the population. This study contributes the whole genome sequences of EV-B106 to the GenBank database and provides valuable information regarding the molecular epidemiology of EV-B106 in China.

Phylogenetic Characterizations of Highly Mutated EV-B106 Recombinants Showing Extensive Genetic Exchanges with Other EV-B in Xinjiang, China

PubMed Central

Song, Yang; Zhang, Yong; Fan, Qin; Cui, Hui; Yan, Dongmei; Zhu, Shuangli; Tang, Haishu; Sun, Qiang; Wang, Dongyan; Xu, Wenbo

2017-01-01

Human enterovirus B106 (EV-B106) is a new member of the enterovirus B species. To date, only three nucleotide sequences of EV-B106 have been published, and only one full-length genome sequence (the Yunnan strain 148/YN/CHN/12) is available in the GenBank database. In this study, we conducted phylogenetic characterisation of four EV-B106 strains isolated in Xinjiang, China. Pairwise comparisons of the nucleotide sequences and the deduced amino acid sequences revealed that the four Xinjiang EV-B106 strains had only 80.5–80.8% nucleotide identity and 95.4–97.3% amino acid identity with the Yunnan EV-B106 strain, indicating high mutagenicity. Similarity plots and bootscanning analyses revealed that frequent intertypic recombination occurred in all four Xinjiang EV-B106 strains in the non-structural region. These four strains may share a donor sequence with the EV-B85 strain, which circulated in Xinjiang in 2011, indicating extensive genetic exchanges between these strains. All Xinjiang EV-B106 strains were temperature-sensitive. An antibody seroprevalence study against EV-B106 in two Xinjiang prefectures also showed low titres of neutralizing antibodies, suggesting limited exposure and transmission in the population. This study contributes the whole genome sequences of EV-B106 to the GenBank database and provides valuable information regarding the molecular epidemiology of EV-B106 in China. PMID:28230168
Complete nucleotide sequences of a new bipartite begomovirus from Malvastrum sp. plants with bright yellow mosaic symptoms in South Texas.

PubMed

Alabi, Olufemi J; Villegas, Cecilia; Gregg, Lori; Murray, K Daniel

2016-06-01

Two isolates of a novel bipartite begomovirus, tentatively named malvastrum bright yellow mosaic virus (MaBYMV), were molecularly characterized from naturally infected plants of the genus Malvastrum showing bright yellow mosaic disease symptoms in South Texas. Six complete DNA-A and five DNA-B genome sequences of MaBYMV obtained from the isolates ranged in length from 2,608 to 2,609 nucleotides (nt) and 2,578 to 2,605 nt, respectively. Both genome segments shared a 178- to 180-nt common region. In pairwise comparisons, the complete DNA-A and DNA-B sequences of MaBYMV were most similar (87-88 % and 79-81 % identity, respectively) and phylogenetically related to the corresponding sequences of sida mosaic Sinaloa virus-[MX-Gua-06]. Further analysis revealed that MaBYMV is a putative recombinant virus, thus supporting the notion that malvaceous hosts may be influencing the evolution of several begomoviruses. The design of new diagnostic primers enabled the detection of MaBYMV in cohorts of Bemisia tabaci collected from symptomatic Malvastrum sp. plants, thus implicating whiteflies as potential vectors of the virus.
Selective sweep at the Drosophila melanogaster Suppressor of Hairless locus and its association with the In(2L)t inversion polymorphism.

PubMed Central

Depaulis, F; Brazier, L; Veuille, M

1999-01-01

The hitchhiking model of population genetics predicts that an allele favored by Darwinian selection can replace haplotypes from the same locus previously established at a neutral mutation-drift equilibrium. This process, known as "selective sweep," was studied by comparing molecular variation between the polymorphic In(2L)t inversion and the standard chromosome. Sequence variation was recorded at the Suppressor of Hairless (Su[H]) gene in an African population of Drosophila melanogaster. We found 47 nucleotide polymorphisms among 20 sequences of 1.2 kb. Neutrality tests were nonsignificant at the nucleotide level. However, these sites were strongly associated, because 290 out of 741 observed pairwise combinations between them were in significant linkage disequilibrium. We found only seven haplotypes, two occurring in the 9 In(2L)t chromosomes, and five in the 11 standard chromosomes, with no shared haplotype. Two haplotypes, one in each chromosome arrangement, made up two-thirds of the sample. This low haplotype diversity departed from neutrality in a haplotype test. This pattern supports a selective sweep hypothesis for the Su(H) chromosome region. PMID:10388820
Oligonucleotide fingerprinting of rRNA genes for analysis of fungal community composition.

PubMed

Valinsky, Lea; Della Vedova, Gianluca; Jiang, Tao; Borneman, James

2002-12-01

Thorough assessments of fungal diversity are currently hindered by technological limitations. Here we describe a new method for identifying fungi, oligonucleotide fingerprinting of rRNA genes (OFRG). ORFG sorts arrayed rRNA gene (ribosomal DNA [rDNA]) clones into taxonomic clusters through a series of hybridization experiments, each using a single oligonucleotide probe. A simulated annealing algorithm was used to design an OFRG probe set for fungal rDNA. Analysis of 1,536 fungal rDNA clones derived from soil generated 455 clusters. A pairwise sequence analysis showed that clones with average sequence identities of 99.2% were grouped into the same cluster. To examine the accuracy of the taxonomic identities produced by this OFRG experiment, we determined the nucleotide sequences for 117 clones distributed throughout the tree. For all but two of these clones, the taxonomic identities generated by this OFRG experiment were consistent with those generated by a nucleotide sequence analysis. Eighty-eight percent of the clones were affiliated with Ascomycota, while 12% belonged to BASIDIOMYCOTA: A large fraction of the clones were affiliated with the genera Fusarium (404 clones) and Raciborskiomyces (176 clones). Smaller assemblages of clones had high sequence identities to the Alternaria, Ascobolus, Chaetomium, Cryptococcus, and Rhizoctonia clades.
Power and sample-size estimation for microbiome studies using pairwise distances and PERMANOVA.

PubMed

Kelly, Brendan J; Gross, Robert; Bittinger, Kyle; Sherrill-Mix, Scott; Lewis, James D; Collman, Ronald G; Bushman, Frederic D; Li, Hongzhe

2015-08-01

The variation in community composition between microbiome samples, termed beta diversity, can be measured by pairwise distance based on either presence-absence or quantitative species abundance data. PERMANOVA, a permutation-based extension of multivariate analysis of variance to a matrix of pairwise distances, partitions within-group and between-group distances to permit assessment of the effect of an exposure or intervention (grouping factor) upon the sampled microbiome. Within-group distance and exposure/intervention effect size must be accurately modeled to estimate statistical power for a microbiome study that will be analyzed with pairwise distances and PERMANOVA. We present a framework for PERMANOVA power estimation tailored to marker-gene microbiome studies that will be analyzed by pairwise distances, which includes: (i) a novel method for distance matrix simulation that permits modeling of within-group pairwise distances according to pre-specified population parameters; (ii) a method to incorporate effects of different sizes within the simulated distance matrix; (iii) a simulation-based method for estimating PERMANOVA power from simulated distance matrices; and (iv) an R statistical software package that implements the above. Matrices of pairwise distances can be efficiently simulated to satisfy the triangle inequality and incorporate group-level effects, which are quantified by the adjusted coefficient of determination, omega-squared (ω2). From simulated distance matrices, available PERMANOVA power or necessary sample size can be estimated for a planned microbiome study. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Efficient selection of tagging single-nucleotide polymorphisms in multiple populations.

PubMed

Howie, Bryan N; Carlson, Christopher S; Rieder, Mark J; Nickerson, Deborah A

2006-08-01

Common genetic polymorphism may explain a portion of the heritable risk for common diseases, so considerable effort has been devoted to finding and typing common single-nucleotide polymorphisms (SNPs) in the human genome. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), suggesting that only a subset of all SNPs (known as tagging SNPs, or tagSNPs) need to be genotyped for disease association studies. Based on the genetic differences that exist among human populations, most tagSNP sets are defined in a single population and applied only in populations that are closely related. To improve the efficiency of multi-population analyses, we have developed an algorithm called MultiPop-TagSelect that finds a near-minimal union of population-specific tagSNP sets across an arbitrary number of populations. We present this approach as an extension of LD-select, a tagSNP selection method that uses a greedy algorithm to group SNPs into bins based on their pairwise association patterns, although the MultiPop-TagSelect algorithm could be used with any SNP tagging approach that allows choices between nearly equivalent SNPs. We evaluate the algorithm by considering tagSNP selection in candidate-gene resequencing data and lower density whole-chromosome data. Our analysis reveals that an exhaustive search is often intractable, while the developed algorithm can quickly and reliably find near-optimal solutions even for difficult tagSNP selection problems. Using populations of African, Asian, and European ancestry, we also show that an optimal multi-population set of tagSNPs can be substantially smaller (up to 44%) than a typical set obtained through independent or sequential selection.
Statistical method to compare massive parallel sequencing pipelines.

PubMed

Elsensohn, M H; Leblay, N; Dimassi, S; Campan-Fournier, A; Labalme, A; Roucher-Boulez, F; Sanlaville, D; Lesca, G; Bardel, C; Roy, P

2017-03-01

Today, sequencing is frequently carried out by Massive Parallel Sequencing (MPS) that cuts drastically sequencing time and expenses. Nevertheless, Sanger sequencing remains the main validation method to confirm the presence of variants. The analysis of MPS data involves the development of several bioinformatic tools, academic or commercial. We present here a statistical method to compare MPS pipelines and test it in a comparison between an academic (BWA-GATK) and a commercial pipeline (TMAP-NextGENe®), with and without reference to a gold standard (here, Sanger sequencing), on a panel of 41 genes in 43 epileptic patients. This method used the number of variants to fit log-linear models for pairwise agreements between pipelines. To assess the heterogeneity of the margins and the odds ratios of agreement, four log-linear models were used: a full model, a homogeneous-margin model, a model with single odds ratio for all patients, and a model with single intercept. Then a log-linear mixed model was fitted considering the biological variability as a random effect. Among the 390,339 base-pairs sequenced, TMAP-NextGENe® and BWA-GATK found, on average, 2253.49 and 1857.14 variants (single nucleotide variants and indels), respectively. Against the gold standard, the pipelines had similar sensitivities (63.47% vs. 63.42%) and close but significantly different specificities (99.57% vs. 99.65%; p < 0.001). Same-trend results were obtained when only single nucleotide variants were considered (99.98% specificity and 76.81% sensitivity for both pipelines). The method allows thus pipeline comparison and selection. It is generalizable to all types of MPS data and all pipelines.
A Comparative Study of Pairwise Learning Methods Based on Kernel Ridge Regression.

PubMed

Stock, Michiel; Pahikkala, Tapio; Airola, Antti; De Baets, Bernard; Waegeman, Willem

2018-06-12

Many machine learning problems can be formulated as predicting labels for a pair of objects. Problems of that kind are often referred to as pairwise learning, dyadic prediction, or network inference problems. During the past decade, kernel methods have played a dominant role in pairwise learning. They still obtain a state-of-the-art predictive performance, but a theoretical analysis of their behavior has been underexplored in the machine learning literature. In this work we review and unify kernel-based algorithms that are commonly used in different pairwise learning settings, ranging from matrix filtering to zero-shot learning. To this end, we focus on closed-form efficient instantiations of Kronecker kernel ridge regression. We show that independent task kernel ridge regression, two-step kernel ridge regression, and a linear matrix filter arise naturally as a special case of Kronecker kernel ridge regression, implying that all these methods implicitly minimize a squared loss. In addition, we analyze universality, consistency, and spectral filtering properties. Our theoretical results provide valuable insights into assessing the advantages and limitations of existing pairwise learning methods.
Complete nucleotide sequence, genome organization, and biological properties of human immunodeficiency virus type 1 in vivo: evidence for limited defectiveness and complementation.

PubMed Central

Li, Y; Hui, H; Burgess, C J; Price, R W; Sharp, P M; Hahn, B H; Shaw, G M

1992-01-01

Previous studies of the genetic and biologic characteristics of human immunodeficiency virus type 1 (HIV-1) have by necessity used tissue culture-derived virus. We recently reported the molecular cloning of four full-length HIV-1 genomes directly from uncultured human brain tissue (Y. Li, J. C. Kappes, J. A. Conway, R. W. Price, G. M. Shaw, and B. H. Hahn, J. Virol. 65:3973-3985, 1991). In this report, we describe the biologic properties of these four clones and the complete nucleotide sequences and genome organization of two of them. Clones HIV-1YU-2 and HIV-1YU-10 were 9,174 and 9,176 nucleotides in length, differed by 0.26% in nucleotide sequence, and except for a frameshift mutation in the pol gene in HIV-1YU-10, contained open reading frames corresponding to 5'-gag-pol-vif-vpr-tat-rev-vpu-env-nef-3' flanked by long terminal repeats. HIV-1YU-2 was fully replication competent, while HIV-1YU-10 and two other clones, HIV-1YU-21 and HIV-1YU-32, were defective. All three defective clones, however, when transfected into Cos-1 cells in any pairwise combination, yielded virions that were replication competent and transmissible by cell-free passage. The cellular host range of HIV-1YU-2 was strictly limited to primary T lymphocytes and monocyte-macrophages, a property conferred by its external envelope glycoprotein. Phylogenetic analyses of HIV-1YU-2 gene sequences revealed this virus to be a member of the North American/European HIV-1 subgroup, with specific similarity to other monocyte-tropic viruses in its V3 envelope amino acid sequence. These results indicate that HIV-1 infection of brain is characterized by the persistence of mixtures of fully competent, minimally defective, and more substantially altered viral forms and that complementation among them is readily attainable. In addition, the limited degree of genotypic heterogeneity observed among HIV-1YU and other brain-derived viruses and their preferential tropism for monocyte-macrophages suggest that viral replication within the central nervous system may differ from that within the peripheral lymphoid compartment in significant and clinically important ways. The availability of genetically and biologically well characterized HIV-1 clones from uncultured human tissue should facilitate future studies of virus-cell interactions relevant to viral pathogenesis and drug and vaccine development. Images PMID:1404605
Shaped Ceria Nanocrystals Catalyze Efficient and Selective Para-Hydrogen-Enhanced Polarization.

PubMed

Zhao, Evan W; Zheng, Haibin; Zhou, Ronghui; Hagelin-Weaver, Helena E; Bowers, Clifford R

2015-11-23

Intense para-hydrogen-enhanced NMR signals are observed in the hydrogenation of propene and propyne over ceria nanocubes, nano-octahedra, and nanorods. The well-defined ceria shapes, synthesized by a hydrothermal method, expose different crystalline facets with various oxygen vacancy densities, which are known to play a role in hydrogenation and oxidation catalysis. While the catalytic activity of the hydrogenation of propene over ceria is strongly facet-dependent, the pairwise selectivity is low (2.4% at 375 °C), which is consistent with stepwise H atom transfer, and it is the same for all three nanocrystal shapes. Selective semi-hydrogenation of propyne over ceria nanocubes yields hyperpolarized propene with a similar pairwise selectivity of (2.7% at 300 °C), indicating product formation predominantly by a non-pairwise addition. Ceria is also shown to be an efficient pairwise replacement catalyst for propene. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Molecular epidemiology of Plum pox virus in Japan.

PubMed

Maejima, Kensaku; Himeno, Misako; Komatsu, Ken; Takinami, Yusuke; Hashimoto, Masayoshi; Takahashi, Shuichiro; Yamaji, Yasuyuki; Oshima, Kenro; Namba, Shigetou

2011-05-01

For a molecular epidemiological study based on complete genome sequences, 37 Plum pox virus (PPV) isolates were collected from the Kanto region in Japan. Pair-wise analyses revealed that all 37 Japanese isolates belong to the PPV-D strain, with low genetic diversity (less than 0.8%). In phylogenetic analysis of the PPV-D strain based on complete nucleotide sequences, the relationships of the PPV-D strain were reconstructed with high resolution: at the global level, the American, Canadian, and Japanese isolates formed their own distinct monophyletic clusters, suggesting that the routes of viral entry into these countries were independent; at the local level, the actual transmission histories of PPV were precisely reconstructed with high bootstrap support. This is the first description of the molecular epidemiology of PPV based on complete genome sequences.
Power and sample-size estimation for microbiome studies using pairwise distances and PERMANOVA

PubMed Central

Kelly, Brendan J.; Gross, Robert; Bittinger, Kyle; Sherrill-Mix, Scott; Lewis, James D.; Collman, Ronald G.; Bushman, Frederic D.; Li, Hongzhe

2015-01-01

Motivation: The variation in community composition between microbiome samples, termed beta diversity, can be measured by pairwise distance based on either presence–absence or quantitative species abundance data. PERMANOVA, a permutation-based extension of multivariate analysis of variance to a matrix of pairwise distances, partitions within-group and between-group distances to permit assessment of the effect of an exposure or intervention (grouping factor) upon the sampled microbiome. Within-group distance and exposure/intervention effect size must be accurately modeled to estimate statistical power for a microbiome study that will be analyzed with pairwise distances and PERMANOVA. Results: We present a framework for PERMANOVA power estimation tailored to marker-gene microbiome studies that will be analyzed by pairwise distances, which includes: (i) a novel method for distance matrix simulation that permits modeling of within-group pairwise distances according to pre-specified population parameters; (ii) a method to incorporate effects of different sizes within the simulated distance matrix; (iii) a simulation-based method for estimating PERMANOVA power from simulated distance matrices; and (iv) an R statistical software package that implements the above. Matrices of pairwise distances can be efficiently simulated to satisfy the triangle inequality and incorporate group-level effects, which are quantified by the adjusted coefficient of determination, omega-squared (ω2). From simulated distance matrices, available PERMANOVA power or necessary sample size can be estimated for a planned microbiome study. Availability and implementation: http://github.com/brendankelly/micropower. Contact: brendank@mail.med.upenn.edu or hongzhe@upenn.edu PMID:25819674
Dynamics of pairwise motions in the Cosmic Web

NASA Astrophysics Data System (ADS)

Hellwing, Wojciech A.

2016-10-01

We present results of analysis of the dark matter (DM) pairwise velocity statistics in different Cosmic Web environments. We use the DM velocity and density field from the Millennium 2 simulation together with the NEXUS+ algorithm to segment the simulation volume into voxels uniquely identifying one of the four possible environments: nodes, filaments, walls or cosmic voids. We show that the PDFs of the mean infall velocities v 12 as well as its spatial dependence together with the perpendicular and parallel velocity dispersions bear a significant signal of the large-scale structure environment in which DM particle pairs are embedded. The pairwise flows are notably colder and have smaller mean magnitude in wall and voids, when compared to much denser environments of filaments and nodes. We discuss on our results, indicating that they are consistent with a simple theoretical predictions for pairwise motions as induced by gravitational instability mechanism. Our results indicate that the Cosmic Web elements are coherent dynamical entities rather than just temporal geometrical associations. In addition it should be possible to observationally test various Cosmic Web finding algorithms by segmenting available peculiar velocity data and studying resulting pairwise velocity statistics.
Unraveling Haplotype Diversity of the Apical Membrane Antigen-1 Gene in Plasmodium falciparum Populations in Thailand

PubMed Central

Lumkul, Lalita; Sawaswong, Vorthon; Simpalipan, Phumin; Kaewthamasorn, Morakot; Harnyuttanakorn, Pongchai; Pattaradilokrat, Sittiporn

2018-01-01

Development of an effective vaccine is critically needed for the prevention of malaria. One of the key antigens for malaria vaccines is the apical membrane antigen 1 (AMA-1) of the human malaria parasite Plasmodium falciparum, the surface protein for erythrocyte invasion of the parasite. The gene encoding AMA-1 has been sequenced from populations of P. falciparum worldwide, but the haplotype diversity of the gene in P. falciparum populations in the Greater Mekong Subregion (GMS), including Thailand, remains to be characterized. In the present study, the AMA-1 gene was PCR amplified and sequenced from the genomic DNA of 65 P. falciparum isolates from 5 endemic areas in Thailand. The nearly full-length 1,848 nucleotide sequence of AMA-1 was subjected to molecular analyses, including nucleotide sequence diversity, haplotype diversity and deduced amino acid sequence diversity and neutrality tests. Phylogenetic analysis and pairwise population differentiation (Fst indices) were performed to infer the population structure. The analyses identified 60 single nucleotide polymorphic loci, predominately located in domain I of AMA-1. A total of 31 unique AMA-1 haplotypes were identified, which included 11 novel ones. The phylogenetic tree of the AMA-1 haplotypes revealed multiple clades of AMA-1, each of which contained parasites of multiple geographical origins, consistent with the Fst indices indicating genetic homogeneity or gene flow among geographically distinct populations of P. falciparum in Thailand’s borders with Myanmar, Laos and Cambodia. In summary, the study revealed novel haplotypes and population structure needed for the further advancement of AMA-1-based malaria vaccines in the GMS. PMID:29742870
Male Lineages in Brazil: Intercontinental Admixture and Stratification of the European Background.

PubMed

Resque, Rafael; Gusmão, Leonor; Geppert, Maria; Roewer, Lutz; Palha, Teresinha; Alvarez, Luis; Ribeiro-dos-Santos, Ândrea; Santos, Sidney

2016-01-01

The non-recombining nature of the Y chromosome and the well-established phylogeny of Y-specific Single Nucleotide Polymorphisms (Y-SNPs) make them useful for defining haplogroups with high geographical specificity; therefore, they are more apt than the Y-STRs to detect population stratification in admixed populations from diverse continental origins. Different Y-SNP typing strategies have been described to address issues of population history and movements within geographic territories of interest. In this study, we investigated a set of 41 Y-SNPs in 1217 unrelated males from the five Brazilian geopolitical regions, aiming to disclose the genetic structure of male lineages in the country. A population comparison based on pairwise FST genetic distances did not reveal statistically significant differences in haplogroup frequency distributions among populations from the different regions. The genetic differences observed among regions were, however, consistent with the colonization history of the country. The sample from the Northern region presented the highest Native American ancestry (8.4%), whereas the more pronounced African contribution could be observed in the Northeastern population (15.1%). The Central-Western and Southern samples showed the higher European contributions (95.7% and 93.6%, respectively). The Southeastern region presented significant European (86.1%) and African (12.0%) contributions. The subtyping of the most frequent European lineage in Brazil (R1b1a-M269) allowed differences in the genetic European background of the five Brazilian regions to be investigated for the first time.
Male Lineages in Brazil: Intercontinental Admixture and Stratification of the European Background

PubMed Central

Geppert, Maria; Roewer, Lutz; Palha, Teresinha; Alvarez, Luis; Ribeiro-dos-Santos, Ândrea; Santos, Sidney

2016-01-01

The non-recombining nature of the Y chromosome and the well-established phylogeny of Y-specific Single Nucleotide Polymorphisms (Y-SNPs) make them useful for defining haplogroups with high geographical specificity; therefore, they are more apt than the Y-STRs to detect population stratification in admixed populations from diverse continental origins. Different Y-SNP typing strategies have been described to address issues of population history and movements within geographic territories of interest. In this study, we investigated a set of 41 Y-SNPs in 1217 unrelated males from the five Brazilian geopolitical regions, aiming to disclose the genetic structure of male lineages in the country. A population comparison based on pairwise FST genetic distances did not reveal statistically significant differences in haplogroup frequency distributions among populations from the different regions. The genetic differences observed among regions were, however, consistent with the colonization history of the country. The sample from the Northern region presented the highest Native American ancestry (8.4%), whereas the more pronounced African contribution could be observed in the Northeastern population (15.1%). The Central-Western and Southern samples showed the higher European contributions (95.7% and 93.6%, respectively). The Southeastern region presented significant European (86.1%) and African (12.0%) contributions. The subtyping of the most frequent European lineage in Brazil (R1b1a-M269) allowed differences in the genetic European background of the five Brazilian regions to be investigated for the first time. PMID:27046235
Non-pairwise additivity of the leading-order dispersion energy

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hollett, Joshua W., E-mail: j.hollett@uwinnipeg.ca

2015-02-28

The leading-order (i.e., dipole-dipole) dispersion energy is calculated for one-dimensional (1D) and two-dimensional (2D) infinite lattices, and an infinite 1D array of infinitely long lines, of doubly occupied locally harmonic wells. The dispersion energy is decomposed into pairwise and non-pairwise additive components. By varying the force constant and separation of the wells, the non-pairwise additive contribution to the dispersion energy is shown to depend on the overlap of density between neighboring wells. As well separation is increased, the non-pairwise additivity of the dispersion energy decays. The different rates of decay for 1D and 2D lattices of wells is explained inmore » terms of a Jacobian effect that influences the number of nearest neighbors. For an array of infinitely long lines of wells spaced 5 bohrs apart, and an inter-well spacing of 3 bohrs within a line, the non-pairwise additive component of the leading-order dispersion energy is −0.11 kJ mol{sup −1} well{sup −1}, which is 7% of the total. The polarizability of the wells and the density overlap between them are small in comparison to that of the atomic densities that arise from the molecular density partitioning used in post-density-functional theory (DFT) damped dispersion corrections, or DFT-D methods. Therefore, the nonadditivity of the leading-order dispersion observed here is a conservative estimate of that in molecular clusters.« less
Balancing Selection on a Regulatory Region Exhibiting Ancient Variation That Predates Human–Neandertal Divergence

PubMed Central

Iskow, Rebecca C.; Austermann, Christian; Scharer, Christopher D.; Raj, Towfique; Boss, Jeremy M.; Sunyaev, Shamil; Price, Alkes; Stranger, Barbara; Simon, Viviana; Lee, Charles

2013-01-01

Ancient population structure shaping contemporary genetic variation has been recently appreciated and has important implications regarding our understanding of the structure of modern human genomes. We identified a ∼36-kb DNA segment in the human genome that displays an ancient substructure. The variation at this locus exists primarily as two highly divergent haplogroups. One of these haplogroups (the NE1 haplogroup) aligns with the Neandertal haplotype and contains a 4.6-kb deletion polymorphism in perfect linkage disequilibrium with 12 single nucleotide polymorphisms (SNPs) across diverse populations. The other haplogroup, which does not contain the 4.6-kb deletion, aligns with the chimpanzee haplotype and is likely ancestral. Africans have higher overall pairwise differences with the Neandertal haplotype than Eurasians do for this NE1 locus (p<10−15). Moreover, the nucleotide diversity at this locus is higher in Eurasians than in Africans. These results mimic signatures of recent Neandertal admixture contributing to this locus. However, an in-depth assessment of the variation in this region across multiple populations reveals that African NE1 haplotypes, albeit rare, harbor more sequence variation than NE1 haplotypes found in Europeans, indicating an ancient African origin of this haplogroup and refuting recent Neandertal admixture. Population genetic analyses of the SNPs within each of these haplogroups, along with genome-wide comparisons revealed significant FST (p = 0.00003) and positive Tajima's D (p = 0.00285) statistics, pointing to non-neutral evolution of this locus. The NE1 locus harbors no protein-coding genes, but contains transcribed sequences as well as sequences with putative regulatory function based on bioinformatic predictions and in vitro experiments. We postulate that the variation observed at this locus predates Human–Neandertal divergence and is evolving under balancing selection, especially among European populations. PMID:23593015
Polymorphism of LRP5, but not of TNFRSF11B, is associated with a decrease in bone mineral density in postmenopausal Maya-Mestizo women.

PubMed

Canto-Cetina, Thelma; Polanco Reyes, Lucila; González Herrera, Lizbeth; Rojano-Mejía, David; Coral-Vázquez, Ramón Mauricio; Coronel, Agustín; Canto, Patricia

2013-01-01

Osteoporosis is a complex disease characterized principally by low bone mineral density (BMD), which is determined by an interaction of genetic, metabolic, and environmental factors. The aim of this study was to analyze the possible association among one polymorphism of LRP5 and three polymorphisms of TNFRSF11B as well as their haplotypes with BMD variations in Maya-Mestizo postmenopausal women. We studied 583 postmenopausal women of Maya-Mestizo ethnic origin. A structured questionnaire for risk factors was applied and BMD was measured in lumbar spine (LS), total hip (TH), and femoral neck (FN) by dual-energy X-ray absorptiometry. DNA was obtained from blood leukocytes. One single-nucleotide polymorphism of LRP5 (rs3736228, p.A1330V) and three of TNFRSF11B (rs4355801, rs2073618, and rs6993813) were studied using real-time PCR allelic discrimination for genotyping. Differences between the means of the BMDs according to the genotype were analyzed with covariance. Deviations from Hardy-Weinberg equilibrium were tested. Pairwise linkage disequilibrium between single nucleotide polymorphisms was calculated by direct correlation r(2), and haplotype analysis of TNFRSF11B was conducted. The Val genotype of the rs3736228 (p.A1330V) of LRP5 was significantly associated with BMD variations at the LS, TH, and FN. None of the three polymorphisms of TNFRSF11B was associated with BMD variations. Our results show that p.A1330V was significantly associated with BMD variations at all three skeletal sites analyzed; the Val allele and the Val/Val genotype were those most frequently found in our population. Copyright © 2013 Wiley Periodicals, Inc.
In-silico Taxonomic Classification of 373 Genomes Reveals Species Misidentification and New Genospecies within the Genus Pseudomonas.

PubMed

Tran, Phuong N; Savka, Michael A; Gan, Han Ming

2017-01-01

The genus Pseudomonas has one of the largest diversity of species within the Bacteria kingdom. To date, its taxonomy is still being revised and updated. Due to the non-standardized procedure and ambiguous thresholds at species level, largely based on 16S rRNA gene or conventional biochemical assay, species identification of publicly available Pseudomonas genomes remains questionable. In this study, we performed a large-scale analysis of all Pseudomonas genomes with species designation (excluding the well-defined P. aeruginosa ) and re-evaluated their taxonomic assignment via in silico genome-genome hybridization and/or genetic comparison with valid type species. Three-hundred and seventy-three pseudomonad genomes were analyzed and subsequently clustered into 145 distinct genospecies. We detected 207 erroneous labels and corrected 43 to the proper species based on Average Nucleotide Identity Multilocus Sequence Typing (MLST) sequence similarity to the type strain. Surprisingly, more than half of the genomes initially designated as Pseudomonas syringae and Pseudomonas fluorescens should be classified either to a previously described species or to a new genospecies. Notably, high pairwise average nucleotide identity (>95%) indicating species-level similarity was observed between P. synxantha-P. libanensis, P. psychrotolerans - P. oryzihabitans , and P. kilonensis- P. brassicacearum , that were previously differentiated based on conventional biochemical tests and/or genome-genome hybridization techniques.

In Silico Analysis of Gene Expression Network Components Underlying Pigmentation Phenotypes in the Python Identified Evolutionarily Conserved Clusters of Transcription Factor Binding Sites

PubMed Central

2016-01-01

Color variation provides the opportunity to investigate the genetic basis of evolution and selection. Reptiles are less studied than mammals. Comparative genomics approaches allow for knowledge gained in one species to be leveraged for use in another species. We describe a comparative vertebrate analysis of conserved regulatory modules in pythons aimed at assessing bioinformatics evidence that transcription factors important in mammalian pigmentation phenotypes may also be important in python pigmentation phenotypes. We identified 23 python orthologs of mammalian genes associated with variation in coat color phenotypes for which we assessed the extent of pairwise protein sequence identity between pythons and mouse, dog, horse, cow, chicken, anole lizard, and garter snake. We next identified a set of melanocyte/pigment associated transcription factors (CREB, FOXD3, LEF-1, MITF, POU3F2, and USF-1) that exhibit relatively conserved sequence similarity within their DNA binding regions across species based on orthologous alignments across multiple species. Finally, we identified 27 evolutionarily conserved clusters of transcription factor binding sites within ~200-nucleotide intervals of the 1500-nucleotide upstream regions of AIM1, DCT, MC1R, MITF, MLANA, OA1, PMEL, RAB27A, and TYR from Python bivittatus. Our results provide insight into pigment phenotypes in pythons. PMID:27698666
In Silico Analysis of Gene Expression Network Components Underlying Pigmentation Phenotypes in the Python Identified Evolutionarily Conserved Clusters of Transcription Factor Binding Sites.

PubMed

Irizarry, Kristopher J L; Bryden, Randall L

2016-01-01

Color variation provides the opportunity to investigate the genetic basis of evolution and selection. Reptiles are less studied than mammals. Comparative genomics approaches allow for knowledge gained in one species to be leveraged for use in another species. We describe a comparative vertebrate analysis of conserved regulatory modules in pythons aimed at assessing bioinformatics evidence that transcription factors important in mammalian pigmentation phenotypes may also be important in python pigmentation phenotypes. We identified 23 python orthologs of mammalian genes associated with variation in coat color phenotypes for which we assessed the extent of pairwise protein sequence identity between pythons and mouse, dog, horse, cow, chicken, anole lizard, and garter snake. We next identified a set of melanocyte/pigment associated transcription factors (CREB, FOXD3, LEF-1, MITF, POU3F2, and USF-1) that exhibit relatively conserved sequence similarity within their DNA binding regions across species based on orthologous alignments across multiple species. Finally, we identified 27 evolutionarily conserved clusters of transcription factor binding sites within ~200-nucleotide intervals of the 1500-nucleotide upstream regions of AIM1, DCT, MC1R, MITF, MLANA, OA1, PMEL, RAB27A, and TYR from Python bivittatus . Our results provide insight into pigment phenotypes in pythons.
Molecular characterisation of Atlantic salmon paramyxovirus (ASPV): A novel paramyxovirus associated with proliferative gill inflammation

USGS Publications Warehouse

Falk, K.; Batts, W.N.; Kvellestad, A.; Kurath, G.; Wiik-Nielsen, J.; Winton, J.R.

2008-01-01

Atlantic salmon paramyxovirus (ASPV) was isolated in 1995 from gills of farmed Atlantic salmon suffering from proliferative gill inflammation. The complete genome sequence of ASPV was determined, revealing a genome 16,968 nucleotides in length consisting of six non-overlapping genes coding for the nucleo- (N), phospho- (P), matrix- (M), fusion- (F), haemagglutinin-neuraminidase- (HN) and large polymerase (L) proteins in the order 3???-N-P-M-F-HN-L-5???. The various conserved features related to virus replication found in most paramyxoviruses were also found in ASPV. These include: conserved and complementary leader and trailer sequences, tri-nucleotide intergenic regions and highly conserved transcription start and stop signal sequences. The P gene expression strategy of ASPV was like that of the respiro-, morbilli- and henipaviruses, which express the P and C proteins from the primary transcript and edit a portion of the mRNA to encode V and W proteins. Sequence similarities among various features related to virus replication, pairwise comparisons of all deduced ASPV protein sequences with homologous regions from other members of the family Paramyxoviridae, and phylogenetic analyses of these amino acid sequences suggested that ASPV was a novel member of the sub-family Paramyxovirinae, most closely related to the respiroviruses. ?? 2008 Elsevier B.V. All rights reserved.
Slow but not low: genomic comparisons reveal slower evolutionary rate and higher dN/dS in conifers compared to angiosperms

PubMed Central

2012-01-01

Background Comparative genomics can inform us about the processes of mutation and selection across diverse taxa. Among seed plants, gymnosperms have been lacking in genomic comparisons. Recent EST and full-length cDNA collections for two conifers, Sitka spruce (Picea sitchensis) and loblolly pine (Pinus taeda), together with full genome sequences for two angiosperms, Arabidopsis thaliana and poplar (Populus trichocarpa), offer an opportunity to infer the evolutionary processes underlying thousands of orthologous protein-coding genes in gymnosperms compared with an angiosperm orthologue set. Results Based upon pairwise comparisons of 3,723 spruce and pine orthologues, we found an average synonymous genetic distance (dS) of 0.191, and an average dN/dS ratio of 0.314. Using a fossil-established divergence time of 140 million years between spruce and pine, we extrapolated a nucleotide substitution rate of 0.68 × 10-9 synonymous substitutions per site per year. When compared to angiosperms, this indicates a dramatically slower rate of nucleotide substitution rates in conifers: on average 15-fold. Coincidentally, we found a three-fold higher dN/dS for the spruce-pine lineage compared to the poplar-Arabidopsis lineage. This joint occurrence of a slower evolutionary rate in conifers with higher dN/dS, and possibly positive selection, showcases the uniqueness of conifer genome evolution. Conclusions Our results are in line with documented reduced nucleotide diversity, conservative genome evolution and low rates of diversification in conifers on the one hand and numerous examples of local adaptation in conifers on the other hand. We propose that reduced levels of nucleotide mutation in large and long-lived conifer trees, coupled with large effective population size, were the main factors leading to slow substitution rates but retention of beneficial mutations. PMID:22264329
Genetic Divergence and Dispersal of Yellow Fever Virus, Brazil

PubMed Central

Bryant, Juliet E.; Travassos da Rosa, Amelia P.A.; Tesh, Robert B.; Rodrigues, Sueli G.; Barrett, Alan D.T.

2004-01-01

An analysis of 79 yellow fever virus (YFV) isolates collected from 1935 to 2001 in Brazil showed a single genotype (South America I) circulating in the country, with the exception of a single strain from Rondônia, which represented South America genotype II. Brazilian YFV strains have diverged into two clades; an older clade appears to have become extinct and another has become the dominant lineage in recent years. Pairwise nucleotide diversity between strains ranged from 0% to 7.4%, while amino acid divergence ranged from 0% to 4.6%. Phylogenetic analysis indicated traffic of virus variants through large geographic areas and suggested that migration of infected people may be an important mechanism of virus dispersal. Isolation of vaccine virus from a patient with a fatal case suggests that vaccine-related illness may have been misdiagnosed in the past. PMID:15498159
The recent emergence in hospitals of multidrug-resistant community-associated sequence type 1 and spa type t127 methicillin-resistant Staphylococcus aureus investigated by whole-genome sequencing: Implications for screening

PubMed Central

Earls, Megan R.; Kinnevey, Peter M.; Brennan, Gráinne I.; Lazaris, Alexandros; Skally, Mairead; O’Connell, Brian; Humphreys, Hilary; Shore, Anna C.

2017-01-01

Community-associated spa type t127/t922 methicillin-resistant Staphylococcus aureus (MRSA) prevalence increased from 1%-7% in Ireland between 2010–2015. This study tracked the spread of 89 such isolates from June 2013-June 2016. These included 78 healthcare-associated and 11 community associated-MRSA isolates from a prolonged hospital outbreak (H1) (n = 46), 16 other hospitals (n = 28), four other healthcare facilities (n = 4) and community-associated sources (n = 11). Isolates underwent antimicrobial susceptibility testing, DNA microarray profiling and whole-genome sequencing. Minimum spanning trees were generated following core-genome multilocus sequence typing and pairwise single nucleotide variation (SNV) analysis was performed. All isolates were sequence type 1 MRSA staphylococcal cassette chromosome mec type IV (ST1-MRSA-IV) and 76/89 were multidrug-resistant. Fifty isolates, including 40/46 from H1, were high-level mupirocin-resistant, carrying a conjugative 39 kb iles2-encoding plasmid. Two closely related ST1-MRSA-IV strains (I and II) and multiple sporadic strains were identified. Strain I isolates (57/89), including 43/46 H1 and all high-level mupirocin-resistant isolates, exhibited ≤80 SNVs. Two strain I isolates from separate H1 healthcare workers differed from other H1/strain I isolates by 7–47 and 12–53 SNVs, respectively, indicating healthcare worker involvement in this outbreak. Strain II isolates (19/89), including the remaining H1 isolates, exhibited ≤127 SNVs. For each strain, the pairwise SNVs exhibited by healthcare-associated and community-associated isolates indicated recent transmission of ST1-MRSA-IV within and between multiple hospitals, healthcare facilities and communities in Ireland. Given the interchange between healthcare-associated and community-associated isolates in hospitals, the risk factors that inform screening for MRSA require revision. PMID:28399151
Sequencing and Characterization of the Invasive Sycamore Lace Bug Corythucha ciliata (Hemiptera: Tingidae) Transcriptome

PubMed Central

Qu, Cheng; Fu, Ningning; Xu, Yihua

2016-01-01

The sycamore lace bug, Corythucha ciliata (Hemiptera: Tingidae), is an invasive forestry pest rapidly expanding in many countries. This pest poses a considerable threat to the urban forestry ecosystem, especially to Platanus spp. However, its molecular biology and biochemistry are poorly understood. This study reports the first C. ciliata transcriptome, encompassing three different life stages (Nymphs, adults female (AF) and adults male (AM)). In total, 26.53 GB of clean data and 60,879 unigenes were obtained from three RNA-seq libraries. These unigenes were annotated and classified by Nr (NCBI non-redundant protein sequences), Nt (NCBI non-redundant nucleotide sequences), Pfam (Protein family), KOG/COG (Clusters of Orthologous Groups of proteins), Swiss-Prot (A manually annotated and reviewed protein sequence database), and KO (KEGG Ortholog database). After all pairwise comparisons between these three different samples, a large number of differentially expressed genes were revealed. The dramatic differences in global gene expression profiles were found between distinct life stages (nymphs and AF, nymphs and AM) and sex difference (AF and AM), with some of the significantly differentially expressed genes (DEGs) being related to metamorphosis, digestion, immune and sex difference. The different express of unigenes were validated through quantitative Real-Time PCR (qRT-PCR) for 16 randomly selected unigenes. In addition, 17,462 potential simple sequence repeat molecular markers were identified in these transcriptome resources. These comprehensive C. ciliata transcriptomic information can be utilized to promote the development of environmentally friendly methodologies to disrupt the processes of metamorphosis, digestion, immune and sex differences. PMID:27494615
Weak Higher-Order Interactions in Macroscopic Functional Networks of the Resting Brain.

PubMed

Huang, Xuhui; Xu, Kaibin; Chu, Congying; Jiang, Tianzi; Yu, Shan

2017-10-25

Interactions among different brain regions are usually examined through functional connectivity (FC) analysis, which is exclusively based on measuring pairwise correlations in activities. However, interactions beyond the pairwise level, that is, higher-order interactions (HOIs), are vital in understanding the behavior of many complex systems. So far, whether HOIs exist among brain regions and how they can affect the brain's activities remains largely elusive. To address these issues, here, we analyzed blood oxygenation level-dependent (BOLD) signals recorded from six typical macroscopic functional networks of the brain in 100 human subjects (46 males and 54 females) during the resting state. Through examining the binarized BOLD signals, we found that HOIs within and across individual networks were both very weak regardless of the network size, topology, degree of spatial proximity, spatial scales, and whether the global signal was regressed. To investigate the potential mechanisms underlying the weak HOIs, we analyzed the dynamics of a network model and also found that HOIs were generally weak within a wide range of key parameters provided that the overall dynamic feature of the model was similar to the empirical data and it was operating close to a linear fluctuation regime. Our results suggest that weak HOI may be a general property of brain's macroscopic functional networks, which implies the dominance of pairwise interactions in shaping brain activities at such a scale and warrants the validity of widely used pairwise-based FC approaches. SIGNIFICANCE STATEMENT To explain how activities of different brain areas are coordinated through interactions is essential to revealing the mechanisms underlying various brain functions. Traditionally, such an interaction structure is commonly studied using pairwise-based functional network analyses. It is unclear whether the interactions beyond the pairwise level (higher-order interactions or HOIs) play any role in this process. Here, we show that HOIs are generally weak in macroscopic brain networks. We also suggest a possible dynamical mechanism that may underlie this phenomenon. These results provide plausible explanation for the effectiveness of widely used pairwise-based approaches in analyzing brain networks. More importantly, it reveals a previously unknown, simple organization of the brain's macroscopic functional systems. Copyright © 2017 the authors 0270-6474/17/3710481-17$15.00/0.
[Analysis of variance of repeated data measured by water maze with SPSS].

PubMed

Qiu, Hong; Jin, Guo-qin; Jin, Ru-feng; Zhao, Wei-kang

2007-01-01

To introduce the method of analyzing repeated data measured by water maze with SPSS 11.0, and offer a reference statistical method to clinical and basic medicine researchers who take the design of repeated measures. Using repeated measures and multivariate analysis of variance (ANOVA) process of the general linear model in SPSS and giving comparison among different groups and different measure time pairwise. Firstly, Mauchly's test of sphericity should be used to judge whether there were relations among the repeatedly measured data. If any (P
Hybrid pairwise likelihood analysis of animal behavior experiments.

PubMed

Cattelan, Manuela; Varin, Cristiano

2013-12-01

The study of the determinants of fights between animals is an important issue in understanding animal behavior. For this purpose, tournament experiments among a set of animals are often used by zoologists. The results of these tournament experiments are naturally analyzed by paired comparison models. Proper statistical analysis of these models is complicated by the presence of dependence between the outcomes of fights because the same animal is involved in different contests. This paper discusses two different model specifications to account for between-fights dependence. Models are fitted through the hybrid pairwise likelihood method that iterates between optimal estimating equations for the regression parameters and pairwise likelihood inference for the association parameters. This approach requires the specification of means and covariances only. For this reason, the method can be applied also when the computation of the joint distribution is difficult or inconvenient. The proposed methodology is investigated by simulation studies and applied to real data about adult male Cape Dwarf Chameleons. © 2013, The International Biometric Society.
Complete genome sequence of Fer-de-Lance Virus reveals a novel gene in reptilian Paramyxoviruses

USGS Publications Warehouse

Kurath, G.; Batts, W.N.; Ahne, W.; Winton, J.R.

2004-01-01

The complete RNA genome sequence of the archetype reptilian paramyxovirus, Fer-de-Lance virus (FDLV), has been determined. The genome is 15,378 nucleotides in length and consists of seven nonoverlapping genes in the order 3??? N-U-P-M-F-HN-L 5???, coding for the nucleocapsid, unknown, phospho-, matrix, fusion, hemagglutinin-neuraminidase, and large polymerase proteins, respectively. The gene junctions contain highly conserved transcription start and stop signal sequences and tri-nucleotide intergenic regions similar to those of other Paramyxoviridae. The FDLV P gene expression strategy is like that of rubulaviruses, which express the accessory V protein from the primary transcript and edit a portion of the mRNA to encode P and I proteins. There is also an overlapping open reading frame potentially encoding a small basic protein in the P gene. The gene designated U (unknown), encodes a deduced protein of 19.4 kDa that has no counterpart in other paramyxoviruses and has no similarity with sequences in the National Center for Biotechnology Information database. Active transcription of the U gene in infected cells was demonstrated by Northern blot analysis, and bicistronic N-U mRNA was also evident. The genomes of two other snake paramyxovirus genotypes were also found to have U genes, with 11 to 16% nucleotide divergence from the FDLV U gene. Pairwise comparisons of amino acid identities and phylogenetic analyses of all deduced FDLV protein sequences with homologous sequences from other Paramyxoviridae indicate that FDLV represents a new genus within the subfamily Paramyxovirinae. We suggest the name Ferlavirus for the new genus, with FDLV as the type species.
The heterogeneous levels of linkage disequilibrium in white spruce genes and comparative analysis with other conifers.

PubMed

Pavy, N; Namroud, M-C; Gagnon, F; Isabel, N; Bousquet, J

2012-03-01

In plants, knowledge about linkage disequilibrium (LD) is relevant for the design of efficient single-nucleotide polymorphism arrays in relation to their use in population and association genomics studies. Previous studies of conifer genes have shown LD to decay rapidly within gene limits, but exceptions have been reported. To evaluate the extent of heterogeneity of LD among conifer genes and its potential causes, we examined LD in 105 genes of white spruce (Picea glauca) by sequencing a panel of 48 haploid megagametophytes from natural populations and further compared it with LD in other conifer species. The average pairwise r(2) value was 0.19 (s.d.=0.19), and LD dropped quickly with a half-decay being reached at a distance of 65 nucleotides between sites. However, LD was significantly heterogeneous among genes. A first group of 29 genes had stronger LD (mean r(2)=0.28), and a second group of 38 genes had weaker LD (mean r(2)=0.12). While a strong relationship was found with the recombination rate, there was no obvious relationship between LD and functional classification. The level of nucleotide diversity, which was highly heterogeneous across genes, was also not significantly correlated with LD. A search for selection signatures highlighted significant deviations from the standard neutral model, which could be mostly attributed to recent demographic changes. Little evidence was seen for hitchhiking and clear relationships with LD. When compared among conifer species, on average, levels of LD were similar in genes from white spruce, Norway spruce and Scots pine, whereas loblolly pine and Douglas fir genes exhibited a significantly higher LD.
MIRNA-DISTILLER: A Stand-Alone Application to Compile microRNA Data from Databases.

PubMed

Rieger, Jessica K; Bodan, Denis A; Zanger, Ulrich M

2011-01-01

MicroRNAs (miRNA) are small non-coding RNA molecules of ∼22 nucleotides which regulate large numbers of genes by binding to seed sequences at the 3'-untranslated region of target gene transcripts. The target mRNA is then usually degraded or translation is inhibited, although thus resulting in posttranscriptional down regulation of gene expression at the mRNA and/or protein level. Due to the bioinformatic difficulties in predicting functional miRNA binding sites, several publically available databases have been developed that predict miRNA binding sites based on different algorithms. The parallel use of different databases is currently indispensable, but highly uncomfortable and time consuming, especially when working with numerous genes of interest. We have therefore developed a new stand-alone program, termed MIRNA-DISTILLER, which allows to compile miRNA data for given target genes from public databases. Currently implemented are TargetScan, microCosm, and miRDB, which may be queried independently, pairwise, or together to calculate the respective intersections. Data are stored locally for application of further analysis tools including freely definable biological parameter filters, customized output-lists for both miRNAs and target genes, and various graphical facilities. The software, a data example file and a tutorial are freely available at http://www.ikp-stuttgart.de/content/language1/html/10415.asp.
MIRNA-DISTILLER: A Stand-Alone Application to Compile microRNA Data from Databases

PubMed Central

Rieger, Jessica K.; Bodan, Denis A.; Zanger, Ulrich M.

2011-01-01

MicroRNAs (miRNA) are small non-coding RNA molecules of ∼22 nucleotides which regulate large numbers of genes by binding to seed sequences at the 3′-untranslated region of target gene transcripts. The target mRNA is then usually degraded or translation is inhibited, although thus resulting in posttranscriptional down regulation of gene expression at the mRNA and/or protein level. Due to the bioinformatic difficulties in predicting functional miRNA binding sites, several publically available databases have been developed that predict miRNA binding sites based on different algorithms. The parallel use of different databases is currently indispensable, but highly uncomfortable and time consuming, especially when working with numerous genes of interest. We have therefore developed a new stand-alone program, termed MIRNA-DISTILLER, which allows to compile miRNA data for given target genes from public databases. Currently implemented are TargetScan, microCosm, and miRDB, which may be queried independently, pairwise, or together to calculate the respective intersections. Data are stored locally for application of further analysis tools including freely definable biological parameter filters, customized output-lists for both miRNAs and target genes, and various graphical facilities. The software, a data example file and a tutorial are freely available at http://www.ikp-stuttgart.de/content/language1/html/10415.asp PMID:22303335
Range-Wide Sex-Chromosome Sequence Similarity Supports Occasional XY Recombination in European Tree Frogs (Hyla arborea)

PubMed Central

Brelsford, Alan; Perrin, Nicolas

2014-01-01

In contrast with mammals and birds, most poikilothermic vertebrates feature structurally undifferentiated sex chromosomes, which may result either from frequent turnovers, or from occasional events of XY recombination. The latter mechanism was recently suggested to be responsible for sex-chromosome homomorphy in European tree frogs (Hyla arborea). However, no single case of male recombination has been identified in large-scale laboratory crosses, and populations from NW Europe consistently display sex-specific allelic frequencies with male-diagnostic alleles, suggesting the absence of recombination in their recent history. To address this apparent paradox, we extended the phylogeographic scope of investigations, by analyzing the sequences of three sex-linked markers throughout the whole species distribution. Refugial populations (southern Balkans and Adriatic coast) show a mix of X and Y alleles in haplotypic networks, and no more within-individual pairwise nucleotide differences in males than in females, testifying to recurrent XY recombination. In contrast, populations of NW Europe, which originated from a recent postglacial expansion, show a clear pattern of XY differentiation; the X and Y gametologs of the sex-linked gene Med15 present different alleles, likely fixed by drift on the front wave of expansions, and kept differentiated since. Our results support the view that sex-chromosome homomorphy in H. arborea is maintained by occasional or historical events of recombination; whether the frequency of these events indeed differs between populations remains to be clarified. PMID:24892652
Lessons from the canine Oxtr gene: populations, variants and functional aspects.

PubMed

Bence, M; Marx, P; Szantai, E; Kubinyi, E; Ronai, Z; Banlaki, Z

2017-04-01

Oxytocin receptor (OXTR) acts as a key behavioral modulator of the central nervous system, affecting social behavior, stress, affiliation and cognitive functions. Variants of the Oxtr gene are known to influence behavior both in animals and humans; however, canine Oxtr polymorphisms are less characterized in terms of possible relevance to function, selection criteria in breeding and domestication. In this report, we provide a detailed characterization of common variants of the canine Oxtr gene. In particular (1) novel polymorphisms were identified by direct sequencing of wolf and dog samples, (2) allelic distributions and pairwise linkage disequilibrium patterns of several canine populations were compared, (3) neighbor joining (NJ) tree based on common single nucleotide polymorphisms (SNPs) was constructed, (4) mRNA expression features were assessed, (5) a novel splice variant was detected and (6) in vitro functional assays were performed. Results indicate marked differences regarding Oxtr variations between purebred dogs of different breeds, free-ranging dog populations, wolf subspecies and golden jackals. This, together with existence of explicitly dog-specific alleles and data obtained from the NJ tree implies that Oxtr could indeed have been a target gene during domestication and selection for human preferred aspects of temperament and social behavior. This assumption is further supported by the present observations on gene expression patterns within the brain and luciferase reporter experiments, providing a molecular level link between certain canine Oxtr polymorphisms and differences in nervous system function and behavior. © 2016 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.
Introducing difference recurrence relations for faster semi-global alignment of long sequences.

PubMed

Suzuki, Hajime; Kasahara, Masahiro

2018-02-19

The read length of single-molecule DNA sequencers is reaching 1 Mb. Popular alignment software tools widely used for analyzing such long reads often take advantage of single-instruction multiple-data (SIMD) operations to accelerate calculation of dynamic programming (DP) matrices in the Smith-Waterman-Gotoh (SWG) algorithm with a fixed alignment start position at the origin. Nonetheless, 16-bit or 32-bit integers are necessary for storing the values in a DP matrix when sequences to be aligned are long; this situation hampers the use of the full SIMD width of modern processors. We proposed a faster semi-global alignment algorithm, "difference recurrence relations," that runs more rapidly than the state-of-the-art algorithm by a factor of 2.1. Instead of calculating and storing all the values in a DP matrix directly, our algorithm computes and stores mainly the differences between the values of adjacent cells in the matrix. Although the SWG algorithm and our algorithm can output exactly the same result, our algorithm mainly involves 8-bit integer operations, enabling us to exploit the full width of SIMD operations (e.g., 32) on modern processors. We also developed a library, libgaba, so that developers can easily integrate our algorithm into alignment programs. Our novel algorithm and optimized library implementation will facilitate accelerating nucleotide long-read analysis algorithms that use pairwise alignment stages. The library is implemented in the C programming language and available at https://github.com/ocxtal/libgaba .
In-silico Taxonomic Classification of 373 Genomes Reveals Species Misidentification and New Genospecies within the Genus Pseudomonas

PubMed Central

Tran, Phuong N.; Savka, Michael A.; Gan, Han Ming

2017-01-01

The genus Pseudomonas has one of the largest diversity of species within the Bacteria kingdom. To date, its taxonomy is still being revised and updated. Due to the non-standardized procedure and ambiguous thresholds at species level, largely based on 16S rRNA gene or conventional biochemical assay, species identification of publicly available Pseudomonas genomes remains questionable. In this study, we performed a large-scale analysis of all Pseudomonas genomes with species designation (excluding the well-defined P. aeruginosa) and re-evaluated their taxonomic assignment via in silico genome-genome hybridization and/or genetic comparison with valid type species. Three-hundred and seventy-three pseudomonad genomes were analyzed and subsequently clustered into 145 distinct genospecies. We detected 207 erroneous labels and corrected 43 to the proper species based on Average Nucleotide Identity Multilocus Sequence Typing (MLST) sequence similarity to the type strain. Surprisingly, more than half of the genomes initially designated as Pseudomonas syringae and Pseudomonas fluorescens should be classified either to a previously described species or to a new genospecies. Notably, high pairwise average nucleotide identity (>95%) indicating species-level similarity was observed between P. synxantha-P. libanensis, P. psychrotolerans–P. oryzihabitans, and P. kilonensis- P. brassicacearum, that were previously differentiated based on conventional biochemical tests and/or genome-genome hybridization techniques. PMID:28747902
Population genetic structure of the mantis shrimp Oratosquilla oratoria (Crustacea: Squillidae) in the Yellow Sea and East China Sea

NASA Astrophysics Data System (ADS)

Yang, Mei; Li, Xinzheng

2017-09-01

The mantis shrimp Oratosquilla oratoria is an ecologically and economically important species in the Western Pacific. In present study, the population genetic structure of Oratosquilla oratoria from the Yellow Sea and East China Sea was examined with mitochondrial DNA control region sequences. In total, 394 samples were collected from 18 locations and 102 haplotypes were obtained. For the Yellow Sea, the overall nucleotide diversity and haplotype diversity were 0.006 9 and 0.946 8, respectively; while across all the East China Sea locations, the overall nucleotide diversity and haplotype diversity were 0.027 94 and 0.979 0, respectively. The results of AMOVA and pairwise F ST (0.145 2, P <0.001) revealed moderate differentiation between the Yellow Sea and East China Sea populations of O. oratoria. However, neither the neighbor-joining tree nor haplotype network showed clades with geographic pattern, which indicated considerable gene flow was existed between the Yellow Sea and East China Sea, and supporting the high larval dispersal ability in this species. Mismatch distribution analysis and neutrality tests suggested that O. oratoria has undergone population expansion event, and the Pleistocene glacial cycles might have an impact on the historical demography of O. oratoria. The genetic information obtained in this study can provide useful information for sustainable improvements for capture fisheries management strategies.
The structural basis of actinomycin D–binding induces nucleotide flipping out, a sharp bend and a left-handed twist in CGG triplet repeats

PubMed Central

Lo, Yu-Sheng; Tseng, Wen-Hsuan; Chuang, Chien-Ying; Hou, Ming-Hon

2013-01-01

The potent anticancer drug actinomycin D (ActD) functions by intercalating into DNA at GpC sites, thereby interrupting essential biological processes including replication and transcription. Certain neurological diseases are correlated with the expansion of (CGG)n trinucleotide sequences, which contain many contiguous GpC sites separated by a single G:G mispair. To characterize the binding of ActD to CGG triplet repeat sequences, the structural basis for the strong binding of ActD to neighbouring GpC sites flanking a G:G mismatch has been determined based on the crystal structure of ActD bound to ATGCGGCAT, which contains a CGG triplet sequence. The binding of ActD molecules to GCGGC causes many unexpected conformational changes including nucleotide flipping out, a sharp bend and a left-handed twist in the DNA helix via a two site-binding model. Heat denaturation, circular dichroism and surface plasmon resonance analyses showed that adjacent GpC sequences flanking a G:G mismatch are preferred ActD-binding sites. In addition, ActD was shown to bind the hairpin conformation of (CGG)16 in a pairwise combination and with greater stability than that of other DNA intercalators. Our results provide evidence of a possible biological consequence of ActD binding to CGG triplet repeat sequences. PMID:23408860

APOLLO: a quality assessment service for single and multiple protein models.

PubMed

Wang, Zheng; Eickholt, Jesse; Cheng, Jianlin

2011-06-15

We built a web server named APOLLO, which can evaluate the absolute global and local qualities of a single protein model using machine learning methods or the global and local qualities of a pool of models using a pair-wise comparison approach. Based on our evaluations on 107 CASP9 (Critical Assessment of Techniques for Protein Structure Prediction) targets, the predicted quality scores generated from our machine learning and pair-wise methods have an average per-target correlation of 0.671 and 0.917, respectively, with the true model quality scores. Based on our test on 92 CASP9 targets, our predicted absolute local qualities have an average difference of 2.60 Å with the actual distances to native structure. http://sysbio.rnet.missouri.edu/apollo/. Single and pair-wise global quality assessment software is also available at the site.
Adaptive multi-view clustering based on nonnegative matrix factorization and pairwise co-regularization

NASA Astrophysics Data System (ADS)

Zhang, Tianzhen; Wang, Xiumei; Gao, Xinbo

2018-04-01

Nowadays, several datasets are demonstrated by multi-view, which usually include shared and complementary information. Multi-view clustering methods integrate the information of multi-view to obtain better clustering results. Nonnegative matrix factorization has become an essential and popular tool in clustering methods because of its interpretation. However, existing nonnegative matrix factorization based multi-view clustering algorithms do not consider the disagreement between views and neglects the fact that different views will have different contributions to the data distribution. In this paper, we propose a new multi-view clustering method, named adaptive multi-view clustering based on nonnegative matrix factorization and pairwise co-regularization. The proposed algorithm can obtain the parts-based representation of multi-view data by nonnegative matrix factorization. Then, pairwise co-regularization is used to measure the disagreement between views. There is only one parameter to auto learning the weight values according to the contribution of each view to data distribution. Experimental results show that the proposed algorithm outperforms several state-of-the-arts algorithms for multi-view clustering.
Consistency-based rectification of nonrigid registrations

PubMed Central

Gass, Tobias; Székely, Gábor; Goksel, Orcun

2015-01-01

Abstract. We present a technique to rectify nonrigid registrations by improving their group-wise consistency, which is a widely used unsupervised measure to assess pair-wise registration quality. While pair-wise registration methods cannot guarantee any group-wise consistency, group-wise approaches typically enforce perfect consistency by registering all images to a common reference. However, errors in individual registrations to the reference then propagate, distorting the mean and accumulating in the pair-wise registrations inferred via the reference. Furthermore, the assumption that perfect correspondences exist is not always true, e.g., for interpatient registration. The proposed consistency-based registration rectification (CBRR) method addresses these issues by minimizing the group-wise inconsistency of all pair-wise registrations using a regularized least-squares algorithm. The regularization controls the adherence to the original registration, which is additionally weighted by the local postregistration similarity. This allows CBRR to adaptively improve consistency while locally preserving accurate pair-wise registrations. We show that the resulting registrations are not only more consistent, but also have lower average transformation error when compared to known transformations in simulated data. On clinical data, we show improvements of up to 50% target registration error in breathing motion estimation from four-dimensional MRI and improvements in atlas-based segmentation quality of up to 65% in terms of mean surface distance in three-dimensional (3-D) CT. Such improvement was observed consistently using different registration algorithms, dimensionality (two-dimensional/3-D), and modalities (MRI/CT). PMID:26158083
From pairwise to group interactions in games of cyclic dominance.

PubMed

Szolnoki, Attila; Vukov, Jeromos; Perc, Matjaž

2014-06-01

We study the rock-paper-scissors game in structured populations, where the invasion rates determine individual payoffs that govern the process of strategy change. The traditional version of the game is recovered if the payoffs for each potential invasion stem from a single pairwise interaction. However, the transformation of invasion rates to payoffs also allows the usage of larger interaction ranges. In addition to the traditional pairwise interaction, we therefore consider simultaneous interactions with all nearest neighbors, as well as with all nearest and next-nearest neighbors, thus effectively going from single pair to group interactions in games of cyclic dominance. We show that differences in the interaction range affect not only the stationary fractions of strategies but also their relations of dominance. The transition from pairwise to group interactions can thus decelerate and even revert the direction of the invasion between the competing strategies. Like in evolutionary social dilemmas, in games of cyclic dominance, too, the indirect multipoint interactions that are due to group interactions hence play a pivotal role. Our results indicate that, in addition to the invasion rates, the interaction range is at least as important for the maintenance of biodiversity among cyclically competing strategies.
A Proposed Genus Boundary for the Prokaryotes Based on Genomic Insights

PubMed Central

Qin, Qi-Long; Xie, Bin-Bin; Zhang, Xi-Ying; Chen, Xiu-Lan; Zhou, Bai-Cheng; Zhou, Jizhong; Oren, Aharon

2014-01-01

Genomic information has already been applied to prokaryotic species definition and classification. However, the contribution of the genome sequence to prokaryotic genus delimitation has been less studied. To gain insights into genus definition for the prokaryotes, we attempted to reveal the genus-level genomic differences in the current prokaryotic classification system and to delineate the boundary of a genus on the basis of genomic information. The average nucleotide sequence identity between two genomes can be used for prokaryotic species delineation, but it is not suitable for genus demarcation. We used the percentage of conserved proteins (POCP) between two strains to estimate their evolutionary and phenotypic distance. A comprehensive genomic survey indicated that the POCP can serve as a robust genomic index for establishing the genus boundary for prokaryotic groups. Basically, two species belonging to the same genus would share at least half of their proteins. In a specific lineage, the genus and family/order ranks showed slight or no overlap in terms of POCP values. A prokaryotic genus can be defined as a group of species with all pairwise POCP values higher than 50%. Integration of whole-genome data into the current taxonomy system can provide comprehensive information for prokaryotic genus definition and delimitation. PMID:24706738
Evaluation of genetic diversity of Panicum turgidum Forssk from Saudi Arabia.

PubMed

Assaeed, Abdulaziz M; Al-Faifi, Sulieman A; Migdadi, Hussein M; El-Bana, Magdy I; Al Qarawi, Abdulaziz A; Khan, Mohammad Altaf

2018-01-01

The genetic diversity of 177 accessions of Panicum turgidum Forssk, representing ten populations collected from four geographical regions in Saudi Arabia, was analyzed using amplified fragment length polymorphism (AFLP) markers. A set of four primer-pairs with two/three selective nucleotides scored 836 AFLP amplified fragments (putative loci/genome landmarks), all of which were polymorphic. Populations collected from the southern region of the country showed the highest genetic diversity parameters, whereas those collected from the central regions showed the lowest values. Analysis of molecular variance (AMOVA) revealed that 78% of the genetic variability was attributable to differences within populations. Pairwise values for population differentiation and genetic structure were statistically significant for all variances. The UPGMA dendrogram, validated by principal coordinate analysis-grouped accessions, corresponded to the geographical origin of the accessions. Mantel's test showed that there was a significant correlation between the genetic and geographical distances ( r = 0.35, P < 0.04). In summary, the AFLP assay demonstrated the existence of substantial genetic variation in P. turgidum . The relationship between the genetic diversity and geographical source of P. turgidum populations of Saudi Arabia, as revealed through this comprehensive study, will enable effective resource management and restoration of new areas without compromising adaptation and genetic diversity.
Genetic characterization and phylogenetic analysis of Eimeria arloingi in Iranian native kids.

PubMed

Khodakaram-Tafti, A; Hashemnia, M; Razavi, S M; Sharifiyazdi, H; Nazifi, S

2013-09-01

Among the 16 species of Eimeria from goats, Eimeria arloingi and Eimeria ninakohlyakimovae are regarded as the most pathogenic species in the world and cause clinical caprine coccidiosis. E. arloingi is known to be an important cause of coccidiosis in Iranian kids. Molecular analyses of two portions of nuclear ribosomal DNA (internal transcribed spacer1 (ITS1) and 18S rDNA) were used for the genetic characterization of the E. arloingi. Comparison of the sequencing data of E. arloingi obtained in the present study (ITS1: KC507793 and 18S rDNA: KC507792) with other Eimeria species in the GenBank database revealed a particularly close relationship between E. arloingi and Eimeria spp. from the cattle and sheep. The phylogram based on the ITS1 sequences shows that the E. arloingi, Eimeria bovis, and Eimeria zuernii formed a distinct group separate from the other remaining Eimeria spp. in cattle and poultry. In pairwise alignment, 18S rDNA sequence derived from E. arloingi showed 99% similarity to Eimeria ahsata with differences observed at only three nucleotides. This study showed that the ITS1 and 18S rDNA gene are useful genetic markers for the specific identification and differentiation of Eimeria spp. in ruminants.
The genetic relationship between extirpated and contemporary Atlantic salmon Salmo salar L. lines from the southern Baltic Sea.

PubMed

Bernaś, Rafał; Poćwierz-Kotus, Anita; Dębowski, Piotr; Wenne, Roman

2016-04-01

The genetic relationship between original Atlantic salmon populations that are now extinct in the southern Baltic Sea and the present-day populations has long been controversial. To investigate and clarify this issue, we successfully genotyped individuals of the historical populations from the Oder and Vistula Rivers using DNA extracted from dried scales with the Atlantic salmon single nucleotide polymorphism array. Our results showed a global F ST of 0.2515 for all pairs of loci, which indicates a high level of genetic differentiation among the groups analyzed in this study. Pairwise F ST values were significant for all comparisons and the highest values were found between present-day reintroduced Slupia River salmon and extinct Vistula River Atlantic salmon. Bayesian analysis of genetic structure revealed the existence of substructures in the extirpated Polish populations and three main clades among studied stocks. The historical salmon population from the Oder River was genetically closer to present-day salmon from the Neman River than to the historical salmon from the Vistula River. Vistula salmon clearly separated from all other analyzed salmon stocks. It is likely that the origins of the Atlantic salmon population from the Morrum River and the Polish historical native populations are different.
Causal analysis of ordinal treatments and binary outcomes under truncation by death.

PubMed

Wang, Linbo; Richardson, Thomas S; Zhou, Xiao-Hua

2017-06-01

It is common that in multi-arm randomized trials, the outcome of interest is "truncated by death," meaning that it is only observed or well-defined conditioning on an intermediate outcome. In this case, in addition to pairwise contrasts, the joint inference for all treatment arms is also of interest. Under a monotonicity assumption we present methods for both pairwise and joint causal analyses of ordinal treatments and binary outcomes in presence of truncation by death. We illustrate via examples the appropriateness of our assumptions in different scientific contexts.
Experimental characterization of pairwise correlations from triple quantum correlated beams generated by cascaded four-wave mixing processes

NASA Astrophysics Data System (ADS)

Wang, Wei; Cao, Leiming; Lou, Yanbo; Du, Jinjian; Jing, Jietai

2018-01-01

We theoretically and experimentally characterize the performance of the pairwise correlations from triple quantum correlated beams based on the cascaded four-wave mixing (FWM) processes. The pairwise correlations between any two of the beams are theoretically calculated and experimentally measured. The experimental and theoretical results are in good agreement. We find that two of the three pairwise correlations can be in the quantum regime. The other pairwise correlation is always in the classical regime. In addition, we also measure the triple-beam correlation which is always in the quantum regime. Such unbalanced and controllable pairwise correlation structures may be taken as advantages in practical quantum communications, for example, hierarchical quantum secret sharing. Our results also open the way for the classification and application of quantum states generated from the cascaded FWM processes.
Correlations and Functional Connections in a Population of Grid Cells

PubMed Central

Roudi, Yasser

2015-01-01

We study the statistics of spike trains of simultaneously recorded grid cells in freely behaving rats. We evaluate pairwise correlations between these cells and, using a maximum entropy kinetic pairwise model (kinetic Ising model), study their functional connectivity. Even when we account for the covariations in firing rates due to overlapping fields, both the pairwise correlations and functional connections decay as a function of the shortest distance between the vertices of the spatial firing pattern of pairs of grid cells, i.e. their phase difference. They take positive values between cells with nearby phases and approach zero or negative values for larger phase differences. We find similar results also when, in addition to correlations due to overlapping fields, we account for correlations due to theta oscillations and head directional inputs. The inferred connections between neurons in the same module and those from different modules can be both negative and positive, with a mean close to zero, but with the strongest inferred connections found between cells of the same module. Taken together, our results suggest that grid cells in the same module do indeed form a local network of interconnected neurons with a functional connectivity that supports a role for attractor dynamics in the generation of grid pattern. PMID:25714908
Extent of linkage disequilibrium, consistency of gametic phase, and imputation accuracy within and across Canadian dairy breeds.

PubMed

Larmer, S G; Sargolzaei, M; Schenkel, F S

2014-05-01

Genomic selection requires a large reference population to accurately estimate single nucleotide polymorphism (SNP) effects. In some Canadian dairy breeds, the available reference populations are not large enough for accurate estimation of SNP effects for traits of interest. If marker phase is highly consistent across multiple breeds, it is theoretically possible to increase the accuracy of genomic prediction for one or all breeds by pooling several breeds into a common reference population. This study investigated the extent of linkage disequilibrium (LD) in 5 major dairy breeds using a 50,000 (50K) SNP panel and 3 of the same breeds using the 777,000 (777K) SNP panel. Correlation of pair-wise SNP phase was also investigated on both panels. The level of LD was measured using the squared correlation of alleles at 2 loci (r(2)), and the consistency of SNP gametic phases was correlated using the signed square root of these values. Because of the high cost of the 777K panel, the accuracy of imputation from lower density marker panels [6,000 (6K) or 50K] was examined both within breed and using a multi-breed reference population in Holstein, Ayrshire, and Guernsey. Imputation was carried out using FImpute V2.2 and Beagle 3.3.2 software. Imputation accuracies were then calculated as both the proportion of correct SNP filled in (concordance rate) and allelic R(2). Computation time was also explored to determine the efficiency of the different algorithms for imputation. Analysis showed that LD values >0.2 were found in all breeds at distances at or shorter than the average adjacent pair-wise distance between SNP on the 50K panel. Correlations of r-values, however, did not reach high levels (<0.9) at these distances. High correlation values of SNP phase between breeds were observed (>0.94) when the average pair-wise distances using the 777K SNP panel were examined. High concordance rate (0.968-0.995) and allelic R(2) (0.946-0.991) were found for all breeds when imputation was carried out with FImpute from 50K to 777K. Imputation accuracy for Guernsey and Ayrshire was slightly lower when using the imputation method in Beagle. Computing time was significantly greater when using Beagle software, with all comparable procedures being 9 to 13 times less efficient, in terms of time, compared with FImpute. These findings suggest that use of a multi-breed reference population might increase prediction accuracy using the 777K SNP panel and that 777K genotypes can be efficiently and effectively imputed using the lower density 50K SNP panel. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Genetic structure of the Caribbean giant barrel sponge Xestospongia muta using the I3-M11 partition of COI

NASA Astrophysics Data System (ADS)

López-Legentil, S.; Pawlik, J. R.

2009-03-01

In recent years, reports of sponge bleaching, disease, and subsequent mortality have increased alarmingly. Population recovery may depend strongly on colonization capabilities of the affected species. The giant barrel sponge Xestospongia muta is a dominant reef constituent in the Caribbean. However, little is known about its population structure and gene flow. The 5'-end fragment of the mitochondrial gene cytochrome oxidase subunit I is often used to address these kinds of questions, but it presents very low intraspecific nucleotide variability in sponges. In this study, the usefulness of the I3-M11 partition of COI to determine the genetic structure of X. muta was tested for seven populations from Florida, the Bahamas and Belize. A total of 116 sequences of 544 bp were obtained for the I3-M11 partition corresponding to four haplotypes. In order to make a comparison with the 5'-end partition, 10 sequences per haplotype were analyzed for this fragment. The 40 resulting sequences were of 569 bp and corresponded to two haplotypes. The nucleotide diversity of the I3-M11 partition (π = 0.00386) was higher than that of the 5'-end partition (π = 0.00058), indicating better resolution at the intraspecific level. Sponges with the most divergent external morphologies (smooth vs. digitate surface) had different haplotypes, while those with the most common external morphology (rough surface) presented a mixture of haplotypes. Pairwise tests for genetic differentiation among geographic locations based on F ST values showed significant genetic divergence between most populations, but this genetic differentiation was not due to isolation by distance. While limited larval dispersal may have led to differentiation among some of the populations, the patterns of genetic structure appear to be most strongly related to patterns of ocean currents. Therefore, hydrological features may play a major role in sponge colonization and need to be considered in future plans for management and conservation of these important components of coral reef ecosystems.
Amino acid positions subject to multiple coevolutionary constraints can be robustly identified by their eigenvector network centrality scores.

PubMed

Parente, Daniel J; Ray, J Christian J; Swint-Kruse, Liskin

2015-12-01

As proteins evolve, amino acid positions key to protein structure or function are subject to mutational constraints. These positions can be detected by analyzing sequence families for amino acid conservation or for coevolution between pairs of positions. Coevolutionary scores are usually rank-ordered and thresholded to reveal the top pairwise scores, but they also can be treated as weighted networks. Here, we used network analyses to bypass a major complication of coevolution studies: For a given sequence alignment, alternative algorithms usually identify different, top pairwise scores. We reconciled results from five commonly-used, mathematically divergent algorithms (ELSC, McBASC, OMES, SCA, and ZNMI), using the LacI/GalR and 1,6-bisphosphate aldolase protein families as models. Calculations used unthresholded coevolution scores from which column-specific properties such as sequence entropy and random noise were subtracted; "central" positions were identified by calculating various network centrality scores. When compared among algorithms, network centrality methods, particularly eigenvector centrality, showed markedly better agreement than comparisons of the top pairwise scores. Positions with large centrality scores occurred at key structural locations and/or were functionally sensitive to mutations. Further, the top central positions often differed from those with top pairwise coevolution scores: instead of a few strong scores, central positions often had multiple, moderate scores. We conclude that eigenvector centrality calculations reveal a robust evolutionary pattern of constraints-detectable by divergent algorithms--that occur at key protein locations. Finally, we discuss the fact that multiple patterns coexist in evolutionary data that, together, give rise to emergent protein functions. © 2015 Wiley Periodicals, Inc.
SLC11A1 polymorphisms and host susceptibility to cutaneous leishmaniasis in Pakistan.

PubMed

Sophie, Mariam; Hameed, Abdul; Muneer, Akhtar; Samdani, Azam J; Saleem, Saima; Azhar, Abid

2017-01-07

The vector-borne cutaneous leishmaniasis (CL) is endemic in several regions of Pakistan mainly affecting poor populations. Host genetic factors, particularly SLC11A1 (solute carrier transmembrane protein) within macrophages, play a crucial role in disease pathology and susceptibility. Association of SLC11A1 with cutaneous leishmaniasis, a neglected tropical disease, is not well established. Inconsistencies have been observed within different populations worldwide with respect to genetic susceptibility. This study was designed to investigate genetic variation(s) in SLC11A1 and to assess possible association with cutaneous leishmaniasis in Pakistan. Eight polymorphisms (rs2276631, rs3731864, rs2290708, rs2695342, rs201565523, rs17215556, rs17235409, rs17235416) were genotyped across SLC11A1 in 274 patients and 119 healthy controls. Six polymorphisms were studied by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) and sequencing. Two single nucleotide polymorphisms were analyzed with newly designed semi-nested PCR assays. Case-control analysis showed no association between selected polymorphisms in SLC11A1 and cutaneous leishmaniasis. No significant difference was observed in the distribution of alleles between leishmaniasis patients and healthy individuals. Strong pairwise linkage disequilibrium was observed between rs2276631 and rs2290708 (r 2 = 64); and rs17235409 and rs17235416 (r 2 = 78). This study shows that genetic variations in the candidate gene SLC11A1 do not affect susceptibility to cutaneous leishmaniasis in the sample population from Pakistan.
Investigating the Genetic Architecture of the PR Interval Using Clinical Phenotypes.

PubMed

Mosley, Jonathan D; Shoemaker, M Benjamin; Wells, Quinn S; Darbar, Dawood; Shaffer, Christian M; Edwards, Todd L; Bastarache, Lisa; McCarty, Catherine A; Thompson, Will; Chute, Christopher G; Jarvik, Gail P; Crosslin, David R; Larson, Eric B; Kullo, Iftikhar J; Pacheco, Jennifer A; Peissig, Peggy L; Brilliant, Murray H; Linneman, James G; Witte, John S; Denny, Josh C; Roden, Dan M

2017-04-01

One potential use for the PR interval is as a biomarker of disease risk. We hypothesized that quantifying the shared genetic architectures of the PR interval and a set of clinical phenotypes would identify genetic mechanisms contributing to PR variability and identify diseases associated with a genetic predictor of PR variability. We used ECG measurements from the ARIC study (Atherosclerosis Risk in Communities; n=6731 subjects) and 63 genetically modulated diseases from the eMERGE network (Electronic Medical Records and Genomics; n=12 978). We measured pairwise genetic correlations (rG) between PR phenotypes (PR interval, PR segment, P-wave duration) and each of the 63 phenotypes. The PR segment was genetically correlated with atrial fibrillation (rG=-0.88; P =0.0009). An analysis of metabolic phenotypes in ARIC also showed that the P wave was genetically correlated with waist circumference (rG=0.47; P =0.02). A genetically predicted PR interval phenotype based on 645 714 single-nucleotide polymorphisms was associated with atrial fibrillation (odds ratio=0.89 per SD change; 95% confidence interval, 0.83-0.95; P =0.0006). The differing pattern of associations among the PR phenotypes is consistent with analyses that show that the genetic correlation between the P wave and PR segment was not significantly different from 0 (rG=-0.03 [0.16]). The genetic architecture of the PR interval comprises modulators of atrial fibrillation risk and obesity. © 2017 American Heart Association, Inc.
Manipulation of Karyotype in Caenorhabditis elegans Reveals Multiple Inputs Driving Pairwise Chromosome Synapsis During Meiosis

PubMed Central

Roelens, Baptiste; Schvarzstein, Mara; Villeneuve, Anne M.

2015-01-01

Meiotic chromosome segregation requires pairwise association between homologs, stabilized by the synaptonemal complex (SC). Here, we investigate factors contributing to pairwise synapsis by investigating meiosis in polyploid worms. We devised a strategy, based on transient inhibition of cohesin function, to generate polyploid derivatives of virtually any Caenorhabditis elegans strain. We exploited this strategy to investigate the contribution of recombination to pairwise synapsis in tetraploid and triploid worms. In otherwise wild-type polyploids, chromosomes first sort into homolog groups, then multipartner interactions mature into exclusive pairwise associations. Pairwise synapsis associations still form in recombination-deficient tetraploids, confirming a propensity for synapsis to occur in a strictly pairwise manner. However, the transition from multipartner to pairwise association was perturbed in recombination-deficient triploids, implying a role for recombination in promoting this transition when three partners compete for synapsis. To evaluate the basis of synapsis partner preference, we generated polyploid worms heterozygous for normal sequence and rearranged chromosomes sharing the same pairing center (PC). Tetraploid worms had no detectable preference for identical partners, indicating that PC-adjacent homology drives partner choice in this context. In contrast, triploid worms exhibited a clear preference for identical partners, indicating that homology outside the PC region can influence partner choice. Together, our findings, suggest a two-phase model for C. elegans synapsis: an early phase, in which initial synapsis interactions are driven primarily by recombination-independent assessment of homology near PCs and by a propensity for pairwise SC assembly, and a later phase in which mature synaptic interactions are promoted by recombination. PMID:26500263
Structure based alignment and clustering of proteins (STRALCP)

DOEpatents

Zemla, Adam T.; Zhou, Carol E.; Smith, Jason R.; Lam, Marisa W.

2013-06-18

Disclosed are computational methods of clustering a set of protein structures based on local and pair-wise global similarity values. Pair-wise local and global similarity values are generated based on pair-wise structural alignments for each protein in the set of protein structures. Initially, the protein structures are clustered based on pair-wise local similarity values. The protein structures are then clustered based on pair-wise global similarity values. For each given cluster both a representative structure and spans of conserved residues are identified. The representative protein structure is used to assign newly-solved protein structures to a group. The spans are used to characterize conservation and assign a "structural footprint" to the cluster.
Candida uthaithanina sp. nov., an anamorphic yeast species in Nakaseomyces clade isolated in Thailand.

PubMed

Limtong, Savitree; Jindamorakot, Sasitorn; Am-In, Somjit; Kaewwichian, Rungluk; Nitiyon, Sukanya; Yongmanitchai, Wichien; Nakase, Takashi

2011-05-01

Three yeast stains were isolated from two unknown fruits (strains DD2-22-1(T) and SK44) and moss (strain ST-449) in Thailand. Analysis of the D1/D2 domain of the large subunit (LSU) rRNA gene sequences of the three strains revealed that they belonged to the same species. In terms of pairwise sequence similarity, Candida cf. glabrata UWO(PS) 98-110.4 and Candida nivariensis were the closest undescribed and recognized taxa, but the levels of nucleotide substitutions were 1.7-1.9% and 2.0-2.2%, respectively. The levels of nucleotide substitutions were sufficient to justify the description of a separate species of Candida. In the phylogenetic tree based on the D1/D2 domain of the LSU rRNA gene the three strains were placed in a separate branch in the Nakaseomyces clade with C. cf. glabrata UWO(PS)98-110.4, C. nivariensis, Candida glabrata, Candida bracarensis, Candida kungkrabaensis and Nakaseomyces delphensis. Phenotypic characteristics of the three strains were similar which included proliferation by multilateral budding, absence of ascospores, arthrospores or ballistospores; negative for Diazonium blue B and urease tests. The major ubiquinone was Q-6. On the basis of the above findings, the three strains were assigned to a single novel species of Candida, for which the name Candida uthaithanina sp. nov is proposed. The type strain is DD2-22-1(T) (= BCC 29899(T) = NBRC 104876(T) = CBS 10932(T)).
Population genetic implications from sequence variation in four Y chromosome genes.

PubMed

Shen, P; Wang, F; Underhill, P A; Franco, C; Yang, W H; Roxas, A; Sung, R; Lin, A A; Hyman, R W; Vollrath, D; Davis, R W; Cavalli-Sforza, L L; Oefner, P J

2000-06-20

Some insight into human evolution has been gained from the sequencing of four Y chromosome genes. Primary genomic sequencing determined gene SMCY to be composed of 27 exons that comprise 4,620 bp of coding sequence. The unfinished sequencing of the 5' portion of gene UTY1 was completed by primer walking, and a total of 20 exons were found. By using denaturing HPLC, these two genes, as well as DBY and DFFRY, were screened for polymorphic sites in 53-72 representatives of the five continents. A total of 98 variants were found, yielding nucleotide diversity estimates of 2.45 x 10(-5), 5. 07 x 10(-5), and 8.54 x 10(-5) for the coding regions of SMCY, DFFRY, and UTY1, respectively, with no variant having been observed in DBY. In agreement with most autosomal genes, diversity estimates for the noncoding regions were about 2- to 3-fold higher and ranged from 9. 16 x 10(-5) to 14.2 x 10(-5) for the four genes. Analysis of the frequencies of derived alleles for all four genes showed that they more closely fit the expectation of a Luria-Delbrück distribution than a distribution expected under a constant population size model, providing evidence for exponential population growth. Pairwise nucleotide mismatch distributions date the occurrence of population expansion to approximately 28,000 years ago. This estimate is in accord with the spread of Aurignacian technology and the disappearance of the Neanderthals.

Genomic analysis of the Chinese genotype 1F rubella virus that disappeared after 2002 in China.

PubMed

Zhu, Zhen; Chen, Min-Hsin; Abernathy, Emily; Zhou, Shujie; Wang, Changyin; Icenogle, Joseph; Xu, Wenbo

2014-12-01

Genotype 1F was likely localized geographically to China as it has not been reported elsewhere. In this study, whole genome sequences of two rubella 1F virus isolates were completed. Both viruses contained 9,761 nt with a single nucleotide deletion in the intergenic region, compared to the NCBI rubella reference sequence (NC 001545). No evidence of recombination was found between 1F and other rubella viruses. The genetic distance between 1F viruses and 10 other rubella virus genotypes (1a, 1B, 1C, 1D, 1E, 1G, 1J 2A, 2B, and 2C) ranged from 3.9% to 8.6% by pairwise comparison. A region known to be hypervariable in other rubella genotypes was also the most variable region in the 1F genomes. Comparisons to all available rubella virus sequences from GenBank identified 22 nucleotide variations exclusively in 1F viruses. Among these unique variations, C9306U is located within the recommended molecular window for rubella virus genotyping assignment, could be useful to confirm 1F viruses. Using the Bayesian Markov Chain Monte Carlo (MCMC) method, the time of the most recent common ancestor for the genotype 1F was estimated between 1976 and 1995. Recent rubella molecular surveillance suggests that this indigenous strain may have circulated for less than three decades, as it has not been detected since 2002. © 2014 Wiley Periodicals, Inc.
MOSAIC: an online database dedicated to the comparative genomics of bacterial strains at the intra-species level.

PubMed

Chiapello, Hélène; Gendrault, Annie; Caron, Christophe; Blum, Jérome; Petit, Marie-Agnès; El Karoui, Meriem

2008-11-27

The recent availability of complete sequences for numerous closely related bacterial genomes opens up new challenges in comparative genomics. Several methods have been developed to align complete genomes at the nucleotide level but their use and the biological interpretation of results are not straightforward. It is therefore necessary to develop new resources to access, analyze, and visualize genome comparisons. Here we present recent developments on MOSAIC, a generalist comparative bacterial genome database. This database provides the bacteriologist community with easy access to comparisons of complete bacterial genomes at the intra-species level. The strategy we developed for comparison allows us to define two types of regions in bacterial genomes: backbone segments (i.e., regions conserved in all compared strains) and variable segments (i.e., regions that are either specific to or variable in one of the aligned genomes). Definition of these segments at the nucleotide level allows precise comparative and evolutionary analyses of both coding and non-coding regions of bacterial genomes. Such work is easily performed using the MOSAIC Web interface, which allows browsing and graphical visualization of genome comparisons. The MOSAIC database now includes 493 pairwise comparisons and 35 multiple maximal comparisons representing 78 bacterial species. Genome conserved regions (backbones) and variable segments are presented in various formats for further analysis. A graphical interface allows visualization of aligned genomes and functional annotations. The MOSAIC database is available online at http://genome.jouy.inra.fr/mosaic.
Bacillus wiedmannii sp. nov., a psychrotolerant and cytotoxic Bacillus cereus group species isolated from dairy foods and dairy environments

PubMed Central

Miller, Rachel A.; Beno, Sarah M.; Kent, David J.; Carroll, Laura M.; Martin, Nicole H.; Boor, Kathryn J.

2016-01-01

A facultatively anaerobic, spore-forming Bacillus strain, FSL W8-0169T, collected from raw milk stored in a silo at a dairy powder processing plant in the north-eastern USA was initially identified as a Bacillus cereus group species based on a partial sequence of the rpoB gene and 16S rRNA gene sequence. Analysis of core genome single nucleotide polymorphisms clustered this strain separately from known B. cereus group species. Pairwise average nucleotide identity blast values obtained for FSL W8-0169T compared to the type strains of existing B. cereus group species were <95 % and predicted DNA–DNA hybridization values were <70 %, suggesting that this strain represents a novel B. cereus group species. We characterized 10 additional strains with the same or closely related rpoB allelic type, by whole genome sequencing and phenotypic analyses. Phenotypic characterization identified a higher content of iso-C16 : 0 fatty acid and the combined inability to ferment sucrose or to hydrolyse arginine as the key characteristics differentiating FSL W8-0169T from other B. cereus group species. FSL W8-0169T is psychrotolerant, produces haemolysin BL and non-haemolytic enterotoxin, and is cytotoxic in a HeLa cell model. The name Bacillus wiedmannii sp. nov. is proposed for the novel species represented by the type strain FSL W8-0169T (=DSM 102050T=LMG 29269T). PMID:27520992
Criterion Predictability: Identifying Differences Between [r-squares

ERIC Educational Resources Information Center

Malgady, Robert G.

1976-01-01

An analysis of variance procedure for testing differences in r-squared, the coefficient of determination, across independent samples is proposed and briefly discussed. The principal advantage of the procedure is to minimize Type I error for follow-up tests of pairwise differences. (Author/JKS)
The contribution of individual and pairwise combinations of SNPs in the APOA1 and APOC3 genes to interindividual HDL-C variability.

PubMed

Brown, C M; Rea, T J; Hamon, S C; Hixson, J E; Boerwinkle, E; Clark, A G; Sing, C F

2006-07-01

Apolipoproteins (apo) A-I and C-III are components of high-density lipoprotein-cholesterol (HDL-C), a quantitative trait negatively correlated with risk of cardiovascular disease (CVD). We analyzed the contribution of individual and pairwise combinations of single nucleotide polymorphisms (SNPs) in the APOA1/APOC3 genes to HDL-C variability to evaluate (1) consistency of published single-SNP studies with our single-SNP analyses; (2) consistency of single-SNP and two-SNP phenotype-genotype relationships across race-, gender-, and geographical location-dependent contexts; and (3) the contribution of single SNPs and pairs of SNPs to variability beyond that explained by plasma apo A-I concentration. We analyzed 45 SNPs in 3,831 young African-American (N=1,858) and European-American (N=1,973) females and males ascertained by the Coronary Artery Risk Development in Young Adults (CARDIA) study. We found three SNPs that significantly impact HDL-C variability in both the literature and the CARDIA sample. Single-SNP analyses identified only one of five significant HDL-C SNP genotype relationships in the CARDIA study that was consistent across all race-, gender-, and geographical location-dependent contexts. The other four were consistent across geographical locations for a particular race-gender context. The portion of total phenotypic variance explained by single-SNP genotypes and genotypes defined by pairs of SNPs was less than 3%, an amount that is miniscule compared to the contribution explained by variability in plasma apo A-I concentration. Our findings illustrate the impact of context-dependence on SNP selection for prediction of CVD risk factor variability.
Revisiting the diffusion approximation to estimate evolutionary rates of gene family diversification.

PubMed

Gjini, Erida; Haydon, Daniel T; David Barry, J; Cobbold, Christina A

2014-01-21

Genetic diversity in multigene families is shaped by multiple processes, including gene conversion and point mutation. Because multi-gene families are involved in crucial traits of organisms, quantifying the rates of their genetic diversification is important. With increasing availability of genomic data, there is a growing need for quantitative approaches that integrate the molecular evolution of gene families with their higher-scale function. In this study, we integrate a stochastic simulation framework with population genetics theory, namely the diffusion approximation, to investigate the dynamics of genetic diversification in a gene family. Duplicated genes can diverge and encode new functions as a result of point mutation, and become more similar through gene conversion. To model the evolution of pairwise identity in a multigene family, we first consider all conversion and mutation events in a discrete manner, keeping track of their details and times of occurrence; second we consider only the infinitesimal effect of these processes on pairwise identity accounting for random sampling of genes and positions. The purely stochastic approach is closer to biological reality and is based on many explicit parameters, such as conversion tract length and family size, but is more challenging analytically. The population genetics approach is an approximation accounting implicitly for point mutation and gene conversion, only in terms of per-site average probabilities. Comparison of these two approaches across a range of parameter combinations reveals that they are not entirely equivalent, but that for certain relevant regimes they do match. As an application of this modelling framework, we consider the distribution of nucleotide identity among VSG genes of African trypanosomes, representing the most prominent example of a multi-gene family mediating parasite antigenic variation and within-host immune evasion. © 2013 Published by Elsevier Ltd. All rights reserved.
Clones or clans: the genetic structure of a deep-sea sponge, Aphrocallistes vastus, in unique sponge reefs of British Columbia, Canada.

PubMed

Brown, Rachel R; Davis, Corey S; Leys, Sally P

2017-02-01

Understanding patterns of reproduction, dispersal and recruitment in deep-sea communities is increasingly important with the need to manage resource extraction and conserve species diversity. Glass sponges are usually found in deep water (>1000 m) worldwide but form kilometre-long reefs on the continental shelf of British Columbia and Alaska that are under threat from trawling and resource exploration. Due to their deep-water habitat, larvae have not yet been found and the level of genetic connectivity between reefs and nonreef communities is unknown. The genetic structure of Aphrocallistes vastus, the primary reef-building species in the Strait of Georgia (SoG) British Columbia, was studied using single nucleotide polymorphisms (SNPs). Pairwise comparisons of multilocus genotypes were used to assess whether sexual reproduction is common. Structure was examined 1) between individuals in reefs, 2) between reefs and 3) between sites in and outside the SoG. Sixty-seven SNPs were genotyped in 91 samples from areas in and around the SoG, including four sponge reefs and nearby nonreef sites. The results show that sponge reefs are formed through sexual reproduction. Within a reef and across the SoG basin, the genetic distance between individuals does not vary with geographic distance (r = -0.005 to 0.014), but populations within the SoG basin are genetically distinct from populations in Barkley Sound, on the west coast of Vancouver Island. Population structure was seen across all sample sites (global F ST = 0.248), especially between SoG and non-SoG locations (average pairwise F ST = 0.251). Our results suggest that genetic mixing occurs across sponge reefs via larvae that disperse widely. © 2016 John Wiley & Sons Ltd.
Phylogenomic analysis of the species of the Mycobacterium tuberculosis complex demonstrates that Mycobacterium africanum, Mycobacterium bovis, Mycobacterium caprae, Mycobacterium microti and Mycobacterium pinnipedii are later heterotypic synonyms of Mycobacterium tuberculosis.

PubMed

Riojas, Marco A; McGough, Katya J; Rider-Riojas, Cristin J; Rastogi, Nalin; Hazbón, Manzour Hernando

2018-01-01

The species within the Mycobacterium tuberculosis Complex (MTBC) have undergone numerous taxonomic and nomenclatural changes, leaving the true structure of the MTBC in doubt. We used next-generation sequencing (NGS), digital DNA-DNA hybridization (dDDH), and average nucleotide identity (ANI) to investigate the relationship between these species. The type strains of Mycobacterium africanum, Mycobacterium bovis, Mycobacterium caprae, Mycobacterium microti and Mycobacterium pinnipedii were sequenced via NGS. Pairwise dDDH and ANI comparisons between these, previously sequenced MTBC type strain genomes (including 'Mycobacterium canettii', 'Mycobacterium mungi' and 'Mycobacterium orygis') and M. tuberculosis H37Rv T were performed. Further, all available genome sequences in GenBank for species in or putatively in the MTBC were compared to H37Rv T . Pairwise results indicated that all of the type strains of the species are extremely closely related to each other (dDDH: 91.2-99.2 %, ANI: 99.21-99.92 %), greatly exceeding the respective species delineation thresholds, thus indicating that they belong to the same species. Results from the GenBank genomes indicate that all the strains examined are within the circumscription of H37Rv T (dDDH: 83.5-100 %). We, therefore, formally propose a union of the species of the MTBC as M. tuberculosis. M. africanum, M. bovis, M. caprae, M. microti and M. pinnipedii are reclassified as later heterotypic synonyms of M. tuberculosis. 'M. canettii', 'M. mungi', and 'M. orygis' are classified as strains of the species M. tuberculosis. We further recommend use of the infrasubspecific term 'variant' ('var.') and infrasubspecific designations that generally retain the historical nomenclature associated with the groups or otherwise convey such characteristics, e.g. M. tuberculosis var. bovis.
HAPRAP: a haplotype-based iterative method for statistical fine mapping using GWAS summary statistics.

PubMed

Zheng, Jie; Rodriguez, Santiago; Laurin, Charles; Baird, Denis; Trela-Larsen, Lea; Erzurumluoglu, Mesut A; Zheng, Yi; White, Jon; Giambartolomei, Claudia; Zabaneh, Delilah; Morris, Richard; Kumari, Meena; Casas, Juan P; Hingorani, Aroon D; Evans, David M; Gaunt, Tom R; Day, Ian N M

2017-01-01

Fine mapping is a widely used approach for identifying the causal variant(s) at disease-associated loci. Standard methods (e.g. multiple regression) require individual level genotypes. Recent fine mapping methods using summary-level data require the pairwise correlation coefficients ([Formula: see text]) of the variants. However, haplotypes rather than pairwise [Formula: see text], are the true biological representation of linkage disequilibrium (LD) among multiple loci. In this article, we present an empirical iterative method, HAPlotype Regional Association analysis Program (HAPRAP), that enables fine mapping using summary statistics and haplotype information from an individual-level reference panel. Simulations with individual-level genotypes show that the results of HAPRAP and multiple regression are highly consistent. In simulation with summary-level data, we demonstrate that HAPRAP is less sensitive to poor LD estimates. In a parametric simulation using Genetic Investigation of ANthropometric Traits height data, HAPRAP performs well with a small training sample size (N < 2000) while other methods become suboptimal. Moreover, HAPRAP's performance is not affected substantially by single nucleotide polymorphisms (SNPs) with low minor allele frequencies. We applied the method to existing quantitative trait and binary outcome meta-analyses (human height, QTc interval and gallbladder disease); all previous reported association signals were replicated and two additional variants were independently associated with human height. Due to the growing availability of summary level data, the value of HAPRAP is likely to increase markedly for future analyses (e.g. functional prediction and identification of instruments for Mendelian randomization). The HAPRAP package and documentation are available at http://apps.biocompute.org.uk/haprap/ CONTACT: : jie.zheng@bristol.ac.uk or tom.gaunt@bristol.ac.ukSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Estrogen Receptor 1 ( ESR1) Gene Polymorphisms and Obesity Phenotypes in a Population of Young Adults.

PubMed

Correa-Rodríguez, María; Schmidt-RioValle, Jacqueline; González-Jiménez, Emilio; Rueda-Medina, Blanca

2017-06-01

Obesity is considered an increasingly serious health problem determined by multiple genetic and environmental factors. Estrogens have been found to play a major role in body weight and adiposity regulation through estrogen receptor 1 ( ESR1). The aim of this study was to determine whether genotype and haplotype frequencies of ESR1 polymorphisms are associated with body composition measures in a population of 572 young adults. A lack of significant association between genotypes of ESR1 gene polymorphisms and obesity phenotypes was seen after adjustment for confounding factors. Linkage disequilibrium (LD) analysis identified a single LD block for the ESR1 gene including PvuII and XbaI single-nucleotide polymorphisms (SNPs) (pairwise r 2 = .66). None of the haplotypes identified revealed statistically significant associations with any of the obesity phenotypes. Our results suggest that polymorphisms of the ESR1 gene do not contribute significantly to the genetic risk for obesity phenotypes in a population of young Caucasian adults.
Lactococcus petauri sp. nov., isolated from an abscess of a sugar glider

PubMed Central

Goodman, Laura B.; Lawton, Marie R.; Franklin-Guild, Rebecca J.; Anderson, Renee R.; Schaan, Lynn; Thachil, Anil J.; Wiedmann, Martin; Miller, Claire B.; Alcaine, Samuel D.; Kovac, Jasna

2017-01-01

A strain of lactic acid bacteria, designated 159469T, isolated from a facial abscess in a sugar glider, was characterized genetically and phenotypically. Cells of the strain were Gram-stain-positive, coccoid and catalase-negative. Morphological, physiological and phylogenetic data indicated that the isolate belongs to the genus Lactococcus. Strain 159469T was closely related to Lactococcus garvieae ATCC 43921T, showing 95.86 and 98.08 % sequence similarity in 16S rRNA gene and rpoB gene sequences, respectively. Furthermore, a pairwise average nucleotide identity blast (ANIb) value of 93.54 % and in silico DNA–DNA hybridization value of 50.7 % were determined for the genome of strain 159469T, when compared with the genome of the type strain of Lactococcus garvieae. Based on the data presented here, the isolate represents a novel species of the genus Lactococcus, for which the name Lactococcus petauri sp. nov. is proposed. The type strain is 159469T (=LMG 30040T=DSM 104842T). PMID:28945531
ExoLocator--an online view into genetic makeup of vertebrate proteins.

PubMed

Khoo, Aik Aun; Ogrizek-Tomas, Mario; Bulovic, Ana; Korpar, Matija; Gürler, Ece; Slijepcevic, Ivan; Šikic, Mile; Mihalek, Ivana

2014-01-01

ExoLocator (http://exolocator.eopsf.org) collects in a single place information needed for comparative analysis of protein-coding exons from vertebrate species. The main source of data--the genomic sequences, and the existing exon and homology annotation--is the ENSEMBL database of completed vertebrate genomes. To these, ExoLocator adds the search for ostensibly missing exons in orthologous protein pairs across species, using an extensive computational pipeline to narrow down the search region for the candidate exons and find a suitable template in the other species, as well as state-of-the-art implementations of pairwise alignment algorithms. The resulting complements of exons are organized in a way currently unique to ExoLocator: multiple sequence alignments, both on the nucleotide and on the peptide levels, clearly indicating the exon boundaries. The alignments can be inspected in the web-embedded viewer, downloaded or used on the spot to produce an estimate of conservation within orthologous sets, or functional divergence across paralogues.
Listeria costaricensis sp. nov.

PubMed

Núñez-Montero, Kattia; Leclercq, Alexandre; Moura, Alexandra; Vales, Guillaume; Peraza, Johnny; Pizarro-Cerdá, Javier; Lecuit, Marc

2018-03-01

A bacterial strain isolated from a food processing drainage system in Costa Rica fulfilled the criteria as belonging to the genus Listeria, but could not be assigned to any of the known species. Phylogenetic analysis based on the 16S rRNA gene revealed highest sequence similarity with the type strain of Listeria floridensis (98.7 %). Phylogenetic analysis based on Listeria core genomes placed the novel taxon within the Listeria fleishmannii, L. floridensis and Listeria aquatica clade (Listeria sensu lato). Whole-genome sequence analyses based on the average nucleotide blast identity (ANI<80 %) indicated that this isolate belonged to a novel species. Results of pairwise amino acid identity (AAI>70 %) and percentage of conserved proteins (POCP>68 %) with currently known Listeria species, as well as of biochemical characterization, confirmed that the strain constituted a novel species within the genus Listeria. The name Listeria costaricensis sp. nov. is proposed for the novel species, and is represented by the type strain CLIP 2016/00682 T (=CIP 111400 T =DSM 105474 T ).
A comparative genetic analysis of the Irish greyhound population using multilocus DNA fingerprinting, canine single locus minisatellites and canine microsatellites.

PubMed

Sutton, M D; Holmes, N G; Brennan, F B; Binns, M M; Kelly, E P; Duke, E J

1998-06-01

Pairwise analysis of HinfI/33.6 DNA fingerprints from a total of one hundred and fifty-three Irish greyhounds of known pedigree were used to determine band-share estimates of unrelated, first-degree and second-degree relationships. Forty-eight unrelated Irish greyhounds were used to determine allele frequencies for three single-locus minisatellites, and following a preliminary screen, eight of the most polymorphic tetra-nucleotide microsatellites from a panel of 15. The results indicated that both band-share estimates by DNA fingerprinting and microsatellite allele frequencies are highly effective in resolving parentage in this greyhound population, while single-locus minisatellites showed limited polymorphism and could not be used alone for routine parentage testing in this breed. The present study also demonstrated that, to obtain optimal resolution of parentage, sample sets of known pedigree status are required to determine the band-share distribution and/or microsatellite allele frequencies.
Lactococcus petauri sp. nov., isolated from an abscess of a sugar glider.

PubMed

Goodman, Laura B; Lawton, Marie R; Franklin-Guild, Rebecca J; Anderson, Renee R; Schaan, Lynn; Thachil, Anil J; Wiedmann, Martin; Miller, Claire B; Alcaine, Samuel D; Kovac, Jasna

2017-11-01

A strain of lactic acid bacteria, designated 159469 T , isolated from a facial abscess in a sugar glider, was characterized genetically and phenotypically. Cells of the strain were Gram-stain-positive, coccoid and catalase-negative. Morphological, physiological and phylogenetic data indicated that the isolate belongs to the genus Lactococcus. Strain 159469 T was closely related to Lactococcus garvieae ATCC 43921 T , showing 95.86 and 98.08 % sequence similarity in 16S rRNA gene and rpoB gene sequences, respectively. Furthermore, a pairwise average nucleotide identity blast (ANIb) value of 93.54 % and in silico DNA-DNA hybridization value of 50.7 % were determined for the genome of strain 159469 T , when compared with the genome of the type strain of Lactococcus garvieae. Based on the data presented here, the isolate represents a novel species of the genus Lactococcus, for which the name Lactococcus petauri sp. nov. is proposed. The type strain is 159469 T (=LMG 30040 T =DSM 104842 T ).
Molecular analysis of Aspergillus section Flavi isolated from Brazil nuts.

PubMed

Gonçalves, Juliana Soares; Ferracin, Lara Munique; Carneiro Vieira, Maria Lucia; Iamanaka, Beatriz Thie; Taniwaki, Marta Hiromi; Pelegrinelli Fungaro, Maria Helena

2012-04-01

Brazil nuts are an important export market in its main producing countries, including Brazil, Bolivia, and Peru. Approximately 30,000 tons of Brazil nuts are harvested each year. However, substantial nut contamination by Aspergillus section Flavi occurs with subsequent production of aflatoxins. In our study, Aspergillus section Flavi were isolated from Brazil nuts (Bertholletia excelsa), and identified by morphological and molecular means. We obtained 241 isolates from nut samples, 41% positive for aflatoxin production. Eighty-one isolates were selected for molecular investigation. Pairwise genetic distances among isolates and phylogenetic relationships were assessed. The following Aspergillus species were identified: A. flavus, A. caelatus, A. nomius, A. tamarii, A. bombycis, and A. arachidicola. Additionally, molecular profiles indicated a high level of nucleotide variation within β-tubulin and calmodulin gene sequences associated with high genetic divergence from RAPD data. Among the 81 isolates analyzed by molecular means, three of them were phylogenetically distinct from all other isolates representing the six species of section Flavi. A putative novel species was identified based on molecular profiles.
Mitochondrial DNA diversity of orchid bee Euglossa fimbriata (Hymenoptera: Apidae) populations assessed by PCR-RFLP.

PubMed

Suzuki, Karen M; Arias, Maria C; Giangarelli, Douglas C; Freiria, Gabriele A; Sofia, Silvia H

2010-04-01

Euglossa fimbriata is a euglossine species widely distributed in Brazil and occurring primarily in Atlantic Forest remnants. In this study, the genetic mitochondrial structure of E. fimbriata from six Atlantic Forest fragments was studied by RFLP analysis of three PCR-amplified mtDNA gene segments (16S, COI-COII, and cyt b). Ten composite haplotypes were identified, six of which were exclusive and represented singleton mitotypes. Low haplotype diversity (0.085-0.289) and nucleotide diversity (0.000-0.002) were detected within samples. AMOVA partitioned 91.13% of the overall genetic variation within samples and 8.87% (phi(st) = 0.089; P < 0.05) among samples. Pairwise comparisons indicated high levels of differentiation among some pairs of samples (phi(st) = 0.161-0.218; P < 0.05). These high levels indicate that these populations of E. fimbriata, despite their highly fragmented landscape, apparently have not suffered loss of genetic variation, suggesting that this particular population is not currently endangered.
Mobile phones and computer keyboards: unlikely reservoirs of multidrug-resistant organisms in the tertiary intensive care unit.

PubMed

Smibert, O C; Aung, A K; Woolnough, E; Carter, G P; Schultz, M B; Howden, B P; Seemann, T; Spelman, D; McGloughlin, S; Peleg, A Y

2018-03-02

Few studies have used molecular epidemiological methods to study transmission links to clinical isolates in intensive care units. Ninety-four multidrug-resistant organisms (MDROs) cultured from routine specimens from intensive care unit (ICU) patients over 13 weeks were stored (11 meticillin-resistant Staphylococcus aureus (MRSA), two vancomycin-resistant enterococci and 81 Gram-negative bacteria). Medical staff personal mobile phones, departmental phones, and ICU keyboards were swabbed and cultured for MDROs; MRSA was isolated from two phones. Environmental and patient isolates of the same genus were selected for whole genome sequencing. On whole genome sequencing, the mobile phone isolates had a pairwise single nucleotide polymorphism (SNP) distance of 183. However, >15,000 core genome SNPs separated the mobile phone and clinical isolates. In a low-endemic setting, mobile phones and keyboards appear unlikely to contribute to hospital-acquired MDROs. Copyright © 2018 The Healthcare Infection Society. Published by Elsevier Ltd. All rights reserved.
DNA sequence variation and selection of tag single-nucleotide polymorphisms at candidate genes for drought-stress response in Pinus taeda L.

PubMed

González-Martínez, Santiago C; Ersoz, Elhan; Brown, Garth R; Wheeler, Nicholas C; Neale, David B

2006-03-01

Genetic association studies are rapidly becoming the experimental approach of choice to dissect complex traits, including tolerance to drought stress, which is the most common cause of mortality and yield losses in forest trees. Optimization of association mapping requires knowledge of the patterns of nucleotide diversity and linkage disequilibrium and the selection of suitable polymorphisms for genotyping. Moreover, standard neutrality tests applied to DNA sequence variation data can be used to select candidate genes or amino acid sites that are putatively under selection for association mapping. In this article, we study the pattern of polymorphism of 18 candidate genes for drought-stress response in Pinus taeda L., an important tree crop. Data analyses based on a set of 21 putatively neutral nuclear microsatellites did not show population genetic structure or genomewide departures from neutrality. Candidate genes had moderate average nucleotide diversity at silent sites (pi(sil) = 0.00853), varying 100-fold among single genes. The level of within-gene LD was low, with an average pairwise r2 of 0.30, decaying rapidly from approximately 0.50 to approximately 0.20 at 800 bp. No apparent LD among genes was found. A selective sweep may have occurred at the early-response-to-drought-3 (erd3) gene, although population expansion can also explain our results and evidence for selection was not conclusive. One other gene, ccoaomt-1, a methylating enzyme involved in lignification, showed dimorphism (i.e., two highly divergent haplotype lineages at equal frequency), which is commonly associated with the long-term action of balancing selection. Finally, a set of haplotype-tagging SNPs (htSNPs) was selected. Using htSNPs, a reduction of genotyping effort of approximately 30-40%, while sampling most common allelic variants, can be gained in our ongoing association studies for drought tolerance in pine.
Scale dependence in species turnover reflects variance in species occupancy.

PubMed

McGlinn, Daniel J; Hurlbert, Allen H

2012-02-01

Patterns of species turnover may reflect the processes driving community dynamics across scales. While the majority of studies on species turnover have examined pairwise comparison metrics (e.g., the average Jaccard dissimilarity), it has been proposed that the species-area relationship (SAR) also offers insight into patterns of species turnover because these two patterns may be analytically linked. However, these previous links only apply in a special case where turnover is scale invariant, and we demonstrate across three different plant communities that over 90% of the pairwise turnover values are larger than expected based on scale-invariant predictions from the SAR. Furthermore, the degree of scale dependence in turnover was negatively related to the degree of variance in the occupancy frequency distribution (OFD). These findings suggest that species turnover diverges from scale invariance, and as such pairwise turnover and the slope of the SAR are not redundant. Furthermore, models developed to explain the OFD should be linked with those developed to explain species turnover to achieve a more unified understanding of community structure.

CombAlign: a code for generating a one-to-many sequence alignment from a set of pairwise structure-based sequence alignments.

PubMed

Zhou, Carol L Ecale

2015-01-01

In order to better define regions of similarity among related protein structures, it is useful to identify the residue-residue correspondences among proteins. Few codes exist for constructing a one-to-many multiple sequence alignment derived from a set of structure or sequence alignments, and a need was evident for creating such a tool for combining pairwise structure alignments that would allow for insertion of gaps in the reference structure. This report describes a new Python code, CombAlign, which takes as input a set of pairwise sequence alignments (which may be structure based) and generates a one-to-many, gapped, multiple structure- or sequence-based sequence alignment (MSSA). The use and utility of CombAlign was demonstrated by generating gapped MSSAs using sets of pairwise structure-based sequence alignments between structure models of the matrix protein (VP40) and pre-small/secreted glycoprotein (sGP) of Reston Ebolavirus and the corresponding proteins of several other filoviruses. The gapped MSSAs revealed structure-based residue-residue correspondences, which enabled identification of structurally similar versus differing regions in the Reston proteins compared to each of the other corresponding proteins. CombAlign is a new Python code that generates a one-to-many, gapped, multiple structure- or sequence-based sequence alignment (MSSA) given a set of pairwise sequence alignments (which may be structure based). CombAlign has utility in assisting the user in distinguishing structurally conserved versus divergent regions on a reference protein structure relative to other closely related proteins. CombAlign was developed in Python 2.6, and the source code is available for download from the GitHub code repository.
Improving prediction of heterodimeric protein complexes using combination with pairwise kernel.

PubMed

Ruan, Peiying; Hayashida, Morihiro; Akutsu, Tatsuya; Vert, Jean-Philippe

2018-02-19

Since many proteins become functional only after they interact with their partner proteins and form protein complexes, it is essential to identify the sets of proteins that form complexes. Therefore, several computational methods have been proposed to predict complexes from the topology and structure of experimental protein-protein interaction (PPI) network. These methods work well to predict complexes involving at least three proteins, but generally fail at identifying complexes involving only two different proteins, called heterodimeric complexes or heterodimers. There is however an urgent need for efficient methods to predict heterodimers, since the majority of known protein complexes are precisely heterodimers. In this paper, we use three promising kernel functions, Min kernel and two pairwise kernels, which are Metric Learning Pairwise Kernel (MLPK) and Tensor Product Pairwise Kernel (TPPK). We also consider the normalization forms of Min kernel. Then, we combine Min kernel or its normalization form and one of the pairwise kernels by plugging. We applied kernels based on PPI, domain, phylogenetic profile, and subcellular localization properties to predicting heterodimers. Then, we evaluate our method by employing C-Support Vector Classification (C-SVC), carrying out 10-fold cross-validation, and calculating the average F-measures. The results suggest that the combination of normalized-Min-kernel and MLPK leads to the best F-measure and improved the performance of our previous work, which had been the best existing method so far. We propose new methods to predict heterodimers, using a machine learning-based approach. We train a support vector machine (SVM) to discriminate interacting vs non-interacting protein pairs, based on informations extracted from PPI, domain, phylogenetic profiles and subcellular localization. We evaluate in detail new kernel functions to encode these data, and report prediction performance that outperforms the state-of-the-art.
Genetic Population Structure of Dastarcus helophoroides (Coleoptera: Bothrideridae) From Different Long-Horned Beetle Hosts Based on Complete Sequences of Mitochondrial COI.

PubMed

Zhang, Zhengqing; Chang, Yong; Li, Menglou

2017-06-01

Dastarcus helophoroides (Fairmaire) (Coleoptera: Bothrideridae) is an important natural enemy of long-horned beetles in China, Japan, and Korea. In this study, the genetic sequence of cytochrome oxidase subunit Ι was used to investigate the genetics and relationships within and among D. helophoroides populations collected from five different geographic locations. We used principal component analysis, heatmap, and Venn diagram results to determine the relationship between haplotypes and populations. In total, 26 haplotypes with 51 nucleotide polymorphic sites were defined, and low genetic diversity was found among the different populations. Significant genetic variations were observed mainly within populations, and no correlation was found between genetic distribution and geographical distance. Low pairwise fixation index values (-0.01424 to 0.04896) and high gene flows show that there was high gene exchange between populations. The codistributed haplotype DH01 was suggested to be the most ancestral haplotype, and other haplotypes were thought to have evolved from it through several mutations. In four of the populations, both common haplotypes (DH01, DH03, and DH22) and unique haplotypes were found. Low genetic diversity among different populations is related to a relatively high flight capacity, host movement, and human-aided dispersal of D. helophoroides. The high gene exchange and typically weak population genetic structure among five populations, especially among populations of Anoplophora glabripennis (Motschulsky), Monochamus alternatus (Hope), and Massicus raddei (Blessig), may suggest that these populations cross naturally in the field. © The Authors 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
A custom correlation coefficient (CCC) approach for fast identification of multi-SNP association patterns in genome-wide SNPs data.

PubMed

Climer, Sharlee; Yang, Wei; de las Fuentes, Lisa; Dávila-Román, Victor G; Gu, C Charles

2014-11-01

Complex diseases are often associated with sets of multiple interacting genetic factors and possibly with unique sets of the genetic factors in different groups of individuals (genetic heterogeneity). We introduce a novel concept of custom correlation coefficient (CCC) between single nucleotide polymorphisms (SNPs) that address genetic heterogeneity by measuring subset correlations autonomously. It is used to develop a 3-step process to identify candidate multi-SNP patterns: (1) pairwise (SNP-SNP) correlations are computed using CCC; (2) clusters of so-correlated SNPs identified; and (3) frequencies of these clusters in disease cases and controls compared to identify disease-associated multi-SNP patterns. This method identified 42 candidate multi-SNP associations with hypertensive heart disease (HHD), among which one cluster of 22 SNPs (six genes) included 13 in SLC8A1 (aka NCX1, an essential component of cardiac excitation-contraction coupling) and another of 32 SNPs had 29 from a different segment of SLC8A1. While allele frequencies show little difference between cases and controls, the cluster of 22 associated alleles were found in 20% of controls but no cases and the other in 3% of controls but 20% of cases. These suggest that both protective and risk effects on HHD could be exerted by combinations of variants in different regions of SLC8A1, modified by variants from other genes. The results demonstrate that this new correlation metric identifies disease-associated multi-SNP patterns overlooked by commonly used correlation measures. Furthermore, computation time using CCC is a small fraction of that required by other methods, thereby enabling the analyses of large GWAS datasets. © 2014 WILEY PERIODICALS, INC.
A custom correlation coefficient (CCC) approach for fast identification of multi-SNP association patterns in genome-wide SNPs data

PubMed Central

Climer, Sharlee; Yang, Wei; de las Fuentes, Lisa; Dávila-Román, Victor G.; Gu, C. Charles

2014-01-01

Complex diseases are often associated with sets of multiple interacting genetic factors and possibly with unique sets of the genetic factors in different groups of individuals (genetic heterogeneity). We introduce a novel concept of Custom Correlation Coefficient (CCC) between single nucleotide polymorphisms (SNPs) that address genetic heterogeneity by measuring subset correlations autonomously. It is used to develop a 3-step process to identify candidate multi-SNP patterns: (1) pairwise (SNP-SNP) correlations are computed using CCC; (2) clusters of so-correlated SNPs identified; and (3) frequencies of these clusters in disease cases and controls compared to identify disease-associated multi-SNP patterns. This method identified 42 candidate multi-SNP associations with hypertensive heart disease (HHD), among which one cluster of 22 SNPs (6 genes) included 13 in SLC8A1 (aka NCX1, an essential component of cardiac excitation-contraction coupling) and another of 32 SNPs had 29 from a different segment of SLC8A1. While allele frequencies show little difference between cases and controls, the cluster of 22 associated alleles were found in 20% of controls but no cases and the other in 3% of controls but 20% of cases. These suggest that both protective and risk effects on HHD could be exerted by combinations of variants in different regions of SLC8A1, modified by variants from other genes. The results demonstrate that this new correlation metric identifies disease-associated multi-SNP patterns overlooked by commonly used correlation measures. Furthermore, computation time using CCC is a small fraction of that required by other methods, thereby enabling the analyses of large GWAS datasets. PMID:25168954
Gene ontology analysis of pairwise genetic associations in two genome-wide studies of sporadic ALS.

PubMed

Kim, Nora Chung; Andrews, Peter C; Asselbergs, Folkert W; Frost, H Robert; Williams, Scott M; Harris, Brent T; Read, Cynthia; Askland, Kathleen D; Moore, Jason H

2012-07-28

It is increasingly clear that common human diseases have a complex genetic architecture characterized by both additive and nonadditive genetic effects. The goal of the present study was to determine whether patterns of both additive and nonadditive genetic associations aggregate in specific functional groups as defined by the Gene Ontology (GO). We first estimated all pairwise additive and nonadditive genetic effects using the multifactor dimensionality reduction (MDR) method that makes few assumptions about the underlying genetic model. Statistical significance was evaluated using permutation testing in two genome-wide association studies of ALS. The detection data consisted of 276 subjects with ALS and 271 healthy controls while the replication data consisted of 221 subjects with ALS and 211 healthy controls. Both studies included genotypes from approximately 550,000 single-nucleotide polymorphisms (SNPs). Each SNP was mapped to a gene if it was within 500 kb of the start or end. Each SNP was assigned a p-value based on its strongest joint effect with the other SNPs. We then used the Exploratory Visual Analysis (EVA) method and software to assign a p-value to each gene based on the overabundance of significant SNPs at the α = 0.05 level in the gene. We also used EVA to assign p-values to each GO group based on the overabundance of significant genes at the α = 0.05 level. A GO category was determined to replicate if that category was significant at the α = 0.05 level in both studies. We found two GO categories that replicated in both studies. The first, 'Regulation of Cellular Component Organization and Biogenesis', a GO Biological Process, had p-values of 0.010 and 0.014 in the detection and replication studies, respectively. The second, 'Actin Cytoskeleton', a GO Cellular Component, had p-values of 0.040 and 0.046 in the detection and replication studies, respectively. Pathway analysis of pairwise genetic associations in two GWAS of sporadic ALS revealed a set of genes involved in cellular component organization and actin cytoskeleton, more specifically, that were not reported by prior GWAS. However, prior biological studies have implicated actin cytoskeleton in ALS and other motor neuron diseases. This study supports the idea that pathway-level analysis of GWAS data may discover important associations not revealed using conventional one-SNP-at-a-time approaches.
Pairwise contact energy statistical potentials can help to find probability of point mutations.

PubMed

Saravanan, K M; Suvaithenamudhan, S; Parthasarathy, S; Selvaraj, S

2017-01-01

To adopt a particular fold, a protein requires several interactions between its amino acid residues. The energetic contribution of these residue-residue interactions can be approximated by extracting statistical potentials from known high resolution structures. Several methods based on statistical potentials extracted from unrelated proteins are found to make a better prediction of probability of point mutations. We postulate that the statistical potentials extracted from known structures of similar folds with varying sequence identity can be a powerful tool to examine probability of point mutation. By keeping this in mind, we have derived pairwise residue and atomic contact energy potentials for the different functional families that adopt the (α/β) 8 TIM-Barrel fold. We carried out computational point mutations at various conserved residue positions in yeast Triose phosphate isomerase enzyme for which experimental results are already reported. We have also performed molecular dynamics simulations on a subset of point mutants to make a comparative study. The difference in pairwise residue and atomic contact energy of wildtype and various point mutations reveals probability of mutations at a particular position. Interestingly, we found that our computational prediction agrees with the experimental studies of Silverman et al. (Proc Natl Acad Sci 2001;98:3092-3097) and perform better prediction than i Mutant and Cologne University Protein Stability Analysis Tool. The present work thus suggests deriving pairwise contact energy potentials and molecular dynamics simulations of functionally important folds could help us to predict probability of point mutations which may ultimately reduce the time and cost of mutation experiments. Proteins 2016; 85:54-64. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
A spreadsheet template compatible with Microsoft Excel and iWork Numbers that returns the simultaneous confidence intervals for all pairwise differences between multiple sample means.

PubMed

Brown, Angus M

2010-04-01

The objective of the method described in this paper is to develop a spreadsheet template for the purpose of comparing multiple sample means. An initial analysis of variance (ANOVA) test on the data returns F--the test statistic. If F is larger than the critical F value drawn from the F distribution at the appropriate degrees of freedom, convention dictates rejection of the null hypothesis and allows subsequent multiple comparison testing to determine where the inequalities between the sample means lie. A variety of multiple comparison methods are described that return the 95% confidence intervals for differences between means using an inclusive pairwise comparison of the sample means. 2009 Elsevier Ireland Ltd. All rights reserved.
A greedy, graph-based algorithm for the alignment of multiple homologous gene lists.

PubMed

Fostier, Jan; Proost, Sebastian; Dhoedt, Bart; Saeys, Yvan; Demeester, Piet; Van de Peer, Yves; Vandepoele, Klaas

2011-03-15

Many comparative genomics studies rely on the correct identification of homologous genomic regions using accurate alignment tools. In such case, the alphabet of the input sequences consists of complete genes, rather than nucleotides or amino acids. As optimal multiple sequence alignment is computationally impractical, a progressive alignment strategy is often employed. However, such an approach is susceptible to the propagation of alignment errors in early pairwise alignment steps, especially when dealing with strongly diverged genomic regions. In this article, we present a novel accurate and efficient greedy, graph-based algorithm for the alignment of multiple homologous genomic segments, represented as ordered gene lists. Based on provable properties of the graph structure, several heuristics are developed to resolve local alignment conflicts that occur due to gene duplication and/or rearrangement events on the different genomic segments. The performance of the algorithm is assessed by comparing the alignment results of homologous genomic segments in Arabidopsis thaliana to those obtained by using both a progressive alignment method and an earlier graph-based implementation. Especially for datasets that contain strongly diverged segments, the proposed method achieves a substantially higher alignment accuracy, and proves to be sufficiently fast for large datasets including a few dozens of eukaryotic genomes. http://bioinformatics.psb.ugent.be/software. The algorithm is implemented as a part of the i-ADHoRe 3.0 package.
Spirocerca vulpis sp. nov. (Spiruridae: Spirocercidae): description of a new nematode species of the red fox, Vulpes vulpes (Carnivora: Canidae).

PubMed

Rojas, Alicia; Sanchis-Monsonís, Gloria; Alić, Amer; Hodžić, Adnan; Otranto, Domenico; Yasur-Landau, Daniel; Martínez-Carrasco, Carlos; Baneth, Gad

2018-05-21

Previous studies have reported nematodes of the Spirocercidae family in the stomach nodules of red foxes (Vulpes vulpes) described as Spirocerca sp. or Spirocerca lupi (Rudolphi, 1819). We characterized spirurid worms collected from red foxes and compared them to S. lupi from domestic dogs by morphometric and phylogenetic analyses. Nematodes from red foxes differed from S. lupi by the presence of six triangular teeth-like buccal capsule structures, which are absent in the latter. Additionally, in female worms from red foxes, the distance of the vulva opening to the anterior end and the ratio of the glandular-to-muscular oesophagus lengths were larger than those of S. lupi (P < 0.006). In males, the lengths of the whole oesophagus and glandular part, the ratio of the glandular-to-muscular oesophagus and the comparison of the oesophagus to the total body length were smaller in S. lupi (all P < 0.044). Phylogenetic analyses revealed that S. lupi and the red foxes spirurid represent monophyletic sister groups with pairwise nucleotide distances of 9.2 and 0.2% in the cytochrome oxidase 1 and 18S genes, respectively. Based on these comparisons, the nematodes from red foxes were considered to belong to a separate species, for which the name Spirocerca vulpis sp. nov. is proposed.
Genetic variability in Melipona quinquefasciata (Hymenoptera, Apidae, Meliponini) from northeastern Brazil determined using the first internal transcribed spacer (ITS1).

PubMed

Pereira, J O P; Freitas, B M; Jorge, D M M; Torres, D C; Soares, C E A; Grangeiro, T B

2009-01-01

Melipona quinquefasciata is a ground-nesting South American stingless bee whose geographic distribution was believed to comprise only the central and southern states of Brazil. We obtained partial sequences (about 500-570 bp) of first internal transcribed spacer (ITS1) nuclear ribosomal DNA from Melipona specimens putatively identified as M. quinquefasciata collected from different localities in northeastern Brazil. To confirm the taxonomic identity of the northeastern samples, specimens from the state of Goiás (Central region of Brazil) were included for comparison. All sequences were deposited in GenBank (accession numbers EU073751-EU073759). The mean nucleotide divergence (excluding sites with insertions/deletions) in the ITS1 sequences was only 1.4%, ranging from 0 to 4.1%. When the sites with insertions/deletions were also taken into account, sequence divergences varied from 0 to 5.3%. In all pairwise comparisons, the ITS1 sequence from the specimens collected in Goiás was most divergent compared to the ITS1 sequences of the bees from the other locations. However, neighbor-joining phylogenetic analysis showed that all ITS1 sequences from northeastern specimens along with the sample of Goiás were resolved in a single clade with a bootstrap support of 100%. The ITS1 sequencing data thus support the occurrence of M. quinquefasciata in northeast Brazil.
Molecular analysis of carbon monoxide-oxidizing bacteria associated with recent Hawaiian volcanic deposits.

PubMed

Dunfield, Kari E; King, Gary M

2004-07-01

Genomic DNA extracts from four sites at Kilauea Volcano were used as templates for PCR amplification of the large subunit (coxL) of aerobic carbon monoxide dehydrogenase. The sites included a 42-year-old tephra deposit, a 108-year-old lava flow, a 212-year-old partially vegetated ash-and-tephra deposit, and an approximately 300-year-old forest. PCR primers amplified coxL sequences from the OMP clade of CO oxidizers, which includes isolates such as Oligotropha carboxidovorans, Mycobacterium tuberculosis, and Pseudomonas thermocarboxydovorans. PCR products were used to create clone libraries that provide the first insights into the diversity and phylogenetic affiliations of CO oxidizers in situ. On the basis of phylogenetic and statistical analyses, clone libraries for each site were distinct. Although some clone sequences were similar to coxL sequences from known organisms, many sequences appeared to represent phylogenetic lineages not previously known to harbor CO oxidizers. On the basis of average nucleotide diversity and average pairwise difference, a forested site supported the most diverse CO-oxidizing populations, while an 1894 lava flow supported the least diverse populations. Neither parameter correlated with previous estimates of atmospheric CO uptake rates, but both parameters correlated positively with estimates of microbial biomass and respiration. Collectively, the results indicate that the CO oxidizer functional group associated with recent volcanic deposits of the remote Hawaiian Islands contains substantial and previously unsuspected diversity.
Molecular Analysis of Carbon Monoxide-Oxidizing Bacteria Associated with Recent Hawaiian Volcanic Deposits†

PubMed Central

Dunfield, Kari E.; King, Gary M.

2004-01-01

Genomic DNA extracts from four sites at Kilauea Volcano were used as templates for PCR amplification of the large subunit (coxL) of aerobic carbon monoxide dehydrogenase. The sites included a 42-year-old tephra deposit, a 108-year-old lava flow, a 212-year-old partially vegetated ash-and-tephra deposit, and an approximately 300-year-old forest. PCR primers amplified coxL sequences from the OMP clade of CO oxidizers, which includes isolates such as Oligotropha carboxidovorans, Mycobacterium tuberculosis, and Pseudomonas thermocarboxydovorans. PCR products were used to create clone libraries that provide the first insights into the diversity and phylogenetic affiliations of CO oxidizers in situ. On the basis of phylogenetic and statistical analyses, clone libraries for each site were distinct. Although some clone sequences were similar to coxL sequences from known organisms, many sequences appeared to represent phylogenetic lineages not previously known to harbor CO oxidizers. On the basis of average nucleotide diversity and average pairwise difference, a forested site supported the most diverse CO-oxidizing populations, while an 1894 lava flow supported the least diverse populations. Neither parameter correlated with previous estimates of atmospheric CO uptake rates, but both parameters correlated positively with estimates of microbial biomass and respiration. Collectively, the results indicate that the CO oxidizer functional group associated with recent volcanic deposits of the remote Hawaiian Islands contains substantial and previously unsuspected diversity. PMID:15240307
A new species of masked-owl (Aves: Strigiformes: Tytonidae) from Seram, Indonesia.

PubMed

Jønsson, Knud Andreas; Poulsen, Michael Køie; Haryoko, Tri; Reeve, Andrew Hart; Fabre, Pierre-Henri

2013-01-01

We describe a new species of masked-owl from the lower montane forest of Seram, one of the largest islands in the Moluccas of eastern Indonesia, for which we propose the name Tyto almae (Seram Masked-Owl), sp. nov. Molecular (mitochondrial cyt-b) differences show that Tyto sororcula of Buru and Tanimbar is closely related to T novaehollandiae of Australia and New Guinea (-1% uncorrected pairwise distance), and that Tyto almae of Seram differs by -3% (uncorrected pairwise distance) from both of them. These differences are further corroborated by morphology and colouration. Although a photograph from Seram published in 1987 had already established the presence of a Tyto owl on the island, ours represents the first specimen of this species. The bird was mist-netted in wet, mossy lower montane forest at an elevation of 1,350 m. No further observations of the owl were made during four weeks of fieldwork in Seram.
Pairwise-Comparison Software

NASA Technical Reports Server (NTRS)

Ricks, Wendell R.

1995-01-01

Pairwise comparison (PWC) is computer program that collects data for psychometric scaling techniques now used in cognitive research. It applies technique of pairwise comparisons, which is one of many techniques commonly used to acquire the data necessary for analyses. PWC administers task, collects data from test subject, and formats data for analysis. Written in Turbo Pascal v6.0.
Multilinguals' Perceptions of Feeling Different When Switching Languages

ERIC Educational Resources Information Center

Dewaele, Jean-Marc; Nakano, Seiji

2013-01-01

Research into multilingualism and personality has shown that a majority of multilinguals report feeling different when they switch from one language to another. The present study looks at perceived shifts on five scales of feelings (feeling logical, serious, emotional, fake and different) in pair-wise comparisons between languages following the…
Transient Classifier Systems and Man-Machine Interface Research.

DTIC Science & Technology

1987-08-31

different timbre from two different resonant sources, i.e., like a violin and oboe emitting nearly the same fundamental mode fre- quency, but each with its...the subjects by examing both hits and misses for signal and noise stimuli. A pairwise com- parison of the means resulted in significant differences (at
Galilean-invariant Nosé-Hoover-type thermostats.

PubMed

Pieprzyk, S; Heyes, D M; Maćkowiak, Sz; Brańka, A C

2015-03-01

A new pairwise Nosé-Hoover type thermostat for molecular dynamics (MD) simulations which is similar in construction to the pair-velocity thermostat of Allen and Schmid, [Mol. Simul. 33, 21 (2007)] (AS) but is based on the configurational thermostat is proposed and tested. Both thermostats generate the canonical velocity distribution, are Galilean invariant, and conserve linear and angular momentum. The unique feature of the pairwise thermostats is an unconditional conservation of the total angular momentum, which is important for thermalizing isolated systems and those nonequilibrium bulk systems manifesting local rotating currents. These thermostats were benchmarked against the corresponding Nosé-Hoover (NH) and Braga-Travis prescriptions, being based on the kinetic and configurational definitions of temperature, respectively. Some differences between the shear-rate-dependent shear viscosity from Sllod nonequilibrium MD are observed at high shear rates using the different thermostats. The thermostats based on the configurational temperature produced very similar monotically decaying shear viscosity (shear thinning) with increasing shear rate, while the NH method showed discontinuous shear thinning into a string phase, and the AS method produced a continuous increase of viscosity (shear thickening), after a shear thinning region at lower shear rates. Both pairwise additive thermostats are neither purely kinetic nor configurational in definition, and possible directions for further improvement in certain aspects are discussed.
Population and forensic genetic analyses of mitochondrial DNA control region variation from six major provinces in the Korean population.

PubMed

Hong, Seung Beom; Kim, Ki Cheol; Kim, Wook

2015-07-01

We generated complete mitochondrial DNA (mtDNA) control region sequences from 704 unrelated individuals residing in six major provinces in Korea. In addition to our earlier survey of the distribution of mtDNA haplogroup variation, a total of 560 different haplotypes characterized by 271 polymorphic sites were identified, of which 473 haplotypes were unique. The gene diversity and random match probability were 0.9989 and 0.0025, respectively. According to the pairwise comparison of the 704 control region sequences, the mean number of pairwise differences between individuals was 13.47±6.06. Based on the result of mtDNA control region sequences, pairwise FST genetic distances revealed genetic homogeneity of the Korean provinces on a peninsular level, except in samples from Jeju Island. This result indicates there may be a need to formulate a local mtDNA database for Jeju Island, to avoid bias in forensic parameter estimates caused by genetic heterogeneity of the population. Thus, the present data may help not only in personal identification but also in determining maternal lineages to provide an expanded and reliable Korean mtDNA database. These data will be available on the EMPOP database via accession number EMP00661. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Galilean-invariant Nosé-Hoover-type thermostats

NASA Astrophysics Data System (ADS)

Pieprzyk, S.; Heyes, D. M.; Maćkowiak, Sz.; Brańka, A. C.

2015-03-01

A new pairwise Nosé-Hoover type thermostat for molecular dynamics (MD) simulations which is similar in construction to the pair-velocity thermostat of Allen and Schmid, [Mol. Simul. 33, 21 (2007), 10.1080/08927020601052856] (AS) but is based on the configurational thermostat is proposed and tested. Both thermostats generate the canonical velocity distribution, are Galilean invariant, and conserve linear and angular momentum. The unique feature of the pairwise thermostats is an unconditional conservation of the total angular momentum, which is important for thermalizing isolated systems and those nonequilibrium bulk systems manifesting local rotating currents. These thermostats were benchmarked against the corresponding Nosé-Hoover (NH) and Braga-Travis prescriptions, being based on the kinetic and configurational definitions of temperature, respectively. Some differences between the shear-rate-dependent shear viscosity from Sllod nonequilibrium MD are observed at high shear rates using the different thermostats. The thermostats based on the configurational temperature produced very similar monotically decaying shear viscosity (shear thinning) with increasing shear rate, while the NH method showed discontinuous shear thinning into a string phase, and the AS method produced a continuous increase of viscosity (shear thickening), after a shear thinning region at lower shear rates. Both pairwise additive thermostats are neither purely kinetic nor configurational in definition, and possible directions for further improvement in certain aspects are discussed.

Genome sequences of a mouse-avirulent and a mouse-virulent strain of Ross River virus.

PubMed

Faragher, S G; Meek, A D; Rice, C M; Dalgarno, L

1988-04-01

The nucleotide sequence of the genomic RNA of a mouse-avirulent strain of Ross River virus, RRV NB5092 (isolated in 1969), has been determined and the corresponding sequence for the prototype mouse-virulent strain, RRV T48 (isolated in 1959), has been completed. The RRV NB5092 genome is approximately 11,674 nucleotides in length, compared with 11,853 nucleotides for RRV T48. RRV NB5092 and RRV T48 have the same genome organization. For both viruses an untranslated region of 80 nucleotides at the 5' end of the genome is followed by a 7440-nucleotide open reading frame which is interrupted after 5586 nucleotides by a single opal termination codon. By homology with other alphaviruses, the 5586-nucleotide open reading frame encodes the nonstructural proteins nsP1, nsP2, and nsP3; a fourth nonstructural protein, nsP4, is produced by read-through of the opal codon. The RRV nonstructural proteins show strong homology with the corresponding proteins of Sindbis virus and Semliki Forest virus in terms of size, net charge, and hydropathy characteristics. However, homology is not uniform between or within the proteins; nsP1, nsP2, and nsP4 contain extended domains which are highly conserved between alphaviruses, while the C-terminal region of nsP3 shows little conservation in sequence or length between alphaviruses. An untranslated "junction" region of 44 nucleotides (for RRV NB5092) or 47 nucleotides (for RRV T48) separates the nonstructural and structural protein coding regions. The structural proteins (capsid-E3-E2-6K-E1) are translated from an open reading frame of 3762 nucleotides which is followed by a 3'-untranslated region of approximately 348 nucleotides (for RRV NB5092) or 524 nucleotides (for RRV T48). Excluding deletions and insertions, the genomes of RRV NB5092 and RRV T48 differ at 284 nucleotides, representing a sequence divergence of 2.38%. Sequence deletions or insertions were found only in the noncoding regions and include a 173-nucleotide deletion in the 3'-untranslated region of RRV NB5092, compared with RRV T48. In the coding regions, most of the nucleotide differences are silent; there are 36 amino acid differences in the nonstructural proteins and 12 in the structural proteins. The distribution of amino acid differences between the two RRV strains correlates with the location of domains which are poorly conserved in sequence between alphaviruses. The possible role of amino acid differences in envelope glycoproteins E1 and E2 in determining the different antigenic and biological properties of RRV NB5092 and RRV T48 is discussed.
Combined hairpin-antisense compositions and methods for modulating expression

DOEpatents

Shanklin, John; Nguyen, Tam

2014-08-05

A nucleotide construct comprising a nucleotide sequence that forms a stem and a loop, wherein the loop comprises a nucleotide sequence that modulates expression of a target, wherein the stem comprises a nucleotide sequence that modulates expression of a target, and wherein the target modulated by the nucleotide sequence in the loop and the target modulated by the nucleotide sequence in the stem may be the same or different. Vectors, methods of regulating target expression, methods of providing a cell, and methods of treating conditions comprising the nucleotide sequence are also disclosed.
Combined hairpin-antisense compositions and methods for modulating expression

DOEpatents

Shanklin, John; Nguyen, Tam Huu

2015-11-24

A nucleotide construct comprising a nucleotide sequence that forms a stem and a loop, wherein the loop comprises a nucleotide sequence that modulates expression of a target, wherein the stem comprises a nucleotide sequence that modulates expression of a target, and wherein the target modulated by the nucleotide sequence in the loop and the target modulated by the nucleotide sequence in the stem may be the same or different. Vectors, methods of regulating target expression, methods of providing a cell, and methods of treating conditions comprising the nucleotide sequence are also disclosed.
Genomic differentiation among wild cyanophages despite widespread horizontal gene transfer

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gregory, Ann C.; Solonenko, Sergei A.; Ignacio-Espinoza, J. Cesar

Genetic recombination is a driving force in genome evolution. Among viruses it has a dual role. For genomes with higher fitness, it maintains genome integrity in the face of high mutation rates. Conversely, for genomes with lower fitness, it provides immediate access to sequence space that cannot be reached by mutation alone. Understanding how recombination impacts the cohesion and dissolution of individual whole genomes within viral sequence space is poorly understood across double-stranded DNA bacteriophages (a.k.a phages) due to the challenges of obtaining appropriately scaled genomic datasets. Here in this study we explore the role of recombination in both maintainingmore » and differentiating whole genomes of 142 wild double-stranded DNA marine cyanophages. Phylogenomic analysis across the 51 core genes revealed ten lineages, six of which were well represented. These phylogenomic lineages represent discrete genotypic populations based on comparisons of intra- and inter- lineage shared gene content, genome-wide average nucleotide identity, as well as detected gaps in the distribution of pairwise differences between genomes. McDonald-Kreitman selection tests identified putative niche-differentiating genes under positive selection that differed across the six well-represented genotypic populations and that may have driven initial divergence. Concurrent with patterns of recombination of discrete populations, recombination analyses of both genic and intergenic regions largely revealed decreased genetic exchange across individual genomes between relative to within populations. Lastly, these findings suggest that discrete double-stranded DNA marine cyanophage populations occur in nature and are maintained by patterns of recombination akin to those observed in bacteria, archaea and in sexual eukaryotes.« less
Genomic differentiation among wild cyanophages despite widespread horizontal gene transfer

DOE PAGES

Gregory, Ann C.; Solonenko, Sergei A.; Ignacio-Espinoza, J. Cesar; ...

2016-11-16

Genetic recombination is a driving force in genome evolution. Among viruses it has a dual role. For genomes with higher fitness, it maintains genome integrity in the face of high mutation rates. Conversely, for genomes with lower fitness, it provides immediate access to sequence space that cannot be reached by mutation alone. Understanding how recombination impacts the cohesion and dissolution of individual whole genomes within viral sequence space is poorly understood across double-stranded DNA bacteriophages (a.k.a phages) due to the challenges of obtaining appropriately scaled genomic datasets. Here in this study we explore the role of recombination in both maintainingmore » and differentiating whole genomes of 142 wild double-stranded DNA marine cyanophages. Phylogenomic analysis across the 51 core genes revealed ten lineages, six of which were well represented. These phylogenomic lineages represent discrete genotypic populations based on comparisons of intra- and inter- lineage shared gene content, genome-wide average nucleotide identity, as well as detected gaps in the distribution of pairwise differences between genomes. McDonald-Kreitman selection tests identified putative niche-differentiating genes under positive selection that differed across the six well-represented genotypic populations and that may have driven initial divergence. Concurrent with patterns of recombination of discrete populations, recombination analyses of both genic and intergenic regions largely revealed decreased genetic exchange across individual genomes between relative to within populations. Lastly, these findings suggest that discrete double-stranded DNA marine cyanophage populations occur in nature and are maintained by patterns of recombination akin to those observed in bacteria, archaea and in sexual eukaryotes.« less
Analysis of Facultative Lithotroph Distribution and Diversity on Volcanic Deposits by Use of the Large Subunit of Ribulose 1,5-Bisphosphate Carboxylase/Oxygenase†

PubMed Central

Nanba, K.; King, G. M.; Dunfield, K.

2004-01-01

A 492- to 495-bp fragment of the gene coding for the large subunit of the form I ribulose 1,5-bisphosphate carboxylase/oxygenase (RubisCO) (rbcL) was amplified by PCR from facultatively lithotrophic aerobic CO-oxidizing bacteria, colorless and purple sulfide-oxidizing microbial mats, and genomic DNA extracts from tephra and ash deposits from Kilauea volcano, for which atmospheric CO and hydrogen have been previously documented as important substrates. PCR products from the mats and volcanic sites were used to construct rbcL clone libraries. Phylogenetic analyses showed that the rbcL sequences from all isolates clustered with form IC rbcL sequences derived from facultative lithotrophs. In contrast, the microbial mat clone sequences clustered with sequences from obligate lithotrophs representative of form IA rbcL. Clone sequences from volcanic sites fell within the form IC clade, suggesting that these sites were dominated by facultative lithotrophs, an observation consistent with biogeochemical patterns at the sites. Based on phylogenetic and statistical analyses, clone libraries differed significantly among volcanic sites, indicating that they support distinct lithotrophic assemblages. Although some of the clone sequences were similar to known rbcL sequences, most were novel. Based on nucleotide diversity and average pairwise difference, a forested site and an 1894 lava flow were found to support the most diverse and least diverse lithotrophic populations, respectively. These indices of diversity were not correlated with rates of atmospheric CO and hydrogen uptake but were correlated with estimates of respiration and microbial biomass. PMID:15066819
Analysis of facultative lithotroph distribution and diversity on volcanic deposits by use of the large subunit of ribulose 1,5-bisphosphate carboxylase/oxygenase.

PubMed

Nanba, K; King, G M; Dunfield, K

2004-04-01

A 492- to 495-bp fragment of the gene coding for the large subunit of the form I ribulose 1,5-bisphosphate carboxylase/oxygenase (RubisCO) (rbcL) was amplified by PCR from facultatively lithotrophic aerobic CO-oxidizing bacteria, colorless and purple sulfide-oxidizing microbial mats, and genomic DNA extracts from tephra and ash deposits from Kilauea volcano, for which atmospheric CO and hydrogen have been previously documented as important substrates. PCR products from the mats and volcanic sites were used to construct rbcL clone libraries. Phylogenetic analyses showed that the rbcL sequences from all isolates clustered with form IC rbcL sequences derived from facultative lithotrophs. In contrast, the microbial mat clone sequences clustered with sequences from obligate lithotrophs representative of form IA rbcL. Clone sequences from volcanic sites fell within the form IC clade, suggesting that these sites were dominated by facultative lithotrophs, an observation consistent with biogeochemical patterns at the sites. Based on phylogenetic and statistical analyses, clone libraries differed significantly among volcanic sites, indicating that they support distinct lithotrophic assemblages. Although some of the clone sequences were similar to known rbcL sequences, most were novel. Based on nucleotide diversity and average pairwise difference, a forested site and an 1894 lava flow were found to support the most diverse and least diverse lithotrophic populations, respectively. These indices of diversity were not correlated with rates of atmospheric CO and hydrogen uptake but were correlated with estimates of respiration and microbial biomass.
Molecular identification and first description of the male of Neoechinorhynchus schmidti (Acanthocephala: Neoechinorhynchidae), a parasite of Trachemys scripta (Testudines) in México.

PubMed

García-Varela, Martín; García-Prieto, Luís; Rodríguez, Rodolfo Pérez

2011-12-01

The morphology of the males of Neoechinorhynchus schmidti (Acanthocephala: Neoechinorhynchidae) is unknown, because this species was described based exclusively on females. However, recently we collected 2 common slider turtles Trachemys scripta in Centla swamps, Tabasco, Mexico, parasitized by 27 specimens of an acanthocephalan whose females were morphologically identical to N. schmidti. The domains D2 and D3 of the large subunit of the nuclear ribosomal RNA (LSU) of 3 males and 2 females of this material were sequenced. The sequences of both sexes were identical, and based on this result, we described for the first time the morphology of the males of N. schmidti. In addition, 6 sequences of a congeneric species, also parasite of turtles (Neoechinorhynchus emyditoides) were generated in the current research. The 11 sequences of these 2 species were aligned with 13 sequences of another 4 species of the same genus, producing a data set of 24 taxa with 674 nucleotides. The genetic divergence between N. schmidti and N. emyditoides was 4% and intraspecific differences ranged from 0.01 to 0.02%. Pairwise differences between either of these species and 4 other congeners parasitic in fresh and brackish water fishes (Neoechinorhynchus golvani, Neoechinorhynchus roseum, Neoechinorhynchus saginatus, and Neoechinorhynchus sp.) varied from 9.5 to 33%. Maximum likelihood and maximum parsimony analyses show that N. schmidti and N. emyditoides are sister taxa. Bootstrap analysis also indicates that the sister relationship is reliably supported. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
related: an R package for analysing pairwise relatedness from codominant molecular markers.

PubMed

Pew, Jack; Muir, Paul H; Wang, Jinliang; Frasier, Timothy R

2015-05-01

Analyses of pairwise relatedness represent a key component to addressing many topics in biology. However, such analyses have been limited because most available programs provide a means to estimate relatedness based on only a single estimator, making comparison across estimators difficult. Second, all programs to date have been platform specific, working only on a specific operating system. This has the undesirable outcome of making choice of relatedness estimator limited by operating system preference, rather than being based on scientific rationale. Here, we present a new R package, called related, that can calculate relatedness based on seven estimators, can account for genotyping errors, missing data and inbreeding, and can estimate 95% confidence intervals. Moreover, simulation functions are provided that allow for easy comparison of the performance of different estimators and for analyses of how much resolution to expect from a given data set. Because this package works in R, it is platform independent. Combined, this functionality should allow for more appropriate analyses and interpretation of pairwise relatedness and will also allow for the integration of relatedness data into larger R workflows. © 2014 John Wiley & Sons Ltd.
Differential Item Functioning Detection across Two Methods of Defining Group Comparisons: Pairwise and Composite Group Comparisons

ERIC Educational Resources Information Center

Sari, Halil Ibrahim; Huggins, Anne Corinne

2015-01-01

This study compares two methods of defining groups for the detection of differential item functioning (DIF): (a) pairwise comparisons and (b) composite group comparisons. We aim to emphasize and empirically support the notion that the choice of pairwise versus composite group definitions in DIF is a reflection of how one defines fairness in DIF…
Metabolic network prediction through pairwise rational kernels.

PubMed

Roche-Lima, Abiel; Domaratzki, Michael; Fristensky, Brian

2014-09-26

Metabolic networks are represented by the set of metabolic pathways. Metabolic pathways are a series of biochemical reactions, in which the product (output) from one reaction serves as the substrate (input) to another reaction. Many pathways remain incompletely characterized. One of the major challenges of computational biology is to obtain better models of metabolic pathways. Existing models are dependent on the annotation of the genes. This propagates error accumulation when the pathways are predicted by incorrectly annotated genes. Pairwise classification methods are supervised learning methods used to classify new pair of entities. Some of these classification methods, e.g., Pairwise Support Vector Machines (SVMs), use pairwise kernels. Pairwise kernels describe similarity measures between two pairs of entities. Using pairwise kernels to handle sequence data requires long processing times and large storage. Rational kernels are kernels based on weighted finite-state transducers that represent similarity measures between sequences or automata. They have been effectively used in problems that handle large amount of sequence information such as protein essentiality, natural language processing and machine translations. We create a new family of pairwise kernels using weighted finite-state transducers (called Pairwise Rational Kernel (PRK)) to predict metabolic pathways from a variety of biological data. PRKs take advantage of the simpler representations and faster algorithms of transducers. Because raw sequence data can be used, the predictor model avoids the errors introduced by incorrect gene annotations. We then developed several experiments with PRKs and Pairwise SVM to validate our methods using the metabolic network of Saccharomyces cerevisiae. As a result, when PRKs are used, our method executes faster in comparison with other pairwise kernels. Also, when we use PRKs combined with other simple kernels that include evolutionary information, the accuracy values have been improved, while maintaining lower construction and execution times. The power of using kernels is that almost any sort of data can be represented using kernels. Therefore, completely disparate types of data can be combined to add power to kernel-based machine learning methods. When we compared our proposal using PRKs with other similar kernel, the execution times were decreased, with no compromise of accuracy. We also proved that by combining PRKs with other kernels that include evolutionary information, the accuracy can also also be improved. As our proposal can use any type of sequence data, genes do not need to be properly annotated, avoiding accumulation errors because of incorrect previous annotations.
Testing hypotheses for differences between linear regression lines

Treesearch

Stanley J. Zarnoch

2009-01-01

Five hypotheses are identified for testing differences between simple linear regression lines. The distinctions between these hypotheses are based on a priori assumptions and illustrated with full and reduced models. The contrast approach is presented as an easy and complete method for testing for overall differences between the regressions and for making pairwise...
Isolation Driven Divergence in Osmoregulation in Galaxias maculatus (Jenyns, 1848) (Actinopterygii: Osmeriformes).

PubMed

Ruiz-Jarabo, Ignacio; González-Wevar, Claudio A; Oyarzún, Ricardo; Fuentes, Juan; Poulin, Elie; Bertrán, Carlos; Vargas-Chacoff, Luis

2016-01-01

Marine species have colonized extreme environments during evolution such as freshwater habitats. The amphidromous teleost fish, Galaxias maculatus is found mainly migrating between estuaries and rivers, but some landlocked populations have been described in lakes formed during the last deglaciation process in the Andes. In the present study we use mtDNA sequences to reconstruct the historical scenario of colonization of such a lake and evaluated the osmoregulatory shift associated to changes in habitat and life cycle between amphidromous and landlocked populations. Standard diversity indices including the average number of nucleotide differences (Π) and the haplotype diversity index (H) indicated that both populations were, as expected, genetically distinctive, being the landlocked population less diverse than the diadromous one. Similarly, pairwise GST and NST comparison detected statistically significant differences between both populations, while genealogy of haplotypes evidenced a recent founder effect from the diadromous stock, followed by an expansion process in the lake. To test for physiological differences, individuals of both populations were challenged with a range of salinities from 0 to 30 ppt for 8 days following a period of progressive acclimation. The results showed that the landlocked population had a surprisingly wider tolerance to salinity, as landlocked fish survival was 100% from 0 to 20 ppt, whereas diadromous fish survival was 100% only from 10 to 15 ppt. The activity of ATPase enzymes, including Na+/K+-ATPase (NKA), and H+-ATPase (HA) was measured in gills and intestine. Activity differences were detected between the populations at the lowest salinities, including differences in ATPases other than NKA and HA. Population differences in mortality are not reflected in enzyme activity differences, suggesting divergence in other processes. These results clearly demonstrate the striking adaptive changes of G. maculatus osmoregulatory system, especially at hyposmotic environments, associated to a drastic shift in habitat and life cycle at a scale of a few thousand years.
Isolation Driven Divergence in Osmoregulation in Galaxias maculatus (Jenyns, 1848) (Actinopterygii: Osmeriformes)

PubMed Central

Ruiz-Jarabo, Ignacio; Oyarzún, Ricardo; Fuentes, Juan; Poulin, Elie; Bertrán, Carlos; Vargas-Chacoff, Luis

2016-01-01

Background Marine species have colonized extreme environments during evolution such as freshwater habitats. The amphidromous teleost fish, Galaxias maculatus is found mainly migrating between estuaries and rivers, but some landlocked populations have been described in lakes formed during the last deglaciation process in the Andes. In the present study we use mtDNA sequences to reconstruct the historical scenario of colonization of such a lake and evaluated the osmoregulatory shift associated to changes in habitat and life cycle between amphidromous and landlocked populations. Results Standard diversity indices including the average number of nucleotide differences (Π) and the haplotype diversity index (H) indicated that both populations were, as expected, genetically distinctive, being the landlocked population less diverse than the diadromous one. Similarly, pairwise GST and NST comparison detected statistically significant differences between both populations, while genealogy of haplotypes evidenced a recent founder effect from the diadromous stock, followed by an expansion process in the lake. To test for physiological differences, individuals of both populations were challenged with a range of salinities from 0 to 30 ppt for 8 days following a period of progressive acclimation. The results showed that the landlocked population had a surprisingly wider tolerance to salinity, as landlocked fish survival was 100% from 0 to 20 ppt, whereas diadromous fish survival was 100% only from 10 to 15 ppt. The activity of ATPase enzymes, including Na+/K+-ATPase (NKA), and H+-ATPase (HA) was measured in gills and intestine. Activity differences were detected between the populations at the lowest salinities, including differences in ATPases other than NKA and HA. Population differences in mortality are not reflected in enzyme activity differences, suggesting divergence in other processes. Conclusions These results clearly demonstrate the striking adaptive changes of G. maculatus osmoregulatory system, especially at hyposmotic environments, associated to a drastic shift in habitat and life cycle at a scale of a few thousand years. PMID:27168069
Exact p-values for pairwise comparison of Friedman rank sums, with application to comparing classifiers.

PubMed

Eisinga, Rob; Heskes, Tom; Pelzer, Ben; Te Grotenhuis, Manfred

2017-01-25

The Friedman rank sum test is a widely-used nonparametric method in computational biology. In addition to examining the overall null hypothesis of no significant difference among any of the rank sums, it is typically of interest to conduct pairwise comparison tests. Current approaches to such tests rely on large-sample approximations, due to the numerical complexity of computing the exact distribution. These approximate methods lead to inaccurate estimates in the tail of the distribution, which is most relevant for p-value calculation. We propose an efficient, combinatorial exact approach for calculating the probability mass distribution of the rank sum difference statistic for pairwise comparison of Friedman rank sums, and compare exact results with recommended asymptotic approximations. Whereas the chi-squared approximation performs inferiorly to exact computation overall, others, particularly the normal, perform well, except for the extreme tail. Hence exact calculation offers an improvement when small p-values occur following multiple testing correction. Exact inference also enhances the identification of significant differences whenever the observed values are close to the approximate critical value. We illustrate the proposed method in the context of biological machine learning, were Friedman rank sum difference tests are commonly used for the comparison of classifiers over multiple datasets. We provide a computationally fast method to determine the exact p-value of the absolute rank sum difference of a pair of Friedman rank sums, making asymptotic tests obsolete. Calculation of exact p-values is easy to implement in statistical software and the implementation in R is provided in one of the Additional files and is also available at http://www.ru.nl/publish/pages/726696/friedmanrsd.zip .
Caveats for the spatial arrangement method: Comment on Hout, Goldinger, and Ferguson (2013).

PubMed

Verheyen, Steven; Voorspoels, Wouter; Vanpaemel, Wolf; Storms, Gert

2016-03-01

The gold standard among proximity data collection methods for multidimensional scaling is the (dis)similarity rating of pairwise presented stimuli. A drawback of the pairwise method is its lengthy duration, which may cause participants to change their strategy over time, become fatigued, or disengage altogether. Hout, Goldinger, and Ferguson (2013) recently made a case for the Spatial Arrangement Method (SpAM) as an alternative to the pairwise method, arguing that it is faster and more engaging. SpAM invites participants to directly arrange stimuli on a computer screen such that the interstimuli distances are proportional to psychological proximity. Based on a reanalysis of the Hout et al. (2013), data we identify three caveats for SpAM. An investigation of the distributional characteristics of the SpAM proximity data reveals that the spatial nature of SpAM imposes structure on the data, invoking a bias against featural representations. Individual-differences scaling of the SpAM proximity data reveals that the two-dimensional nature of SpAM allows individuals to only communicate two dimensions of variation among stimuli properly, invoking a bias against high-dimensional scaling representations. Monte Carlo simulations indicate that in order to obtain reliable estimates of the group average, SpAM requires more individuals to be tested. We conclude with an overview of considerations that can inform the choice between SpAM and the pairwise method and offer suggestions on how to overcome their respective limitations. (c) 2016 APA, all rights reserved).
Dietary nitrogen alters codon bias and genome composition in parasitic microorganisms.

PubMed

Seward, Emily A; Kelly, Steven

2016-11-15

Genomes are composed of long strings of nucleotide monomers (A, C, G and T) that are either scavenged from the organism's environment or built from metabolic precursors. The biosynthesis of each nucleotide differs in atomic requirements with different nucleotides requiring different quantities of nitrogen atoms. However, the impact of the relative availability of dietary nitrogen on genome composition and codon bias is poorly understood. Here we show that differential nitrogen availability, due to differences in environment and dietary inputs, is a major determinant of genome nucleotide composition and synonymous codon use in both bacterial and eukaryotic microorganisms. Specifically, low nitrogen availability species use nucleotides that require fewer nitrogen atoms to encode the same genes compared to high nitrogen availability species. Furthermore, we provide a novel selection-mutation framework for the evaluation of the impact of metabolism on gene sequence evolution and show that it is possible to predict the metabolic inputs of related organisms from an analysis of the raw nucleotide sequence of their genes. Taken together, these results reveal a previously hidden relationship between cellular metabolism and genome evolution and provide new insight into how genome sequence evolution can be influenced by adaptation to different diets and environments.
Nucleotide Selectivity in Abiotic RNA Polymerization Reactions.

PubMed

Coari, Kristin M; Martin, Rebecca C; Jain, Kopal; McGown, Linda B

2017-09-01

In order to establish an RNA world on early Earth, the nucleotides must form polymers through chemical rather than biochemical reactions. The polymerization products must be long enough to perform catalytic functions, including self-replication, and to preserve genetic information. These functions depend not only on the length of the polymers, but also on their sequences. To date, studies of abiotic RNA polymerization generally have focused on routes to polymerization of a single nucleotide and lengths of the homopolymer products. Less work has been done the selectivity of the reaction toward incorporation of some nucleotides over others in nucleotide mixtures. Such information is an essential step toward understanding the chemical evolution of RNA. To address this question, in the present work RNA polymerization reactions were performed in the presence of montmorillonite clay catalyst. The nucleotides included the monophosphates of adenosine, cytosine, guanosine, uridine and inosine. Experiments included reactions of mixtures of an imidazole-activated nucleotide (ImpX) with one or more unactivated nucleotides (XMP), of two or more ImpX, and of XMP that were activated in situ in the polymerization reaction itself. The reaction products were analyzed using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) to identify the lengths and nucleotide compositions of the polymerization products. The results show that the extent of polymerization, the degree of heteropolymerization vs. homopolymerization, and the composition of the polymeric products all vary among the different nucleotides and depend upon which nucleotides and how many different nucleotides are present in the mixture.
Nucleotide Selectivity in Abiotic RNA Polymerization Reactions

NASA Astrophysics Data System (ADS)

Coari, Kristin M.; Martin, Rebecca C.; Jain, Kopal; McGown, Linda B.

2017-09-01

In order to establish an RNA world on early Earth, the nucleotides must form polymers through chemical rather than biochemical reactions. The polymerization products must be long enough to perform catalytic functions, including self-replication, and to preserve genetic information. These functions depend not only on the length of the polymers, but also on their sequences. To date, studies of abiotic RNA polymerization generally have focused on routes to polymerization of a single nucleotide and lengths of the homopolymer products. Less work has been done the selectivity of the reaction toward incorporation of some nucleotides over others in nucleotide mixtures. Such information is an essential step toward understanding the chemical evolution of RNA. To address this question, in the present work RNA polymerization reactions were performed in the presence of montmorillonite clay catalyst. The nucleotides included the monophosphates of adenosine, cytosine, guanosine, uridine and inosine. Experiments included reactions of mixtures of an imidazole-activated nucleotide (ImpX) with one or more unactivated nucleotides (XMP), of two or more ImpX, and of XMP that were activated in situ in the polymerization reaction itself. The reaction products were analyzed using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) to identify the lengths and nucleotide compositions of the polymerization products. The results show that the extent of polymerization, the degree of heteropolymerization vs. homopolymerization, and the composition of the polymeric products all vary among the different nucleotides and depend upon which nucleotides and how many different nucleotides are present in the mixture.
A water market simulator considering pair-wise trades between agents

NASA Astrophysics Data System (ADS)

Huskova, I.; Erfani, T.; Harou, J. J.

2012-04-01

In many basins in England no further water abstraction licences are available. Trading water between water rights holders has been recognized as a potentially effective and economically efficient strategy to mitigate increasing scarcity. A screening tool that could assess the potential for trade through realistic simulation of individual water rights holders would help assess the solution's potential contribution to local water management. We propose an optimisation-driven water market simulator that predicts pair-wise trade in a catchment and represents its interaction with natural hydrology and engineered infrastructure. A model is used to emulate licence-holders' willingness to engage in short-term trade transactions. In their simplest form agents are represented using an economic benefit function. The working hypothesis is that trading behaviour can be partially predicted based on differences in marginal values of water over space and time and estimates of transaction costs on pair-wise trades. We discuss the further possibility of embedding rules, norms and preferences of the different water user sectors to more realistically represent the behaviours, motives and constraints of individual licence holders. The potential benefits and limitations of such a social simulation (agent-based) approach is contrasted with our simulator where agents are driven by economic optimization. A case study based on the Dove River Basin (UK) demonstrates model inputs and outputs. The ability of the model to suggest impacts of water rights policy reforms on trading is discussed.

Pair-Wise Trajectory Management-Oceanic (PTM-O) . [Concept of Operations—Version 3.9

NASA Technical Reports Server (NTRS)

Jones, Kenneth M.

2014-01-01

This document describes the Pair-wise Trajectory Management-Oceanic (PTM-O) Concept of Operations (ConOps). Pair-wise Trajectory Management (PTM) is a concept that includes airborne and ground-based capabilities designed to enable and to benefit from, airborne pair-wise distance-monitoring capability. PTM includes the capabilities needed for the controller to issue a PTM clearance that resolves a conflict for a specific pair of aircraft. PTM avionics include the capabilities needed for the flight crew to manage their trajectory relative to specific designated aircraft. Pair-wise Trajectory Management PTM-Oceanic (PTM-O) is a regional specific application of the PTM concept. PTM is sponsored by the National Aeronautics and Space Administration (NASA) Concept and Technology Development Project (part of NASA's Airspace Systems Program). The goal of PTM is to use enhanced and distributed communications and surveillance along with airborne tools to permit reduced separation standards for given aircraft pairs, thereby increasing the capacity and efficiency of aircraft operations at a given altitude or volume of airspace.
A pairwise maximum entropy model accurately describes resting-state human brain networks

PubMed Central

Watanabe, Takamitsu; Hirose, Satoshi; Wada, Hiroyuki; Imai, Yoshio; Machida, Toru; Shirouzu, Ichiro; Konishi, Seiki; Miyashita, Yasushi; Masuda, Naoki

2013-01-01

The resting-state human brain networks underlie fundamental cognitive functions and consist of complex interactions among brain regions. However, the level of complexity of the resting-state networks has not been quantified, which has prevented comprehensive descriptions of the brain activity as an integrative system. Here, we address this issue by demonstrating that a pairwise maximum entropy model, which takes into account region-specific activity rates and pairwise interactions, can be robustly and accurately fitted to resting-state human brain activities obtained by functional magnetic resonance imaging. Furthermore, to validate the approximation of the resting-state networks by the pairwise maximum entropy model, we show that the functional interactions estimated by the pairwise maximum entropy model reflect anatomical connexions more accurately than the conventional functional connectivity method. These findings indicate that a relatively simple statistical model not only captures the structure of the resting-state networks but also provides a possible method to derive physiological information about various large-scale brain networks. PMID:23340410
High-throughput discovery of rare human nucleotide polymorphisms by Ecotilling

PubMed Central

Till, Bradley J.; Zerr, Troy; Bowers, Elisabeth; Greene, Elizabeth A.; Comai, Luca; Henikoff, Steven

2006-01-01

Human individuals differ from one another at only ∼0.1% of nucleotide positions, but these single nucleotide differences account for most heritable phenotypic variation. Large-scale efforts to discover and genotype human variation have been limited to common polymorphisms. However, these efforts overlook rare nucleotide changes that may contribute to phenotypic diversity and genetic disorders, including cancer. Thus, there is an increasing need for high-throughput methods to robustly detect rare nucleotide differences. Toward this end, we have adapted the mismatch discovery method known as Ecotilling for the discovery of human single nucleotide polymorphisms. To increase throughput and reduce costs, we developed a universal primer strategy and implemented algorithms for automated band detection. Ecotilling was validated by screening 90 human DNA samples for nucleotide changes in 5 gene targets and by comparing results to public resequencing data. To increase throughput for discovery of rare alleles, we pooled samples 8-fold and found Ecotilling to be efficient relative to resequencing, with a false negative rate of 5% and a false discovery rate of 4%. We identified 28 new rare alleles, including some that are predicted to damage protein function. The detection of rare damaging mutations has implications for models of human disease. PMID:16893952
Compositions and methods for detecting single nucleotide polymorphisms

DOEpatents

Yeh, Hsin-Chih; Werner, James; Martinez, Jennifer S.

2016-11-22

Described herein are nucleic acid based probes and methods for discriminating and detecting single nucleotide variants in nucleic acid molecules (e.g., DNA). The methods include use of a pair of probes can be used to detect and identify polymorphisms, for example single nucleotide polymorphism in DNA. The pair of probes emit a different fluorescent wavelength of light depending on the association and alignment of the probes when hybridized to a target nucleic acid molecule. Each pair of probes is capable of discriminating at least two different nucleic acid molecules that differ by at least a single nucleotide difference. The methods can probes can be used, for example, for detection of DNA polymorphisms that are indicative of a particular disease or condition.
The tapeworm Atractolytocestus tenuicollis (Cestoda: Caryophyllidea)--a sister species or ancestor of an invasive A. huronensis?

PubMed

Králová-Hromadová, Ivica; Štefka, Jan; Bazsalovicsová, Eva; Bokorová, Silvia; Oros, Mikuláš

2013-10-01

Atractolytocestus tenuicollis (Li, 1964) Xi, Wang, Wu, Gao et Nie, 2009 is a monozoic, non-segmented tapeworm of the order Caryophyllidea, parasitizing exclusively common carp (Cyprinus carpio L.). In the current work, the first molecular data, in particular complete ribosomal internal transcribed spacer 2 (ITS2) and partial mitochondrial cytochrome c oxidase subunit I (cox1) on A. tenuicollis from Niushan Lake, Wuhan, China, are provided. In order to evaluate molecular interrelationships within Atractolytocestus, the data on A. tenuicollis were compared with relevant data on two other congeners, Atractolytocestus huronensis and Atractolytocestus sagittatus. Divergent intragenomic copies (ITS2 paralogues) were detected in the ITS2 ribosomal spacer of A. tenuicollis; the same phenomenon has previously been observed also in two other congeners. ITS2 structure of A. tenuicollis was very similar to that of A. huronensis from Slovakia, USA and UK; overall pairwise sequence identity was 91.7-95.2%. On the other hand, values of sequence identity between A. tenuicollis and A. sagittatus were lower, 69.7-70.9%. Cox1 sequence, analysed in five A. tenuicollis individuals, were 100 % identical and no intraspecific variation was observed. Comparison of A. tenuicollis cox1 with respective sequences of two other Atractolytocestus species showed that the mitochondrial haplotype found in Chinese A. tenuicollis is structurally specific (haplotype 4; Ha4) and differs from all so far determined Atractolytocestus haplotypes (Ha1 and Ha2 for A. huronensis; Ha3 for A. sagittatus). Pairwise sequence identity between A. tenuicollis cox1 haplotype and remaining three haplotypes followed the same pattern as in ITS2. The nucleotide and amino acide (aa) sequence comparison with A. huronensis Ha1 and Ha2 revealed higher sequence identity, 90.3-90.8% (96.9% in aa), while lower values were achieved between A. tenuicollis haplotype and Ha3 of Japanese A. sagittatus-75.2 % (81.9 % in aa). The phylogenetic analyses using cox1, ITS2 and combined cox1 + ITS2 sequences revealed close genetic interrelationship between A. tenuicollis and A. huronensis. Independently of a type of analysis and DNA region used, the topology of obtained trees was always identical; A. tenuicollis formed separate clade with A. huronensis forming a closely related sister group.
Identification of a Herbal Powder by Deoxyribonucleic Acid Barcoding and Structural Analyses.

PubMed

Sheth, Bhavisha P; Thaker, Vrinda S

2015-10-01

Authentic identification of plants is essential for exploiting their medicinal properties as well as to stop the adulteration and malpractices with the trade of the same. To identify a herbal powder obtained from a herbalist in the local vicinity of Rajkot, Gujarat, using deoxyribonucleic acid (DNA) barcoding and molecular tools. The DNA was extracted from a herbal powder and selected Cassia species, followed by the polymerase chain reaction (PCR) and sequencing of the rbcL barcode locus. Thereafter the sequences were subjected to National Center for Biotechnology Information (NCBI) basic local alignment search tool (BLAST) analysis, followed by the protein three-dimension structure determination of the rbcL protein from the herbal powder and Cassia species namely Cassia fistula, Cassia tora and Cassia javanica (sequences obtained in the present study), Cassia Roxburghii, and Cassia abbreviata (sequences retrieved from Genbank). Further, the multiple and pairwise structural alignment were carried out in order to identify the herbal powder. The nucleotide sequences obtained from the selected species of Cassia were submitted to Genbank (Accession No. JX141397, JX141405, JX141420). The NCBI BLAST analysis of the rbcL protein from the herbal powder showed an equal sequence similarity (with reference to different parameters like E value, maximum identity, total score, query coverage) to C. javanica and C. roxburghii. In order to solve the ambiguities of the BLAST result, a protein structural approach was implemented. The protein homology models obtained in the present study were submitted to the protein model database (PM0079748-PM0079753). The pairwise structural alignment of the herbal powder (as template) and C. javanica and C. roxburghii (as targets individually) revealed a close similarity of the herbal powder with C. javanica. A strategy as used here, incorporating the integrated use of DNA barcoding and protein structural analyses could be adopted, as a novel rapid and economic procedure, especially in cases when protein coding loci are considered. Authentic identification of plants is essential for exploiting their medicinal properties as well as to stop the adulteration and malpractices with the trade of the same. A herbal powder was obtained from a herbalist in the local vicinity of Rajkot, Gujarat. An integrated approach using DNA barcoding and structural analyses was carried out to identify the herbal powder. The herbal powder was identified as Cassia javanica L.
Why rate when you could compare? Using the "EloChoice" package to assess pairwise comparisons of perceived physical strength.

PubMed

Clark, Andrew P; Howard, Kate L; Woods, Andy T; Penton-Voak, Ian S; Neumann, Christof

2018-01-01

We introduce "EloChoice", a package for R which uses Elo rating to assess pairwise comparisons between stimuli in order to measure perceived stimulus characteristics. To demonstrate the package and compare results from forced choice pairwise comparisons to those from more standard single stimulus rating tasks using Likert (or Likert-type) items, we investigated perceptions of physical strength from images of male bodies. The stimulus set comprised images of 82 men standing on a raised platform with minimal clothing. Strength-related anthropometrics and grip strength measurements were available for each man in the set. UK laboratory participants (Study 1) and US online participants (Study 2) viewed all images in both a Likert rating task, to collect mean Likert scores, and a pairwise comparison task, to calculate Elo, mean Elo (mElo), and Bradley-Terry scores. Within both studies, Likert, Elo and Bradley-Terry scores were closely correlated to mElo scores (all rs > 0.95), and all measures were correlated with stimulus grip strength (all rs > 0.38) and body size (all rs > 0.59). However, mElo scores were less variable than Elo scores and were hundreds of times quicker to compute than Bradley-Terry scores. Responses in pairwise comparison trials were 2/3 quicker than in Likert tasks, indicating that participants found pairwise comparisons to be easier. In addition, mElo scores generated from a data set with half the participants randomly excluded produced very comparable results to those produced with Likert scores from the full participant set, indicating that researchers require fewer participants when using pairwise comparisons.
A general transformation to canonical form for potentials in pairwise interatomic interactions.

PubMed

Walton, Jay R; Rivera-Rivera, Luis A; Lucchese, Robert R; Bevan, John W

2015-06-14

A generalized formulation of explicit force-based transformations is introduced to investigate the concept of a canonical potential in both fundamental chemical and intermolecular bonding. Different classes of representative ground electronic state pairwise interatomic interactions are referenced to a chosen canonical potential illustrating application of such transformations. Specifically, accurately determined potentials of the diatomic molecules H2, H2(+), HF, LiH, argon dimer, and one-dimensional dissociative coordinates in Ar-HBr, OC-HF, and OC-Cl2 are investigated throughout their bound potentials. Advantages of the current formulation for accurately evaluating equilibrium dissociation energies and a fundamentally different unified perspective on nature of intermolecular interactions will be emphasized. In particular, this canonical approach has significance to previous assertions that there is no very fundamental distinction between van der Waals bonding and covalent bonding or for that matter hydrogen and halogen bonds.
Beyond pairwise strategy updating in the prisoner's dilemma game

NASA Astrophysics Data System (ADS)

Wang, Xiaofeng; Perc, Matjaž; Liu, Yongkui; Chen, Xiaojie; Wang, Long

2012-10-01

In spatial games players typically alter their strategy by imitating the most successful or one randomly selected neighbor. Since a single neighbor is taken as reference, the information stemming from other neighbors is neglected, which begets the consideration of alternative, possibly more realistic approaches. Here we show that strategy changes inspired not only by the performance of individual neighbors but rather by entire neighborhoods introduce a qualitatively different evolutionary dynamics that is able to support the stable existence of very small cooperative clusters. This leads to phase diagrams that differ significantly from those obtained by means of pairwise strategy updating. In particular, the survivability of cooperators is possible even by high temptations to defect and over a much wider uncertainty range. We support the simulation results by means of pair approximations and analysis of spatial patterns, which jointly highlight the importance of local information for the resolution of social dilemmas.
Effectiveness of oral hydration in preventing contrast-induced acute kidney injury in patients undergoing coronary angiography or intervention: a pairwise and network meta-analysis.

PubMed

Zhang, Weidai; Zhang, Jiawei; Yang, Baojun; Wu, Kefei; Lin, Hanfei; Wang, Yanping; Zhou, Lihong; Wang, Huatao; Zeng, Chujuan; Chen, Xiao; Wang, Zhixing; Zhu, Junxing; Songming, Chen

2018-06-01

The effectiveness of oral hydration in preventing contrast-induced acute kidney injury (CI-AKI) in patients undergoing coronary angiography or intervention has not been well established. This study aims to evaluate the efficacy of oral hydration compared with intravenous hydration and other frequently used hydration strategies. PubMed, Embase, Web of Science, and the Cochrane central register of controlled trials were searched from inception to 8 October 2017. To be eligible for analysis, studies had to evaluate the relative efficacy of different prophylactic hydration strategies. We selected and assessed the studies that fulfilled the inclusion criteria and carried out a pairwise and network meta-analysis using RevMan5.2 and Aggregate Data Drug Information System 1.16.8 software. A total of four studies (538 participants) were included in our pairwise meta-analysis and 1754 participants from eight studies with four frequently used hydration strategies were included in a network meta-analysis. Pairwise meta-analysis indicated that oral hydration was as effective as intravenous hydration for the prevention of CI-AKI (5.88 vs. 8.43%; odds ratio: 0.73; 95% confidence interval: 0.36-1.47; P>0.05), with no significant heterogeneity between studies. Network meta-analysis showed that there was no significant difference in the prevention of CI-AKI. However, the rank probability plot suggested that oral plus intravenous hydration had a higher probability (51%) of being the best strategy, followed by diuretic plus intravenous hydration (39%) and oral hydration alone (10%). Intravenous hydration alone was the strategy with the highest probability (70%) of being the worst hydration strategy. Our study shows that oral hydration is not inferior to intravenous hydration for the prevention of CI-AKI in patients with normal or mild-to-moderate renal dysfunction undergoing coronary angiography or intervention.
Bioinformatic mining of EST-SSR loci in the Pacific oyster, Crassostrea gigas.

PubMed

Wang, Y; Ren, R; Yu, Z

2008-06-01

A set of expressed sequence tag-simple sequence repeat (EST-SSR) markers of the Pacific oyster, Crassostrea gigas, was developed through bioinformatic mining of the GenBank public database. As of June 30, 2007, a total of 5132 EST sequences from GenBank were downloaded and screened for di-, tri- and tetra-nucleotide repeats, with criteria set at a minimum of 5, 4 and 4 repeats for the three categories of SSRs respectively. Seventeen polymorphic microsatellite markers were characterized. Allele numbers ranged from 3 to 10, and the observed and expected heterozygosity values varied from 0.125 to 0.770 and from 0.113 to 0.732 respectively. Eleven loci were at Hardy-Weinberg equilibrium (HWE); the other six loci showed significant departure from HWE (P < 0.01), suggesting possible presence of null alleles. Pairwise check of linkage disequilibrium (LD) indicated that 11 of 136 pairs of loci showed significant LD (P < 0.01), likely due to HWE present in single markers. Cross-species amplification was examined for five other Crassostrea species and reasonable results were obtained, promising usefulness of these markers in oyster genetics.
Weak genetic differentiation in cobia, Rachycentron canadum from Indian waters as inferred from mitochondrial DNA ATPase 6 and 8 genes.

PubMed

Joy, Linu; Mohitha, C; Divya, P R; Gopalakrishnan, A; Basheer, V S; Jena, J K

2016-07-01

Cobia, Rachycentron canadum, is an economically important migratory fish distributed in tropical waters worldwide and is a candidate fish species for aquaculture practices. The genetic stock structure of R. canadum distributed along the Indian waters was identified using mitochondrial ATPase 6 and 8 genes. A total of 842 bp sequence of ATPase 6/8 genes obtained in this study revealed 15 haplotypes with mean low nucleotide diversity (π = 0.001) and high haplotype diversity (h = 0.785). AMOVA indicated the genetic differentiation of 90.47% for individuals within the population. This is well supported by co-efficient of genetic differentiation (FST) values obtained for pairwise populations that were low and non-significant with an overall value of 0.002. The parsimony network tree revealed star-like phylogeny and all the haplotypes were connected with each other by single mutational event. The findings of the present study indicated the panmixia nature of the species which can be managed as a unit stock in Indian waters.
LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants.

PubMed

Machiela, Mitchell J; Chanock, Stephen J

2015-11-01

Assessing linkage disequilibrium (LD) across ancestral populations is a powerful approach for investigating population-specific genetic structure as well as functionally mapping regions of disease susceptibility. Here, we present LDlink, a web-based collection of bioinformatic modules that query single nucleotide polymorphisms (SNPs) in population groups of interest to generate haplotype tables and interactive plots. Modules are designed with an emphasis on ease of use, query flexibility, and interactive visualization of results. Phase 3 haplotype data from the 1000 Genomes Project are referenced for calculating pairwise metrics of LD, searching for proxies in high LD, and enumerating all observed haplotypes. LDlink is tailored for investigators interested in mapping common and uncommon disease susceptibility loci by focusing on output linking correlated alleles and highlighting putative functional variants. LDlink is a free and publically available web tool which can be accessed at http://analysistools.nci.nih.gov/LDlink/. mitchell.machiela@nih.gov. Published by Oxford University Press 2015. This work is written by US Government employees and is in the public domain in the US.
The complete genome sequence of the Atlantic salmon paramyxovirus (ASPV)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nylund, Stian; Karlsen, Marius; Nylund, Are

2008-03-30

The complete RNA genome of the Atlantic salmon paramyxovirus (ASPV), isolated from Atlantic salmon suffering from proliferative gill inflammation (PGI), has been determined. The genome is 16,965 nucleotides in length and consists of six nonoverlapping genes in the order 3'- N - P/C/V - M - F - HN - L -5', coding for the nucleocapsid, phospho-, matrix, fusion, hemagglutinin-neuraminidase and large polymerase proteins, respectively. The gene junctions contain highly conserved transcription start and stop signal sequences and trinucleotide intergenic regions similar to those of other Paramyxoviridae. The ASPV P-gene expression strategy is like that of the respiro- and morbilliviruses,more » which express the phosphoprotein from the primary transcript, and edit a portion of the mRNA to encode the accessory proteins V and W. It also encodes the C-protein by ribosomal choice of translation initiation. Pairwise comparisons of amino acid identities, and phylogenetic analysis of deduced ASPV protein sequences with homologous sequences from other Paramyxoviridae, show that ASPV has an affinity for the genus Respirovirus, but may represent a new genus within the subfamily Paramyxovirinae.« less
Transcription Factor Map Alignment of Promoter Regions

PubMed Central

Blanco, Enrique; Messeguer, Xavier; Smith, Temple F; Guigó, Roderic

2006-01-01

We address the problem of comparing and characterizing the promoter regions of genes with similar expression patterns. This remains a challenging problem in sequence analysis, because often the promoter regions of co-expressed genes do not show discernible sequence conservation. In our approach, thus, we have not directly compared the nucleotide sequence of promoters. Instead, we have obtained predictions of transcription factor binding sites, annotated the predicted sites with the labels of the corresponding binding factors, and aligned the resulting sequences of labels—to which we refer here as transcription factor maps (TF-maps). To obtain the global pairwise alignment of two TF-maps, we have adapted an algorithm initially developed to align restriction enzyme maps. We have optimized the parameters of the algorithm in a small, but well-curated, collection of human–mouse orthologous gene pairs. Results in this dataset, as well as in an independent much larger dataset from the CISRED database, indicate that TF-map alignments are able to uncover conserved regulatory elements, which cannot be detected by the typical sequence alignments. PMID:16733547
Prokaryotic Nucleotide Composition Is Shaped by Both Phylogeny and the Environment

DOE PAGES

Reichenberger, Erin R.; Rosen, Gail; Hershberg, Uri; ...

2015-04-09

Here, the causes of the great variation in nucleotide composition of prokaryotic genomes have long been disputed. Here, we use extensive metagenomic and whole-genome data to demonstrate that both phylogeny and the environment shape prokaryotic nucleotide content. We show that across environments, various phyla are characterized by different mean guanine and cytosine (GC) values as well as by the extent of variation on that mean value. At the same time, we show that GC-content varies greatly as a function of environment, in a manner that cannot be entirely explained by disparities in phylogenetic composition. We find environmentally driven differences inmore » nucleotide content not only between highly diverged environments (e.g., soil, vs. aquatic vs. human gut) but also within a single type of environment. More specifically, we demonstrate that some human guts are associated with a microbiome that is consistently more GC-rich across phyla, whereas others are associated with a more AT-rich microbiome. These differences appear to be driven both by variations in phylogenetic composition and by environmental differences—which are independent of these phylogenetic composition differences. Combined, our results demonstrate that both phylogeny and the environment significantly affect nucleotide composition and that the environmental differences affecting nucleotide composition are far subtler than previously appreciated.« less
Development and application of a 6.5 million feature Affymetrix Genechip® for massively parallel discovery of single position polymorphisms in lettuce (Lactuca spp.)

PubMed Central

2012-01-01

Background High-resolution genetic maps are needed in many crops to help characterize the genetic diversity that determines agriculturally important traits. Hybridization to microarrays to detect single feature polymorphisms is a powerful technique for marker discovery and genotyping because of its highly parallel nature. However, microarrays designed for gene expression analysis rarely provide sufficient gene coverage for optimal detection of nucleotide polymorphisms, which limits utility in species with low rates of polymorphism such as lettuce (Lactuca sativa). Results We developed a 6.5 million feature Affymetrix GeneChip® for efficient polymorphism discovery and genotyping, as well as for analysis of gene expression in lettuce. Probes on the microarray were designed from 26,809 unigenes from cultivated lettuce and an additional 8,819 unigenes from four related species (L. serriola, L. saligna, L. virosa and L. perennis). Where possible, probes were tiled with a 2 bp stagger, alternating on each DNA strand; providing an average of 187 probes covering approximately 600 bp for each of over 35,000 unigenes; resulting in up to 13 fold redundancy in coverage per nucleotide. We developed protocols for hybridization of genomic DNA to the GeneChip® and refined custom algorithms that utilized coverage from multiple, high quality probes to detect single position polymorphisms in 2 bp sliding windows across each unigene. This allowed us to detect greater than 18,000 polymorphisms between the parental lines of our core mapping population, as well as numerous polymorphisms between cultivated lettuce and wild species in the lettuce genepool. Using marker data from our diversity panel comprised of 52 accessions from the five species listed above, we were able to separate accessions by species using both phylogenetic and principal component analyses. Additionally, we estimated the diversity between different types of cultivated lettuce and distinguished morphological types. Conclusion By hybridizing genomic DNA to a custom oligonucleotide array designed for maximum gene coverage, we were able to identify polymorphisms using two approaches for pair-wise comparisons, as well as a highly parallel method that compared all 52 genotypes simultaneously. PMID:22583801
Development and application of a 6.5 million feature Affymetrix Genechip® for massively parallel discovery of single position polymorphisms in lettuce (Lactuca spp.).

PubMed

Stoffel, Kevin; van Leeuwen, Hans; Kozik, Alexander; Caldwell, David; Ashrafi, Hamid; Cui, Xinping; Tan, Xiaoping; Hill, Theresa; Reyes-Chin-Wo, Sebastian; Truco, Maria-Jose; Michelmore, Richard W; Van Deynze, Allen

2012-05-14

High-resolution genetic maps are needed in many crops to help characterize the genetic diversity that determines agriculturally important traits. Hybridization to microarrays to detect single feature polymorphisms is a powerful technique for marker discovery and genotyping because of its highly parallel nature. However, microarrays designed for gene expression analysis rarely provide sufficient gene coverage for optimal detection of nucleotide polymorphisms, which limits utility in species with low rates of polymorphism such as lettuce (Lactuca sativa). We developed a 6.5 million feature Affymetrix GeneChip® for efficient polymorphism discovery and genotyping, as well as for analysis of gene expression in lettuce. Probes on the microarray were designed from 26,809 unigenes from cultivated lettuce and an additional 8,819 unigenes from four related species (L. serriola, L. saligna, L. virosa and L. perennis). Where possible, probes were tiled with a 2 bp stagger, alternating on each DNA strand; providing an average of 187 probes covering approximately 600 bp for each of over 35,000 unigenes; resulting in up to 13 fold redundancy in coverage per nucleotide. We developed protocols for hybridization of genomic DNA to the GeneChip® and refined custom algorithms that utilized coverage from multiple, high quality probes to detect single position polymorphisms in 2 bp sliding windows across each unigene. This allowed us to detect greater than 18,000 polymorphisms between the parental lines of our core mapping population, as well as numerous polymorphisms between cultivated lettuce and wild species in the lettuce genepool. Using marker data from our diversity panel comprised of 52 accessions from the five species listed above, we were able to separate accessions by species using both phylogenetic and principal component analyses. Additionally, we estimated the diversity between different types of cultivated lettuce and distinguished morphological types. By hybridizing genomic DNA to a custom oligonucleotide array designed for maximum gene coverage, we were able to identify polymorphisms using two approaches for pair-wise comparisons, as well as a highly parallel method that compared all 52 genotypes simultaneously.
Intercenter Differences in Bronchopulmonary Dysplasia or Death Among Very Low Birth Weight Infants

PubMed Central

Walsh, Michele; Bobashev, Georgiy; Das, Abhik; Levine, Burton; Carlo, Waldemar A.; Higgins, Rosemary D.

2011-01-01

OBJECTIVES: To determine (1) the magnitude of clustering of bronchopulmonary dysplasia (36 weeks) or death (the outcome) across centers of the Eunice Kennedy Shriver National Institute of Child and Human Development National Research Network, (2) the infant-level variables associated with the outcome and estimate their clustering, and (3) the center-specific practices associated with the differences and build predictive models. METHODS: Data on neonates with a birth weight of <1250 g from the cluster-randomized benchmarking trial were used to determine the magnitude of clustering of the outcome according to alternating logistic regression by using pairwise odds ratio and predictive modeling. Clinical variables associated with the outcome were identified by using multivariate analysis. The magnitude of clustering was then evaluated after correction for infant-level variables. Predictive models were developed by using center-specific and infant-level variables for data from 2001 2004 and projected to 2006. RESULTS: In 2001–2004, clustering of bronchopulmonary dysplasia/death was significant (pairwise odds ratio: 1.3; P < .001) and increased in 2006 (pairwise odds ratio: 1.6; overall incidence: 52%; range across centers: 32%–74%); center rates were relatively stable over time. Variables that varied according to center and were associated with increased risk of outcome included lower body temperature at NICU admission, use of prophylactic indomethacin, specific drug therapy on day 1, and lack of endotracheal intubation. Center differences remained significant even after correction for clustered variables. CONCLUSION: Bronchopulmonary dysplasia/death rates demonstrated moderate clustering according to center. Clinical variables associated with the outcome were also clustered. Center differences after correction of clustered variables indicate presence of as-yet unmeasured center variables. PMID:21149431
Does technique matter; a pilot study exploring weighting techniques for a multi-criteria decision support framework.

PubMed

van Til, Janine; Groothuis-Oudshoorn, Catharina; Lieferink, Marijke; Dolan, James; Goetghebeur, Mireille

2014-01-01

There is an increased interest in the use of multi-criteria decision analysis (MCDA) to support regulatory and reimbursement decision making. The EVIDEM framework was developed to provide pragmatic multi-criteria decision support in health care, to estimate the value of healthcare interventions, and to aid in priority-setting. The objectives of this study were to test 1) the influence of different weighting techniques on the overall outcome of an MCDA exercise, 2) the discriminative power in weighting different criteria of such techniques, and 3) whether different techniques result in similar weights in weighting the criteria set proposed by the EVIDEM framework. A sample of 60 Dutch and Canadian students participated in the study. Each student used an online survey to provide weights for 14 criteria with two different techniques: a five-point rating scale and one of the following techniques selected randomly: ranking, point allocation, pairwise comparison and best worst scaling. The results of this study indicate that there is no effect of differences in weights on value estimates at the group level. On an individual level, considerable differences in criteria weights and rank order occur as a result of the weight elicitation method used, and the ability of different techniques to discriminate in criteria importance. Of the five techniques tested, the pair-wise comparison of criteria has the highest ability to discriminate in weights when fourteen criteria are compared. When weights are intended to support group decisions, the choice of elicitation technique has negligible impact on criteria weights and the overall value of an innovation. However, when weights are used to support individual decisions, the choice of elicitation technique influences outcome and studies that use dissimilar techniques cannot be easily compared. Weight elicitation through pairwise comparison of criteria is preferred when taking into account its superior ability to discriminate between criteria and respondents' preferences.

Prokaryotic nucleotide composition is shaped by both phylogeny and the environment.

PubMed

Reichenberger, Erin R; Rosen, Gail; Hershberg, Uri; Hershberg, Ruth

2015-04-09

The causes of the great variation in nucleotide composition of prokaryotic genomes have long been disputed. Here, we use extensive metagenomic and whole-genome data to demonstrate that both phylogeny and the environment shape prokaryotic nucleotide content. We show that across environments, various phyla are characterized by different mean guanine and cytosine (GC) values as well as by the extent of variation on that mean value. At the same time, we show that GC-content varies greatly as a function of environment, in a manner that cannot be entirely explained by disparities in phylogenetic composition. We find environmentally driven differences in nucleotide content not only between highly diverged environments (e.g., soil, vs. aquatic vs. human gut) but also within a single type of environment. More specifically, we demonstrate that some human guts are associated with a microbiome that is consistently more GC-rich across phyla, whereas others are associated with a more AT-rich microbiome. These differences appear to be driven both by variations in phylogenetic composition and by environmental differences-which are independent of these phylogenetic composition differences. Combined, our results demonstrate that both phylogeny and the environment significantly affect nucleotide composition and that the environmental differences affecting nucleotide composition are far subtler than previously appreciated. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Performance analysis of model based iterative reconstruction with dictionary learning in transportation security CT

NASA Astrophysics Data System (ADS)

Haneda, Eri; Luo, Jiajia; Can, Ali; Ramani, Sathish; Fu, Lin; De Man, Bruno

2016-05-01

In this study, we implement and compare model based iterative reconstruction (MBIR) with dictionary learning (DL) over MBIR with pairwise pixel-difference regularization, in the context of transportation security. DL is a technique of sparse signal representation using an over complete dictionary which has provided promising results in image processing applications including denoising,1 as well as medical CT reconstruction.2 It has been previously reported that DL produces promising results in terms of noise reduction and preservation of structural details, especially for low dose and few-view CT acquisitions.2 A distinguishing feature of transportation security CT is that scanned baggage may contain items with a wide range of material densities. While medical CT typically scans soft tissues, blood with and without contrast agents, and bones, luggage typically contains more high density materials (i.e. metals and glass), which can produce severe distortions such as metal streaking artifacts. Important factors of security CT are the emphasis on image quality such as resolution, contrast, noise level, and CT number accuracy for target detection. While MBIR has shown exemplary performance in the trade-off of noise reduction and resolution preservation, we demonstrate that DL may further improve this trade-off. In this study, we used the KSVD-based DL3 combined with the MBIR cost-minimization framework and compared results to Filtered Back Projection (FBP) and MBIR with pairwise pixel-difference regularization. We performed a parameter analysis to show the image quality impact of each parameter. We also investigated few-view CT acquisitions where DL can show an additional advantage relative to pairwise pixel difference regularization.
Population genetic data and forensic parameters of 30 autosomal InDel markers in Santa Catarina State population, Southern Brazil.

PubMed

Torres, Sandra Regina Rachadel; Uehara, Clineu Julien Seki; Sutter-Latorre, Ana Frederica; de Almeida, Bibiana Sgorla; Sauerbier, Tania Streck; Muniz, Yara Costa Netto; Marrero, Andrea Rita; de Souza, Ilíada Rainha

2014-08-01

The application of DNA technology in forensic investigations has grown rapidly in the last 25 years and with an exponential increase of short tandem repeats (STRs) data, usually presented as allele frequencies, that may be later used as databases for forensic and population genetics purposes. Thereby, classes of molecular markers such as single nucleotide polymorphisms and insertions/deletions (InDels) have been presented as another option of genetic marker sets. These markers can be used in paternity cases, when mutations in STR polymorphisms are present, as well as in highly degraded DNA analysis. In the present study, the allele frequencies and heterozygosity (H) of a 30 InDel markers set were determined and the forensic efficacy was evaluated through estimation of discrimination power (DP), match probability, typical paternity index and power of paternity exclusion in 108 unrelated volunteers from the State of Santa Catarina (South Brazil). The observed H per locus showed a range between 0.370 and 0.574 (mean = 0.479). HLD128 was the locus with the highest DP (DP = 0.656). DP for all markers combined was greater than 99.9999999999646 % which provides satisfactory levels of information for forensic demands. Genetic comparisons (exact tests of population differentiation and pairwise genetic distances) revealed that the population of Santa Catarina State differs from Korea and USA Afro-American populations but is similar to the Portuguese, German, Polish, Spanish and Basque populations.
Performing monkeys of Bangladesh: characterizing their source and genetic variation.

PubMed

Hasan, M Kamrul; Feeroz, M Mostafa; Jones-Engel, Lisa; Engel, Gregory A; Akhtar, Sharmin; Kanthaswamy, Sree; Smith, David Glenn

2016-04-01

The acquisition and training of monkeys to perform is a centuries-old tradition in South Asia, resulting in a large number of rhesus macaques kept in captivity for this purpose. The performing monkeys are reportedly collected from free-ranging populations, and may escape from their owners or may be released into other populations. In order to determine whether this tradition involving the acquisition and movement of animals has influenced the population structure of free-ranging rhesus macaques in Bangladesh, we first characterized the source of these monkeys. Biological samples from 65 performing macaques collected between January 2010 and August 2013 were analyzed for genetic variation using 716 base pairs of mitochondrial DNA. Performing monkey sequences were compared with those of free-ranging rhesus macaque populations in Bangladesh, India and Myanmar. Forty-five haplotypes with 116 (16 %) polymorphic nucleotide sites were detected among the performing monkeys. As for the free-ranging rhesus population, most of the substitutions (89 %) were transitions, and no indels (insertion/deletion) were observed. The estimate of the mean number of pair-wise differences for the performing monkey population was 10.1264 ± 4.686, compared to 14.076 ± 6.363 for the free-ranging population. Fifteen free-ranging rhesus macaque populations were identified as the source of performing monkeys in Bangladesh; several of these populations were from areas where active provisioning has resulted in a large number of macaques. The collection of performing monkeys from India was also evident.
Multiple major disease-associated clones of Legionella pneumophila have emerged recently and independently

PubMed Central

David, Sophia; Rusniok, Christophe; Mentasti, Massimo; Gomez-Valero, Laura; Harris, Simon R.; Lechat, Pierre; Lees, John; Ginevra, Christophe; Glaser, Philippe; Ma, Laurence; Bouchier, Christiane; Underwood, Anthony; Jarraud, Sophie; Harrison, Timothy G.; Parkhill, Julian; Buchrieser, Carmen

2016-01-01

Legionella pneumophila is an environmental bacterium and the leading cause of Legionnaires’ disease. Just five sequence types (ST), from more than 2000 currently described, cause nearly half of disease cases in northwest Europe. Here, we report the sequence and analyses of 364 L. pneumophila genomes, including 337 from the five disease-associated STs and 27 representative of the species diversity. Phylogenetic analyses revealed that the five STs have independent origins within a highly diverse species. The number of de novo mutations is extremely low with maximum pairwise single-nucleotide polymorphisms (SNPs) ranging from 19 (ST47) to 127 (ST1), which suggests emergences within the last century. Isolates sampled geographically far apart differ by only a few SNPs, demonstrating rapid dissemination. These five STs have been recombining recently, leading to a shared pool of allelic variants potentially contributing to their increased disease propensity. The oldest clone, ST1, has spread globally; between 1940 and 2000, four new clones have emerged in Europe, which show long-distance, rapid dispersal. That a large proportion of clinical cases is caused by recently emerged and internationally dispersed clones, linked by convergent evolution, is surprising for an environmental bacterium traditionally considered to be an opportunistic pathogen. To simultaneously explain recent emergence, rapid spread and increased disease association, we hypothesize that these STs have adapted to new man-made environmental niches, which may be linked by human infection and transmission. PMID:27662900
Performing monkeys of Bangladesh: characterizing their source and genetic variation

PubMed Central

Hasan, M Kamrul; Feeroz, M Mostafa; Jones-Engel, Lisa; Engel, Gregory A; Akhtar, Sharmin; Kanthaswamy, Sree; Smith, David Glenn

2016-01-01

The acquisition and training of monkeys to perform is a century's old tradition in South Asia, resulting in a large number of rhesus macaques kept in captivity for this purpose. The performing monkeys are reportedly collected from free-ranging populations and may escape from their owners or be released into other populations. In order to determine whether this tradition, that involves the acquisition and movement of animals, has influenced the population structure of free-ranging rhesus macaques in Bangladesh we first characterized the source of these monkeys. Biological samples from 65 performing macaques, collected between January 2010 and August 2013 were analyzed for genetic variation using 716 base pairs of mitochondrial DNA. Performing monkey sequences were compared with those of free-ranging rhesus macaque populations in Bangladesh, India and Myanmar. Forty-five haplotypes with 116 (16%) polymorphic nucleotide sites were detected among the performing monkeys. As for the free-ranging rhesus population, most of the substitutions (89%) were transitions and no indels (insertion/deletion) were observed. The estimate of the mean number of pair-wise difference for the performing monkey population was 10.1264 ± 4.686, compared to 14.076 ± 6.363 for the free-ranging population. Fifteen free-ranging rhesus macaque populations were identified as the source of performing monkeys in Bangladesh; several of these populations were from areas where active provisioning has resulted in a large number of macaques. Collection of performing monkeys from India was also evident. PMID:26758818
Multiple major disease-associated clones of Legionella pneumophila have emerged recently and independently.

PubMed

David, Sophia; Rusniok, Christophe; Mentasti, Massimo; Gomez-Valero, Laura; Harris, Simon R; Lechat, Pierre; Lees, John; Ginevra, Christophe; Glaser, Philippe; Ma, Laurence; Bouchier, Christiane; Underwood, Anthony; Jarraud, Sophie; Harrison, Timothy G; Parkhill, Julian; Buchrieser, Carmen

2016-11-01

Legionella pneumophila is an environmental bacterium and the leading cause of Legionnaires' disease. Just five sequence types (ST), from more than 2000 currently described, cause nearly half of disease cases in northwest Europe. Here, we report the sequence and analyses of 364 L. pneumophila genomes, including 337 from the five disease-associated STs and 27 representative of the species diversity. Phylogenetic analyses revealed that the five STs have independent origins within a highly diverse species. The number of de novo mutations is extremely low with maximum pairwise single-nucleotide polymorphisms (SNPs) ranging from 19 (ST47) to 127 (ST1), which suggests emergences within the last century. Isolates sampled geographically far apart differ by only a few SNPs, demonstrating rapid dissemination. These five STs have been recombining recently, leading to a shared pool of allelic variants potentially contributing to their increased disease propensity. The oldest clone, ST1, has spread globally; between 1940 and 2000, four new clones have emerged in Europe, which show long-distance, rapid dispersal. That a large proportion of clinical cases is caused by recently emerged and internationally dispersed clones, linked by convergent evolution, is surprising for an environmental bacterium traditionally considered to be an opportunistic pathogen. To simultaneously explain recent emergence, rapid spread and increased disease association, we hypothesize that these STs have adapted to new man-made environmental niches, which may be linked by human infection and transmission. © 2016 David et al.; Published by Cold Spring Harbor Laboratory Press.
Sex steroid-related genes and male-to-female transsexualism.

PubMed

Henningsson, Susanne; Westberg, Lars; Nilsson, Staffan; Lundström, Bengt; Ekselius, Lisa; Bodlund, Owe; Lindström, Eva; Hellstrand, Monika; Rosmond, Roland; Eriksson, Elias; Landén, Mikael

2005-08-01

Transsexualism is characterised by lifelong discomfort with the assigned sex and a strong identification with the opposite sex. The cause of transsexualism is unknown, but it has been suggested that an aberration in the early sexual differentiation of various brain structures may be involved. Animal experiments have revealed that the sexual differentiation of the brain is mainly due to an influence of testosterone, acting both via androgen receptors (ARs) and--after aromatase-catalyzed conversion to estradiol--via estrogen receptors (ERs). The present study examined the possible importance of three polymorphisms and their pairwise interactions for the development of male-to-female transsexualism: a CAG repeat sequence in the first exon of the AR gene, a tetra nucleotide repeat polymorphism in intron 4 of the aromatase gene, and a CA repeat polymorphism in intron 5 of the ERbeta gene. Subjects were 29 Caucasian male-to-female transsexuals and 229 healthy male controls. Transsexuals differed from controls with respect to the mean length of the ERbeta repeat polymorphism, but not with respect to the length of the other two studied polymorphisms. However, binary logistic regression analysis revealed significant partial effects for all three polymorphisms, as well as for the interaction between the AR and aromatase gene polymorphisms, on the risk of developing transsexualism. Given the small number of transsexuals in the study, the results should be interpreted with the utmost caution. Further study of the putative role of these and other sex steroid-related genes for the development of transsexualism may, however, be worthwhile.
Genetic diversity of three surface protein genes in Plasmodium malariae from three Asian countries.

PubMed

Srisutham, Suttipat; Saralamba, Naowarat; Sriprawat, Kanlaya; Mayxay, Mayfong; Smithuis, Frank; Nosten, Francois; Pukrittayakamee, Sasithon; Day, Nicholas P J; Dondorp, Arjen M; Imwong, Mallika

2018-01-11

Genetic diversity of the three important antigenic proteins, namely thrombospondin-related anonymous protein (TRAP), apical membrane antigen 1 (AMA1), and 6-cysteine protein (P48/45), all of which are found in various developmental stages of Plasmodium parasites is crucial for targeted vaccine development. While studies related to the genetic diversity of these proteins are available for Plasmodium falciparum and Plasmodium vivax, barely enough information exists regarding Plasmodium malariae. The present study aims to demonstrate the genetic variations existing among these three genes in P. malariae by analysing their diversity at nucleotide and protein levels. Three surface protein genes were isolated from 45 samples collected in Thailand (N = 33), Myanmar (N = 8), and Lao PDR (N = 4), using conventional polymerase chain reaction (PCR) assay. Then, the PCR products were sequenced and analysed using BioEdit, MEGA6, and DnaSP programs. The average pairwise nucleotide diversities (π) of P. malariae trap, ama1, and p48/45 were 0.00169, 0.00413, and 0.00029, respectively. The haplotype diversities (Hd) of P. malariae trap, ama1, and p48/45 were 0.919, 0.946, and 0.130, respectively. Most of the nucleotide substitutions were non-synonymous, which indicated that the genetic variations of these genes were maintained by positive diversifying selection, thus, suggesting their role as a potential target of protective immune response. Amino acid substitutions of P. malariae TRAP, AMA1, and P48/45 could be categorized to 17, 20, and 2 unique amino-acid variants, respectively. For further vaccine development, carboxyl terminal of P48/45 would be a good candidate according to conserved amino acid at low genetic diversity (π = 0.2-0.3). High mutational diversity was observed in P. malariae trap and ama1 as compared to p48/45 in P. malariae samples isolated from Thailand, Myanmar, and Lao PDR. Taken together, these results suggest that P48/45 might be a good vaccine candidate against P. malariae infection because of its sufficiently low genetic diversity and highly conserved amino acids especially on the carboxyl end.
Why rate when you could compare? Using the “EloChoice” package to assess pairwise comparisons of perceived physical strength

PubMed Central

Howard, Kate L.; Woods, Andy T.; Penton-Voak, Ian S.; Neumann, Christof

2018-01-01

We introduce “EloChoice”, a package for R which uses Elo rating to assess pairwise comparisons between stimuli in order to measure perceived stimulus characteristics. To demonstrate the package and compare results from forced choice pairwise comparisons to those from more standard single stimulus rating tasks using Likert (or Likert-type) items, we investigated perceptions of physical strength from images of male bodies. The stimulus set comprised images of 82 men standing on a raised platform with minimal clothing. Strength-related anthropometrics and grip strength measurements were available for each man in the set. UK laboratory participants (Study 1) and US online participants (Study 2) viewed all images in both a Likert rating task, to collect mean Likert scores, and a pairwise comparison task, to calculate Elo, mean Elo (mElo), and Bradley-Terry scores. Within both studies, Likert, Elo and Bradley-Terry scores were closely correlated to mElo scores (all rs > 0.95), and all measures were correlated with stimulus grip strength (all rs > 0.38) and body size (all rs > 0.59). However, mElo scores were less variable than Elo scores and were hundreds of times quicker to compute than Bradley-Terry scores. Responses in pairwise comparison trials were 2/3 quicker than in Likert tasks, indicating that participants found pairwise comparisons to be easier. In addition, mElo scores generated from a data set with half the participants randomly excluded produced very comparable results to those produced with Likert scores from the full participant set, indicating that researchers require fewer participants when using pairwise comparisons. PMID:29293615
Random Partition Distribution Indexed by Pairwise Information

PubMed Central

Dahl, David B.; Day, Ryan; Tsai, Jerry W.

2017-01-01

We propose a random partition distribution indexed by pairwise similarity information such that partitions compatible with the similarities are given more probability. The use of pairwise similarities, in the form of distances, is common in some clustering algorithms (e.g., hierarchical clustering), but we show how to use this type of information to define a prior partition distribution for flexible Bayesian modeling. A defining feature of the distribution is that it allocates probability among partitions within a given number of subsets, but it does not shift probability among sets of partitions with different numbers of subsets. Our distribution places more probability on partitions that group similar items yet keeps the total probability of partitions with a given number of subsets constant. The distribution of the number of subsets (and its moments) is available in closed-form and is not a function of the similarities. Our formulation has an explicit probability mass function (with a tractable normalizing constant) so the full suite of MCMC methods may be used for posterior inference. We compare our distribution with several existing partition distributions, showing that our formulation has attractive properties. We provide three demonstrations to highlight the features and relative performance of our distribution. PMID:29276318
Process perspective on image quality evaluation

NASA Astrophysics Data System (ADS)

Leisti, Tuomas; Halonen, Raisa; Kokkonen, Anna; Weckman, Hanna; Mettänen, Marja; Lensu, Lasse; Ritala, Risto; Oittinen, Pirkko; Nyman, Göte

2008-01-01

The psychological complexity of multivariate image quality evaluation makes it difficult to develop general image quality metrics. Quality evaluation includes several mental processes and ignoring these processes and the use of a few test images can lead to biased results. By using a qualitative/quantitative (Interpretation Based Quality, IBQ) methodology, we examined the process of pair-wise comparison in a setting, where the quality of the images printed by laser printer on different paper grades was evaluated. Test image consisted of a picture of a table covered with several objects. Three other images were also used, photographs of a woman, cityscape and countryside. In addition to the pair-wise comparisons, observers (N=10) were interviewed about the subjective quality attributes they used in making their quality decisions. An examination of the individual pair-wise comparisons revealed serious inconsistencies in observers' evaluations on the test image content, but not on other contexts. The qualitative analysis showed that this inconsistency was due to the observers' focus of attention. The lack of easily recognizable context in the test image may have contributed to this inconsistency. To obtain reliable knowledge of the effect of image context or attention on subjective image quality, a qualitative methodology is needed.
Pairwise registration of TLS point clouds using covariance descriptors and a non-cooperative game

NASA Astrophysics Data System (ADS)

Zai, Dawei; Li, Jonathan; Guo, Yulan; Cheng, Ming; Huang, Pengdi; Cao, Xiaofei; Wang, Cheng

2017-12-01

It is challenging to automatically register TLS point clouds with noise, outliers and varying overlap. In this paper, we propose a new method for pairwise registration of TLS point clouds. We first generate covariance matrix descriptors with an adaptive neighborhood size from point clouds to find candidate correspondences, we then construct a non-cooperative game to isolate mutual compatible correspondences, which are considered as true positives. The method was tested on three models acquired by two different TLS systems. Experimental results demonstrate that our proposed adaptive covariance (ACOV) descriptor is invariant to rigid transformation and robust to noise and varying resolutions. The average registration errors achieved on three models are 0.46 cm, 0.32 cm and 1.73 cm, respectively. The computational times cost on these models are about 288 s, 184 s and 903 s, respectively. Besides, our registration framework using ACOV descriptors and a game theoretic method is superior to the state-of-the-art methods in terms of both registration error and computational time. The experiment on a large outdoor scene further demonstrates the feasibility and effectiveness of our proposed pairwise registration framework.
Impaired inference in a case of developmental amnesia.

PubMed

D'Angelo, Maria C; Rosenbaum, R Shayna; Ryan, Jennifer D

2016-10-01

Amnesia is associated with impairments in relational memory, which is critically supported by the hippocampus. By adapting the transitivity paradigm, we previously showed that age-related impairments in inference were mitigated when judgments could be predicated on known pairwise relations, however, such advantages were not observed in the adult-onset amnesic case D.A. Here, we replicate and extend this finding in a developmental amnesic case (N.C.), who also shows impaired relational learning and transitive expression. Unlike D.A., N.C.'s damage affected the extended hippocampal system and diencephalic structures, and does not extend to neocortical areas that are affected in D.A. Critically, despite their differences in etiology and affected structures, N.C. and D.A. perform similarly on the task. N.C. showed intact pairwise knowledge, suggesting that he is able to use existing semantic information, but this semantic knowledge was insufficient to support transitive expression. The present results suggest a critical role for regions connected to the hippocampus and/or medial prefrontal cortex in inference beyond learning of pairwise relations. © 2016 The Authors Hippocampus Published by Wiley Periodicals, Inc. © 2016 The Authors. Wiley Periodicals, Inc.
Detecting and Removing Ascertainment Bias in Microsatellites from the HGDP-CEPH Panel

PubMed Central

Eriksson, Anders; Manica, Andrea

2011-01-01

Although ascertainment bias in single nucleotide polymorphisms is a well-known problem, it is generally accepted that microsatellites have mutation rates too high for bias to be a concern. Here, we analyze in detail the large set of microsatellites typed for the Human Genetic Diversity Panel (HGDP)-CEPH panel. We develop a novel framework based on rarefaction to compare heterozygosity across markers with different mutation rates. We find that, whereas di- and tri-nucleotides show similar patterns of within- and between-population heterozygosity, tetra-nucleotides are inconsistent with the other two motifs. In addition, di- and tri-nucleotides are consistent with 16 unbiased tetra-nucleotide markers, whereas the HPGP-CEPH tetra-nucleotides are significantly different. This discrepancy is due to the HGDP-CEPH tetra-nucleotides being too homogeneous across Eurasia, even after their slower mutation rate is taken into account by rarefying the other markers. The most likely explanation for this pattern is ascertainment bias. We strongly advocate the exclusion of tetra-nucleotides from future population genetics analysis of this dataset, and we argue that other microsatellite datasets should be investigated for the presence of bias using the approach outlined in this article. PMID:22384358
Sequence analysis of the internal transcribed spacer (ITS) region reveals a novel clade of Ichthyophonus sp. from rainbow trout

USGS Publications Warehouse

Rasmussen, C.; Purcell, M.K.; Gregg, J.L.; LaPatra, S.E.; Winton, J.R.; Hershberger, P.K.

2010-01-01

The mesomycetozoean parasite Ichthyophonus hoferi is most commonly associated with marine fish hosts but also occurs in some components of the freshwater rainbow trout Oncorhynchus mykiss aquaculture industry in Idaho, USA. It is not certain how the parasite was introduced into rainbow trout culture, but it might have been associated with the historical practice of feeding raw, ground common carp Cyprinus carpio that were caught by commercial fisherman. Here, we report a major genetic division between west coast freshwater and marine isolates of Ichthyophonus hoferi. Sequence differences were not detected in 2 regions of the highly conserved small subunit (18S) rDNA gene; however, nucleotide variation was seen in internal transcribed spacer loci (ITS1 and ITS2), both within and among the isolates. Intra-isolate variation ranged from 2.4 to 7.6 nucleotides over a region consisting of ~740 bp. Majority consensus sequences from marine/anadromous hosts differed in only 0 to 3 nucleotides (99.6 to 100% nucleotide identity), while those derived from freshwater rainbow trout had no nucleotide substitutions relative to each other. However, the consensus sequences between isolates from freshwater rainbow trout and those from marine/anadromous hosts differed in 13 to 16 nucleotides (97.8 to 98.2% nucleotide identity).
Seasonal changes of nucleotides in mussel (Mytilus galloprovincialis) mantle tissue.

PubMed

Blanco, S L; Suárez, M P; San Juan, F

2006-03-01

Seasonal variations of nucleotides in Mytilus galloprovincialis mantle tissue were analyzed. Separation and quantification was achieved by reversed-phase high-performance liquid chromatography. Total nucleotides show a pronounced seasonal variation with maximum and minimum values in autumn and spring, respectively. Adenine nucleotides accounted for the major part in spring and summer, guanosine and cytidine nucleotides in winter; uridine nucleotides were relatively constant throughout the year. Their inverse variation suggests inter-conversion among them and the maintenance of the potential cell energy in winter by other triphosphate nucleotides different from ATP. These results reflect environmental and nutritional conditions, and also the reserves and gametogenic cycles taking place in M. galloprovincialis mantle tissue.
Smile attractiveness related to buccal corridor space in 3 different facial types: A perception of 3 ethnic groups of Malaysians.

PubMed

Nimbalkar, Smita; Oh, Yih Y; Mok, Reei Y; Tioh, Jing Y; Yew, Kai J; Patil, Pravinkumar G

2018-03-16

Buccal corridor space and its variations greatly influence smile attractiveness. Facial types are different for different ethnic populations, and so is smile attractiveness. The subjective perception of smile attractiveness of different populations may vary in regard to different buccal corridor spaces and facial patterns. The purpose of this study was to determine esthetic perceptions of the Malaysian population regarding the width of buccal corridor spaces and their effect on smile esthetics in individuals with short, normal, and long faces. The image of a smiling individual with a mesofacial face was modified to create 2 different facial types (brachyfacial and dolicofacial). Each face form was further modified into 5 different buccal corridors (2%, 10%, 15%, 22%, and 28%). The images were submitted to 3 different ethnic groups of evaluators (Chinese, Malay, Indian; 100 each), ranging between 17 and 21 years of age. A visual analog scale (50 mm in length) was used for assessment. The scores given to each image were compared with the Kruskal-Wallis test, and pairwise comparison was performed using the Mann-Whitney U test (α=.05). All 3 groups of evaluators could distinguish gradations of dark spaces in the buccal corridor at 2%, 10%, and 28%. Statistically significant differences were observed among 3 groups of evaluators in esthetic perception when pairwise comparisons were performed. A 15% buccal corridor was found to score esthetically equally within 3 face types by all 3 groups of evaluators. The Indian population was more critical in evaluation than the Chinese or Malay populations. In a pairwise comparison, more significant differences were found between long and short faces and the normal face; the normal face was compared with long and short faces separately. The width of the buccal corridor space influences smile attractiveness in different facial types. A medium buccal corridor (15%) is the esthetic characteristic preferred by all groups of evaluators in short, normal, and long face types. Copyright © 2017 Editorial Council for the Journal of Prosthetic Dentistry. Published by Elsevier Inc. All rights reserved.
Extent of Linkage Disequilibrium in the Domestic Cat, Felis silvestris catus, and Its Breeds

PubMed Central

Alhaddad, Hasan; Khan, Razib; Grahn, Robert A.; Gandolfi, Barbara; Mullikin, James C.; Cole, Shelley A.; Gruffydd-Jones, Timothy J.; Häggström, Jens; Lohi, Hannes; Longeri, Maria; Lyons, Leslie A.

2013-01-01

Domestic cats have a unique breeding history and can be used as models for human hereditary and infectious diseases. In the current era of genome-wide association studies, insights regarding linkage disequilibrium (LD) are essential for efficient association studies. The objective of this study is to investigate the extent of LD in the domestic cat, Felis silvestris catus, particularly within its breeds. A custom illumina GoldenGate Assay consisting of 1536 single nucleotide polymorphisms (SNPs) equally divided over ten 1 Mb chromosomal regions was developed, and genotyped across 18 globally recognized cat breeds and two distinct random bred populations. The pair-wise LD descriptive measure (r 2) was calculated between the SNPs in each region and within each population independently. LD decay was estimated by determining the non-linear least-squares of all pair-wise estimates as a function of distance using established models. The point of 50% decay of r2 was used to compare the extent of LD between breeds. The longest extent of LD was observed in the Burmese breed, where the distance at which r2 ≈ 0.25 was ∼380 kb, comparable to several horse and dog breeds. The shortest extent of LD was found in the Siberian breed, with an r2 ≈ 0.25 at approximately 17 kb, comparable to random bred cats and human populations. A comprehensive haplotype analysis was also conducted. The haplotype structure of each region within each breed mirrored the LD estimates. The LD of cat breeds largely reflects the breeds’ population history and breeding strategies. Understanding LD in diverse populations will contribute to an efficient use of the newly developed SNP array for the cat in the design of genome-wide association studies, as well as to the interpretation of results for the fine mapping of disease and phenotypic traits. PMID:23308248
Molecular characterization of a novel orthomyxovirus from rainbow and steelhead trout (Oncorhynchus mykiss)

USGS Publications Warehouse

Batts, William N.; LaPatra, Scott E.; Katona, Ryan; Leis, Eric; Fei Fan Ng, Terry; Bruieuc, Marine S.O.; Breyta, Rachel; Purcell, Maureen; Waltzek, Thomas B.; Delwart, Eric; Winton, James

2017-01-01

A novel virus, rainbow trout orthomyxovirus (RbtOV), was isolated in 1997 and again in 2000 from commercially-reared rainbow trout (Oncorhynchus mykiss) in Idaho, USA. The virus grew optimally in the CHSE-214 cell line at 15°C producing a diffuse cytopathic effect; however, juvenile rainbow trout exposed to cell culture-grown virus showed no mortality or gross pathology. Electron microscopy of preparations from infected cell cultures revealed the presence of typical orthomyxovirus particles. The complete genome of RbtOV is comprised of eight linear segments of single-stranded, negative-sense RNA having highly conserved 5′ and 3′-terminal nucleotide sequences. Another virus isolated in 2014 from steelhead trout (also O. mykiss) in Wisconsin, USA, and designated SttOV was found to have eight genome segments with high amino acid sequence identities (89–99%) to the corresponding genes of RbtOV, suggesting these new viruses are isolates of the same virus species and may be more widespread than currently realized. The new isolates had the same genome segment order and the closest pairwise amino acid sequence identities of 16–42% with Infectious salmon anemia virus (ISAV), the type species and currently only member of the genus Isavirus in the family Orthomyxoviridae. However, pairwise comparisons of the predicted amino acid sequences of the 10 RbtOV and SttOV proteins with orthologs from representatives of the established orthomyxoviral genera and a phylogenetic analysis using the PB1 protein showed that while RbtOV and SttOV clustered most closely with ISAV, they diverged sufficiently to merit consideration as representatives of a novel genus. A set of PCR primers was designed using conserved regions of the PB1 gene to produce amplicons that may be sequenced for identification of similar fish orthomyxoviruses in the future.

Extent of linkage disequilibrium in the domestic cat, Felis silvestris catus, and its breeds.

PubMed

Alhaddad, Hasan; Khan, Razib; Grahn, Robert A; Gandolfi, Barbara; Mullikin, James C; Cole, Shelley A; Gruffydd-Jones, Timothy J; Häggström, Jens; Lohi, Hannes; Longeri, Maria; Lyons, Leslie A

2013-01-01

Domestic cats have a unique breeding history and can be used as models for human hereditary and infectious diseases. In the current era of genome-wide association studies, insights regarding linkage disequilibrium (LD) are essential for efficient association studies. The objective of this study is to investigate the extent of LD in the domestic cat, Felis silvestris catus, particularly within its breeds. A custom illumina GoldenGate Assay consisting of 1536 single nucleotide polymorphisms (SNPs) equally divided over ten 1 Mb chromosomal regions was developed, and genotyped across 18 globally recognized cat breeds and two distinct random bred populations. The pair-wise LD descriptive measure (r(2)) was calculated between the SNPs in each region and within each population independently. LD decay was estimated by determining the non-linear least-squares of all pair-wise estimates as a function of distance using established models. The point of 50% decay of r(2) was used to compare the extent of LD between breeds. The longest extent of LD was observed in the Burmese breed, where the distance at which r(2) ≈ 0.25 was ∼380 kb, comparable to several horse and dog breeds. The shortest extent of LD was found in the Siberian breed, with an r(2) ≈ 0.25 at approximately 17 kb, comparable to random bred cats and human populations. A comprehensive haplotype analysis was also conducted. The haplotype structure of each region within each breed mirrored the LD estimates. The LD of cat breeds largely reflects the breeds' population history and breeding strategies. Understanding LD in diverse populations will contribute to an efficient use of the newly developed SNP array for the cat in the design of genome-wide association studies, as well as to the interpretation of results for the fine mapping of disease and phenotypic traits.
Fast and accurate estimation of the covariance between pairwise maximum likelihood distances.

PubMed

Gil, Manuel

2014-01-01

Pairwise evolutionary distances are a model-based summary statistic for a set of molecular sequences. They represent the leaf-to-leaf path lengths of the underlying phylogenetic tree. Estimates of pairwise distances with overlapping paths covary because of shared mutation events. It is desirable to take these covariance structure into account to increase precision in any process that compares or combines distances. This paper introduces a fast estimator for the covariance of two pairwise maximum likelihood distances, estimated under general Markov models. The estimator is based on a conjecture (going back to Nei & Jin, 1989) which links the covariance to path lengths. It is proven here under a simple symmetric substitution model. A simulation shows that the estimator outperforms previously published ones in terms of the mean squared error.
Fast and accurate estimation of the covariance between pairwise maximum likelihood distances

PubMed Central

2014-01-01

Pairwise evolutionary distances are a model-based summary statistic for a set of molecular sequences. They represent the leaf-to-leaf path lengths of the underlying phylogenetic tree. Estimates of pairwise distances with overlapping paths covary because of shared mutation events. It is desirable to take these covariance structure into account to increase precision in any process that compares or combines distances. This paper introduces a fast estimator for the covariance of two pairwise maximum likelihood distances, estimated under general Markov models. The estimator is based on a conjecture (going back to Nei & Jin, 1989) which links the covariance to path lengths. It is proven here under a simple symmetric substitution model. A simulation shows that the estimator outperforms previously published ones in terms of the mean squared error. PMID:25279263
Cyclic nucleotide binding proteins in the Arabidopsis thaliana and Oryza sativa genomes

PubMed Central

Bridges, Dave; Fraser, Marie E; Moorhead, Greg BG

2005-01-01

Background Cyclic nucleotides are ubiquitous intracellular messengers. Until recently, the roles of cyclic nucleotides in plant cells have proven difficult to uncover. With an understanding of the protein domains which can bind cyclic nucleotides (CNB and GAF domains) we scanned the completed genomes of the higher plants Arabidopsis thaliana (mustard weed) and Oryza sativa (rice) for the effectors of these signalling molecules. Results Our analysis found that several ion channels and a class of thioesterases constitute the possible cyclic nucleotide binding proteins in plants. Contrary to some reports, we found no biochemical or bioinformatic evidence for a plant cyclic nucleotide regulated protein kinase, suggesting that cyclic nucleotide functions in plants have evolved differently than in mammals. Conclusion This paper provides a molecular framework for the discussion of cyclic nucleotide function in plants, and resolves a longstanding debate about the presence of a cyclic nucleotide dependent kinase in plants. PMID:15644130
Genetic Variation within a Lotic Population of Janthinobacterium lividum

PubMed Central

Saeger, Jennifer L.; Hale, Alan B.

1993-01-01

An understanding of the genetic variation within and between populations should allow scientists to address many problems, including those associated with endangered species and the release of genetically modified organisms into the environment. With respect to microorganisms, the release of genetically engineered microorganisms is likely to increase dramatically given the current growth in the bioremediation industry. In this study, genetic variation within a lotic, bacterial population of Janthinobacterium lividum was measured with restriction fragment length polymorphism analysis. Chromosomal DNA from 10 Kettle Creek (Hawk Mountain Sanctuary, Kempton, Pa.) J. lividum isolates was digested with six restriction endonucleases and probed with a 7.5-kb pKK3535 fragment containing the E. coli rrnB rRNA operon. Genetic variation, as measured in terms of nucleotide diversity, was high within the population. The 0.0781 value for genetic variation was especially high given the conservative nature of the genetic probe. The average percent similarity among isolates within the population was 67.25%. Pairwise comparisons of nucleotide diversity values (π) and similarity coefficients (F) yielded values ranging from 0.0032 to 0.1816 and 0.3363 to 0.9808, respectively. Putative clonemates were not present within the group of isolates; however, all isolates shared 14 fragments across a spectrum of six restriction enzymes. The presence of these common fragments indicates that restriction fragment length polymorphism analysis may provide population- or species-specific diagnostic markers for J. lividum. Data that suggest a plume effect with respect to the downstream movement of J. lividum are also presented. An increase in genetic variation within groups of isolates along the longitudinal gradient of Kettle Creek is also suggested. PMID:16348995
Genetic Variation within a Lotic Population of Janthinobacterium lividum.

PubMed

Saeger, J L; Hale, A B

1993-07-01

An understanding of the genetic variation within and between populations should allow scientists to address many problems, including those associated with endangered species and the release of genetically modified organisms into the environment. With respect to microorganisms, the release of genetically engineered microorganisms is likely to increase dramatically given the current growth in the bioremediation industry. In this study, genetic variation within a lotic, bacterial population of Janthinobacterium lividum was measured with restriction fragment length polymorphism analysis. Chromosomal DNA from 10 Kettle Creek (Hawk Mountain Sanctuary, Kempton, Pa.) J. lividum isolates was digested with six restriction endonucleases and probed with a 7.5-kb pKK3535 fragment containing the E. coli rrnB rRNA operon. Genetic variation, as measured in terms of nucleotide diversity, was high within the population. The 0.0781 value for genetic variation was especially high given the conservative nature of the genetic probe. The average percent similarity among isolates within the population was 67.25%. Pairwise comparisons of nucleotide diversity values (pi) and similarity coefficients (F) yielded values ranging from 0.0032 to 0.1816 and 0.3363 to 0.9808, respectively. Putative clonemates were not present within the group of isolates; however, all isolates shared 14 fragments across a spectrum of six restriction enzymes. The presence of these common fragments indicates that restriction fragment length polymorphism analysis may provide population- or species-specific diagnostic markers for J. lividum. Data that suggest a plume effect with respect to the downstream movement of J. lividum are also presented. An increase in genetic variation within groups of isolates along the longitudinal gradient of Kettle Creek is also suggested.
Molecular epizootiology and evolution of the glycoprotein and non-virion protein genes of infectious hematopoietic necrosis virus, a fish rhabdovirus

USGS Publications Warehouse

Nichol, Stuart T.; Rowe, Joan E.; Winton, James R.

1995-01-01

Infectious hematopoietic necrosis virus (IHNV) causes a highly lethal, economically important disease of salmon and trout. The virus is enzootic throughout western North America, and has been spread to Asia and Europe. The nucleotide sequences of the glycoprotein (G) and non-virion (NV) genes of 12 diverse IHNV isolates were determined in order to examine the molecular epizootiology of IHN, the primary structure and conservation of NV, and the evolution of the virus. The G and NV genes and their encoded proteins were highly conserved, with a maximum pairwise nucleotide divergence of 3.6 and 4.4.%, and amino acid divergence of 3.7 and 6.2%, respectively. Conservation of NV protein sequence (111 amino acids in length) confirms that the protein is functional and plays an important role in virus replication. The phylogenetic relationship of viruses was found to correlate with the geographic origin of virus isolates rather than with host species or time of isolation. These data are consistent with stable maintenance of virus in enzootic foci. Two main IHNV genetic lineages were identified; one in the Columbia River Basin (Oregon, Washington and Idaho), the other in the Sacramento River Basin (California). The first major IHNV outbreak in chinook salmon in 1973 in the Columbia River was genetically linked to importation of virus-infected fish eggs from the Sacramento River where outbreaks in chinook salmon are common. However, the introduced virus apparently did not persist, subsequent virus outbreaks in Columbia River chinook salmon being associated with Columbia River genetic lineages. In general, virus monoclonal antibody reactivity profiles and phylogenetic relationships correlated well.
Multilocus sequence analysis of Thermoanaerobacter isolates reveals recombining, but differentiated, populations from geothermal springs of the Uzon Caldera, Kamchatka, Russia

PubMed Central

Wagner, Isaac D.; Varghese, Litty B.; Hemme, Christopher L.; Wiegel, Juergen

2013-01-01

Thermal environments have island-like characteristics and provide a unique opportunity to study population structure and diversity patterns of microbial taxa inhabiting these sites. Strains having ≥98% 16S rRNA gene sequence similarity to the obligately anaerobic Firmicutes Thermoanaerobacter uzonensis were isolated from seven geothermal springs, separated by up to 1600 m, within the Uzon Caldera (Kamchatka, Russian Far East). The intraspecies variation and spatial patterns of diversity for this taxon were assessed by multilocus sequence analysis (MLSA) of 106 strains. Analysis of eight protein-coding loci (gyrB, lepA, leuS, pyrG, recA, recG, rplB, and rpoB) revealed that all loci were polymorphic and that nucleotide substitutions were mostly synonymous. There were 148 variable nucleotide sites across 8003 bp concatenates of the protein-coding loci. While pairwise FST values indicated a small but significant level of genetic differentiation between most subpopulations, there was a negligible relationship between genetic divergence and spatial separation. Strains with the same allelic profile were only isolated from the same hot spring, occasionally from consecutive years, and single locus variant (SLV) sequence types were usually derived from the same spring. While recombination occurred, there was an “epidemic” population structure in which a particular T. uzonensis sequence type rose in frequency relative to the rest of the population. These results demonstrate spatial diversity patterns for an anaerobic bacterial species in a relative small geographic location and reinforce the view that terrestrial geothermal springs are excellent places to look for biogeographic diversity patterns regardless of the involved distances. PMID:23801987
Spontaneous nucleotide exchange in low molecular weight GTPases by fluorescently labeled γ-phosphate-linked GTP analogs

PubMed Central

Korlach, Jonas; Baird, Daniel W.; Heikal, Ahmed A.; Gee, Kyle R.; Hoffman, Gregory R.; Webb, Watt W.

2004-01-01

Regulated guanosine nucleotide exchange and hydrolysis constitute the fundamental activities of low molecular weight GTPases. We show that three guanosine 5′-triphosphate analogs with BODIPY fluorophores coupled via the gamma phosphate bind to the GTPases Cdc42, Rac1, RhoA, and Ras and displace guanosine 5′-diphosphate with high intrinsic exchange rates in the presence of Mg2+ ions, thereby acting as synthetic, low molecular weight guanine nucleotide exchange factors. The accompanying large fluorescence enhancements (as high as 12-fold), caused by a reduction in guanine quenching of the environmentally sensitive BODIPY dye fluorescence on protein binding, allow for real-time monitoring of this spontaneous nucleotide exchange in the visible spectrum with high signal-to-noise ratios. Binding affinities increased with longer aliphatic linkers connecting the nucleotide and BODIPY fluorophore and were in the 10–100 nM range. Steady-state and time-resolved fluorescence spectroscopy showed an inverse relationship between linker length and fluorescence enhancement factors and differences in protein-bound fluorophore mobilities, providing optimization criteria for future applications of such compounds as efficient elicitors and reporters of nucleotide exchange. EDTA markedly enhanced nucleotide exchange, enabling rapid loading of GTPases with these probes. Differences in active site geometries, in the absence of Mg2+, caused qualitatively different reporting of the bound state by the different analogs. The BODIPY analogs also prevented the interaction of Cdc42 with p21 activated kinase. Together, these results validate the use of these analogs as valuable tools for studying GTPase functions and for developing potent synthetic nucleotide exchange factors for this important class of signaling molecules. PMID:14973186
Aspergillus and Penicillium identification using DNA sequences: Barcode or MLST?

USDA-ARS?s Scientific Manuscript database

Current methods in DNA technology can detect single nucleotide polymorphisms with measurable accuracy using several different approaches appropriate for different uses. If there are even single nucleotide differences that are invariant markers of the species, we can accomplish identification through...
The Medicago sativa gene index 1.2: a web-accessible gene expression atlas for investigating expression differences between Medicago sativa subspecies.

PubMed

O'Rourke, Jamie A; Fu, Fengli; Bucciarelli, Bruna; Yang, S Sam; Samac, Deborah A; Lamb, JoAnn F S; Monteros, Maria J; Graham, Michelle A; Gronwald, John W; Krom, Nick; Li, Jun; Dai, Xinbin; Zhao, Patrick X; Vance, Carroll P

2015-07-07

Alfalfa (Medicago sativa L.) is the primary forage legume crop species in the United States and plays essential economic and ecological roles in agricultural systems across the country. Modern alfalfa is the result of hybridization between tetraploid M. sativa ssp. sativa and M. sativa ssp. falcata. Due to its large and complex genome, there are few genomic resources available for alfalfa improvement. A de novo transcriptome assembly from two alfalfa subspecies, M. sativa ssp. sativa (B47) and M. sativa ssp. falcata (F56) was developed using Illumina RNA-seq technology. Transcripts from roots, nitrogen-fixing root nodules, leaves, flowers, elongating stem internodes, and post-elongation stem internodes were assembled into the Medicago sativa Gene Index 1.2 (MSGI 1.2) representing 112,626 unique transcript sequences. Nodule-specific and transcripts involved in cell wall biosynthesis were identified. Statistical analyses identified 20,447 transcripts differentially expressed between the two subspecies. Pair-wise comparisons of each tissue combination identified 58,932 sequences differentially expressed in B47 and 69,143 sequences differentially expressed in F56. Comparing transcript abundance in floral tissues of B47 and F56 identified expression differences in sequences involved in anthocyanin and carotenoid synthesis, which determine flower pigmentation. Single nucleotide polymorphisms (SNPs) unique to each M. sativa subspecies (110,241) were identified. The Medicago sativa Gene Index 1.2 increases the expressed sequence data available for alfalfa by ninefold and can be expanded as additional experiments are performed. The MSGI 1.2 transcriptome sequences, annotations, expression profiles, and SNPs were assembled into the Alfalfa Gene Index and Expression Database (AGED) at http://plantgrn.noble.org/AGED/ , a publicly available genomic resource for alfalfa improvement and legume research.
Two Distinct Patterns of Clostridium Difficile Diversity Across Europe Indicates Contrasting Routes of Spread.

PubMed

Eyre, David W; Davies, Kerrie A; Davis, Georgina; Fawley, Warren N; Dingle, Kate E; De Maio, Nicola; Karas, Andreas; Crook, Derrick W; Peto, Tim E A; Walker, A Sarah; Wilcox, Mark H

2018-04-06

Rates of Clostridium difficile infection vary widely across Europe, as do prevalent ribotypes. The extent of Europe-wide diversity within each ribotype is however unknown. Inpatient diarrhoeal faecal samples submitted on one day in summer and winter (2012-2013) to laboratories in 482 European hospitals were cultured for C. difficile, and isolates ribotyped; those from the 10 most prevalent ribotypes were Illumina whole-genome sequenced. Pairwise single nucleotide differences (SNPs) were obtained from recombination-corrected maximum-likelihood phylogenies. Within each ribotype, country-based sequence clustering was assessed using the ratio of the median SNPs between isolates within versus across different countries using permutation tests. Time-scaled Bayesian phylogenies where used to reconstruct the historic location of each lineage. Sequenced isolates (n=624) were from 19 countries. Five ribotypes had within-country clustering: ribotype-356, only in Italy; ribotype-018, predominantly in Italy; ribotype-176, with distinct Czech and German clades; ribotype-001/072, including distinct German, Slovakian, and Spanish clades; and ribotype-027, with multiple predominantly country-specific clades including in Hungary, Italy, Germany, Romania and Poland. By contrast, we found no within-country clustering for ribotypes 078, 015, 002, 014, and 020, consistent with a Europe-wide distribution. Fluoroquinolone-resistance was significantly more common in within-country clustered ribotypes (p=0.009). Fluoroquinolone-resistant isolates were also more tightly geographically clustered, median (IQR) 43 (0-213) miles between each isolate and the most closely genetically-related isolate vs. 421 (204-680) in non-resistant pairs (p<0.001). Two distinct patterns of C. difficile ribotype spread were observed, consistent with either predominantly healthcare-associated acquisition or Europe-wide dissemination via other routes/sources, e.g. the food chain.
Characterization of transcriptome in the Indian meal moth Plodia interpunctella (Lepidoptera: Pyralidae) and gene expression analysis during developmental stages.

PubMed

Tang, Pei-An; Wu, Hai-Jing; Xue, Hao; Ju, Xing-Rong; Song, Wei; Zhang, Qi-Lin; Yuan, Ming-Long

2017-07-30

The Indian meal moth Plodia interpunctella (Lepidoptera: Pyralidae) is a worldwide pest that causes serious damage to stored foods. Although many efforts have been conducted on this species due to its economic importance, the study of genetic basis of development, behavior and insecticide resistance has been greatly hampered due to lack of genomic information. In this study, we used high throughput sequencing platform to perform a de novo transcriptome assembly and tag-based digital gene expression profiling (DGE) analyses across four different developmental stages of P. interpunctella (egg, third-instar larvae, pupae and adult). We obtained approximate 9gigabyte (GB) of clean data and recovered 84,938 unigenes, including 37,602 clusters and 47,336 singletons. These unigenes were annotated using BLAST against the non-redundant protein databases and then functionally classified based on Gene Ontology (GO), Clusters of Orthologous Groups (COG), and Kyoto Encyclopedia of Genes and Genomes databases (KEGG). A large number of differentially expressed genes were identified by pairwise comparisons among different developmental stages. Gene expression profiles dramatically changed between developmental stage transitions. Some of these differentially expressed genes were related to digestion and cuticularization. Quantitative real-time PCR results of six randomly selected genes conformed the findings in the DGEs. Furthermore, we identified over 8000 microsatellite markers and 97,648 single nucleotide polymorphisms which will be useful for population genetics studies of P. interpunctella. This transcriptomic information provided insight into the developmental basis of P. interpunctella and will be helpful for establishing integrated management strategies and developing new targets of insecticides for this serious pest. Copyright © 2017 Elsevier B.V. All rights reserved.
Non-parallel divergence across freshwater and marine three-spined stickleback Gasterosteus aculeatus populations.

PubMed

Pujolar, J M; Ferchaud, A L; Bekkevold, D; Hansen, M M

2017-07-01

This work investigated whether multiple freshwater populations of three-spined stickleback Gasterosteus aculeatus in different freshwater catchments in the Jutland Peninsula, Denmark, derived from the same marine populations show repeated adaptive responses. A total of 327 G. aculeatus collected at 13 sampling locations were screened for genetic variation using a combination of 70 genes putatively under selection and 26 neutral genes along with a marker linked to the ectodysplasin gene (eda), which is strongly correlated with plate armour morphs in the species. A highly significant genetic differentiation was found that was higher among different freshwater samples than between marine-freshwater samples. Tests for selection between marine and freshwater populations showed a very low degree of parallelism and no single nucleotide polymorphism was detected as outlier in all freshwater-marine pairwise comparisons, including the eda. This suggests that G. aculeatus is not necessarily the prime example of parallel local adaptation suggested in much of the literature and that important exceptions exist (i.e. the Jutland Peninsula). While marine populations in the results described here showed a high phenotype-genotype correlation at eda, a low association was found for most of the freshwater populations. The most extreme case was found in the freshwater Lake Hald where all low-plated phenotypes were either homozygotes for the allele supposed to be associated with completely plated morphs or heterozygotes, but none were homozygotes for the putative low-plated allele. Re-examination of data from seven G. aculeatus studies agrees in showing a high but partial association between phenotype-genotype at eda in G. aculeatus freshwater populations and that mismatches occur everywhere in the European regions studied (higher in some areas, i.e. Denmark). This is independent of the eda marker used. © 2017 The Fisheries Society of the British Isles.
Partial molecular characterisation of New World non-human primate lymphocryptoviruses.

PubMed

Lavergne, Anne; de Thoisy, Benoît; Pouliquen, Jean-François; Ruiz-García, Manuel; Lacoste, Vincent

2011-10-01

The description of numerous viruses belonging to the Lymphocryptovirus genus from different Old and New World non-human primate species during the past 10 years has led to developing and supporting co-speciational evolution hypotheses for these viruses and their hosts. Among the different primate species tested, only a few were from the New World. This study attempted to achieve a better understanding of the evolutionary processes within the Platyrrhini branch. Molecular screening of 253 blood DNA samples from 20 New World non-human primate species from Central and South America was carried out using polymerase chain reaction amplification with degenerate consensus primers targeting highly conserved amino acid motifs of the herpesvirus DNA polymerase gene. In addition to the 33 samples from which we have already described three lymphocryptoviruses, amplification products were detected in 17 other samples originating from 11 species (13 sub-species). BLAST searches, pairwise nucleotide and amino acid sequence comparisons, and phylogenetic analyses confirm that they all belong to the Lymphocryptovirus genus. Fourteen distinct Lymphocryptovirus sequences were detected, of which nine have never been reported. Phylogenetic analyses showed that, as expected, the New World virus lineage formed a sister clade to that of the Old World viruses. The parallel determination of the host taxa has demonstrated a good correlation between the distinct monophyletic clades of viruses and the infected primates at the sub-family level. In addition, these results further suggest the existence of two distinct groups within the Cebidae for Saimirinae and Cebinae primates. Nevertheless, based on the current genetic data, this study fell short of achieving a tree that was completely resolved within the lineage of Platyrrhini viruses. Further studies will be needed to better assess the evolutionary relationships between these viruses. Copyright © 2011 Elsevier B.V. All rights reserved.
The alignment of enzymatic steps reveals similar metabolic pathways and probable recruitment events in Gammaproteobacteria.

PubMed

Poot-Hernandez, Augusto Cesar; Rodriguez-Vazquez, Katya; Perez-Rueda, Ernesto

2015-11-17

It is generally accepted that gene duplication followed by functional divergence is one of the main sources of metabolic diversity. In this regard, there is an increasing interest in the development of methods that allow the systematic identification of these evolutionary events in metabolism. Here, we used a method not based on biomolecular sequence analysis to compare and identify common and variable routes in the metabolism of 40 Gammaproteobacteria species. The metabolic maps deposited in the KEGG database were transformed into linear Enzymatic Step Sequences (ESS) by using the breadth-first search algorithm. These ESS represent subsequent enzymes linked to each other, where their catalytic activities are encoded in the Enzyme Commission numbers. The ESS were compared in an all-against-all (pairwise comparisons) approach by using a dynamic programming algorithm, leaving only a set of significant pairs. From these comparisons, we identified a set of functionally conserved enzymatic steps in different metabolic maps, in which cell wall components and fatty acid and lysine biosynthesis were included. In addition, we found that pathways associated with biosynthesis share a higher proportion of similar ESS than degradation pathways and secondary metabolism pathways. Also, maps associated with the metabolism of similar compounds contain a high proportion of similar ESS, such as those maps from nucleotide metabolism pathways, in particular the inosine monophosphate pathway. Furthermore, diverse ESS associated with the low part of the glycolysis pathway were identified as functionally similar to multiple metabolic pathways. In summary, our comparisons may help to identify similar reactions in different metabolic pathways and could reinforce the patchwork model in the evolution of metabolism in Gammaproteobacteria.
Defining a Contemporary Ischemic Heart Disease Genetic Risk Profile Using Historical Data.

PubMed

Mosley, Jonathan D; van Driest, Sara L; Wells, Quinn S; Shaffer, Christian M; Edwards, Todd L; Bastarache, Lisa; McCarty, Catherine A; Thompson, Will; Chute, Christopher G; Jarvik, Gail P; Crosslin, David R; Larson, Eric B; Kullo, Iftikhar J; Pacheco, Jennifer A; Peissig, Peggy L; Brilliant, Murray H; Linneman, James G; Denny, Josh C; Roden, Dan M

2016-12-01

Continued reductions in morbidity and mortality attributable to ischemic heart disease (IHD) require an understanding of the changing epidemiology of this disease. We hypothesized that we could use genetic correlations, which quantify the shared genetic architectures of phenotype pairs and extant risk factors from a historical prospective study to define the risk profile of a contemporary IHD phenotype. We used 37 phenotypes measured in the ARIC study (Atherosclerosis Risk in Communities; n=7716, European ancestry subjects) and clinical diagnoses from an electronic health record (EHR) data set (n=19 093). All subjects had genome-wide single-nucleotide polymorphism genotyping. We measured pairwise genetic correlations (rG) between the ARIC and EHR phenotypes using linear mixed models. The genetic correlation estimates between the ARIC risk factors and the EHR IHD were modestly linearly correlated with hazards ratio estimates for incident IHD in ARIC (Pearson correlation [r]=0.62), indicating that the 2 IHD phenotypes had differing risk profiles. For comparison, this correlation was 0.80 when comparing EHR and ARIC type 2 diabetes mellitus phenotypes. The EHR IHD phenotype was most strongly correlated with ARIC metabolic phenotypes, including total:high-density lipoprotein cholesterol ratio (rG=-0.44, P=0.005), high-density lipoprotein (rG=-0.48, P=0.005), systolic blood pressure (rG=0.44, P=0.02), and triglycerides (rG=0.38, P=0.02). EHR phenotypes related to type 2 diabetes mellitus, atherosclerotic, and hypertensive diseases were also genetically correlated with these ARIC risk factors. The EHR IHD risk profile differed from ARIC and indicates that treatment and prevention efforts in this population should target hypertensive and metabolic disease. © 2016 American Heart Association, Inc.
A Procedure for Testing the Difference between Effect Sizes.

ERIC Educational Resources Information Center

Lambert, Richard G.; Flowers, Claudia

A special case of the homogeneity of effect size test, as applied to pairwise comparisons of standardized mean differences, was evaluated. Procedures for comparing pairs of pretest to posttest effect sizes, as well as pairs of treatment versus control group effect sizes, were examined. Monte Carlo simulation was used to generate Type I error rates…
Surveying alignment-free features for Ortholog detection in related yeast proteomes by using supervised big data classifiers.

PubMed

Galpert, Deborah; Fernández, Alberto; Herrera, Francisco; Antunes, Agostinho; Molina-Ruiz, Reinaldo; Agüero-Chapin, Guillermin

2018-05-03

The development of new ortholog detection algorithms and the improvement of existing ones are of major importance in functional genomics. We have previously introduced a successful supervised pairwise ortholog classification approach implemented in a big data platform that considered several pairwise protein features and the low ortholog pair ratios found between two annotated proteomes (Galpert, D et al., BioMed Research International, 2015). The supervised models were built and tested using a Saccharomycete yeast benchmark dataset proposed by Salichos and Rokas (2011). Despite several pairwise protein features being combined in a supervised big data approach; they all, to some extent were alignment-based features and the proposed algorithms were evaluated on a unique test set. Here, we aim to evaluate the impact of alignment-free features on the performance of supervised models implemented in the Spark big data platform for pairwise ortholog detection in several related yeast proteomes. The Spark Random Forest and Decision Trees with oversampling and undersampling techniques, and built with only alignment-based similarity measures or combined with several alignment-free pairwise protein features showed the highest classification performance for ortholog detection in three yeast proteome pairs. Although such supervised approaches outperformed traditional methods, there were no significant differences between the exclusive use of alignment-based similarity measures and their combination with alignment-free features, even within the twilight zone of the studied proteomes. Just when alignment-based and alignment-free features were combined in Spark Decision Trees with imbalance management, a higher success rate (98.71%) within the twilight zone could be achieved for a yeast proteome pair that underwent a whole genome duplication. The feature selection study showed that alignment-based features were top-ranked for the best classifiers while the runners-up were alignment-free features related to amino acid composition. The incorporation of alignment-free features in supervised big data models did not significantly improve ortholog detection in yeast proteomes regarding the classification qualities achieved with just alignment-based similarity measures. However, the similarity of their classification performance to that of traditional ortholog detection methods encourages the evaluation of other alignment-free protein pair descriptors in future research.
Building dynamic population graph for accurate correspondence detection.

PubMed

Du, Shaoyi; Guo, Yanrong; Sanroma, Gerard; Ni, Dong; Wu, Guorong; Shen, Dinggang

2015-12-01

In medical imaging studies, there is an increasing trend for discovering the intrinsic anatomical difference across individual subjects in a dataset, such as hand images for skeletal bone age estimation. Pair-wise matching is often used to detect correspondences between each individual subject and a pre-selected model image with manually-placed landmarks. However, the large anatomical variability across individual subjects can easily compromise such pair-wise matching step. In this paper, we present a new framework to simultaneously detect correspondences among a population of individual subjects, by propagating all manually-placed landmarks from a small set of model images through a dynamically constructed image graph. Specifically, we first establish graph links between models and individual subjects according to pair-wise shape similarity (called as forward step). Next, we detect correspondences for the individual subjects with direct links to any of model images, which is achieved by a new multi-model correspondence detection approach based on our recently-published sparse point matching method. To correct those inaccurate correspondences, we further apply an error detection mechanism to automatically detect wrong correspondences and then update the image graph accordingly (called as backward step). After that, all subject images with detected correspondences are included into the set of model images, and the above two steps of graph expansion and error correction are repeated until accurate correspondences for all subject images are established. Evaluations on real hand X-ray images demonstrate that our proposed method using a dynamic graph construction approach can achieve much higher accuracy and robustness, when compared with the state-of-the-art pair-wise correspondence detection methods as well as a similar method but using static population graph. Copyright © 2015 Elsevier B.V. All rights reserved.

Breaking the computational barriers of pairwise genome comparison.

PubMed

Torreno, Oscar; Trelles, Oswaldo

2015-08-11

Conventional pairwise sequence comparison software algorithms are being used to process much larger datasets than they were originally designed for. This can result in processing bottlenecks that limit software capabilities or prevent full use of the available hardware resources. Overcoming the barriers that limit the efficient computational analysis of large biological sequence datasets by retrofitting existing algorithms or by creating new applications represents a major challenge for the bioinformatics community. We have developed C libraries for pairwise sequence comparison within diverse architectures, ranging from commodity systems to high performance and cloud computing environments. Exhaustive tests were performed using different datasets of closely- and distantly-related sequences that span from small viral genomes to large mammalian chromosomes. The tests demonstrated that our solution is capable of generating high quality results with a linear-time response and controlled memory consumption, being comparable or faster than the current state-of-the-art methods. We have addressed the problem of pairwise and all-versus-all comparison of large sequences in general, greatly increasing the limits on input data size. The approach described here is based on a modular out-of-core strategy that uses secondary storage to avoid reaching memory limits during the identification of High-scoring Segment Pairs (HSPs) between the sequences under comparison. Software engineering concepts were applied to avoid intermediate result re-calculation, to minimise the performance impact of input/output (I/O) operations and to modularise the process, thus enhancing application flexibility and extendibility. Our computationally-efficient approach allows tasks such as the massive comparison of complete genomes, evolutionary event detection, the identification of conserved synteny blocks and inter-genome distance calculations to be performed more effectively.
Bispectral pairwise interacting source analysis for identifying systems of cross-frequency interacting brain sources from electroencephalographic or magnetoencephalographic signals

NASA Astrophysics Data System (ADS)

Chella, Federico; Pizzella, Vittorio; Zappasodi, Filippo; Nolte, Guido; Marzetti, Laura

2016-05-01

Brain cognitive functions arise through the coordinated activity of several brain regions, which actually form complex dynamical systems operating at multiple frequencies. These systems often consist of interacting subsystems, whose characterization is of importance for a complete understanding of the brain interaction processes. To address this issue, we present a technique, namely the bispectral pairwise interacting source analysis (biPISA), for analyzing systems of cross-frequency interacting brain sources when multichannel electroencephalographic (EEG) or magnetoencephalographic (MEG) data are available. Specifically, the biPISA makes it possible to identify one or many subsystems of cross-frequency interacting sources by decomposing the antisymmetric components of the cross-bispectra between EEG or MEG signals, based on the assumption that interactions are pairwise. Thanks to the properties of the antisymmetric components of the cross-bispectra, biPISA is also robust to spurious interactions arising from mixing artifacts, i.e., volume conduction or field spread, which always affect EEG or MEG functional connectivity estimates. This method is an extension of the pairwise interacting source analysis (PISA), which was originally introduced for investigating interactions at the same frequency, to the study of cross-frequency interactions. The effectiveness of this approach is demonstrated in simulations for up to three interacting source pairs and for real MEG recordings of spontaneous brain activity. Simulations show that the performances of biPISA in estimating the phase difference between the interacting sources are affected by the increasing level of noise rather than by the number of the interacting subsystems. The analysis of real MEG data reveals an interaction between two pairs of sources of central mu and beta rhythms, localizing in the proximity of the left and right central sulci.
Population Expansion and Genetic Structure in Carcharhinus brevipinna in the Southern Indo-Pacific

PubMed Central

Geraghty, Pascal T.; Williamson, Jane E.; Macbeth, William G.; Wintner, Sabine P.; Harry, Alastair V.; Ovenden, Jennifer R.; Gillings, Michael R.

2013-01-01

Background Quantifying genetic diversity and metapopulation structure provides insights into the evolutionary history of a species and helps develop appropriate management strategies. We provide the first assessment of genetic structure in spinner sharks (Carcharhinus brevipinna), a large cosmopolitan carcharhinid, sampled from eastern and northern Australia and South Africa. Methods and Findings Sequencing of the mitochondrial DNA NADH dehydrogenase subunit 4 gene for 430 individuals revealed 37 haplotypes and moderately high haplotype diversity (h = 0.6770 ±0.025). While two metrics of genetic divergence (ΦST and F ST) revealed somewhat different results, subdivision was detected between South Africa and all Australian locations (pairwise ΦST, range 0.02717–0.03508, p values ≤ 0.0013; pairwise F ST South Africa vs New South Wales = 0.04056, p = 0.0008). Evidence for fine-scale genetic structuring was also detected along Australia’s east coast (pairwise ΦST = 0.01328, p < 0.015), and between south-eastern and northern locations (pairwise ΦST = 0.00669, p < 0.04). Conclusions The Indian Ocean represents a robust barrier to contemporary gene flow in C. brevipinna between Australia and South Africa. Gene flow also appears restricted along a continuous continental margin in this species, with data tentatively suggesting the delineation of two management units within Australian waters. Further sampling, however, is required for a more robust evaluation of the latter finding. Evidence indicates that all sampled populations were shaped by a substantial demographic expansion event, with the resultant high genetic diversity being cause for optimism when considering conservation of this commercially-targeted species in the southern Indo-Pacific. PMID:24086462
Generalized priority-queue network dynamics: Impact of team and hierarchy

NASA Astrophysics Data System (ADS)

Cho, Won-Kuk; Min, Byungjoon; Goh, K.-I.; Kim, I.-M.

2010-06-01

We study the effect of team and hierarchy on the waiting-time dynamics of priority-queue networks. To this end, we introduce generalized priority-queue network models incorporating interaction rules based on team-execution and hierarchy in decision making, respectively. It is numerically found that the waiting-time distribution exhibits a power law for long waiting times in both cases, yet with different exponents depending on the team size and the position of queue nodes in the hierarchy, respectively. The observed power-law behaviors have in many cases a corresponding single or pairwise-interacting queue dynamics, suggesting that the pairwise interaction may constitute a major dynamic consequence in the priority-queue networks. It is also found that the reciprocity of influence is a relevant factor for the priority-queue network dynamics.
Effect of interacting second- and third-order stimulus-dependent correlations on population-coding asymmetries.

PubMed

Montangie, Lisandro; Montani, Fernando

2016-10-01

Spike correlations among neurons are widely encountered in the brain. Although models accounting for pairwise interactions have proved able to capture some of the most important features of population activity at the level of the retina, the evidence shows that pairwise neuronal correlation analysis does not resolve cooperative population dynamics by itself. By means of a series expansion for short time scales of the mutual information conveyed by a population of neurons, the information transmission can be broken down into firing rate and correlational components. In a proposed extension of this framework, we investigate the information components considering both second- and higher-order correlations. We show that the existence of a mixed stimulus-dependent correlation term defines a new scenario for the interplay between pairwise and higher-than-pairwise interactions in noise and signal correlations that would lead either to redundancy or synergy in the information-theoretic sense.
Pairwise Force Smoothed Particle Hydrodynamics model for multiphase flow: Surface tension and contact line dynamics

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tartakovsky, Alexandre M.; Panchenko, Alexander

2016-01-01

We present a novel formulation of the Pairwise Force Smoothed Particle Hydrodynamics Model (PF-SPH) and use it to simulate two- and three-phase flows in bounded domains. In the PF-SPH model, the Navier-Stokes equations are discretized with the Smoothed Particle Hydrodynamics (SPH) method and the Young-Laplace boundary condition at the fluid-fluid interface and the Young boundary condition at the fluid-fluid-solid interface are replaced with pairwise forces added into the Navier-Stokes equations. We derive a relationship between the parameters in the pairwise forces and the surface tension and static contact angle. Next, we demonstrate the accuracy of the model under static andmore » dynamic conditions. Finally, to demonstrate the capabilities and robustness of the model we use it to simulate flow of three fluids in a porous material.« less
Cortical Dynamics in Presence of Assemblies of Densely Connected Weight-Hub Neurons

PubMed Central

Setareh, Hesam; Deger, Moritz; Petersen, Carl C. H.; Gerstner, Wulfram

2017-01-01

Experimental measurements of pairwise connection probability of pyramidal neurons together with the distribution of synaptic weights have been used to construct randomly connected model networks. However, several experimental studies suggest that both wiring and synaptic weight structure between neurons show statistics that differ from random networks. Here we study a network containing a subset of neurons which we call weight-hub neurons, that are characterized by strong inward synapses. We propose a connectivity structure for excitatory neurons that contain assemblies of densely connected weight-hub neurons, while the pairwise connection probability and synaptic weight distribution remain consistent with experimental data. Simulations of such a network with generalized integrate-and-fire neurons display regular and irregular slow oscillations akin to experimentally observed up/down state transitions in the activity of cortical neurons with a broad distribution of pairwise spike correlations. Moreover, stimulation of a model network in the presence or absence of assembly structure exhibits responses similar to light-evoked responses of cortical layers in optogenetically modified animals. We conclude that a high connection probability into and within assemblies of excitatory weight-hub neurons, as it likely is present in some but not all cortical layers, changes the dynamics of a layer of cortical microcircuitry significantly. PMID:28690508
Enzymatic Incorporation of Modified Purine Nucleotides in DNA.

PubMed

Abu El Asrar, Rania; Margamuljana, Lia; Abramov, Mikhail; Bande, Omprakash; Agnello, Stefano; Jang, Miyeon; Herdewijn, Piet

2017-12-14

A series of nucleotide analogues, with a hypoxanthine base moiety (8-aminohypoxanthine, 1-methyl-8-aminohypoxanthine, and 8-oxohypoxanthine), together with 5-methylisocytosine were tested as potential pairing partners of N 8 -glycosylated nucleotides with an 8-azaguanine or 8-aza-9-deazaguanine base moiety by using DNA polymerases (incorporation studies). The best results were obtained with the 5-methylisocytosine nucleotide followed by the 1-methyl-8-aminohypoxanthine nucleotide. The experiments demonstrated that small differences in the structure (8-azaguanine versus 8-aza-9-deazaguanine) might lead to significant differences in recognition efficiency and selectivity, base pairing by Hoogsteen recognition at the polymerase level is possible, 8-aza-9-deazaguanine represents a self-complementary base pair, and a correlation exists between in vitro incorporation studies and in vivo recognition by natural bases in Escherichia coli, but this recognition is not absolute (exceptions were observed). © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Using Analytic Hierarchy Process in Textbook Evaluation

ERIC Educational Resources Information Center

Kato, Shigeo

2014-01-01

This study demonstrates the application of the analytic hierarchy process (AHP) in English language teaching materials evaluation, focusing in particular on its potential for systematically integrating different components of evaluation criteria in a variety of teaching contexts. AHP is a measurement procedure wherein pairwise comparisons are made…
Application of whole genome sequence data in analyzing the molecular epidemiology of Shiga toxin-producing Escherichia coli O157:H7/H.

PubMed

Yokoyama, Eiji; Hirai, Shinichiro; Ishige, Taichiro; Murakami, Satoshi

2018-01-02

Seventeen clusters of Shiga toxin-producing Escherichia coli O157:H7/- (O157) strains, determined by cluster analysis of pulsed-field gel electrophoresis patterns, were analyzed using whole genome sequence (WGS) data to investigate this pathogen's molecular epidemiology. The 17 clusters included 136 strains containing strains from nine outbreaks, with each outbreak caused by a single source contaminated with the organism, as shown by epidemiological contact surveys. WGS data of these strains were used to identify single nucleotide polymorphisms (SNPs) by two methods: short read data were directly mapped to a reference genome (mapping derived SNPs) and common SNPs between the mapping derived SNPs and SNPs in assembled data of short read data (common SNPs). Among both SNPs, those that were detected in genes with a gap were excluded to remove ambiguous SNPs from further analysis. The effectiveness of both SNPs was investigated among all the concatenated SNPs that were detected (whole SNP set); SNPs were divided into three categories based on the genes in which they were located (i.e., backbone SNP set, O-island SNP set, and mobile element SNP set); and SNPs in non-coding regions (intergenic region SNP set). When SNPs from strains isolated from the nine single source derived outbreaks were analyzed using an unweighted pair group method with arithmetic mean tree (UPGMA) and a minimum spanning tree (MST), the maximum pair-wise distances of the backbone SNP set of the mapping derived SNPs were significantly smaller than those of the whole and intergenic region SNP set on both UPGMAs and MSTs. This significant difference was also observed when the backbone SNP set of the common SNPs were examined (Steel-Dwass test, P≤0.01). When the maximum pair-wise distances were compared between the mapping derived and common SNPs, significant differences were observed in those of the whole, mobile element, and intergenic region SNP set (Wilcoxon signed rank test, P≤0.01). When all the strains included in one complex on an MST or one cluster on a UPGMA were designated as the same genotype, the values of the Hunter-Gaston Discriminatory Power Index for the backbone SNP set of the mapping derived and common SNPs were higher than those of other SNP sets. In contrast, the mobile element SNP set could not robustly subdivide lineage I strains of tested O157 strains using both the mapping derived and common SNPs. These results suggested that the backbone SNP set were the most effective for analysis of WGS data for O157 in enabling an appropriation of its molecular epidemiology. Copyright © 2017 Elsevier B.V. All rights reserved.
Differences between high-affinity forskolin binding sites in dopamine-riche and other regions of rat brain

DOE Office of Scientific and Technical Information (OSTI.GOV)

Poat, J.A.; Cripps, H.E.; Iversen, L.L.

1988-05-01

Forskolin labelled with (/sup 3/H) bound to high- and low-affinity sites in the rat brain. The high-affinity site was discretely located, with highest densities in the striatum, nucleus accumbens, olfactory tubercule, substantia nigra, hippocampus, and the molecular layers of the cerebellum. This site did not correlate well with the distribution of adenylate cyclase. The high-affinity striatal binding site may be associated with a stimulatory guanine nucleotide-binding protein. Thus, the number of sites was increased by the addition of Mg/sup 2 +/ and guanylyl imidodiphosphate. Cholera toxin stereotaxically injected into rat striatum increased the number of binding sites, and no furthermore » increase was noted following the subsequent addition of guanyl nucleotide. High-affinity forskolin binding sites in non-dopamine-rich brain areas (hippocampus and cerebullum) were modulated in a qualitatively different manner by guanyl nucleotides. In these areas the number of binding sites was significantly reduced by the addition of guanyl nucleotide. These results suggest that forskolin may have a potential role in identifying different functional/structural guanine nucleotide-binding proteins.« less
DNA Nucleotides Detection via capacitance properties of Graphene

NASA Astrophysics Data System (ADS)

Khadempar, Nahid; Berahman, Masoud; Yazdanpanah, Arash

2016-05-01

In the present paper a new method is suggested to detect the DNA nucleotides on a first-principles calculation of the electronic features of DNA bases which chemisorbed to a graphene sheet placed between two gold electrodes in a contact-channel-contact system. The capacitance properties of graphene in the channel are surveyed using non-equilibrium Green's function coupled with the Density Functional Theory. Thus, the capacitance properties of graphene are theoretically investigated in a biological environment, and, using a novel method, the effect of the chemisorbed DNA nucleotides on electrical charges on the surface of graphene is deciphered. Several parameters in this method are also extracted including Electrostatic energy, Induced density, induced electrostatic potential, Electron difference potential and Electron difference density. The qualitative and quantitative differences among these parameters can be used to identify DNA nucleotides. Some of the advantages of this approach include its ease and high accuracy. What distinguishes the current research is that it is the first experiment to investigate the capacitance properties of gaphene changes in the biological environment and the effect of chemisorbed DNA nucleotides on the surface of graphene on the charge.
OmpF, a nucleotide-sensing nanoprobe, computational evaluation of single channel activities

NASA Astrophysics Data System (ADS)

Abdolvahab, R. H.; Mobasheri, H.; Nikouee, A.; Ejtehadi, M. R.

2016-09-01

The results of highthroughput practical single channel experiments should be formulated and validated by signal analysis approaches to increase the recognition precision of translocating molecules. For this purpose, the activities of the single nano-pore forming protein, OmpF, in the presence of nucleotides were recorded in real time by the voltage clamp technique and used as a means for nucleotide recognition. The results were analyzed based on the permutation entropy of current Time Series (TS), fractality, autocorrelation, structure function, spectral density, and peak fraction to recognize each nucleotide, based on its signature effect on the conductance, gating frequency and voltage sensitivity of channel at different concentrations and membrane potentials. The amplitude and frequency of ion current fluctuation increased in the presence of Adenine more than Cytosine and Thymine in milli-molar (0.5 mM) concentrations. The variance of the current TS at various applied voltages showed a non-monotonic trend whose initial increasing slope in the presence of Thymine changed to a decreasing one in the second phase and was different from that of Adenine and Cytosine; e.g., by increasing the voltage from 40 to 140 mV in the 0.5 mM concentration of Adenine or Cytosine, the variance decreased by one third while for the case of Thymine it was doubled. Moreover, according to the structure function of TS, the fractality of current TS differed as a function of varying membrane potentials (pd) and nucleotide concentrations. Accordingly, the calculated permutation entropy of the TS, validated the biophysical approach defined for the recognition of different nucleotides at various concentrations, pd's and polarities. Thus, the promising outcomes of the combined experimental and theoretical methodologies presented here can be implemented as a complementary means in pore-based nucleotide recognition approaches.
Adverse events and treatment failure leading to discontinuation of recently approved antipsychotic drugs in schizophrenia: A network meta-analysis.

PubMed

Tonin, Fernanda S; Piazza, Thais; Wiens, Astrid; Fernandez-Llimos, Fernando; Pontarolo, Roberto

2015-12-01

Objective:We aimed to gather evidence of the discontinuation rates owing to adverse events or treatment failure for four recently approved antipsychotics (asenapine, blonanserin, iloperidone, and lurasidone).Methods: A systematic review followed by pairwise meta-analysis and mixed treatment comparison meta analysis(MTC) was performed, including randomized controlled trials (RCTs) that compared the use of the above-mentioned drugs versus placebo in patients with schizophrenia. An electronic search was conducted in PubMed, Scopus, Science Direct, Scielo, the Cochrane Library, and International Pharmaceutical Abstracts(January 2015). The included trials were at least single blinded. The main outcome measures extracted were discontinuation owing to adverse events and discontinuation owing to treatment failure.Results: Fifteen RCTs were identified (n = 5400 participants) and 13 of them were amenable for use in our meta-analyses. No significant differences were observed between any of the four drugs and placebo as regards discontinuation owing to adverse events, whether in pairwise meta-analysis or in MTC. All drugs presented a better profile than placebo on discontinuation owing to treatment failure, both in pairwise meta-analysis and MTC. Asenapine was found to be the best therapy in terms of tolerability owing to failure,while lurasidone was the worst treatment in terms of adverse events. The evidence around blonanserin is weak.Conclusion: MTCs allowed the creation of two different rank orders of these four antipsychotic drugs in two outcome measures. This evidence-generating method allows direct and indirect comparisons, supporting approval and pricing decisions when lacking sufficient, direct, head-to-head trials.
Reporting of analyses from randomized controlled trials with multiple arms: a systematic review.

PubMed

Baron, Gabriel; Perrodeau, Elodie; Boutron, Isabelle; Ravaud, Philippe

2013-03-27

Multiple-arm randomized trials can be more complex in their design, data analysis, and result reporting than two-arm trials. We conducted a systematic review to assess the reporting of analyses in reports of randomized controlled trials (RCTs) with multiple arms. The literature in the MEDLINE database was searched for reports of RCTs with multiple arms published in 2009 in the core clinical journals. Two reviewers extracted data using a standardized extraction form. In total, 298 reports were identified. Descriptions of the baseline characteristics and outcomes per group were missing in 45 reports (15.1%) and 48 reports (16.1%), respectively. More than half of the articles (n = 171, 57.4%) reported that a planned global test comparison was used (that is, assessment of the global differences between all groups), but 67 (39.2%) of these 171 articles did not report details of the planned analysis. Of the 116 articles reporting a global comparison test, 12 (10.3%) did not report the analysis as planned. In all, 60% of publications (n = 180) described planned pairwise test comparisons (that is, assessment of the difference between two groups), but 20 of these 180 articles (11.1%) did not report the pairwise test comparisons. Of the 204 articles reporting pairwise test comparisons, the comparisons were not planned for 44 (21.6%) of them. Less than half the reports (n = 137; 46%) provided baseline and outcome data per arm and reported the analysis as planned. Our findings highlight discrepancies between the planning and reporting of analyses in reports of multiple-arm trials.
Online Pairwise Learning Algorithms.

PubMed

Ying, Yiming; Zhou, Ding-Xuan

2016-04-01

Pairwise learning usually refers to a learning task that involves a loss function depending on pairs of examples, among which the most notable ones are bipartite ranking, metric learning, and AUC maximization. In this letter we study an online algorithm for pairwise learning with a least-square loss function in an unconstrained setting of a reproducing kernel Hilbert space (RKHS) that we refer to as the Online Pairwise lEaRning Algorithm (OPERA). In contrast to existing works (Kar, Sriperumbudur, Jain, & Karnick, 2013 ; Wang, Khardon, Pechyony, & Jones, 2012 ), which require that the iterates are restricted to a bounded domain or the loss function is strongly convex, OPERA is associated with a non-strongly convex objective function and learns the target function in an unconstrained RKHS. Specifically, we establish a general theorem that guarantees the almost sure convergence for the last iterate of OPERA without any assumptions on the underlying distribution. Explicit convergence rates are derived under the condition of polynomially decaying step sizes. We also establish an interesting property for a family of widely used kernels in the setting of pairwise learning and illustrate the convergence results using such kernels. Our methodology mainly depends on the characterization of RKHSs using its associated integral operators and probability inequalities for random variables with values in a Hilbert space.
Cosmology with the pairwise kinematic SZ effect: Calibration and validation using hydrodynamical simulations

NASA Astrophysics Data System (ADS)

Soergel, Bjoern; Saro, Alexandro; Giannantonio, Tommaso; Efstathiou, George; Dolag, Klaus

2018-05-01

We study the potential of the kinematic SZ effect as a probe for cosmology, focusing on the pairwise method. The main challenge is disentangling the cosmologically interesting mean pairwise velocity from the cluster optical depth and the associated uncertainties on the baryonic physics in clusters. Furthermore, the pairwise kSZ signal might be affected by internal cluster motions or correlations between velocity and optical depth. We investigate these effects using the Magneticum cosmological hydrodynamical simulations, one of the largest simulations of this kind performed to date. We produce tSZ and kSZ maps with an area of ≃ 1600 deg2, and the corresponding cluster catalogues with M500c ≳ 3 × 1013 h-1M⊙ and z ≲ 2. From these data sets we calibrate a scaling relation between the average Compton-y parameter and optical depth. We show that this relation can be used to recover an accurate estimate of the mean pairwise velocity from the kSZ effect, and that this effect can be used as an important probe of cosmology. We discuss the impact of theoretical and observational systematic effects, and find that further work on feedback models is required to interpret future high-precision measurements of the kSZ effect.
"New turns from old STaRs": enhancing the capabilities of forensic short tandem repeat analysis.

PubMed

Phillips, Christopher; Gelabert-Besada, Miguel; Fernandez-Formoso, Luis; García-Magariños, Manuel; Santos, Carla; Fondevila, Manuel; Ballard, David; Syndercombe Court, Denise; Carracedo, Angel; Lareu, Maria Victoria

2014-11-01

The field of research and development of forensic STR genotyping remains active, innovative, and focused on continuous improvements. A series of recent developments including the introduction of a sixth dye have brought expanded STR multiplex sizes while maintaining sensitivity to typical forensic DNA. New supplementary kits complimenting the core STRs have also helped improve analysis of challenging identification cases such as distant pairwise relationships in deficient pedigrees. This article gives an overview of several recent key developments in forensic STR analysis: availability of expanded core STR kits and supplementary STRs, short-amplicon mini-STRs offering practical options for highly degraded DNA, Y-STR enhancements made from the identification of rapidly mutating loci, and enhanced analysis of genetic ancestry by analyzing 32-STR profiles with a Bayesian forensic classifier originally developed for SNP population data. As well as providing scope for genotyping larger numbers of STRs optimized for forensic applications, the launch of compact next-generation sequencing systems provides considerable potential for genotyping the sizeable proportion of nucleotide variation existing in forensic STRs, which currently escapes detection with CE. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Minimap2: pairwise alignment for nucleotide sequences.

PubMed

Li, Heng

2018-05-10

Recent advances in sequencing technologies promise ultra-long reads of ∼100 kilo bases (kb) in average, full-length mRNA or cDNA reads in high throughput and genomic contigs over 100 mega bases (Mb) in length. Existing alignment programs are unable or inefficient to process such data at scale, which presses for the development of new alignment algorithms. Minimap2 is a general-purpose alignment program to map DNA or long mRNA sequences against a large reference database. It works with accurate short reads of ≥ 100bp in length, ≥1kb genomic reads at error rate ∼15%, full-length noisy Direct RNA or cDNA reads, and assembly contigs or closely related full chromosomes of hundreds of megabases in length. Minimap2 does split-read alignment, employs concave gap cost for long insertions and deletions (INDELs) and introduces new heuristics to reduce spurious alignments. It is 3-4 times as fast as mainstream short-read mappers at comparable accuracy, and is ≥30 times faster than long-read genomic or cDNA mappers at higher accuracy, surpassing most aligners specialized in one type of alignment. https://github.com/lh3/minimap2. hengli@broadinstitute.org.
Genetic diversity and population genetic analysis of Donax vittatus (Mollusca: Bivalvia) and phylogeny of the genus with mitochondrial and nuclear markers

NASA Astrophysics Data System (ADS)

Fernández-Pérez, Jenyfer; Froufe, Elsa; Nantón, Ana; Gaspar, Miguel B.; Méndez, Josefina

2017-10-01

In this study, the genetic diversity of Donax vittatus across the Iberian Peninsula was investigated using four mitochondrial (COI, Cytb, 16S F and M types) and three nuclear (H3, 18S and 28S) genes. These same molecular markers were also sequenced in D. semistriatus and D variegatus to address the phylogenetic relationships of the species of the genus Donax common along the European coasts. Our results showed high haplotype diversity in combination with a low nucleotide diversity and a star-shaped network with a predominant haplotype, indicating a recent population expansion for the examined sampling sites of D. vittatus. Furthermore, analyses of population differentiation performed with COI mitochondrial marker, including global FST estimation and pairwise FST values, indicated the non-existence of significant genetic structure in D. vittatus of Northwest Iberian populations. Because these localities show a high genetic similarity, we suggest that D. vittatus could be a potentially alternative exploitable resource, as complement to the D. trunculus fisheries, whose natural stocks have decreased dramatically in some areas. Furthermore, we present for the first time, evidence of DUI in the clams D. vittatus and D. semistriatus.

Further Improvements to Linear Mixed Models for Genome-Wide Association Studies

PubMed Central

Widmer, Christian; Lippert, Christoph; Weissbrod, Omer; Fusi, Nicolo; Kadie, Carl; Davidson, Robert; Listgarten, Jennifer; Heckerman, David

2014-01-01

We examine improvements to the linear mixed model (LMM) that better correct for population structure and family relatedness in genome-wide association studies (GWAS). LMMs rely on the estimation of a genetic similarity matrix (GSM), which encodes the pairwise similarity between every two individuals in a cohort. These similarities are estimated from single nucleotide polymorphisms (SNPs) or other genetic variants. Traditionally, all available SNPs are used to estimate the GSM. In empirical studies across a wide range of synthetic and real data, we find that modifications to this approach improve GWAS performance as measured by type I error control and power. Specifically, when only population structure is present, a GSM constructed from SNPs that well predict the phenotype in combination with principal components as covariates controls type I error and yields more power than the traditional LMM. In any setting, with or without population structure or family relatedness, a GSM consisting of a mixture of two component GSMs, one constructed from all SNPs and another constructed from SNPs that well predict the phenotype again controls type I error and yields more power than the traditional LMM. Software implementing these improvements and the experimental comparisons are available at http://microsoft.com/science. PMID:25387525
Population-genetic properties of differentiated copy number variations in cattle.

PubMed

Xu, Lingyang; Hou, Yali; Bickhart, Derek M; Zhou, Yang; Hay, El Hamidi Abdel; Song, Jiuzhou; Sonstegard, Tad S; Van Tassell, Curtis P; Liu, George E

2016-03-23

While single nucleotide polymorphism (SNP) is typically the variant of choice for population genetics, copy number variation (CNV) which comprises insertion, deletion and duplication of genomic sequence, is an informative type of genetic variation. CNVs have been shown to be both common in mammals and important for understanding the relationship between genotype and phenotype. However, CNV differentiation, selection and its population genetic properties are not well understood across diverse populations. We performed a population genetics survey based on CNVs derived from the BovineHD SNP array data of eight distinct cattle breeds. We generated high resolution results that show geographical patterns of variations and genome-wide admixture proportions within and among breeds. Similar to the previous SNP-based studies, our CNV-based results displayed a strong correlation of population structure and geographical location. By conducting three pairwise comparisons among European taurine, African taurine, and indicine groups, we further identified 78 unique CNV regions that were highly differentiated, some of which might be due to selection. These CNV regions overlapped with genes involved in traits related to parasite resistance, immunity response, body size, fertility, and milk production. Our results characterize CNV diversity among cattle populations and provide a list of lineage-differentiated CNVs.
Molecular characterization of novel mucosotropic papillomaviruses from a Florida manatee (Trichechus manatus latirostris).

PubMed

2015-12-01

We isolated two new manatee papillomavirus (PV) types, TmPV3 and TmPV4, from a Florida manatee (Trichechus manatus latirostris). Two PV types were previously isolated from this species. TmPV1 is widely dispersed amongst manatees and a close-to-root PV; not much is known about TmPV2. The genomes of TmPV3 and TmPV4 were 7622 and 7771 bp in size, respectively. Both PVs had a genomic organization characteristic of all PVs, with one non-coding region and seven ORFs, including the E7 ORF that is absent in other cetacean PVs. Although these PVs were isolated from separate genital lesions of the same manatee, an enlarged E2/E4 ORF was found only in the TmPV4 genome. The full genome and L1 sequence similarities between TmPV3 and TmPV4 were 63.2 and 70.3 %, respectively. These genomes shared only 49.1 and 50.2 % similarity with TmPV1. The pairwise alignment of L1 nucleotide sequences indicated that the two new PVs nested in a monophyletic group of the genus Rhopapillomavirus, together with the cutaneotropic TmPV1 and TmPV2.
Further Improvements to Linear Mixed Models for Genome-Wide Association Studies

NASA Astrophysics Data System (ADS)

Widmer, Christian; Lippert, Christoph; Weissbrod, Omer; Fusi, Nicolo; Kadie, Carl; Davidson, Robert; Listgarten, Jennifer; Heckerman, David

2014-11-01

We examine improvements to the linear mixed model (LMM) that better correct for population structure and family relatedness in genome-wide association studies (GWAS). LMMs rely on the estimation of a genetic similarity matrix (GSM), which encodes the pairwise similarity between every two individuals in a cohort. These similarities are estimated from single nucleotide polymorphisms (SNPs) or other genetic variants. Traditionally, all available SNPs are used to estimate the GSM. In empirical studies across a wide range of synthetic and real data, we find that modifications to this approach improve GWAS performance as measured by type I error control and power. Specifically, when only population structure is present, a GSM constructed from SNPs that well predict the phenotype in combination with principal components as covariates controls type I error and yields more power than the traditional LMM. In any setting, with or without population structure or family relatedness, a GSM consisting of a mixture of two component GSMs, one constructed from all SNPs and another constructed from SNPs that well predict the phenotype again controls type I error and yields more power than the traditional LMM. Software implementing these improvements and the experimental comparisons are available at http://microsoft.com/science.
Further improvements to linear mixed models for genome-wide association studies.

PubMed

Widmer, Christian; Lippert, Christoph; Weissbrod, Omer; Fusi, Nicolo; Kadie, Carl; Davidson, Robert; Listgarten, Jennifer; Heckerman, David

2014-11-12

We examine improvements to the linear mixed model (LMM) that better correct for population structure and family relatedness in genome-wide association studies (GWAS). LMMs rely on the estimation of a genetic similarity matrix (GSM), which encodes the pairwise similarity between every two individuals in a cohort. These similarities are estimated from single nucleotide polymorphisms (SNPs) or other genetic variants. Traditionally, all available SNPs are used to estimate the GSM. In empirical studies across a wide range of synthetic and real data, we find that modifications to this approach improve GWAS performance as measured by type I error control and power. Specifically, when only population structure is present, a GSM constructed from SNPs that well predict the phenotype in combination with principal components as covariates controls type I error and yields more power than the traditional LMM. In any setting, with or without population structure or family relatedness, a GSM consisting of a mixture of two component GSMs, one constructed from all SNPs and another constructed from SNPs that well predict the phenotype again controls type I error and yields more power than the traditional LMM. Software implementing these improvements and the experimental comparisons are available at http://microsoft.com/science.
Capturing pair-wise epistatic effects associated with three agronomic traits in barley.

PubMed

Xu, Yi; Wu, Yajun; Wu, Jixiang

2018-04-01

Genetic association mapping has been widely applied to determine genetic markers favorably associated with a trait of interest and provide information for marker-assisted selection. Many association mapping studies commonly focus on main effects due to intolerable computing intensity. This study aims to select several sets of DNA markers with potential epistasis to maximize genetic variations of some key agronomic traits in barley. By doing so, we integrated a MDR (multifactor dimensionality reduction) method with a forward variable selection approach. This integrated approach was used to determine single nucleotide polymorphism pairs with epistasis effects associated with three agronomic traits: heading date, plant height, and grain yield in barley from the barley Coordinated Agricultural Project. Our results showed that four, seven, and five SNP pairs accounted for 51.06, 45.66 and 40.42% for heading date, plant height, and grain yield, respectively with epistasis being considered, while corresponding contributions to these three traits were 45.32, 31.39, 31.31%, respectively without epistasis being included. The results suggested that epistasis model was more effective than non-epistasis model in this study and can be more preferred for other applications.
Discovery and identification of a series of alkyl decalin isomers in petroleum geological samples.

PubMed

Wang, Huitong; Zhang, Shuichang; Weng, Na; Zhang, Bin; Zhu, Guangyou; Liu, Lingyan

2015-07-07

The comprehensive two-dimensional gas chromatography/time-of-flight mass spectrometry (GC × GC/TOFMS) has been used to characterize a crude oil and a source rock extract sample. During the process, a series of pairwise components between monocyclic alkanes and mono-aromatics have been discovered. After tentative assignments of decahydronaphthalene isomers, a series of alkyl decalin isomers have been synthesized and used for identification and validation of these petroleum compounds. From both the MS and chromatography information, these pairwise compounds were identified as 2-alkyl-decahydronaphthalenes and 1-alkyl-decahydronaphthalenes. The polarity of 1-alkyl-decahydronaphthalenes was stronger. Their long chain alkyl substituent groups may be due to bacterial transformation or different oil cracking events. This systematic profiling of alkyl-decahydronaphthalene isomers provides further understanding and recognition of these potential petroleum biomarkers.
Non-rigid multi-frame registration of cell nuclei in live cell fluorescence microscopy image data.

PubMed

Tektonidis, Marco; Kim, Il-Han; Chen, Yi-Chun M; Eils, Roland; Spector, David L; Rohr, Karl

2015-01-01

The analysis of the motion of subcellular particles in live cell microscopy images is essential for understanding biological processes within cells. For accurate quantification of the particle motion, compensation of the motion and deformation of the cell nucleus is required. We introduce a non-rigid multi-frame registration approach for live cell fluorescence microscopy image data. Compared to existing approaches using pairwise registration, our approach exploits information from multiple consecutive images simultaneously to improve the registration accuracy. We present three intensity-based variants of the multi-frame registration approach and we investigate two different temporal weighting schemes. The approach has been successfully applied to synthetic and live cell microscopy image sequences, and an experimental comparison with non-rigid pairwise registration has been carried out. Copyright © 2014 Elsevier B.V. All rights reserved.
Efficacy of Proton Pump Inhibitors for Patients with Duodenal Ulcers: A Pairwise and Network Meta-Analysis of Randomized Controlled Trials

PubMed Central

Hu, Zhan-Hong; Shi, Ai-Ming; Hu, Duan-Min; Bao, Jun-Jie

2017-01-01

Background/Aim: To compare the efficacy and tolerance of different proton pump inhibitors (PPIs) in different doses for patients with duodenal ulcers. Materials and Methods: An electronic database was searched to collect all randomized clinical trials (RCTs), and a pairwise and network meta-analysis were performed. Results: A total of 24 RCTs involving 6188 patients were included. The network meta-analysis showed that there were no significant differences for the 4-week healing rate of duodenal ulcer treated with different PPI regimens except pantoprazle 40 mg/d versus lansoprazole 15 mg/d [Relative risk (RR) = 3.57; 95% confidence interval (CI) = 1.36–10.31)] and lansoprazole 30 mg/d versus lansoprazole 15 mg/d (RR = 2.45; 95% CI = 1.01–6.14). In comparison with H2 receptor antagonists (H2 RA), pantoprazole 40 mg/d and lansoprazole 30 mg/d significantly increase the healing rate (RR = 2.96; 95% CI = 1.78–5.14 and RR = 2.04; 95% CI = 1.13–3.53, respectively). There was no significant difference for the rate of adverse events between different regimens, including H2 RA for a duration of 4-week of follow up. Conclusion: There was no significant difference for the efficacy and tolerance between the ordinary doses of different PPIs with the exception of lansoprazle 15 mg/d. PMID:28139495
Analyzing Longitudinal Item Response Data via the Pairwise Fitting Method

ERIC Educational Resources Information Center

Fu, Zhi-Hui; Tao, Jian; Shi, Ning-Zhong; Zhang, Ming; Lin, Nan

2011-01-01

Multidimensional item response theory (MIRT) models can be applied to longitudinal educational surveys where a group of individuals are administered different tests over time with some common items. However, computational problems typically arise as the dimension of the latent variables increases. This is especially true when the latent variable…
Electrical detection and quantification of single and mixed DNA nucleotides in suspension

NASA Astrophysics Data System (ADS)

Ahmad, Mahmoud Al; Panicker, Neena G.; Rizvi, Tahir A.; Mustafa, Farah

2016-09-01

High speed sequential identification of the building blocks of DNA, (deoxyribonucleotides or nucleotides for short) without labeling or processing in long reads of DNA is the need of the hour. This can be accomplished through exploiting their unique electrical properties. In this study, the four different types of nucleotides that constitute a DNA molecule were suspended in a buffer followed by performing several types of electrical measurements. These electrical parameters were then used to quantify the suspended DNA nucleotides. Thus, we present a purely electrical counting scheme based on the semiconductor theory that allows one to determine the number of nucleotides in a solution by measuring their capacitance-voltage dependency. The nucleotide count was observed to be similar to the multiplication of the corresponding dopant concentration and debye volume after de-embedding the buffer contribution. The presented approach allows for a fast and label-free quantification of single and mixed nucleotides in a solution.
Transcriptome sequencing reveals genome-wide variation in molecular evolutionary rate among ferns.

PubMed

Grusz, Amanda L; Rothfels, Carl J; Schuettpelz, Eric

2016-08-30

Transcriptomics in non-model plant systems has recently reached a point where the examination of nuclear genome-wide patterns in understudied groups is an achievable reality. This progress is especially notable in evolutionary studies of ferns, for which molecular resources to date have been derived primarily from the plastid genome. Here, we utilize transcriptome data in the first genome-wide comparative study of molecular evolutionary rate in ferns. We focus on the ecologically diverse family Pteridaceae, which comprises about 10 % of fern diversity and includes the enigmatic vittarioid ferns-an epiphytic, tropical lineage known for dramatically reduced morphologies and radically elongated phylogenetic branch lengths. Using expressed sequence data for 2091 loci, we perform pairwise comparisons of molecular evolutionary rate among 12 species spanning the three largest clades in the family and ask whether previously documented heterogeneity in plastid substitution rates is reflected in their nuclear genomes. We then inquire whether variation in evolutionary rate is being shaped by genes belonging to specific functional categories and test for differential patterns of selection. We find significant, genome-wide differences in evolutionary rate for vittarioid ferns relative to all other lineages within the Pteridaceae, but we recover few significant correlations between faster/slower vittarioid loci and known functional gene categories. We demonstrate that the faster rates characteristic of the vittarioid ferns are likely not driven by positive selection, nor are they unique to any particular type of nucleotide substitution. Our results reinforce recently reviewed mechanisms hypothesized to shape molecular evolutionary rates in vittarioid ferns and provide novel insight into substitution rate variation both within and among fern nuclear genomes.
Global occurrence and heterogeneity of the Roseobacter-clade species Ruegeria mobilis

PubMed Central

Sonnenschein, Eva C; Nielsen, Kristian F; D'Alvise, Paul; Porsby, Cisse H; Melchiorsen, Jette; Heilmann, Jens; Kalatzis, Panos G; López-Pérez, Mario; Bunk, Boyke; Spröer, Cathrin; Middelboe, Mathias; Gram, Lone

2017-01-01

Tropodithietic acid (TDA)-producing Ruegeria mobilis strains of the Roseobacter clade have primarily been isolated from marine aquaculture and have probiotic potential due to inhibition of fish pathogens. We hypothesized that TDA producers with additional novel features are present in the oceanic environment. We isolated 42 TDA-producing R. mobilis strains during a global marine research cruise. While highly similar on the 16S ribosomal RNA gene level (99–100% identity), the strains separated into four sub-clusters in a multilocus sequence analysis. They were further differentiated to the strain level by average nucleotide identity using pairwise genome comparison. The four sub-clusters could not be associated with a specific environmental niche, however, correlated with the pattern of sub-typing using co-isolated phages, the number of prophages in the genomes and the distribution in ocean provinces. Major genomic differences within the sub-clusters include prophages and toxin-antitoxin systems. In general, the genome of R. mobilis revealed adaptation to a particle-associated life style and querying TARA ocean data confirmed that R. mobilis is more abundant in the particle-associated fraction than in the free-living fraction occurring in 40% and 6% of the samples, respectively. Our data and the TARA data, although lacking sufficient data from the polar regions, demonstrate that R. mobilis is a globally distributed marine bacterial species found primarily in the upper open oceans. It has preserved key phenotypic behaviors such as the production of TDA, but contains diverse sub-clusters, which could provide new capabilities for utilization in aquaculture. PMID:27552638
A method for integrating neuroimaging into genetic models of learning performance.

PubMed

Mehta, Chintan M; Gruen, Jeffrey R; Zhang, Heping

2017-01-01

Specific learning disorders (SLD) are an archetypal example of how clinical neuropsychological (NP) traits can differ from underlying genetic and neurobiological risk factors. Disparate environmental influences and pathologies impact learning performance assessed through cognitive examinations and clinical evaluations, the primary diagnostic tools for SLD. We propose a neurobiological risk for SLD with neuroimaging biomarkers, which is integrated into a genome-wide association study (GWAS) of learning performance in a cohort of 479 European individuals between 8 and 21 years of age. We first identified six regions of interest (ROIs) in temporal and anterior cingulate regions where the group diagnosed with learning disability has the least overall variation, relative to the other group, in thickness, area, and volume measurements. Although we used the three imaging measures, the thickness was the leading contributor. Hence, we calculated the Euclidean distances between any two individuals based on their thickness measures in the six ROIs. Then, we defined the relative similarity of one individual according to the averaged ranking of pairwise distances from the individuals to those in the SLD group. The inverse of this relative similarity is called the neurobiological risk for the individual. Single nucleotide polymorphisms in the AGBL1 gene on chromosome 15 had a significant association with learning performance at a genome-wide level. This finding was supported in an independent cohort of 2,327 individuals of the same demographic profile. Our statistical approach for integrating genetic and neuroimaging biomarkers can be extended into studying the biological basis of other NP traits. © 2016 WILEY PERIODICALS, INC.
Phylogeography of haplotypes of five microsatellites located in a low-recombination region of the X chromosome: studies worldwide and in Brazilian populations.

PubMed

Pereira, Rinaldo Wellerson; Pena, Sérgio D J

2006-01-01

We studied five microsatellites (DXS995, DXS8076, DXS8114, DXS1002 and DXS1050) located in a region of very low recombination rate in the long arm of the human X chromosome (Xq13.3-Xq21.3). No recombination was seen in 291 meioses in CEPH families. To test whether haplotypes composed of the five microsatellites could differentiate among distinct human continental populations, we studied an international panel containing 72 males from Africa, Europe, Asia and the America. Haplotypic diversity was very high within these groups and no haplotypes were shared among them. This led to the hope that we might be able to identify continent-specific lineages. However, in a median joining network there was no clear discrimination of the different continental groups. We then tested whether we could identify X chromosomal lineages from different continental origins in Brazilians. We typed 180 white Brazilians from four different geographical regions and examined their proportions of haplotype sharing with Africans, Asians, Europeans and Amerindians. No phylogeographical patterns emerged from the data. Moreover, there were several instances of the same haplotype being shared by many (and in one instance all) groups, suggesting that recombination might be occurring. We thus studied pairwise the level of linkage disequilibrium (LD) between the microsatellites. No detectable linkage disequilibrium between the most external loci DXS995 and DXS1050 was observed. Thus, even though recombination may be absent on short time spans, as seen in the CEPH pedigrees, on a long term basis it occurs often enough to dissipate all linkage disequilibrium. On the other hand, we observed very strong linkage disequilibrium between the pairs DXS995/DXS8076 and DXS1002/DXS8114, raising the possibility of resequencing the segment between them to identify single nucleotide polymorphisms (SNPs) in their intervals. The combination of X-linked microsatellites and SNPs in strong linkage disequilibrium might provide a powerful new tool to investigate human demographic history.
Two-dimensional cross correlation analysis of protein unfolding: Portrayal of the thermal denaturation of CMP kinases in the absence and presence of substrates

NASA Astrophysics Data System (ADS)

Schultz, Christian P.; Bârzu, Octavian; Mantsch, Henry H.

2000-03-01

The functional role of CMP kinases is to regenerate mono-phosphate nucleotides in cells by transferring phosphate residues from tri-phosphorylated nucleotides to monophosphorylated nucleotides. These enzymes possess two binding sites and maintain a highly conserved secondary structure. They are essential for cell survival. Herein we compare the infrared spectra of two similar, but not identical enzymes, the CMP kinases from Escherichia coli and Bacillus subtilis. A two-dimensional cross correlation analysis of the infrared spectra reveals differences in the denaturation behavior of the two proteins. Different secondary structure elements show different time-delayed or advanced unfolding events in the two enzymes. When bound to the active sites, the two nucleotide-substrates CMP and ATP exert a stabilizing effect on the structure of both proteins. The changes observed upon thermal denaturation are different for the two enzymes. Model 2D correlations are used to simulate the different denaturation of the two enzymes. Thermal denaturation and aggregation can be distinguished as two processes separated in time.
PCR/LDR/capillary electrophoresis for detection of single-nucleotide differences between fetal and maternal DNA in maternal plasma.

PubMed

Yi, Ping; Chen, Zhuqin; Zhao, Yan; Guo, Jianxin; Fu, Huabin; Zhou, Yuanguo; Yu, Lili; Li, Li

2009-03-01

The discovery of fetal DNA in maternal plasma has opened up an approach for noninvasive diagnosis. We have now assessed the possibility of detecting single-nucleotide differences between fetal and maternal DNA in maternal plasma by polymerase chain reaction (PCR)/ligase detection reaction((LDR)/capillary electrophoresis. PCR/LDR/capillary electrophoresis was applied to detect the genotype of c.454-397T>gene (ESR1) from experimental DNA models of maternal plasma at different sensitivity levels and 13 maternal plasma samples.alphaC in estrogen receptor. (1) Our results demonstrated that the technique could discriminate low abundance single-nucleotide mutation with a mutant/normal allele ratio up to 1:10 000. (2) Examination of ESR1 c.454-397T>C genotypes by using the method of restriction fragment length analysis was performed in 25 pregnant women, of whom 13 pregnant women had homozygous genotypes. The c.454-397T>C genotypes of paternally inherited fetal DNA in maternal plasma of these 13 women were detected by PCR/LDR/capillary electrophoresis, which were accordant with the results of umbilical cord blood. PCR/LDR/capillary electrophoresis has very high sensitivity to distinguish low abundance single nucleotide differences and can discriminate point mutations and single-nucleotide polymorphisms(SNPs) of paternally inherited fetal DNA in maternal plasma.
Real-time single-molecule electronic DNA sequencing by synthesis using polymer-tagged nucleotides on a nanopore array

PubMed Central

Fuller, Carl W.; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Bibillo, Arek; Stranges, P. Benjamin; Dorwart, Michael; Tao, Chuanjuan; Li, Zengmin; Guo, Wenjing; Shi, Shundi; Korenblum, Daniel; Trans, Andrew; Aguirre, Anne; Liu, Edward; Harada, Eric T.; Pollard, James; Bhat, Ashwini; Cech, Cynthia; Yang, Alexander; Arnold, Cleoma; Palla, Mirkó; Hovis, Jennifer; Chen, Roger; Morozova, Irina; Kalachikov, Sergey; Russo, James J.; Kasianowicz, John J.; Davis, Randy; Roever, Stefan; Church, George M.; Ju, Jingyue

2016-01-01

DNA sequencing by synthesis (SBS) offers a robust platform to decipher nucleic acid sequences. Recently, we reported a single-molecule nanopore-based SBS strategy that accurately distinguishes four bases by electronically detecting and differentiating four different polymer tags attached to the 5′-phosphate of the nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. Further developing this approach, we report here the use of nucleotides tagged at the terminal phosphate with oligonucleotide-based polymers to perform nanopore SBS on an α-hemolysin nanopore array platform. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primer/template and polymerase, the tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. The use of these polymer-tagged nucleotides, combined with polymerase tethering to nanopores and multiplexed nanopore sensors, should lead to new high-throughput sequencing methods. PMID:27091962
Evidence of codon usage in the nearest neighbor spacing distribution of bases in bacterial genomes

NASA Astrophysics Data System (ADS)

Higareda, M. F.; Geiger, O.; Mendoza, L.; Méndez-Sánchez, R. A.

2012-02-01

Statistical analysis of whole genomic sequences usually assumes a homogeneous nucleotide density throughout the genome, an assumption that has been proved incorrect for several organisms since the nucleotide density is only locally homogeneous. To avoid giving a single numerical value to this variable property, we propose the use of spectral statistics, which characterizes the density of nucleotides as a function of its position in the genome. We show that the cumulative density of bases in bacterial genomes can be separated into an average (or secular) plus a fluctuating part. Bacterial genomes can be divided into two groups according to the qualitative description of their secular part: linear and piecewise linear. These two groups of genomes show different properties when their nucleotide spacing distribution is studied. In order to analyze genomes having a variable nucleotide density, statistically, the use of unfolding is necessary, i.e., to get a separation between the secular part and the fluctuations. The unfolding allows an adequate comparison with the statistical properties of other genomes. With this methodology, four genomes were analyzed Burkholderia, Bacillus, Clostridium and Corynebacterium. Interestingly, the nearest neighbor spacing distributions or detrended distance distributions are very similar for species within the same genus but they are very different for species from different genera. This difference can be attributed to the difference in the codon usage.
Maximally informative pairwise interactions in networks

PubMed Central

Fitzgerald, Jeffrey D.; Sharpee, Tatyana O.

2010-01-01

Several types of biological networks have recently been shown to be accurately described by a maximum entropy model with pairwise interactions, also known as the Ising model. Here we present an approach for finding the optimal mappings between input signals and network states that allow the network to convey the maximal information about input signals drawn from a given distribution. This mapping also produces a set of linear equations for calculating the optimal Ising-model coupling constants, as well as geometric properties that indicate the applicability of the pairwise Ising model. We show that the optimal pairwise interactions are on average zero for Gaussian and uniformly distributed inputs, whereas they are nonzero for inputs approximating those in natural environments. These nonzero network interactions are predicted to increase in strength as the noise in the response functions of each network node increases. This approach also suggests ways for how interactions with unmeasured parts of the network can be inferred from the parameters of response functions for the measured network nodes. PMID:19905153

Threesomes destabilise certain relationships: multispecies interactions between wood decay fungi in natural resources

PubMed Central

Savoury, Melanie; Toledo, Selin; Kingscott-Edmunds, James; Bettridge, Aimee; Waili, Nasra Al; Boddy, Lynne

2017-01-01

Abstract Understanding interspecific interactions is key to explaining and modelling community development and associated ecosystem function. Most interactions research has focused on pairwise combinations, overlooking the complexity of multispecies communities. This study investigated three-way interactions between saprotrophic fungi in wood and across soil, and indicated that pairwise combinations are often inaccurate predictors of the outcomes of multispecies competition in wood block interactions. This inconsistency was especially true of intransitive combinations, resulting in increased species coexistence within the resource. Furthermore, the addition of a third competitor frequently destabilised the otherwise consistent outcomes of pairwise combinations in wood blocks, which occasionally resulted in altered resource decomposition rates, depending on the relative decay abilities of the species involved. Conversely, interaction outcomes in soil microcosms were unaffected by the presence of a third combatant. Multispecies interactions promoted species diversity within natural resources, and made community dynamics less consistent than could be predicted from pairwise interaction studies. PMID:28175239
Development of a method for the analysis of nucleotides from the mantle tissue of the mussel Mytilus galloprovincialis.

PubMed

Blanco López, S L; Moal, J; San Juan Serrano, F

2000-09-01

Reversed-phase HPLC was applied to obtain a sensitive and efficient means for quantitating nucleotides in the mussel Mytilus galloprovincialis. We obtained a good separation of adenylic, guanylic, uridylic and cytidylic nucleotides. Adenine nucleotides play a critical role in the regulation and integration of cellular metabolism; particularly in the mantle tissue in the mussel, they are involved in the regulation of the enzyme glycogen phosphorylase, a key enzyme in the transfer of bioenergetic reserves (glycogen) to gametogenic development; it is of great importance to have a measure of the concentrations in vivo during the reproductive cycle of the organism. Different elution conditions were tested: isocratic versus step gradient elution, different mobile phase pH and the type and proportion of ion-pairing agent added to the mobile phase. The best method was selected and the separation and accurate determination of adenine, citidine, guanine and uridine nucleotides was accomplished within a 20-min run, with UV-Vis detection (254 nm).
Interactions of praseodymium and neodymium with nucleosides and nucleotides: absorption difference and comparative absorption spectral study.

PubMed

Misra, S N; Anjaiah, K; Joseph, G; Abdi, S H

1992-02-01

The interactions of praseodymium(III) and neodymium(III) with nucleosides and nucleotides have been studied in different stoichiometry in water and water-DMF mixtures by employing absorption difference and comparative absorption spectrophotometry. The 4f-4f bands were analysed by linear curve analysis followed by gaussian curve analysis, and various spectral parameters were computed, using partial and multiple regression method. The magnitude of changes in both energy interaction and intensity were used to explore the degree of outer and inner sphere coordination, incidence of covalency and the extent of metal 4f-orbital involvement in chemical bonding. Crystalline complexes of the type [Ln(nucleotide)2(H2O)2]- (where nucleotide--GMP or IMP) were characterized by IR, 1H NMR, 31P NMR data. These studies indicated that the binding of the nucleotide is through phosphate oxygen in a bidentate manner and the complexes undergo substantial ionisation in aqueous medium, thereby supporting the observed weak 4f-4f bands and lower values for nephelauxetic effect (1-beta), bonding (b) and covalency (delta) parameters derived from coulombic and spin orbit interaction parameters.
Effects of nutrition (herbivore vs carnivore) on energy charge and nucleotide composition in Hyas araneus larvae

NASA Astrophysics Data System (ADS)

Harms, J.

1992-03-01

Growth rate expressed as dry weight, elemetnal composition (C, N, H), protein content and nucleotide composition (ATP, ADP, AMP, CTP, GTP and UTP) as well as adenosine were measured in laboratory cultured Hyas araneus larvae fed two different diets. One group was fed freshly hatched Artemia sp. nauplii, the other the diatom Odontella (Biddulphia) sinensis. Growth rate was reduced in the O. sinensis-fed group, reaching 20 to 50% of the growth rate of Artemia-fed larvae. In all cases, some further development to the next instar occurred when larvae were fed O. sinensis, although at reduced levels compared to Artemia-fed larvae. The adenylic energy charge was quite similar for the two nutritional conditions tested and therefore does not reflect the reduced growth rate in O. sinensis-fed larvae. The individual nucleotide content was clearly reduced in O. sinensis-fed larvae, reflecting the nutritional conditions already during early developmental periods. These reduced amount of nucleotides in O. sinensis-fed larvae were most obvious when adenylic nucleotide contents were pooled. Pooled adenylic nucleotides were found to be correlated with the individual content of carbon and protein, showing significant differences at both nutritional conditions tested.
Interaction between macrocyclic nickel complexes and the nucleotides GMP, AMP and ApG.

PubMed

Liu, Yangzhong; Sletten, Einar

2003-01-15

Reactions between the nucleotides GMP, AMP and ApG and the complexes Ni(tren), Ni(cyclam) and NiCR in aqueous solution have been monitored by (1)H, (15)N NMR and UV spectroscopy. The three nickel complexes display different properties in reactions with nucleotides. Ni(tren) which has a pseudo-octahedral coordination geometry was shown to bind to all three nucleotides. Ni(cyclam) and NiCR, both with four nitrogen atoms in a square planar arrangement are not able to bind to nucleotides efficiently because of steric hindrance. Oxidation of Ni(cyclam) by KHSO(5) to produce trivalent Ni(III)(cyclam) improves the coordination capacity, while oxidation of NiCR does not produce a similar effect. The nucleotides interact with trivalent nickel complexes to different extent. Ni(III)CR is seen to oxidize GMP gradually but does not affect AMP significantly. Ni(III)(cyclam), on the other hand, does not oxidize either GMP or AMP at the 1:1 concentration of oxidant used. This result is probably due to the lower redox potential of Ni(cyclam). ApG binds less efficiently to the Ni complexes but is easier oxidized than the mononucleotides.
Effect of the nucleotides surrounding the start codon on the translation of foot-and-mouth disease virus RNA.

PubMed

Ma, X X; Feng, Y P; Gu, Y X; Zhou, J H; Ma, Z R

2016-06-01

As for the alternative AUGs in foot-and-mouth disease virus (FMDV), nucleotide bias of the context flanking the AUG(2nd) could be used as a strong signal to initiate translation. To determine the role of the specific nucleotide context, dicistronic reporter constructs were engineered to contain different versions of nucleotide context linking between internal ribosome entry site (IRES) and downstream gene. The results indicate that under FMDV IRES-dependent mechanism, the nucleotide contexts flanking start codon can influence the translation initiation efficiencies. The most optimal sequences for both start codons have proved to be UUU AUG(1st) AAC and AAG AUG(2nd) GAA.
Acid-soluble nucleotides of pinto bean leaves at different stages of development.

PubMed

Weinstein, L H; McCune, D C; Mancini, J F; van Leuken, P

1969-11-01

Acid-soluble nucleotides of unifoliate leaves of Pinto bean plants (Phaseolus vulgaris L.) were determined at young, mature, and senescent stages of development. At least 25 components could be distinguished on the basis of inorganic phosphorus determinations and 37 or more fractions on the basis of (32)P labeling, with adenosine di- and triphosphates accounting for 60% of the total moles of nucleotide. The total nucleotide P and inorganic P, on a fresh weight basis, decreased about 44% between each stage of leaf development, but decrements in the levels of individual nucleotides varied from this over-all pattern.Minor changes in the relative abundance of the individual nucleotides accompanied aging although the percentage of purine-containing nucleotides decreased with age. Total (32)P activity per leaf in the nucleotide pool increased about 3-fold between the young and mature leaves and decreased slightly as leaves became senescent. In general, the specific activities of the nucleotides increased with increased age and adenosine-, guanosine-, uridine-, and cytidine triphosphates and adenosine diphosphate accounted for approximately 90% of the total activity. The changes in the relative sizes and energy status of the nucleotide pools were not so obvious as the changes in other metabolites that have been reported to accompany aging in leaf tissue.
Blazing Signature Filter: a library for fast pairwise similarity comparisons

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lee, Joon-Yong; Fujimoto, Grant M.; Wilson, Ryan

Identifying similarities between datasets is a fundamental task in data mining and has become an integral part of modern scientific investigation. Whether the task is to identify co-expressed genes in large-scale expression surveys or to predict combinations of gene knockouts which would elicit a similar phenotype, the underlying computational task is often a multi-dimensional similarity test. As datasets continue to grow, improvements to the efficiency, sensitivity or specificity of such computation will have broad impacts as it allows scientists to more completely explore the wealth of scientific data. A significant practical drawback of large-scale data mining is the vast majoritymore » of pairwise comparisons are unlikely to be relevant, meaning that they do not share a signature of interest. It is therefore essential to efficiently identify these unproductive comparisons as rapidly as possible and exclude them from more time-intensive similarity calculations. The Blazing Signature Filter (BSF) is a highly efficient pairwise similarity algorithm which enables extensive data mining within a reasonable amount of time. The algorithm transforms datasets into binary metrics, allowing it to utilize the computationally efficient bit operators and provide a coarse measure of similarity. As a result, the BSF can scale to high dimensionality and rapidly filter unproductive pairwise comparison. Two bioinformatics applications of the tool are presented to demonstrate the ability to scale to billions of pairwise comparisons and the usefulness of this approach.« less
The structure of pairwise correlation in mouse primary visual cortex reveals functional organization in the absence of an orientation map.

PubMed

Denman, Daniel J; Contreras, Diego

2014-10-01

Neural responses to sensory stimuli are not independent. Pairwise correlation can reduce coding efficiency, occur independent of stimulus representation, or serve as an additional channel of information, depending on the timescale of correlation and the method of decoding. Any role for correlation depends on its magnitude and structure. In sensory areas with maps, like the orientation map in primary visual cortex (V1), correlation is strongly related to the underlying functional architecture, but it is unclear whether this correlation structure is an essential feature of the system or arises from the arrangement of cells in the map. We assessed the relationship between functional architecture and pairwise correlation by measuring both synchrony and correlated spike count variability in mouse V1, which lacks an orientation map. We observed significant pairwise synchrony, which was organized by distance and relative orientation preference between cells. We also observed nonzero correlated variability in both the anesthetized (0.16) and awake states (0.18). Our results indicate that the structure of pairwise correlation is maintained in the absence of an underlying anatomical organization and may be an organizing principle of the mammalian visual system preserved by nonrandom connectivity within local networks. © The Author 2013. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Pairwise Maximum Entropy Models for Studying Large Biological Systems: When They Can Work and When They Can't

PubMed Central

Roudi, Yasser; Nirenberg, Sheila; Latham, Peter E.

2009-01-01

One of the most critical problems we face in the study of biological systems is building accurate statistical descriptions of them. This problem has been particularly challenging because biological systems typically contain large numbers of interacting elements, which precludes the use of standard brute force approaches. Recently, though, several groups have reported that there may be an alternate strategy. The reports show that reliable statistical models can be built without knowledge of all the interactions in a system; instead, pairwise interactions can suffice. These findings, however, are based on the analysis of small subsystems. Here, we ask whether the observations will generalize to systems of realistic size, that is, whether pairwise models will provide reliable descriptions of true biological systems. Our results show that, in most cases, they will not. The reason is that there is a crossover in the predictive power of pairwise models: If the size of the subsystem is below the crossover point, then the results have no predictive power for large systems. If the size is above the crossover point, then the results may have predictive power. This work thus provides a general framework for determining the extent to which pairwise models can be used to predict the behavior of large biological systems. Applied to neural data, the size of most systems studied so far is below the crossover point. PMID:19424487
Genomic single-nucleotide polymorphisms confirm that Gunnison and Greater sage-grouse are genetically well differentiated and that the Bi-State population is distinct

USGS Publications Warehouse

Oyler-McCance, Sara J.; Cornman, Robert S.; Jones, Kenneth L.; Fike, Jennifer

2015-01-01

Sage-grouse are iconic, declining inhabitants of sagebrush habitats in western North America, and their management depends on an understanding of genetic variation across the landscape. Two distinct species of sage-grouse have been recognized, Greater (Centrocercus urophasianus) and Gunnison sage-grouse (C. minimus), based on morphology, behavior, and variation at neutral genetic markers. A parapatric group of Greater Sage-Grouse along the border of California and Nevada ("Bi-State") is also genetically distinct at the same neutral genetic markers, yet not different in behavior or morphology. Because delineating taxonomic boundaries and defining conservation units is often difficult in recently diverged taxa and can be further complicated by highly skewed mating systems, we took advantage of new genomic methods that improve our ability to characterize genetic variation at a much finer resolution. We identified thousands of single-nucleotide polymorphisms (SNPs) among Gunnison, Greater, and Bi-State sage-grouse and used them to comprehensively examine levels of genetic diversity and differentiation among these groups. The pairwise multilocus fixation index (FST) was high (0.49) between Gunnison and Greater sage-grouse, and both principal coordinates analysis and model-based clustering grouped samples unequivocally by species. Standing genetic variation was lower within the Gunnison Sage-Grouse. The Bi-State population was also significantly differentiated from Greater Sage-Grouse, albeit more weakly (FST = 0.09), and genetic clustering results were consistent with reduced gene flow with Greater Sage-Grouse. No comparable genetic divisions were found within the Greater Sage-Grouse sample, which spanned the southern half of the range. Thus, we provide much stronger genetic evidence supporting the recognition of Gunnison Sage-Grouse as a distinct species with low genetic diversity. Further, our work confirms that the Bi-State population is differentiated from other Greater Sage-Grouse. The level of differentiation is much less than the divergence between Greater and Gunnison sage-grouse, supporting the idea that the Bi-State represents a unique population within the Greater Sage-Grouse. New genomic methods like the restriction-site-associated DNA (RAD-tag) method used here illustrate how increasing the number of markers and coverage of the genome can better characterize patterns of genetic variation, particularly among recently diverged taxa, providing vital information for conservation and management.
Evaluating the Quality of Evidence from a Network Meta-Analysis

PubMed Central

Salanti, Georgia; Del Giovane, Cinzia; Chaimani, Anna; Caldwell, Deborah M.; Higgins, Julian P. T.

2014-01-01

Systematic reviews that collate data about the relative effects of multiple interventions via network meta-analysis are highly informative for decision-making purposes. A network meta-analysis provides two types of findings for a specific outcome: the relative treatment effect for all pairwise comparisons, and a ranking of the treatments. It is important to consider the confidence with which these two types of results can enable clinicians, policy makers and patients to make informed decisions. We propose an approach to determining confidence in the output of a network meta-analysis. Our proposed approach is based on methodology developed by the Grading of Recommendations Assessment, Development and Evaluation (GRADE) Working Group for pairwise meta-analyses. The suggested framework for evaluating a network meta-analysis acknowledges (i) the key role of indirect comparisons (ii) the contributions of each piece of direct evidence to the network meta-analysis estimates of effect size; (iii) the importance of the transitivity assumption to the validity of network meta-analysis; and (iv) the possibility of disagreement between direct evidence and indirect evidence. We apply our proposed strategy to a systematic review comparing topical antibiotics without steroids for chronically discharging ears with underlying eardrum perforations. The proposed framework can be used to determine confidence in the results from a network meta-analysis. Judgements about evidence from a network meta-analysis can be different from those made about evidence from pairwise meta-analyses. PMID:24992266
Untargeted analysis of chromatographic data for green and fermented rooibos: Problem with size effect removal.

PubMed

Tobin, Jade; Walach, Jan; de Beer, Dalene; Williams, Paul J; Filzmoser, Peter; Walczak, Beata

2017-11-24

While analyzing chromatographic data, it is necessary to preprocess it properly before exploration and/or supervised modeling. To make chromatographic signals comparable, it is crucial to remove the scaling effect, caused by differences in overall sample concentrations. One of the efficient methods of signal scaling is Probabilistic Quotient Normalization (PQN) [1]. However, it can be applied only to data for which the majority of features do not vary systematically among the studied classes of signals. When studying the influence of the traditional "fermentation" (oxidation) process on the concentration of 56 individual peaks detected in rooibos plant material, this assumption is not fulfilled. In this case, the only possible solution is the analysis of pairwise log-ratios, which are not influenced by the scaling constant. To estimate significant features, i.e., peaks differentiating the studied classes of samples (green and fermented rooibos plant material), we propose the application of rPLR (robust pair-wise log-ratios) as proposed by Walach et al. [2]. It allows for fast computation and identification of the significant features in terms of original variables (peaks) which is problematic, while working with the unfolded pair-wise log ratios. As demonstrated, it can be applied to designed data sets and in the case of contaminated data, it allows proper conclusions. Copyright © 2017 Elsevier B.V. All rights reserved.
Score distributions of gapped multiple sequence alignments down to the low-probability tail

NASA Astrophysics Data System (ADS)

Fieth, Pascal; Hartmann, Alexander K.

2016-08-01

Assessing the significance of alignment scores of optimally aligned DNA or amino acid sequences can be achieved via the knowledge of the score distribution of random sequences. But this requires obtaining the distribution in the biologically relevant high-scoring region, where the probabilities are exponentially small. For gapless local alignments of infinitely long sequences this distribution is known analytically to follow a Gumbel distribution. Distributions for gapped local alignments and global alignments of finite lengths can only be obtained numerically. To obtain result for the small-probability region, specific statistical mechanics-based rare-event algorithms can be applied. In previous studies, this was achieved for pairwise alignments. They showed that, contrary to results from previous simple sampling studies, strong deviations from the Gumbel distribution occur in case of finite sequence lengths. Here we extend the studies to multiple sequence alignments with gaps, which are much more relevant for practical applications in molecular biology. We study the distributions of scores over a large range of the support, reaching probabilities as small as 10-160, for global and local (sum-of-pair scores) multiple alignments. We find that even after suitable rescaling, eliminating the sequence-length dependence, the distributions for multiple alignment differ from the pairwise alignment case. Furthermore, we also show that the previously discussed Gaussian correction to the Gumbel distribution needs to be refined, also for the case of pairwise alignments.
Molecular markers for establishing distinctness in vegetatively propagated crops: a case study in grapevine.

PubMed

Ibáñez, Javier; Vélez, M Dolores; de Andrés, M Teresa; Borrego, Joaquín

2009-11-01

Distinctness, uniformity and stability (DUS) testing of varieties is usually required to apply for Plant Breeders' Rights. This exam is currently carried out using morphological traits, where the establishment of distinctness through a minimum distance is the key issue. In this study, the possibility of using microsatellite markers for establishing the minimum distance in a vegetatively propagated crop (grapevine) has been evaluated. A collection of 991 accessions have been studied with nine microsatellite markers and pair-wise compared, and the highest intra-variety distance and the lowest inter-variety distance determined. The collection included 489 different genotypes, and synonyms and sports. Average values for number of alleles per locus (19), Polymorphic Information Content (0.764) and heterozygosities observed (0.773) and expected (0.785) indicated the high level of polymorphism existing in grapevine. The maximum intra-variety variability found was one allele between two accessions of the same variety, of a total of 3,171 pair-wise comparisons. The minimum inter-variety variability found was two alleles between two pairs of varieties, of a total of 119,316 pair-wise comparisons. In base to these results, the minimum distance required to set distinctness in grapevine with the nine microsatellite markers used could be established in two alleles. General rules for the use of the system as a support for establishing distinctness in vegetatively propagated crops are discussed.
Saving the Best for Last? A Cross-Species Analysis of Choices between Reinforcer Sequences

ERIC Educational Resources Information Center

Andrade, Leonardo F.; Hackenberg, Timothy D.

2012-01-01

Two experiments were conducted to compare choices between sequences of reinforcers in pigeon (Experiment 1) and human (Experiment 2) subjects, using functionally analogous procedures. The subjects made pairwise choices among 3 sequence types, all of which provided the same overall reinforcement rate, but differed in their temporal patterning.…
A New Algorithm to Create Balanced Teams Promoting More Diversity

ERIC Educational Resources Information Center

Dias, Teresa Galvão; Borges, José

2017-01-01

The problem of assigning students to teams can be described as maximising their profiles diversity within teams while minimising the differences among teams. This problem is commonly known as the maximally diverse grouping problem and it is usually formulated as maximising the sum of the pairwise distances among students within teams. We propose…
Effect of congenital blindness on the semantic representation of some everyday concepts.

PubMed

Connolly, Andrew C; Gleitman, Lila R; Thompson-Schill, Sharon L

2007-05-15

This study explores how the lack of first-hand experience with color, as a result of congenital blindness, affects implicit judgments about "higher-order" concepts, such as "fruits and vegetables" (FV), but not others, such as "household items" (HHI). We demonstrate how the differential diagnosticity of color across our test categories interacts with visual experience to produce, in effect, a category-specific difference in implicit similarity. Implicit pair-wise similarity judgments were collected by using an odd-man-out triad task. Pair-wise similarities for both FV and for HHI were derived from this task and were compared by using cluster analysis and regression analyses. Color was found to be a significant component in the structure of implicit similarity for FV for sighted participants but not for blind participants; and this pattern remained even when the analysis was restricted to blind participants who had good explicit color knowledge of the stimulus items. There was also no evidence that either subject group used color knowledge in making decisions about HHI, nor was there an indication of any qualitative differences between blind and sighted subjects' judgments on HHI.
DIALIGN P: fast pair-wise and multiple sequence alignment using parallel processors.

PubMed

Schmollinger, Martin; Nieselt, Kay; Kaufmann, Michael; Morgenstern, Burkhard

2004-09-09

Parallel computing is frequently used to speed up computationally expensive tasks in Bioinformatics. Herein, a parallel version of the multi-alignment program DIALIGN is introduced. We propose two ways of dividing the program into independent sub-routines that can be run on different processors: (a) pair-wise sequence alignments that are used as a first step to multiple alignment account for most of the CPU time in DIALIGN. Since alignments of different sequence pairs are completely independent of each other, they can be distributed to multiple processors without any effect on the resulting output alignments. (b) For alignments of large genomic sequences, we use a heuristics by splitting up sequences into sub-sequences based on a previously introduced anchored alignment procedure. For our test sequences, this combined approach reduces the program running time of DIALIGN by up to 97%. By distributing sub-routines to multiple processors, the running time of DIALIGN can be crucially improved. With these improvements, it is possible to apply the program in large-scale genomics and proteomics projects that were previously beyond its scope.
Characterization of demographic expansions from pairwise comparisons of linked microsatellite haplotypes.

PubMed

Navascués, Miguel; Hardy, Olivier J; Burgarella, Concetta

2009-03-01

This work extends the methods of demographic inference based on the distribution of pairwise genetic differences between individuals (mismatch distribution) to the case of linked microsatellite data. Population genetics theory describes the distribution of mutations among a sample of genes under different demographic scenarios. However, the actual number of mutations can rarely be deduced from DNA polymorphisms. The inclusion of mutation models in theoretical predictions can improve the performance of statistical methods. We have developed a maximum-pseudolikelihood estimator for the parameters that characterize a demographic expansion for a series of linked loci evolving under a stepwise mutation model. Those loci would correspond to DNA polymorphisms of linked microsatellites (such as those found on the Y chromosome or the chloroplast genome). The proposed method was evaluated with simulated data sets and with a data set of chloroplast microsatellites that showed signal for demographic expansion in a previous study. The results show that inclusion of a mutational model in the analysis improves the estimates of the age of expansion in the case of older expansions.

Design, Implementation and Deployment of PAIRwise

ERIC Educational Resources Information Center

Knight, Allan; Almeroth, Kevin; Bimber, Bruce

2008-01-01

Increased access to the Internet has dramatically increased the sources from which students can deliberately or accidentally copy information. This article discusses our motivation to design, implement, and deploy an Internet based plagiarism detection system, called PAIRwise, to address this growing problem. We give details as to how we detect…
From micro-correlations to macro-correlations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Eliazar, Iddo, E-mail: iddo.eliazar@intel.com

2016-11-15

Random vectors with a symmetric correlation structure share a common value of pair-wise correlation between their different components. The symmetric correlation structure appears in a multitude of settings, e.g. mixture models. In a mixture model the components of the random vector are drawn independently from a general probability distribution that is determined by an underlying parameter, and the parameter itself is randomized. In this paper we study the overall correlation of high-dimensional random vectors with a symmetric correlation structure. Considering such a random vector, and terming its pair-wise correlation “micro-correlation”, we use an asymptotic analysis to derive the random vector’smore » “macro-correlation” : a score that takes values in the unit interval, and that quantifies the random vector’s overall correlation. The method of obtaining macro-correlations from micro-correlations is then applied to a diverse collection of frameworks that demonstrate the method’s wide applicability.« less
Upscaling of fungal-bacterial interactions: from the lab to the field.

PubMed

de Boer, Wietse

2017-06-01

Fungal-bacterial interactions (FBI) are an integral component of microbial community networks in terrestrial ecosystems. During the last decade, the attention for FBI has increased tremendously. For a wide variety of FBI, information has become available on the mechanisms and functional responses. Yet, most studies have focused on pairwise interactions under controlled conditions. The question to what extent such studies are relevant to assess the importance of FBI for functioning of natural microbial communities in real ecosystems remains largely unanswered. Here, the information obtained by studying a type of FBI, namely antagonistic interactions between bacteria and plant pathogenic fungi, is discussed for different levels of community complexity. Based on this, general recommendations are given to integrate pairwise and ecosystem FBI studies. This approach could lead to the development of novel strategies to steer terrestrial ecosystem functioning. Copyright © 2017 Elsevier Ltd. All rights reserved.
Dynamical pairwise entanglement and two-point correlations in the three-ligand spin-star structure

NASA Astrophysics Data System (ADS)

Motamedifar, M.

2017-10-01

We consider the three-ligand spin-star structure through homogeneous Heisenberg interactions (XXX-3LSSS) in the framework of dynamical pairwise entanglement. It is shown that the time evolution of the central qubit ;one-particle; state (COPS) brings about the generation of quantum W states at periodical time instants. On the contrary, W states cannot be generated from the time evolution of a ligand ;one-particle; state (LOPS). We also investigate the dynamical behavior of two-point quantum correlations as well as the expectation values of the different spin-components for each element in the XXX-3LSSS. It is found that when a W state is generated, the same value of the concurrence between any two arbitrary qubits arises from the xx and yy two-point quantum correlations. On the opposite, zz quantum correlation between any two qubits vanishes at these time instants.
The rise and fall of a challenger: the Bullet Cluster in Λ cold dark matter simulations

NASA Astrophysics Data System (ADS)

Thompson, Robert; Davé, Romeel; Nagamine, Kentaro

2015-09-01

The Bullet Cluster has provided some of the best evidence for the Λ cold dark matter (ΛCDM) model via direct empirical proof of the existence of collisionless dark matter, while posing a serious challenge owing to the unusually high inferred pairwise velocities of its progenitor clusters. Here, we investigate the probability of finding such a high-velocity pair in large-volume N-body simulations, particularly focusing on differences between halo-finding algorithms. We find that algorithms that do not account for the kinematics of infalling groups yield vastly different statistics and probabilities. When employing the ROCKSTAR halo finder that considers particle velocities, we find numerous Bullet-like pair candidates that closely match not only the high pairwise velocity, but also the mass, mass ratio, separation distance, and collision angle of the initial conditions that have been shown to produce the Bullet Cluster in non-cosmological hydrodynamic simulations. The probability of finding a high pairwise velocity pair among haloes with Mhalo ≥ 1014 M⊙ is 4.6 × 10-4 using ROCKSTAR, while it is ≈34 × lower using a friends-of-friends (FoF)-based approach as in previous studies. This is because the typical spatial extent of Bullet progenitors is such that FoF tends to group them into a single halo despite clearly distinct kinematics. Further requiring an appropriately high average mass among the two progenitors, we find the comoving number density of potential Bullet-like candidates to be of the order of ≈10-10 Mpc-3. Our findings suggest that ΛCDM straightforwardly produces massive, high relative velocity halo pairs analogous to Bullet Cluster progenitors, and hence the Bullet Cluster does not present a challenge to the ΛCDM model.
Pairwise gene GO-based measures for biclustering of high-dimensional expression data.

PubMed

Nepomuceno, Juan A; Troncoso, Alicia; Nepomuceno-Chamorro, Isabel A; Aguilar-Ruiz, Jesús S

2018-01-01

Biclustering algorithms search for groups of genes that share the same behavior under a subset of samples in gene expression data. Nowadays, the biological knowledge available in public repositories can be used to drive these algorithms to find biclusters composed of groups of genes functionally coherent. On the other hand, a distance among genes can be defined according to their information stored in Gene Ontology (GO). Gene pairwise GO semantic similarity measures report a value for each pair of genes which establishes their functional similarity. A scatter search-based algorithm that optimizes a merit function that integrates GO information is studied in this paper. This merit function uses a term that addresses the information through a GO measure. The effect of two possible different gene pairwise GO measures on the performance of the algorithm is analyzed. Firstly, three well known yeast datasets with approximately one thousand of genes are studied. Secondly, a group of human datasets related to clinical data of cancer is also explored by the algorithm. Most of these data are high-dimensional datasets composed of a huge number of genes. The resultant biclusters reveal groups of genes linked by a same functionality when the search procedure is driven by one of the proposed GO measures. Furthermore, a qualitative biological study of a group of biclusters show their relevance from a cancer disease perspective. It can be concluded that the integration of biological information improves the performance of the biclustering process. The two different GO measures studied show an improvement in the results obtained for the yeast dataset. However, if datasets are composed of a huge number of genes, only one of them really improves the algorithm performance. This second case constitutes a clear option to explore interesting datasets from a clinical point of view.
Tumor segmentation on FDG-PET: usefulness of locally connected conditional random fields

NASA Astrophysics Data System (ADS)

Nishio, Mizuho; Kono, Atsushi K.; Koyama, Hisanobu; Nishii, Tatsuya; Sugimura, Kazuro

2015-03-01

This study aimed to develop software for tumor segmentation on 18F-fluorodeoxyglucose (FDG) positron emission tomography (PET). To segment the tumor from the background, we used graph cut, whose segmentation energy was generally divided into two terms: the unary and pairwise terms. Locally connected conditional random fields (LCRF) was proposed for the pairwise term. In LCRF, a three-dimensional cubic window with length L was set for each voxel, and voxels within the window were considered for the pairwise term. To evaluate our method, 64 clinically suspected metastatic bone tumors were tested, which were revealed by FDG-PET. To obtain ground truth, the tumors were manually delineated via consensus of two board-certified radiologists. To compare the LCRF accuracy, other types of segmentation were also applied such as region-growing based on 35%, 40%, and 45% of the tumor maximum standardized uptake value (RG35, RG40, and RG45, respectively), SLIC superpixels (SS), and region-based active contour models (AC). To validate the tumor segmentation accuracy, a dice similarity coefficient (DSC) was calculated between manual segmentation and result of each technique. The DSC difference was tested using the Wilcoxon signed rank test. The mean DSCs of LCRF at L = 3, 5, 7, and 9 were 0.784, 0.801, 0.809, and 0.812, respectively. The mean DSCs of other techniques were RG35, 0.633; RG40, 0.675; RG45, 0.689; SS, 0.709; and AC, 0.758. The DSC differences between LCRF and other techniques were statistically significant (p <0.05). In conclusion, tumor segmentation was more reliably performed with LCRF relative to other techniques.
Phosphate-Modified Nucleotides for Monitoring Enzyme Activity.

PubMed

Ermert, Susanne; Marx, Andreas; Hacker, Stephan M

2017-04-01

Nucleotides modified at the terminal phosphate position have been proven to be interesting entities to study the activity of a variety of different protein classes. In this chapter, we present various types of modifications that were attached as reporter molecules to the phosphate chain of nucleotides and briefly describe the chemical reactions that are frequently used to synthesize them. Furthermore, we discuss a variety of applications of these molecules. Kinase activity, for instance, was studied by transfer of a phosphate modified with a reporter group to the target proteins. This allows not only studying the activity of kinases, but also identifying their target proteins. Moreover, kinases can also be directly labeled with a reporter at a conserved lysine using acyl-phosphate probes. Another important application for phosphate-modified nucleotides is the study of RNA and DNA polymerases. In this context, single-molecule sequencing is made possible using detection in zero-mode waveguides, nanopores or by a Förster resonance energy transfer (FRET)-based mechanism between the polymerase and a fluorophore-labeled nucleotide. Additionally, fluorogenic nucleotides that utilize an intramolecular interaction between a fluorophore and the nucleobase or an intramolecular FRET effect have been successfully developed to study a variety of different enzymes. Finally, also some novel techniques applying electron paramagnetic resonance (EPR)-based detection of nucleotide cleavage or the detection of the cleavage of fluorophosphates are discussed. Taken together, nucleotides modified at the terminal phosphate position have been applied to study the activity of a large diversity of proteins and are valuable tools to enhance the knowledge of biological systems.
PWC - PAIRWISE COMPARISON SOFTWARE: SOFTWARE PROGRAM FOR PAIRWISE COMPARISON TASK FOR PSYCHOMETRIC SCALING AND COGNITIVE RESEARCH

NASA Technical Reports Server (NTRS)

Ricks, W. R.

1994-01-01

PWC is used for pair-wise comparisons in both psychometric scaling techniques and cognitive research. The cognitive tasks and processes of a human operator of automated systems are now prominent considerations when defining system requirements. Recent developments in cognitive research have emphasized the potential utility of psychometric scaling techniques, such as multidimensional scaling, for representing human knowledge and cognitive processing structures. Such techniques involve collecting measurements of stimulus-relatedness from human observers. When data are analyzed using this scaling approach, an n-dimensional representation of the stimuli is produced. This resulting representation is said to describe the subject's cognitive or perceptual view of the stimuli. PWC applies one of the many techniques commonly used to acquire the data necessary for these types of analyses: pair-wise comparisons. PWC administers the task, collects the data from the test subject, and formats the data for analysis. It therefore addresses many of the limitations of the traditional "pen-and-paper" methods. By automating the data collection process, subjects are prevented from going back to check previous responses, the possibility of erroneous data transfer is eliminated, and the burden of the administration and taking of the test is eased. By using randomization, PWC ensures that subjects see the stimuli pairs presented in random order, and that each subject sees pairs in a different random order. PWC is written in Turbo Pascal v6.0 for IBM PC compatible computers running MS-DOS. The program has also been successfully compiled with Turbo Pascal v7.0. A sample executable is provided. PWC requires 30K of RAM for execution. The standard distribution medium for this program is a 5.25 inch 360K MS-DOS format diskette. Two electronic versions of the documentation are included on the diskette: one in ASCII format and one in MS Word for Windows format. PWC was developed in 1993.
Nucleotide Catabolism on the Surface of Aortic Valve Xenografts; Effects of Different Decellularization Strategies.

PubMed

Kutryb-Zajac, Barbara; Yuen, Ada H Y; Khalpey, Zain; Zukowska, Paulina; Slominska, Ewa M; Taylor, Patricia M; Goldstein, Steven; Heacox, Albert E; Lavitrano, Marialuisa; Chester, Adrian H; Yacoub, Magdi H; Smolenski, Ryszard T

2016-04-01

Extracellular nucleotide metabolism controls thrombosis and inflammation and may affect degeneration and calcification of aortic valve prostheses. We evaluated the effect of different decellularization strategies on enzyme activities involved in extracellular nucleotide metabolism. Porcine valves were tested intact or decellularized either by detergent treatment or hypotonic lysis and nuclease digestion. The rates of ATP hydrolysis, AMP hydrolysis, and adenosine deamination were estimated by incubation of aorta or valve leaflet sections with substrates followed by HPLC analysis. We demonstrated relatively high activities of ecto-enzymes on porcine valve as compared to the aortic wall. Hypotonic lysis/nuclease digestion preserved >80 % of ATP and AMP hydrolytic activity but reduced adenosine deamination to <10 %. Detergent decellularization completely removed (<5 %) all these activities. These results demonstrate high intensity of extracellular nucleotide metabolism on valve surface and indicate that various valve decellularization techniques differently affect ecto-enzyme activities that could be important in the development of improved valve prostheses.
Does the choice of nucleotide substitution models matter topologically?

PubMed

Hoff, Michael; Orf, Stefan; Riehm, Benedikt; Darriba, Diego; Stamatakis, Alexandros

2016-03-24

In the context of a master level programming practical at the computer science department of the Karlsruhe Institute of Technology, we developed and make available an open-source code for testing all 203 possible nucleotide substitution models in the Maximum Likelihood (ML) setting under the common Akaike, corrected Akaike, and Bayesian information criteria. We address the question if model selection matters topologically, that is, if conducting ML inferences under the optimal, instead of a standard General Time Reversible model, yields different tree topologies. We also assess, to which degree models selected and trees inferred under the three standard criteria (AIC, AICc, BIC) differ. Finally, we assess if the definition of the sample size (#sites versus #sites × #taxa) yields different models and, as a consequence, different tree topologies. We find that, all three factors (by order of impact: nucleotide model selection, information criterion used, sample size definition) can yield topologically substantially different final tree topologies (topological difference exceeding 10 %) for approximately 5 % of the tree inferences conducted on the 39 empirical datasets used in our study. We find that, using the best-fit nucleotide substitution model may change the final ML tree topology compared to an inference under a default GTR model. The effect is less pronounced when comparing distinct information criteria. Nonetheless, in some cases we did obtain substantial topological differences.
Gause's Principle and the Effect of Resource Partitioning on the Dynamical Coexistence of Replicating Templates

PubMed Central

Szilágyi, András; Zachar, István; Szathmáry, Eörs

2013-01-01

Models of competitive template replication, although basic for replicator dynamics and primordial evolution, have not yet taken different sequences explicitly into account, neither have they analyzed the effect of resource partitioning (feeding on different resources) on coexistence. Here we show by analytical and numerical calculations that Gause's principle of competitive exclusion holds for template replicators if resources (nucleotides) affect growth linearly and coexistence is at fixed point attractors. Cases of complementary or homologous pairing between building blocks with parallel or antiparallel strands show no deviation from the rule that the nucleotide compositions of stably coexisting species must be different and there cannot be more coexisting replicator species than nucleotide types. Besides this overlooked mechanism of template coexistence we show also that interesting sequence effects prevail as parts of sequences that are copied earlier affect coexistence more strongly due to the higher concentration of the corresponding replication intermediates. Template and copy always count as one species due their constraint of strict stoichiometric coupling. Stability of fixed-point coexistence tends to decrease with the length of sequences, although this effect is unlikely to be detrimental for sequences below 100 nucleotides. In sum, resource partitioning (niche differentiation) is the default form of competitive coexistence for replicating templates feeding on a cocktail of different nucleotides, as it may have been the case in the RNA world. Our analysis of different pairing and strand orientation schemes is relevant for artificial and potentially astrobiological genetics. PMID:23990769
Transcripts of the NADH-dehydrogenase subunit 3 gene are differentially edited in Oenothera mitochondria.

PubMed Central

Schuster, W; Wissinger, B; Unseld, M; Brennicke, A

1990-01-01

A number of cytosines are altered to be recognized as uridines in transcripts of the nad3 locus in mitochondria of the higher plant Oenothera. Such nucleotide modifications can be found at 16 different sites within the nad3 coding region. Most of these alterations in the mRNA sequence change codon identities to specify amino acids better conserved in evolution. Individual cDNA clones differ in their degree of editing at five nucleotide positions, three of which are silent, while two lead to codon alterations specifying different amino acids. None of the cDNA clones analysed is maximally edited at all possible sites, suggesting slow processing or lowered stringency of editing at these nucleotides. Differentially edited transcripts could be editing intermediates or could code for differing polypeptides. Two edited nucleotides in an open reading frame located upstream of nad3 change two amino acids in the deduced polypeptide. Part of the well-conserved ribosomal protein gene rps12 also encoded downstream of nad3 in other plants, is lost in Oenothera mitochondria by recombination events. The functional rps12 protein must be imported from the cytoplasm since the deleted sequences of this gene are not found in the Oenothera mitochondrial genome. The pseudogene sequence is not edited at any nucleotide position. Images Fig. 3. Fig. 4. Fig. 7. PMID:1688531
Comparison of type 2 diabetes mellitus incidence in different phases of hepatitis B virus infection: A meta-analysis.

PubMed

Shen, Yi; Zhang, Sheng; Wang, Xulin; Wang, Yuanyuan; Zhang, Jian; Qin, Gang; Li, Wenchao; Ding, Kun; Zhang, Lei; Liang, Feng

2017-10-01

Because whether hepatitis B virus infection increases the risk of type 2 diabetes mellitus has been a controversial topic, pair-wise and network meta-analyses of published literature were carried out to accurately evaluate the association between different phases of hepatitis B virus infection and the risk of type 2 diabetes mellitus. A comprehensive literature retrieval was conducted from the PubMed, Embase, Cochrane Library and Chinese Database to identify epidemiological studies on the association between hepatitis B virus infection and the risk of type 2 diabetes mellitus that were published from 1999 to 2015. A pair-wise meta-analysis of direct evidence was performed to estimate the pooled odds ratios and 95% confidence intervals. A network meta-analysis was conducted, including the construction of a network plot, inconsistency plot, predictive interval plot, comparison-adjusted funnel plot and rank diagram, to graphically link the direct and indirect comparisons between different hepatitis B virus infective phases. Eighteen publications (n=113 639) describing 32 studies were included in this meta-analysis. In the pair-wise meta-analysis, the pooled odds ratio for type 2 diabetes mellitus in chronic hepatitis B cirrhosis patients was 1.76 (95% confidence interval: 1.44-2.14) when compared with non-cirrhotic chronic hepatitis B patients. In the network meta-analysis, six comparisons of four hepatitis B virus infectious states indicated the following descending order for the risk of type 2 diabetes mellitus: hepatitis B cirrhosis patients, non-cirrhotic chronic hepatitis B patients, hepatitis B virus carriers and non-hepatitis B virus controls. This study suggests that hepatitis B virus infection is not an independent risk factor for type 2 diabetes mellitus, but the development of cirrhosis may increase the incidence of type 2 diabetes mellitus cirrhosis. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Generation of Synthetic Spike Trains with Defined Pairwise Correlations

PubMed Central

Niebur, Ernst

2008-01-01

Recent technological advances as well as progress in theoretical understanding of neural systems have created a need for synthetic spike trains with controlled mean rate and pairwise cross-correlation. This report introduces and analyzes a novel algorithm for the generation of discretized spike trains with arbitrary mean rates and controlled cross correlation. Pairs of spike trains with any pairwise correlation can be generated, and higher-order correlations are compatible with common synaptic input. Relations between allowable mean rates and correlations within a population are discussed. The algorithm is highly efficient, its complexity increasing linearly with the number of spike trains generated and therefore inversely with the number of cross-correlated pairs. PMID:17521277
Nucleotide sequence analysis of the recA gene and discrimination of the three isolates of urease-positive thermophilic Campylobacter (UPTC) isolated from seagulls (Larus spp.) in Northern Ireland.

PubMed

Matsuda, M; Tai, K; Moore, J E; Millar, B C; Murayama, O

2004-01-01

Nucleotide sequencing after TA cloning of the amplicon of the almost-full length recA gene from three strains of UPTC (A1, A2, and A3) isolated from seagulls in Northern Ireland, the phenotypical and genotypical characteristics of which have been demonstrated to be indistinguishable, clarified nucleotide differences at three nucleotide positions among the three strains. In conclusion, the nucleotide sequences of the recA gene were found to discriminate among the three strains of UPTC, A1, A2, and A3, which are indistinguishable phenotypically and genotypically. Thus, the present study strongly suggests that nucleotide sequence data of the amplicon of a suitable gene or region could aid in discriminating among isolates of the UPTC group, which are indistinguishable phenotypically and genotypically. Copyright 2004 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
ParallABEL: an R library for generalized parallelization of genome-wide association studies.

PubMed

Sangket, Unitsa; Mahasirimongkol, Surakameth; Chantratita, Wasun; Tandayya, Pichaya; Aulchenko, Yurii S

2010-04-29

Genome-Wide Association (GWA) analysis is a powerful method for identifying loci associated with complex traits and drug response. Parts of GWA analyses, especially those involving thousands of individuals and consuming hours to months, will benefit from parallel computation. It is arduous acquiring the necessary programming skills to correctly partition and distribute data, control and monitor tasks on clustered computers, and merge output files. Most components of GWA analysis can be divided into four groups based on the types of input data and statistical outputs. The first group contains statistics computed for a particular Single Nucleotide Polymorphism (SNP), or trait, such as SNP characterization statistics or association test statistics. The input data of this group includes the SNPs/traits. The second group concerns statistics characterizing an individual in a study, for example, the summary statistics of genotype quality for each sample. The input data of this group includes individuals. The third group consists of pair-wise statistics derived from analyses between each pair of individuals in the study, for example genome-wide identity-by-state or genomic kinship analyses. The input data of this group includes pairs of SNPs/traits. The final group concerns pair-wise statistics derived for pairs of SNPs, such as the linkage disequilibrium characterisation. The input data of this group includes pairs of individuals. We developed the ParallABEL library, which utilizes the Rmpi library, to parallelize these four types of computations. ParallABEL library is not only aimed at GenABEL, but may also be employed to parallelize various GWA packages in R. The data set from the North American Rheumatoid Arthritis Consortium (NARAC) includes 2,062 individuals with 545,080, SNPs' genotyping, was used to measure ParallABEL performance. Almost perfect speed-up was achieved for many types of analyses. For example, the computing time for the identity-by-state matrix was linearly reduced from approximately eight hours to one hour when ParallABEL employed eight processors. Executing genome-wide association analysis using the ParallABEL library on a computer cluster is an effective way to boost performance, and simplify the parallelization of GWA studies. ParallABEL is a user-friendly parallelization of GenABEL.
A universal genomic coordinate translator for comparative genomics

PubMed Central

2014-01-01

Background Genomic duplications constitute major events in the evolution of species, allowing paralogous copies of genes to take on fine-tuned biological roles. Unambiguously identifying the orthology relationship between copies across multiple genomes can be resolved by synteny, i.e. the conserved order of genomic sequences. However, a comprehensive analysis of duplication events and their contributions to evolution would require all-to-all genome alignments, which increases at N2 with the number of available genomes, N. Results Here, we introduce Kraken, software that omits the all-to-all requirement by recursively traversing a graph of pairwise alignments and dynamically re-computing orthology. Kraken scales linearly with the number of targeted genomes, N, which allows for including large numbers of genomes in analyses. We first evaluated the method on the set of 12 Drosophila genomes, finding that orthologous correspondence computed indirectly through a graph of multiple synteny maps comes at minimal cost in terms of sensitivity, but reduces overall computational runtime by an order of magnitude. We then used the method on three well-annotated mammalian genomes, human, mouse, and rat, and show that up to 93% of protein coding transcripts have unambiguous pairwise orthologous relationships across the genomes. On a nucleotide level, 70 to 83% of exons match exactly at both splice junctions, and up to 97% on at least one junction. We last applied Kraken to an RNA-sequencing dataset from multiple vertebrates and diverse tissues, where we confirmed that brain-specific gene family members, i.e. one-to-many or many-to-many homologs, are more highly correlated across species than single-copy (i.e. one-to-one homologous) genes. Not limited to protein coding genes, Kraken also identifies thousands of newly identified transcribed loci, likely non-coding RNAs that are consistently transcribed in human, chimpanzee and gorilla, and maintain significant correlation of expression levels across species. Conclusions Kraken is a computational genome coordinate translator that facilitates cross-species comparisons, distinguishes orthologs from paralogs, and does not require costly all-to-all whole genome mappings. Kraken is freely available under LPGL from http://github.com/nedaz/kraken. PMID:24976580
A universal genomic coordinate translator for comparative genomics.

PubMed

Zamani, Neda; Sundström, Görel; Meadows, Jennifer R S; Höppner, Marc P; Dainat, Jacques; Lantz, Henrik; Haas, Brian J; Grabherr, Manfred G

2014-06-30

Genomic duplications constitute major events in the evolution of species, allowing paralogous copies of genes to take on fine-tuned biological roles. Unambiguously identifying the orthology relationship between copies across multiple genomes can be resolved by synteny, i.e. the conserved order of genomic sequences. However, a comprehensive analysis of duplication events and their contributions to evolution would require all-to-all genome alignments, which increases at N2 with the number of available genomes, N. Here, we introduce Kraken, software that omits the all-to-all requirement by recursively traversing a graph of pairwise alignments and dynamically re-computing orthology. Kraken scales linearly with the number of targeted genomes, N, which allows for including large numbers of genomes in analyses. We first evaluated the method on the set of 12 Drosophila genomes, finding that orthologous correspondence computed indirectly through a graph of multiple synteny maps comes at minimal cost in terms of sensitivity, but reduces overall computational runtime by an order of magnitude. We then used the method on three well-annotated mammalian genomes, human, mouse, and rat, and show that up to 93% of protein coding transcripts have unambiguous pairwise orthologous relationships across the genomes. On a nucleotide level, 70 to 83% of exons match exactly at both splice junctions, and up to 97% on at least one junction. We last applied Kraken to an RNA-sequencing dataset from multiple vertebrates and diverse tissues, where we confirmed that brain-specific gene family members, i.e. one-to-many or many-to-many homologs, are more highly correlated across species than single-copy (i.e. one-to-one homologous) genes. Not limited to protein coding genes, Kraken also identifies thousands of newly identified transcribed loci, likely non-coding RNAs that are consistently transcribed in human, chimpanzee and gorilla, and maintain significant correlation of expression levels across species. Kraken is a computational genome coordinate translator that facilitates cross-species comparisons, distinguishes orthologs from paralogs, and does not require costly all-to-all whole genome mappings. Kraken is freely available under LPGL from http://github.com/nedaz/kraken.
Hierarchical semi-numeric method for pairwise fuzzy group decision making.

PubMed

Marimin, M; Umano, M; Hatono, I; Tamura, H

2002-01-01

Gradual improvements to a single-level semi-numeric method, i.e., linguistic labels preference representation by fuzzy sets computation for pairwise fuzzy group decision making are summarized. The method is extended to solve multiple criteria hierarchical structure pairwise fuzzy group decision-making problems. The problems are hierarchically structured into focus, criteria, and alternatives. Decision makers express their evaluations of criteria and alternatives based on each criterion by using linguistic labels. The labels are converted into and processed in triangular fuzzy numbers (TFNs). Evaluations of criteria yield relative criteria weights. Evaluations of the alternatives, based on each criterion, yield a degree of preference for each alternative or a degree of satisfaction for each preference value. By using a neat ordered weighted average (OWA) or a fuzzy weighted average operator, solutions obtained based on each criterion are aggregated into final solutions. The hierarchical semi-numeric method is suitable for solving a larger and more complex pairwise fuzzy group decision-making problem. The proposed method has been verified and applied to solve some real cases and is compared to Saaty's (1996) analytic hierarchy process (AHP) method.

A new method of content based medical image retrieval and its applications to CT imaging sign retrieval.

PubMed

Ma, Ling; Liu, Xiabi; Gao, Yan; Zhao, Yanfeng; Zhao, Xinming; Zhou, Chunwu

2017-02-01

This paper proposes a new method of content based medical image retrieval through considering fused, context-sensitive similarity. Firstly, we fuse the semantic and visual similarities between the query image and each image in the database as their pairwise similarities. Then, we construct a weighted graph whose nodes represent the images and edges measure their pairwise similarities. By using the shortest path algorithm over the weighted graph, we obtain a new similarity measure, context-sensitive similarity measure, between the query image and each database image to complete the retrieval process. Actually, we use the fused pairwise similarity to narrow down the semantic gap for obtaining a more accurate pairwise similarity measure, and spread it on the intrinsic data manifold to achieve the context-sensitive similarity for a better retrieval performance. The proposed method has been evaluated on the retrieval of the Common CT Imaging Signs of Lung Diseases (CISLs) and achieved not only better retrieval results but also the satisfactory computation efficiency. Copyright Â© 2017 Elsevier Inc. All rights reserved.
Frequency-Dependent Selection: The High Potential for Permanent Genetic Variation in the Diallelic, Pairwise Interaction Model

PubMed Central

Asmussen, M. A.; Basnayake, E.

1990-01-01

A detailed analytic and numerical study is made of the potential for permanent genetic variation in frequency-dependent models based on pairwise interactions among genotypes at a single diallelic locus. The full equilibrium structure and qualitative gene-frequency dynamics are derived analytically for a symmetric model, in which pairwise fitnesses are chiefly determined by the genetic similarity of the individuals involved. This is supplemented by an extensive numerical investigation of the general model, the symmetric model, and nine other special cases. Together the results show that there is a high potential for permanent genetic diversity in the pairwise interaction model, and provide insight into the extent to which various forms of genotypic interactions enhance or reduce this potential. Technically, although two stable polymorphic equilibria are possible, the increased likelihood of maintaining both alleles, and the poor performance of protected polymorphism conditions as a measure of this likelihood, are primarily due to a greater variety and frequency of equilibrium patterns with one stable polymorphic equilibrium, in conjunction with a disproportionately large domain of attraction for stable internal equilibria. PMID:2341034
Evidence for the role of hydrophobic forces on the interactions of nucleotide-monophosphates with cationic liposomes.

PubMed

Cuomo, Francesca; Mosca, Monica; Murgia, Sergio; Avino, Pasquale; Ceglie, Andrea; Lopez, Francesco

2013-11-15

In this work, the interaction of nucleotide-monophosphates (NMPs) with unilamellar liposomes made of 1,2-Dioleoyl-3-Trimethylammonium-Propane (DOTAP) and 1,2-Dioleoyl-sn-Glycero-3-Phosphoethanolamine (DOPE) was investigated. Here, we demonstrate how adsorption is affected by the type of nucleotide-monophosphate. Dynamic light scattering (DLS) results revealed, for each NMP, that a distinguishable concentration exists at which a significant growth of the aggregates occurs. Adenosine 5'-monophosphate (AMP) and guanosine 5'-monophosphate (GMP) have shown a higher propensity to induce liposome aggregation process and in particular GMP appears to be the most effective. From ζ-potential experiments we found that liposomes loaded with purine based nucleotides (AMP and GMP) are able to decrease the ζ-potential values to a greater extent in comparison with the pyrimidine based nucleotides thimydine 5'-monophosphate (TMP) and uridine 5'-monophosphate (UMP). Moreover, a careful analysis of nucleotide-liposome interactions revealed that nucleotides have different capacity to induce the formation of nucleotide-liposome complexes, and purine based nucleotides have higher affinities with lipid membranes. On the whole, the data emphasize that the mechanisms driving the interactions between liposomes and NMPs are also influenced by the existence of hydrophobic forces. Copyright © 2013 Elsevier Inc. All rights reserved.
Statistical properties of the Jukes-Holmquist method of estimating the number of nucleotide substitutions: reply to Holmquist and Conroy's criticism.

PubMed

Nei, M; Tateno, Y

1981-01-01

Conducting computer simulations, Nei and Tateno (1978) have shown that Jukes and Holmquist's (1972) method of estimating the number of nucleotide substitutions tends to give an overestimate and the estimate obtained has a large variance. Holmquist and Conroy (1980) repeated some parts of our simulation and claim that the overestimation of nucleotide substitutions in our paper occurred mainly because we used selected data. Examination of Holmquist and Conroy's simulation indicates that their results are essentially the same as ours when the Jukes-Holmquist method is used, but since they used a different method of computation their estimates of nucleotide substitutions differed substantially from ours. Another problem in Holmquist and Conroy's Letter is that they confused the expected number of nucleotide substitution with the number in a sample. This confusion has resulted in a number of unnecessary arguments. They also criticized our X2 measure, but this criticism is apparently due to a misunderstanding of the assumptions of our method and a failure to use our method in the way we described. We believe that our earlier conclusions remain unchanged.
The C-terminal Helix of Pseudomonas aeruginosa Elongation Factor Ts Tunes EF-Tu Dynamics to Modulate Nucleotide Exchange.

PubMed

De Laurentiis, Evelina Ines; Mercier, Evan; Wieden, Hans-Joachim

2016-10-28

Little is known about the conservation of critical kinetic parameters and the mechanistic strategies of elongation factor (EF) Ts-catalyzed nucleotide exchange in EF-Tu in bacteria and particularly in clinically relevant pathogens. EF-Tu from the clinically relevant pathogen Pseudomonas aeruginosa shares over 84% sequence identity with the corresponding elongation factor from Escherichia coli Interestingly, the functionally closely linked EF-Ts only shares 55% sequence identity. To identify any differences in the nucleotide binding properties, as well as in the EF-Ts-mediated nucleotide exchange reaction, we performed a comparative rapid kinetics and mutagenesis analysis of the nucleotide exchange mechanism for both the E. coli and P. aeruginosa systems, identifying helix 13 of EF-Ts as a previously unnoticed regulatory element in the nucleotide exchange mechanism with species-specific elements. Our findings support the base side-first entry of the nucleotide into the binding pocket of the EF-Tu·EF-Ts binary complex, followed by displacement of helix 13 and rapid binding of the phosphate side of the nucleotide, ultimately leading to the release of EF-Ts. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Array of nucleic acid probes on biological chips for diagnosis of HIV and methods of using the same

DOEpatents

Chee, Mark; Gingeras, Thomas R.; Fodor, Stephen P. A.; Hubble, Earl A.; Morris, MacDonald S.

1999-01-19

The invention provides an array of oligonucleotide probes immobilized on a solid support for analysis of a target sequence from a human immunodeficiency virus. The array comprises at least four sets of oligonucleotide probes 9 to 21 nucleotides in length. A first probe set has a probe corresponding to each nucleotide in a reference sequence from a human immunodeficiency virus. A probe is related to its corresponding nucleotide by being exactly complementary to a subsequence of the reference sequence that includes the corresponding nucleotide. Thus, each probe has a position, designated an interrogation position, that is occupied by a complementary nucleotide to the corresponding nucleotide. The three additional probe sets each have a corresponding probe for each probe in the first probe set. Thus, for each nucleotide in the reference sequence, there are four corresponding probes, one from each of the probe sets. The three corresponding probes in the three additional probe sets are identical to the corresponding probe from the first probe or a subsequence thereof that includes the interrogation position, except that the interrogation position is occupied by a different nucleotide in each of the four corresponding probes.
Restructurable VLSI Program

DTIC Science & Technology

1981-03-31

logic testing element and a concomitant testability criterion ideally suited to dynamic circuit applications and appro- priate for automatic computer...making connections automatically . PF is an experimental feature which provides users with only four different chip sizes (full, half, quarter, and eighth...initial solution is found constructively which is improved by pair-wise swapping. Results show, however, that the constructive initial sorter , which
Amphibian Communities Under Diverse Forest Management In The Ouachita Mountains, Arkansas

Treesearch

Stanley F. Fox; Paul A. Shipman; Ronald E. Thill; Joseph P. Phelps; David M. Leslie

2004-01-01

Abstract - From May 1995 to March 1999, we censused amphibians in the Ouachita Mountains, Arkansas, on 60 plots on each of four forested watersheds five times per year, with new plots each year. We found negligible differences in species richness among watersheds, and community similarities were high, even though most pairwise comparisons were...
MRSA Transmission Dynamics Among Interconnected Acute, Intermediate-Term, and Long-Term Healthcare Facilities in Singapore.

PubMed

Chow, Angela; Lim, Vanessa W; Khan, Ateeb; Pettigrew, Kerry; Lye, David C B; Kanagasabai, Kala; Phua, Kelvin; Krishnan, Prabha; Ang, Brenda; Marimuthu, Kalisvar; Hon, Pei-Yun; Koh, Jocelyn; Leong, Ian; Parkhill, Julian; Hsu, Li-Yang; Holden, Matthew T G

2017-05-15

Methicillin-resistant Staphylococcus aureus (MRSA) is the most common healthcare-associated multidrug-resistant organism. Despite the interconnectedness between acute care hospitals (ACHs) and intermediate- and long-term care facilities (ILTCFs), the transmission dynamics of MRSA between healthcare settings is not well understood. We conducted a cross-sectional study in a network comprising an ACH and 5 closely affiliated ILTCFs in Singapore. A total of 1700 inpatients were screened for MRSA over a 6-week period in 2014. MRSA isolates underwent whole-genome sequencing, with a pairwise single-nucleotide polymorphism (Hamming distance) cutoff of 60 core genome single-nucleotide polymorphisms used to define recent transmission clusters (clades) for the 3 major clones. MRSA prevalence was significantly higher in intermediate-term (29.9%) and long-term (20.4%) care facilities than in the ACH (11.8%) (P < .001). The predominant clones were sequence type [ST] 22 (n = 183; 47.8%), ST45 (n = 129; 33.7%), and ST239 (n = 26; 6.8%), with greater diversity of STs in ILTCFs relative to the ACH. A large proportion of the clades in ST22 (14 of 21 clades; 67%) and ST45 (7 of 13; 54%) included inpatients from the ACH and ILTCFs. The most frequent source of the interfacility transmissions was the ACH (n = 28 transmission events; 36.4%). MRSA transmission dynamics between the ACH and ILTCFs were complex. The greater diversity of STs in ILTCFs suggests that the ecosystem in such settings might be more conducive for intrafacility transmission events. ST22 and ST45 have successfully established themselves in ILTCFs. The importance of interconnected infection prevention and control measures and strategies cannot be overemphasized. © The Author 2017. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail: journals.permissions@oup.com.
Genetic diversity and relationships among six local cattle populations in semi-arid areas assessed by a bovine medium-density single nucleotide polymorphism data.

PubMed

Boushaba, N; Boujenane, I; Moazami-Goudarzi, K; Flori, L; Saïdi-Mehtar, N; Tabet-Aoul, N; Laloë, D

2018-06-18

The local cattle populations belonging to the 'Brune de l'Atlas' cattle in Algeria and Morocco are potential resources in terms of genetic diversity and socioeconomic prevalence and their characterization is an essential step in any program designed to conserve genetic diversity. Our objectives were to assess the genetic diversity, the population structure and relationships among four Algerian cattle breeds, the Biskra, Cheurfa, Chelifienne and Guelmoise and of two Moroccan, the Oulmès-Zaër and Tidili by genotyping 50 309 single nucleotide polymorphism in 203 unrelated animals. A low population structure was observed across breeds with pairwise F ST values ranging from 0.008 to 0.043, suggesting a high level of gene flow. These data were combined with the available data on cattle populations representative of Europe (EUT), West African taurine (WAT) and zebu (ZEB). Principle Components Analysis was carried out which revealed that the Maghrebin populations are closer to the EUT/ZEB population than to the WAT. Structure analysis confirmed this mixed origin of the Maghrebin cattle populations. We also detected the influence of zebu breeds in Cheurfa and Guelmoise populations. This study provides the first information about genetic diversity within and between Algerian and Moroccan cattle populations and gives a detailed description of their genetic structure and relationships according to their historical origins. This study revealed that several combined effects contributed to shape the genetic diversity of the six Maghrebin populations studied: (i) gene flow among local breeds, (ii) the recent introgression of European breeds in local Algerian breeds and (iii) the traditional management systems. The results of this study will primarily assist policy makers and livestock keepers to make useful decisions for improvement of genetic resources while ensuring the preservation and conservation of local breeds in Algeria and Morocco.
Genome-wide diversity and differentiation in New World populations of the human malaria parasite Plasmodium vivax

PubMed Central

de Oliveira, Thais C.; Rodrigues, Priscila T.; Menezes, Maria José; Gonçalves-Lopes, Raquel M.; Bastos, Melissa S.; Lima, Nathália F.; Barbosa, Susana; Gerber, Alexandra L.; Loss de Morais, Guilherme; Berná, Luisa; Phelan, Jody; Robello, Carlos; de Vasconcelos, Ana Tereza R.

2017-01-01

Background The Americas were the last continent colonized by humans carrying malaria parasites. Plasmodium falciparum from the New World shows very little genetic diversity and greater linkage disequilibrium, compared with its African counterparts, and is clearly subdivided into local, highly divergent populations. However, limited available data have revealed extensive genetic diversity in American populations of another major human malaria parasite, P. vivax. Methods We used an improved sample preparation strategy and next-generation sequencing to characterize 9 high-quality P. vivax genome sequences from northwestern Brazil. These new data were compared with publicly available sequences from recently sampled clinical P. vivax isolates from Brazil (BRA, total n = 11 sequences), Peru (PER, n = 23), Colombia (COL, n = 31), and Mexico (MEX, n = 19). Principal findings/Conclusions We found that New World populations of P. vivax are as diverse (nucleotide diversity π between 5.2 × 10−4 and 6.2 × 10−4) as P. vivax populations from Southeast Asia, where malaria transmission is substantially more intense. They display several non-synonymous nucleotide substitutions (some of them previously undescribed) in genes known or suspected to be involved in antimalarial drug resistance, such as dhfr, dhps, mdr1, mrp1, and mrp-2, but not in the chloroquine resistance transporter ortholog (crt-o) gene. Moreover, P. vivax in the Americas is much less geographically substructured than local P. falciparum populations, with relatively little between-population genome-wide differentiation (pairwise FST values ranging between 0.025 and 0.092). Finally, P. vivax populations show a rapid decline in linkage disequilibrium with increasing distance between pairs of polymorphic sites, consistent with very frequent outcrossing. We hypothesize that the high diversity of present-day P. vivax lineages in the Americas originated from successive migratory waves and subsequent admixture between parasite lineages from geographically diverse sites. Further genome-wide analyses are required to test the demographic scenario suggested by our data. PMID:28759591
Partial sequencing analysis of the NS5B region confirmed the predominance of hepatitis C virus genotype 1 infection in Jeddah, Saudi Arabia.

PubMed

El Hadad, Sahar; Al-Hamdan, Hesa; Linjawi, Sabah

2017-01-01

Chronic hepatitis C virus (HCV) infection and its progression are major health problems that many countries including Saudi Arabia are facing. Determination of HCV genotypes and subgenotypes is critical for epidemiological and clinical analysis and aids in the determination of the ideal treatment strategy that needs to be followed and the expected therapy response. Although HCV infection has been identified as the second most predominant type of hepatitis in Saudi Arabia, little is known about the molecular epidemiology and genetic variability of HCV circulating in the Jeddah province of Saudi Arabia. The aim of this study was to determine the dominance of various HCV genotypes and subgenotypes circulating in Jeddah using partial sequencing of the NS5B region. To the best of our knowledge, this is the first study of its kind in Saudi Arabia. To characterize HCV genotypes and subgenotypes, serum samples from 56 patients with chronic HCV infection were collected and subjected to partial NS5B gene amplification and sequence analysis. Phylogenetic analysis of the NS5B partial sequences revealed that HCV/1 was the predominant genotype (73%), followed by HCV/4 (24.49%) and HCV/3 (2.04%). Moreover, pairwise analysis also confirmed these results based on the average specific nucleotide distance identity: ±0.112, ±0.112, and ±0.179 for HCV/1, HCV/4, and HCV/3, respectively, without any interference between genotypes. Notably, the phylogenetic tree of the HCV/1 subgenotypes revealed that all the isolates (100%) from the present study belonged to the HCV/1a subgenotype. Our findings also revealed similarities in the nucleotide sequences between HCV circulating in Saudi Arabia and those circulating in countries such as Morocco, Egypt, Canada, India, Pakistan, and France. These results indicated that determination of HCV genotypes and subgenotypes based on partial sequence analysis of the NS5B region is accurate and reliable for HCV subtype determination.
Genetic diversity and epidemiology of infectious hematopoietic necrosis virus in Alaska

USGS Publications Warehouse

Emmenegger, E.G; Meyers, T.R.; Burton, T.O.; Kurath, G.

2000-01-01

Forty-two infectious hematopoietic necrosis virus (IHNV) isolates from Alaska were analyzed using the ribonuclease protection assay (RPA) and nucleotide sequencing. RPA analyses, utilizing 4 probes, N5, N3 (N gene), GF (G gene), and NV (NV gene), determined that the haplotypes of all 3 genes demonstrated a consistent spatial pattern. Virus isolates belonging to the most common haplotype groups were distributed throughout Alaska, whereas isolates in small haplotype groups were obtained from only 1 site (hatchery, lake, etc.). The temporal pattern of the GF haplotypes suggested a 'genetic acclimation' of the G gene, possibly due to positive selection on the glycoprotein. A pairwise comparison of the sequence data determined that the maximum nucleotide diversity of the isolates was 2.75% (10 mismatches) for the NV gene, and 1.99% (6 mismatches) for a 301 base pair region of the G gene, indicating that the genetic diversity of IHNV within Alaska is notably lower than in the more southern portions of the IHNV North American range. Phylogenetic analysis of representative Alaskan sequences and sequences of 12 previously characterized IHNV strains from Washington, Oregon, Idaho, California (USA) and British Columbia (Canada) distinguished the isolates into clusters that correlated with geographic origin and indicated that the Alaskan and British Columbia isolates may have a common viral ancestral lineage. Comparisons of multiple isolates from the same site provided epidemiological insights into viral transmission patterns and indicated that viral evolution, viral introduction, and genetic stasis were the mechanisms involved with IHN virus population dynamics in Alaska. The examples of genetic stasis and the overall low sequence heterogeneity of the Alaskan isolates suggested that they are evolutionarily constrained. This study establishes a baseline of genetic fingerprint patterns and sequence groups representing the genetic diversity of Alaskan IHNV isolates. This information could be used to determine the source of an IHN outbreak and to facilitate decisions in fisheries management of Alaskan salmonid stocks.
Conservation of the structure and organization of lupin mitochondrial nad3 and rps12 genes.

PubMed

Rurek, M; Oczkowski, M; Augustyniak, H

1998-01-01

A high level of the nucleotide sequence conservation of mitochondrial nad3 and rps12 genes was found in four lupin species. The only differences concern three nucleotides in the Lupinus albus rps12 gene and three nucleotides insertion in the L. mutabilis spacer. Northern blot analysis as well as RT-PCR confirmed cotranscription of the L. luteus genes because the transcripts detected were long enough.
A biological inspired fuzzy adaptive window median filter (FAWMF) for enhancing DNA signal processing.

PubMed

Ahmad, Muneer; Jung, Low Tan; Bhuiyan, Al-Amin

2017-10-01

Digital signal processing techniques commonly employ fixed length window filters to process the signal contents. DNA signals differ in characteristics from common digital signals since they carry nucleotides as contents. The nucleotides own genetic code context and fuzzy behaviors due to their special structure and order in DNA strand. Employing conventional fixed length window filters for DNA signal processing produce spectral leakage and hence results in signal noise. A biological context aware adaptive window filter is required to process the DNA signals. This paper introduces a biological inspired fuzzy adaptive window median filter (FAWMF) which computes the fuzzy membership strength of nucleotides in each slide of window and filters nucleotides based on median filtering with a combination of s-shaped and z-shaped filters. Since coding regions cause 3-base periodicity by an unbalanced nucleotides' distribution producing a relatively high bias for nucleotides' usage, such fundamental characteristic of nucleotides has been exploited in FAWMF to suppress the signal noise. Along with adaptive response of FAWMF, a strong correlation between median nucleotides and the Π shaped filter was observed which produced enhanced discrimination between coding and non-coding regions contrary to fixed length conventional window filters. The proposed FAWMF attains a significant enhancement in coding regions identification i.e. 40% to 125% as compared to other conventional window filters tested over more than 250 benchmarked and randomly taken DNA datasets of different organisms. This study proves that conventional fixed length window filters applied to DNA signals do not achieve significant results since the nucleotides carry genetic code context. The proposed FAWMF algorithm is adaptive and outperforms significantly to process DNA signal contents. The algorithm applied to variety of DNA datasets produced noteworthy discrimination between coding and non-coding regions contrary to fixed window length conventional filters. Copyright © 2017 Elsevier B.V. All rights reserved.
Document Level Assessment of Document Retrieval Systems in a Pairwise System Evaluation

ERIC Educational Resources Information Center

Rajagopal, Prabha; Ravana, Sri Devi

2017-01-01

Introduction: The use of averaged topic-level scores can result in the loss of valuable data and can cause misinterpretation of the effectiveness of system performance. This study aims to use the scores of each document to evaluate document retrieval systems in a pairwise system evaluation. Method: The chosen evaluation metrics are document-level…
Pairwise Multiple Comparisons in Single Group Repeated Measures Analysis.

ERIC Educational Resources Information Center

Barcikowski, Robert S.; Elliott, Ronald S.

Research was conducted to provide educational researchers with a choice of pairwise multiple comparison procedures (P-MCPs) to use with single group repeated measures designs. The following were studied through two Monte Carlo (MC) simulations: (1) The T procedure of J. W. Tukey (1953); (2) a modification of Tukey's T (G. Keppel, 1973); (3) the…
Impaired Discrimination Learning in Mice Lacking the NMDA Receptor NR2A Subunit

ERIC Educational Resources Information Center

Brigman, Jonathan L.; Feyder, Michael; Saksida, Lisa M.; Bussey, Timothy J.; Mishina, Masayoshi; Holmes, Andrew

2008-01-01

N-Methyl-D-aspartate receptors (NMDARs) mediate certain forms of synaptic plasticity and learning. We used a touchscreen system to assess NR2A subunit knockout mice (KO) for (1) pairwise visual discrimination and reversal learning and (2) acquisition and extinction of an instrumental response requiring no pairwise discrimination. NR2A KO mice…
Three-dimensional analysis of the uniqueness of the anterior dentition in orthodontically treated patients and twins.

PubMed

Franco, A; Willems, G; Souza, P H C; Tanaka, O M; Coucke, W; Thevissen, P

2017-04-01

Dental uniqueness can be proven if no perfect match in pair-wise morphological comparisons of human dentitions is detected. Establishing these comparisons in a worldwide random population is practically unfeasible due to the need for a large and representative sample size. Sample stratification is an option to reduce sample size. The present study investigated the uniqueness of the human dentition in randomly selected subjects (Group 1), orthodontically treated patients (Group 2), twins (Group 3), and orthodontically treated twins (Group 4) in comparison with a threshold control sample of identical dentitions (Group 5). The samples consisted of digital cast files (DCF) obtained through extraoral 3D scanning. A total of 2.013 pair-wise morphological comparisons were performed (Group 1 n=110, Group 2 n=1.711, Group 3 n=172, Group 4 n=10, Group 5 n=10) with Geomagic Studio ® (3D Systems ® , Rock Hill, SC, USA) software package. Comparisons within groups were performed quantifying the morphological differences between DCF in Euclidean distances. Comparisons between groups were established applying One-way ANOVA. To ensure fair comparisons a post-hoc Power Analysis was performed. ROC analysis was applied to distinguish unique from non-unique dentures. Identical DCF were not detected within the experimental groups (from 1 to 4). The most similar DCF had Euclidian distance of 5.19mm in Group 1, 2.06mm in Group 2, 2.03mm in Group 3, and 1.88mm in Group 4. Groups 2 and 3 were statistically different from Group 5 (p<0.05). Statistically significant difference between Group 4 and 5 revealed to be possible including more pair-wise comparisons in both groups. The ROC analysis revealed sensitivity rate of 80% and specificity between 66.7% and 81.6%. Evidence to sustain the uniqueness of the human dentition in random and stratified populations was observed in the present study. Further studies testing the influence of the quantity of tooth material on morphological difference between dentitions and its impact on uniqueness remain necessary. Copyright © 2017 Elsevier B.V. All rights reserved.
Pairwise-additive hydrophobic effect for alkanes in water

PubMed Central

Wu, Jianzhong; Prausnitz, John M.

2008-01-01

Pairwise additivity of the hydrophobic effect is indicated by reliable experimental Henry's constants for a large number of linear and branched low-molecular-weight alkanes in water. Pairwise additivity suggests that the hydrophobic effect is primarily a local phenomenon and that the hydrophobic interaction may be represented by a semiempirical force field. By representing the hydrophobic potential between two methane molecules as a linear function of the overlap volume of the hydration layers, we find that the contact value of the hydrophobic potential (−0.72 kcal/mol) is smaller than that from quantum mechanics simulations (−2.8 kcal/mol) but is close to that from classical molecular dynamics (−0.5∼−0.9 kcal/mol). PMID:18599448

Molecular population genetics of inversion breakpoint regions in Drosophila pseudoobscura.

PubMed

Wallace, Andre G; Detweiler, Don; Schaeffer, Stephen W

2013-07-08

Paracentric inversions in populations can have a profound effect on the pattern and organization of nucleotide variability along a chromosome. Regions near inversion breakpoints are expected to have greater levels of differentiation because of reduced genetic exchange between different gene arrangements whereas central regions in the inverted segments are predicted to have lower levels of nucleotide differentiation due to greater levels of genetic flux among different karyotypes. We used the inversion polymorphism on the third chromosome of Drosophila pseudoobscura to test these predictions with an analysis of nucleotide diversity of 18 genetic markers near and away from inversion breakpoints. We tested hypotheses about how the presence of different chromosomal arrangements affects the pattern and organization of nucleotide variation. Overall, markers in the distal segment of the chromosome had greater levels of nucleotide heterozygosity than markers within the proximal segment of the chromosome. In addition, our results rejected the hypothesis that the breakpoints of derived inversions will have lower levels of nucleotide variability than breakpoints of ancestral inversions, even when strains with gene conversion events were removed. High levels of linkage disequilibrium were observed within all 11 breakpoint regions as well as between the ends of most proximal and distal breakpoints. The central region of the chromosome had the greatest levels of linkage disequilibrium compared with the proximal and distal regions because this is the region that experiences the highest level of recombination suppression. These data do not fully support the idea that genetic exchange is the sole force that influences genetic variation on inverted chromosomes.
Structural characterization of Helicobacter pylori dethiobiotin synthetase reveals differences between family members

DOE Office of Scientific and Technical Information (OSTI.GOV)

Porebski, Przemyslaw J.; Klimecka, Maria; Chruszcz, Maksymilian

2012-07-11

Dethiobiotin synthetase (DTBS) is involved in the biosynthesis of biotin in bacteria, fungi, and plants. As humans lack this pathway, DTBS is a promising antimicrobial drug target. We determined structures of DTBS from Helicobacter pylori (hpDTBS) bound with cofactors and a substrate analog, and described its unique characteristics relative to other DTBS proteins. Comparison with bacterial DTBS orthologs revealed considerable structural differences in nucleotide recognition. The C-terminal region of DTBS proteins, which contains two nucleotide-recognition motifs, differs greatly among DTBS proteins from different species. The structure of hpDTBS revealed that this protein is unique and does not contain a C-terminalmore » region containing one of the motifs. The single nucleotide-binding motif in hpDTBS is similar to its counterpart in GTPases; however, isothermal titration calorimetry binding studies showed that hpDTBS has a strong preference for ATP. The structural determinants of ATP specificity were assessed with X-ray crystallographic studies of hpDTBS-ATP and hpDTBS-GTP complexes. The unique mode of nucleotide recognition in hpDTBS makes this protein a good target for H. pylori-specific inhibitors of the biotin synthesis pathway.« less
Dependence of Halo Bias and Kinematics on Assembly Variables

NASA Astrophysics Data System (ADS)

Xu, Xiaoju; Zheng, Zheng

2018-06-01

Using dark matter haloes identified in a large N-body simulation, we study halo assembly bias, with halo formation time, peak maximum circular velocity, concentration, and spin as the assembly variables. Instead of grouping haloes at fixed mass into different percentiles of each assembly variable, we present the joint dependence of halo bias on the values of halo mass and each assembly variable. In the plane of halo mass and one assembly variable, the joint dependence can be largely described as halo bias increasing outward from a global minimum. We find it unlikely to have a combination of halo variables to absorb all assembly bias effects. We then present the joint dependence of halo bias on two assembly variables at fixed halo mass. The gradient of halo bias does not necessarily follow the correlation direction of the two assembly variables and it varies with halo mass. Therefore in general for two correlated assembly variables one cannot be used as a proxy for the other in predicting halo assembly bias trend. Finally, halo assembly is found to affect the kinematics of haloes. Low-mass haloes formed earlier can have much higher pairwise velocity dispersion than those of massive haloes. In general, halo assembly leads to a correlation between halo bias and halo pairwise velocity distribution, with more strongly clustered haloes having higher pairwise velocity and velocity dispersion. However, the correlation is not tight, and the kinematics of haloes at fixed halo bias still depends on halo mass and assembly variables.
The construct-behavior gap in behavioral decision research: A challenge beyond replicability.

PubMed

Regenwetter, Michel; Robinson, Maria M

2017-10-01

Behavioral decision research compares theoretical constructs like preferences to behavior such as observed choices. Three fairly common links from constructs to behavior are (1) to tally, across participants and decision problems, the number of choices consistent with one predicted pattern of pairwise preferences; (2) to compare what most people choose in each decision problem against a predicted preference pattern; or (3) to enumerate the decision problems in which two experimental conditions generate a 1-sided significant difference in choice frequency 'consistent' with the theory. Although simple, these theoretical links are heuristics. They are subject to well-known reasoning fallacies, most notably the fallacy of sweeping generalization and the fallacy of composition. No amount of replication can alleviate these fallacies. On the contrary, reiterating logically inconsistent theoretical reasoning over and again across studies obfuscates science. As a case in point, we consider pairwise choices among simple lotteries and the hypotheses of overweighting or underweighting of small probabilities, as well as the description-experience gap. We discuss ways to avoid reasoning fallacies in bridging the conceptual gap between hypothetical constructs, such as, for example, "overweighting" to observable pairwise choice data. Although replication is invaluable, successful replication of hard-to-interpret results is not. Behavioral decision research stands to gain much theoretical and empirical clarity by spelling out precise and formally explicit theories of how hypothetical constructs translate into observable behavior. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Concerted evolution of life stage performances signals recent selection on yeast nitrogen use.

PubMed

Ibstedt, Sebastian; Stenberg, Simon; Bagés, Sara; Gjuvsland, Arne B; Salinas, Francisco; Kourtchenko, Olga; Samy, Jeevan K A; Blomberg, Anders; Omholt, Stig W; Liti, Gianni; Beltran, Gemma; Warringer, Jonas

2015-01-01

Exposing natural selection driving phenotypic and genotypic adaptive differentiation is an extraordinary challenge. Given that an organism's life stages are exposed to the same environmental variations, we reasoned that fitness components, such as the lag, rate, and efficiency of growth, directly reflecting performance in these life stages, should often be selected in concert. We therefore conjectured that correlations between fitness components over natural isolates, in a particular environmental context, would constitute a robust signal of recent selection. Critically, this test for selection requires fitness components to be determined by different genetic loci. To explore our conjecture, we exhaustively evaluated the lag, rate, and efficiency of asexual population growth of natural isolates of the model yeast Saccharomyces cerevisiae in a large variety of nitrogen-limited environments. Overall, fitness components were well correlated under nitrogen restriction. Yeast isolates were further crossed in all pairwise combinations and coinheritance of each fitness component and genetic markers were traced. Trait variations tended to map to quantitative trait loci (QTL) that were private to a single fitness component. We further traced QTLs down to single-nucleotide resolution and uncovered loss-of-function mutations in RIM15, PUT4, DAL1, and DAL4 as the genetic basis for nitrogen source use variations. Effects of SNPs were unique for a single fitness component, strongly arguing against pleiotropy between lag, rate, and efficiency of reproduction under nitrogen restriction. The strong correlations between life stage performances that cannot be explained by pleiotropy compellingly support adaptive differentiation of yeast nitrogen source use and suggest a generic approach for detecting selection. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Molecular Characterization of the Complete Genome of Three Basal-BR Isolates of Turnip mosaic virus Infecting Raphanus sativus in China.

PubMed

Zhu, Fuxiang; Sun, Ying; Wang, Yan; Pan, Hongyu; Wang, Fengting; Zhang, Xianghui; Zhang, Yanhua; Liu, Jinliang

2016-06-04

Turnip mosaic virus (TuMV) infects crops of plant species in the family Brassicaceae worldwide. TuMV isolates were clustered to five lineages corresponding to basal-B, basal-BR, Asian-BR, world-B and OMs. Here, we determined the complete genome sequences of three TuMV basal-BR isolates infecting radish from Shandong and Jilin Provinces in China. Their genomes were all composed of 9833 nucleotides, excluding the 3'-terminal poly(A) tail. They contained two open reading frames (ORFs), with the large one encoding a polyprotein of 3164 amino acids and the small overlapping ORF encoding a PIPO protein of 61 amino acids, which contained the typically conserved motifs found in members of the genus Potyvirus. In pairwise comparison with 30 other TuMV genome sequences, these three isolates shared their highest identities with isolates from Eurasian countries (Germany, Italy, Turkey and China). Recombination analysis showed that the three isolates in this study had no "clear" recombination. The analyses of conserved amino acids changed between groups showed that the codons in the TuMV out group (OGp) and OMs group were the same at three codon sites (852, 1006, 1548), and the other TuMV groups (basal-B, basal-BR, Asian-BR, world-B) were different. This pattern suggests that the codon in the OMs progenitor did not change but that in the other TuMV groups the progenitor sequence did change at divergence. Genetic diversity analyses indicate that the PIPO gene was under the highest selection pressure and the selection pressure on P3N-PIPO and P3 was almost the same. It suggests that most of the selection pressure on P3 was probably imposed through P3N-PIPO.
Filling Gaps in Biodiversity Knowledge for Macrofungi: Contributions and Assessment of an Herbarium Collection DNA Barcode Sequencing Project

PubMed Central

Osmundson, Todd W.; Robert, Vincent A.; Schoch, Conrad L.; Baker, Lydia J.; Smith, Amy; Robich, Giovanni; Mizzan, Luca; Garbelotto, Matteo M.

2013-01-01

Despite recent advances spearheaded by molecular approaches and novel technologies, species description and DNA sequence information are significantly lagging for fungi compared to many other groups of organisms. Large scale sequencing of vouchered herbarium material can aid in closing this gap. Here, we describe an effort to obtain broad ITS sequence coverage of the approximately 6000 macrofungal-species-rich herbarium of the Museum of Natural History in Venice, Italy. Our goals were to investigate issues related to large sequencing projects, develop heuristic methods for assessing the overall performance of such a project, and evaluate the prospects of such efforts to reduce the current gap in fungal biodiversity knowledge. The effort generated 1107 sequences submitted to GenBank, including 416 previously unrepresented taxa and 398 sequences exhibiting a best BLAST match to an unidentified environmental sequence. Specimen age and taxon affected sequencing success, and subsequent work on failed specimens showed that an ITS1 mini-barcode greatly increased sequencing success without greatly reducing the discriminating power of the barcode. Similarity comparisons and nonmetric multidimensional scaling ordinations based on pairwise distance matrices proved to be useful heuristic tools for validating the overall accuracy of specimen identifications, flagging potential misidentifications, and identifying taxa in need of additional species-level revision. Comparison of within- and among-species nucleotide variation showed a strong increase in species discriminating power at 1–2% dissimilarity, and identified potential barcoding issues (same sequence for different species and vice-versa). All sequences are linked to a vouchered specimen, and results from this study have already prompted revisions of species-sequence assignments in several taxa. PMID:23638077
Population Genetics of Lactobacillus sakei Reveals Three Lineages with Distinct Evolutionary Histories

PubMed Central

Chaillou, Stéphane; Lucquin, Isabelle; Najjari, Afef; Zagorec, Monique; Champomier-Vergès, Marie-Christine

2013-01-01

Lactobacillus sakei plays a major role in meat fermentation and in the preservation of fresh meat. The large diversity of L. sakei strains represents a valuable and exploitable asset in the development of a variety of industrial applications; however, an efficient method to identify and classify these strains has yet to be developed. In this study, we used multilocus sequence typing (MLST) to analyze the polymorphism and allelic distribution of eight loci within an L. sakei population of 232 strains collected worldwide. Within this population, we identified 116 unique sequence types with an average pairwise nucleotide diversity per site (π) of 0.13%. Results from Structure, goeBurst, and ClonalFrame software analyses demonstrated that the L. sakei population analyzed here is derived from three ancestral lineages, each of which shows evidence of a unique evolutionary history influenced by independent selection scenarios. However, the signature of selective events in the contemporary population of isolates was somewhat masked by the pervasive phenomenon of homologous recombination. Our results demonstrate that lineage 1 is a completely panmictic subpopulation in which alleles have been continually redistributed through the process of intra-lineage recombination. In contrast, lineage 2 was characterized by a high degree of clonality. Lineage 3, the earliest-diverging branch in the genealogy, showed evidence of both clonality and recombination. These evolutionary histories strongly indicate that the three lineages may correspond to distinct ecotypes, likely linked or specialized to different environmental reservoirs. The MLST scheme developed in this study represents an easy and straightforward tool that can be used to further analyze the population dynamics of L. sakei strains in food products. PMID:24069179
Population genetics of Lactobacillus sakei reveals three lineages with distinct evolutionary histories.

PubMed

Chaillou, Stéphane; Lucquin, Isabelle; Najjari, Afef; Zagorec, Monique; Champomier-Vergès, Marie-Christine

2013-01-01

Lactobacillus sakei plays a major role in meat fermentation and in the preservation of fresh meat. The large diversity of L. sakei strains represents a valuable and exploitable asset in the development of a variety of industrial applications; however, an efficient method to identify and classify these strains has yet to be developed. In this study, we used multilocus sequence typing (MLST) to analyze the polymorphism and allelic distribution of eight loci within an L. sakei population of 232 strains collected worldwide. Within this population, we identified 116 unique sequence types with an average pairwise nucleotide diversity per site (π) of 0.13%. Results from Structure, goeBurst, and ClonalFrame software analyses demonstrated that the L. sakei population analyzed here is derived from three ancestral lineages, each of which shows evidence of a unique evolutionary history influenced by independent selection scenarios. However, the signature of selective events in the contemporary population of isolates was somewhat masked by the pervasive phenomenon of homologous recombination. Our results demonstrate that lineage 1 is a completely panmictic subpopulation in which alleles have been continually redistributed through the process of intra-lineage recombination. In contrast, lineage 2 was characterized by a high degree of clonality. Lineage 3, the earliest-diverging branch in the genealogy, showed evidence of both clonality and recombination. These evolutionary histories strongly indicate that the three lineages may correspond to distinct ecotypes, likely linked or specialized to different environmental reservoirs. The MLST scheme developed in this study represents an easy and straightforward tool that can be used to further analyze the population dynamics of L. sakei strains in food products.
Filling gaps in biodiversity knowledge for macrofungi: contributions and assessment of an herbarium collection DNA barcode sequencing project.

PubMed

Osmundson, Todd W; Robert, Vincent A; Schoch, Conrad L; Baker, Lydia J; Smith, Amy; Robich, Giovanni; Mizzan, Luca; Garbelotto, Matteo M

2013-01-01

Despite recent advances spearheaded by molecular approaches and novel technologies, species description and DNA sequence information are significantly lagging for fungi compared to many other groups of organisms. Large scale sequencing of vouchered herbarium material can aid in closing this gap. Here, we describe an effort to obtain broad ITS sequence coverage of the approximately 6000 macrofungal-species-rich herbarium of the Museum of Natural History in Venice, Italy. Our goals were to investigate issues related to large sequencing projects, develop heuristic methods for assessing the overall performance of such a project, and evaluate the prospects of such efforts to reduce the current gap in fungal biodiversity knowledge. The effort generated 1107 sequences submitted to GenBank, including 416 previously unrepresented taxa and 398 sequences exhibiting a best BLAST match to an unidentified environmental sequence. Specimen age and taxon affected sequencing success, and subsequent work on failed specimens showed that an ITS1 mini-barcode greatly increased sequencing success without greatly reducing the discriminating power of the barcode. Similarity comparisons and nonmetric multidimensional scaling ordinations based on pairwise distance matrices proved to be useful heuristic tools for validating the overall accuracy of specimen identifications, flagging potential misidentifications, and identifying taxa in need of additional species-level revision. Comparison of within- and among-species nucleotide variation showed a strong increase in species discriminating power at 1-2% dissimilarity, and identified potential barcoding issues (same sequence for different species and vice-versa). All sequences are linked to a vouchered specimen, and results from this study have already prompted revisions of species-sequence assignments in several taxa.
A tensorial approach to access cognitive workload related to mental arithmetic from EEG functional connectivity estimates.

PubMed

Dimitriadis, S I; Sun, Yu; Kwok, K; Laskaris, N A; Bezerianos, A

2013-01-01

The association of functional connectivity patterns with particular cognitive tasks has long been a topic of interest in neuroscience, e.g., studies of functional connectivity have demonstrated its potential use for decoding various brain states. However, the high-dimensionality of the pairwise functional connectivity limits its usefulness in some real-time applications. In the present study, the methodology of tensor subspace analysis (TSA) is used to reduce the initial high-dimensionality of the pairwise coupling in the original functional connectivity network to a space of condensed descriptive power, which would significantly decrease the computational cost and facilitate the differentiation of brain states. We assess the feasibility of the proposed method on EEG recordings when the subject was performing mental arithmetic task which differ only in the difficulty level (easy: 1-digit addition v.s. 3-digit additions). Two different cortical connective networks were detected, and by comparing the functional connectivity networks in different work states, it was found that the task-difficulty is best reflected in the connectivity structure of sub-graphs extending over parietooccipital sites. Incorporating this data-driven information within original TSA methodology, we succeeded in predicting the difficulty level from connectivity patterns in an efficient way that can be implemented so as to work in real-time.
Modeling Spatial Dependence of Rainfall Extremes Across Multiple Durations

NASA Astrophysics Data System (ADS)

Le, Phuong Dong; Leonard, Michael; Westra, Seth

2018-03-01

Determining the probability of a flood event in a catchment given that another flood has occurred in a nearby catchment is useful in the design of infrastructure such as road networks that have multiple river crossings. These conditional flood probabilities can be estimated by calculating conditional probabilities of extreme rainfall and then transforming rainfall to runoff through a hydrologic model. Each catchment's hydrological response times are unlikely to be the same, so in order to estimate these conditional probabilities one must consider the dependence of extreme rainfall both across space and across critical storm durations. To represent these types of dependence, this study proposes a new approach for combining extreme rainfall across different durations within a spatial extreme value model using max-stable process theory. This is achieved in a stepwise manner. The first step defines a set of common parameters for the marginal distributions across multiple durations. The parameters are then spatially interpolated to develop a spatial field. Storm-level dependence is represented through the max-stable process for rainfall extremes across different durations. The dependence model shows a reasonable fit between the observed pairwise extremal coefficients and the theoretical pairwise extremal coefficient function across all durations. The study demonstrates how the approach can be applied to develop conditional maps of the return period and return level across different durations.
Dissecting random and systematic differences between noisy composite data sets.

PubMed

Diederichs, Kay

2017-04-01

Composite data sets measured on different objects are usually affected by random errors, but may also be influenced by systematic (genuine) differences in the objects themselves, or the experimental conditions. If the individual measurements forming each data set are quantitative and approximately normally distributed, a correlation coefficient is often used to compare data sets. However, the relations between data sets are not obvious from the matrix of pairwise correlations since the numerical value of the correlation coefficient is lowered by both random and systematic differences between the data sets. This work presents a multidimensional scaling analysis of the pairwise correlation coefficients which places data sets into a unit sphere within low-dimensional space, at a position given by their CC* values [as defined by Karplus & Diederichs (2012), Science, 336, 1030-1033] in the radial direction and by their systematic differences in one or more angular directions. This dimensionality reduction can not only be used for classification purposes, but also to derive data-set relations on a continuous scale. Projecting the arrangement of data sets onto the subspace spanned by systematic differences (the surface of a unit sphere) allows, irrespective of the random-error levels, the identification of clusters of closely related data sets. The method gains power with increasing numbers of data sets. It is illustrated with an example from low signal-to-noise ratio image processing, and an application in macromolecular crystallography is shown, but the approach is completely general and thus should be widely applicable.
“Gate-keeper” Residues and Active-Site Rearrangements in DNA Polymerase μ Help Discriminate Non-cognate Nucleotides

PubMed Central

Li, Yunlang; Schlick, Tamar

2013-01-01

Incorporating the cognate instead of non-cognate substrates is crucial for DNA polymerase function. Here we analyze molecular dynamics simulations of DNA polymerase μ (pol μ) bound to different non-cognate incoming nucleotides including A:dCTP, A:dGTP, A(syn):dGTP, A:dATP, A(syn):dATP, T:dCTP, and T:dGTP to study the structure-function relationships involved with aberrant base pairs in the conformational pathway; while a pol μ complex with the A:dTTP base pair is available, no solved non-cognate structures are available. We observe distinct differences of the non-cognate systems compared to the cognate system. Specifically, the motions of active-site residue His329 and Asp330 distort the active site, and Trp436, Gln440, Glu443 and Arg444 tend to tighten the nucleotide-binding pocket when non-cognate nucleotides are bound; the latter effect may further lead to an altered electrostatic potential within the active site. That most of these “gate-keeper” residues are located farther apart from the upstream primer in pol μ, compared to other X family members, also suggests an interesting relation to pol μ's ability to incorporate nucleotides when the upstream primer is not paired. By examining the correlated motions within pol μ complexes, we also observe different patterns of correlations between non-cognate systems and the cognate system, especially decreased interactions between the incoming nucleotides and the nucleotide-binding pocket. Altered correlated motions in non-cognate systems agree with our recently proposed hybrid conformational selection/induced-fit models. Taken together, our studies propose the following order for difficulty of non-cognate system insertions by pol μ: T:dGTP
SALAD database: a motif-based database of protein annotations for plant comparative genomics

PubMed Central

Mihara, Motohiro; Itoh, Takeshi; Izawa, Takeshi

2010-01-01

Proteins often have several motifs with distinct evolutionary histories. Proteins with similar motifs have similar biochemical properties and thus related biological functions. We constructed a unique comparative genomics database termed the SALAD database (http://salad.dna.affrc.go.jp/salad/) from plant-genome-based proteome data sets. We extracted evolutionarily conserved motifs by MEME software from 209 529 protein-sequence annotation groups selected by BLASTP from the proteome data sets of 10 species: rice, sorghum, Arabidopsis thaliana, grape, a lycophyte, a moss, 3 algae, and yeast. Similarity clustering of each protein group was performed by pairwise scoring of the motif patterns of the sequences. The SALAD database provides a user-friendly graphical viewer that displays a motif pattern diagram linked to the resulting bootstrapped dendrogram for each protein group. Amino-acid-sequence-based and nucleotide-sequence-based phylogenetic trees for motif combination alignment, a logo comparison diagram for each clade in the tree, and a Pfam-domain pattern diagram are also available. We also developed a viewer named ‘SALAD on ARRAYs’ to view arbitrary microarray data sets of paralogous genes linked to the same dendrogram in a window. The SALAD database is a powerful tool for comparing protein sequences and can provide valuable hints for biological analysis. PMID:19854933
Identifying novel sequence variants of RNA 3D motifs

PubMed Central

Zirbel, Craig L.; Roll, James; Sweeney, Blake A.; Petrov, Anton I.; Pirrung, Meg; Leontis, Neocles B.

2015-01-01

Predicting RNA 3D structure from sequence is a major challenge in biophysics. An important sub-goal is accurately identifying recurrent 3D motifs from RNA internal and hairpin loop sequences extracted from secondary structure (2D) diagrams. We have developed and validated new probabilistic models for 3D motif sequences based on hybrid Stochastic Context-Free Grammars and Markov Random Fields (SCFG/MRF). The SCFG/MRF models are constructed using atomic-resolution RNA 3D structures. To parameterize each model, we use all instances of each motif found in the RNA 3D Motif Atlas and annotations of pairwise nucleotide interactions generated by the FR3D software. Isostericity relations between non-Watson–Crick basepairs are used in scoring sequence variants. SCFG techniques model nested pairs and insertions, while MRF ideas handle crossing interactions and base triples. We use test sets of randomly-generated sequences to set acceptance and rejection thresholds for each motif group and thus control the false positive rate. Validation was carried out by comparing results for four motif groups to RMDetect. The software developed for sequence scoring (JAR3D) is structured to automatically incorporate new motifs as they accumulate in the RNA 3D Motif Atlas when new structures are solved and is available free for download. PMID:26130723
A kernel regression approach to gene-gene interaction detection for case-control studies.

PubMed

Larson, Nicholas B; Schaid, Daniel J

2013-11-01

Gene-gene interactions are increasingly being addressed as a potentially important contributor to the variability of complex traits. Consequently, attentions have moved beyond single locus analysis of association to more complex genetic models. Although several single-marker approaches toward interaction analysis have been developed, such methods suffer from very high testing dimensionality and do not take advantage of existing information, notably the definition of genes as functional units. Here, we propose a comprehensive family of gene-level score tests for identifying genetic elements of disease risk, in particular pairwise gene-gene interactions. Using kernel machine methods, we devise score-based variance component tests under a generalized linear mixed model framework. We conducted simulations based upon coalescent genetic models to evaluate the performance of our approach under a variety of disease models. These simulations indicate that our methods are generally higher powered than alternative gene-level approaches and at worst competitive with exhaustive SNP-level (where SNP is single-nucleotide polymorphism) analyses. Furthermore, we observe that simulated epistatic effects resulted in significant marginal testing results for the involved genes regardless of whether or not true main effects were present. We detail the benefits of our methods and discuss potential genome-wide analysis strategies for gene-gene interaction analysis in a case-control study design. © 2013 WILEY PERIODICALS, INC.
SALAD database: a motif-based database of protein annotations for plant comparative genomics.

PubMed

Mihara, Motohiro; Itoh, Takeshi; Izawa, Takeshi

2010-01-01

Proteins often have several motifs with distinct evolutionary histories. Proteins with similar motifs have similar biochemical properties and thus related biological functions. We constructed a unique comparative genomics database termed the SALAD database (http://salad.dna.affrc.go.jp/salad/) from plant-genome-based proteome data sets. We extracted evolutionarily conserved motifs by MEME software from 209,529 protein-sequence annotation groups selected by BLASTP from the proteome data sets of 10 species: rice, sorghum, Arabidopsis thaliana, grape, a lycophyte, a moss, 3 algae, and yeast. Similarity clustering of each protein group was performed by pairwise scoring of the motif patterns of the sequences. The SALAD database provides a user-friendly graphical viewer that displays a motif pattern diagram linked to the resulting bootstrapped dendrogram for each protein group. Amino-acid-sequence-based and nucleotide-sequence-based phylogenetic trees for motif combination alignment, a logo comparison diagram for each clade in the tree, and a Pfam-domain pattern diagram are also available. We also developed a viewer named 'SALAD on ARRAYs' to view arbitrary microarray data sets of paralogous genes linked to the same dendrogram in a window. The SALAD database is a powerful tool for comparing protein sequences and can provide valuable hints for biological analysis.
Targeted Approach to Identify Genetic Loci Associated with ...

EPA Pesticide Factsheets

Extreme tolerance to highly toxic dioxin-like contaminants (DLCs) has evolved independently and contemporaneously in (at least) four populations of Atlantic killifish (Fundulus heteroclitus). Surprisingly, the magnitude and phenotype of DLC tolerance is similar among these killifish populations that have adapted to varied, but highly contaminated urban/industrialized estuaries of the US Atlantic coast. We hypothesized that comparisons among tolerant populations and in contrast to their sensitive neighboring killifish might reveal genetic loci associated with DLC tolerance. Since the aryl hydrocarbon receptor (AHR) pathway partly or fully mediates DLC toxicity in vertebrates, we identified single nucleotide polymorphisms (SNPs) from 43 genes associated with the AHR to serve as targeted markers. Wild fish from the four highly tolerant killifish populations and four nearby sensitive populations were genotyped using 59 SNP markers. Consistent with other killifish population genetic analyses, our results revealed strong genetic differentiation among populations, consistent with isolation by distance models. Pairwise comparisons of nearby tolerant and sensitive populations revealed differentiation among these loci: AHR 1 and 2, cathepsin Z, the cytochrome P450s (CYP) 1A and 3A30, and the NADH ubiquinone oxidoreductase MLRQ subunit. By grouping tolerant versus sensitive populations, we also identified cytochrome P450 1A and the AHR2 loci as under selection, lend
The genome sequence of Agrotis segetum granulovirus, isolate AgseGV-DA, reveals a new Betabaculovirus species of a slow killing granulovirus.

PubMed

Gueli Alletti, Gianpiero; Eigenbrod, Marina; Carstens, Eric B; Kleespies, Regina G; Jehle, Johannes A

2017-06-01

The European isolate Agrotis segetum granulovirus DA (AgseGV-DA) is a slow killing, type I granulovirus due to low dose-mortality responses within seven days post infection and a tissue tropism of infection restricted solely to the fat body of infected Agrotis segetum host larvae. The genome of AgseGV-DA was completely sequenced and compared to the whole genome sequences of the Chinese isolates AgseGV-XJ and AgseGV-L1. All three isolates share highly conserved genomes. The AgseGV-DA genome is 131,557bp in length and encodes for 149 putative open reading frames, including 37 baculovirus core genes and the per os infectivity factor ac110. Comprehensive investigations of repeat regions identified one putative non-hr like origin of replication in AgseGV-DA. Phylogenetic analysis based on concatenated amino acid alignments of 37 baculovirus core genes as well as pairwise distances based on the nucleotide alignments of partial granulin, lef-8 and lef-9 sequences with deposited betabaculoviruses confirmed AgseGV-DA, AgseGV-XJ and AgseGV-L1 as representative isolates of the same Betabaculovirus species. AgseGV encodes for a distinct putative enhancin, distantly related to enhancins from other granuloviruses. Copyright © 2017. Published by Elsevier Inc.

Development of Genetic Markers in Eucalyptus Species by Target Enrichment and Exome Sequencing

PubMed Central

Dasgupta, Modhumita Ghosh; Dharanishanthi, Veeramuthu; Agarwal, Ishangi; Krutovsky, Konstantin V.

2015-01-01

The advent of next-generation sequencing has facilitated large-scale discovery, validation and assessment of genetic markers for high density genotyping. The present study was undertaken to identify markers in genes supposedly related to wood property traits in three Eucalyptus species. Ninety four genes involved in xylogenesis were selected for hybridization probe based nuclear genomic DNA target enrichment and exome sequencing. Genomic DNA was isolated from the leaf tissues and used for on-array probe hybridization followed by Illumina sequencing. The raw sequence reads were trimmed and high-quality reads were mapped to the E. grandis reference sequence and the presence of single nucleotide variants (SNVs) and insertions/ deletions (InDels) were identified across the three species. The average read coverage was 216X and a total of 2294 SNVs and 479 InDels were discovered in E. camaldulensis, 2383 SNVs and 518 InDels in E. tereticornis, and 1228 SNVs and 409 InDels in E. grandis. Additionally, SNV calling and InDel detection were conducted in pair-wise comparisons of E. tereticornis vs. E. grandis, E. camaldulensis vs. E. tereticornis and E. camaldulensis vs. E. grandis. This study presents an efficient and high throughput method on development of genetic markers for family– based QTL and association analysis in Eucalyptus. PMID:25602379
Nucleobase and nucleoside transport and integration into plant metabolism

PubMed Central

Girke, Christopher; Daumann, Manuel; Niopek-Witz, Sandra; Möhlmann, Torsten

2014-01-01

Nucleotide metabolism is an essential process in all living organisms. Besides newly synthesized nucleotides, the recycling (salvage) of partially degraded nucleotides, i.e., nucleosides and nucleobases serves to keep the homeostasis of the nucleotide pool. Both types of metabolites are substrates of at least six families of transport proteins in Arabidopsis thaliana (Arabidopsis) with a total of 49 members. In the last years several members of such transport proteins have been analyzed allowing to present a more detailed picture of nucleoside and nucleobase transport and the physiological function of these processes. Besides functioning in nucleotide metabolism it turned out that individual members of the before named transporters exhibit the capacity to transport a wide range of different substrates including vitamins and phytohormones. The aim of this review is to summarize the current knowledge on nucleobase and nucleoside transport processes in plants and integrate this into nucleotide metabolism in general. Thereby, we will focus on those proteins which have been characterized at the biochemical level. PMID:25250038
A configuration space of homologous proteins conserving mutual information and allowing a phylogeny inference based on pair-wise Z-score probabilities.

PubMed

Bastien, Olivier; Ortet, Philippe; Roy, Sylvaine; Maréchal, Eric

2005-03-10

Popular methods to reconstruct molecular phylogenies are based on multiple sequence alignments, in which addition or removal of data may change the resulting tree topology. We have sought a representation of homologous proteins that would conserve the information of pair-wise sequence alignments, respect probabilistic properties of Z-scores (Monte Carlo methods applied to pair-wise comparisons) and be the basis for a novel method of consistent and stable phylogenetic reconstruction. We have built up a spatial representation of protein sequences using concepts from particle physics (configuration space) and respecting a frame of constraints deduced from pair-wise alignment score properties in information theory. The obtained configuration space of homologous proteins (CSHP) allows the representation of real and shuffled sequences, and thereupon an expression of the TULIP theorem for Z-score probabilities. Based on the CSHP, we propose a phylogeny reconstruction using Z-scores. Deduced trees, called TULIP trees, are consistent with multiple-alignment based trees. Furthermore, the TULIP tree reconstruction method provides a solution for some previously reported incongruent results, such as the apicomplexan enolase phylogeny. The CSHP is a unified model that conserves mutual information between proteins in the way physical models conserve energy. Applications include the reconstruction of evolutionary consistent and robust trees, the topology of which is based on a spatial representation that is not reordered after addition or removal of sequences. The CSHP and its assigned phylogenetic topology, provide a powerful and easily updated representation for massive pair-wise genome comparisons based on Z-score computations.
Simulations of the pairwise kinematic Sunyaev-Zel'dovich signal

DOE PAGES

Flender, Samuel; Bleem, Lindsey; Finkel, Hal; ...

2016-05-26

The pairwise kinematic Sunyaev–Zel'dovich (kSZ) signal from galaxy clusters is a probe of their line of sight momenta, and thus a potentially valuable source of cosmological information. In addition to the momenta, the amplitude of the measured signal depends on the properties of the intracluster gas and observational limitations such as errors in determining cluster centers and redshifts. In this work, we simulate the pairwise kSZ signal of clusters atmore » $$z\\lt 1$$, using the output from a cosmological N-body simulation and including the properties of the intracluster gas via a model that can be varied in post-processing. We find that modifications to the gas profile due to star formation and feedback reduce the pairwise kSZ amplitude of clusters by $$\\sim 50\\%$$, relative to the naive "gas traces mass" assumption. We demonstrate that miscentering can reduce the overall amplitude of the pairwise kSZ signal by up to 10%, while redshift errors can lead to an almost complete suppression of the signal at small separations. We confirm that a high-significance detection is expected from the combination of data from current generation, high-resolution cosmic microwave background experiments, such as the South Pole Telescope, and cluster samples from optical photometric surveys, such as the Dark Energy Survey. As a result, we forecast that future experiments such as Advanced ACTPol in conjunction with data from the Dark Energy Spectroscopic Instrument will yield detection significances of at least $$20\\sigma $$, and up to $$57\\sigma $$ in an optimistic scenario.« less
Demonstrating microbial co-occurrence pattern analyses within and between ecosystems

PubMed Central

Williams, Ryan J.; Howe, Adina; Hofmockel, Kirsten S.

2014-01-01

Co-occurrence patterns are used in ecology to explore interactions between organisms and environmental effects on coexistence within biological communities. Analysis of co-occurrence patterns among microbial communities has ranged from simple pairwise comparisons between all community members to direct hypothesis testing between focal species. However, co-occurrence patterns are rarely studied across multiple ecosystems or multiple scales of biological organization within the same study. Here we outline an approach to produce co-occurrence analyses that are focused at three different scales: co-occurrence patterns between ecosystems at the community scale, modules of co-occurring microorganisms within communities, and co-occurring pairs within modules that are nested within microbial communities. To demonstrate our co-occurrence analysis approach, we gathered publicly available 16S rRNA amplicon datasets to compare and contrast microbial co-occurrence at different taxonomic levels across different ecosystems. We found differences in community composition and co-occurrence that reflect environmental filtering at the community scale and consistent pairwise occurrences that may be used to infer ecological traits about poorly understood microbial taxa. However, we also found that conclusions derived from applying network statistics to microbial relationships can vary depending on the taxonomic level chosen and criteria used to build co-occurrence networks. We present our statistical analysis and code for public use in analysis of co-occurrence patterns across microbial communities. PMID:25101065
The immediate upstream region of the 5′-UTR from the AUG start codon has a pronounced effect on the translational efficiency in Arabidopsis thaliana

PubMed Central

Kim, Younghyun; Lee, Goeun; Jeon, Eunhyun; Sohn, Eun ju; Lee, Yongjik; Kang, Hyangju; Lee, Dong wook; Kim, Dae Heon; Hwang, Inhwan

2014-01-01

The nucleotide sequence around the translational initiation site is an important cis-acting element for post-transcriptional regulation. However, it has not been fully understood how the sequence context at the 5′-untranslated region (5′-UTR) affects the translational efficiency of individual mRNAs. In this study, we provide evidence that the 5′-UTRs of Arabidopsis genes showing a great difference in the nucleotide sequence vary greatly in translational efficiency with more than a 200-fold difference. Of the four types of nucleotides, the A residue was the most favourable nucleotide from positions −1 to −21 of the 5′-UTRs in Arabidopsis genes. In particular, the A residue in the 5′-UTR from positions −1 to −5 was required for a high-level translational efficiency. In contrast, the T residue in the 5′-UTR from positions −1 to −5 was the least favourable nucleotide in translational efficiency. Furthermore, the effect of the sequence context in the −1 to −21 region of the 5′-UTR was conserved in different plant species. Based on these observations, we propose that the sequence context immediately upstream of the AUG initiation codon plays a crucial role in determining the translational efficiency of plant genes. PMID:24084084
Classification of forest-based ecotourism areas in Pocahontas County of West Virginia using GIS and pairwise comparison method

Treesearch

Ishwar Dhami; Jinyang. Deng

2012-01-01

Many previous studies have examined ecotourism primarily from the perspective of tourists while largely ignoring ecotourism destinations. This study used geographical information system (GIS) and pairwise comparison to identify forest-based ecotourism areas in Pocahontas County, West Virginia. The study adopted the criteria and scores developed by Boyd and Butler (1994...
Learning Factors Transfer Analysis: Using Learning Curve Analysis to Automatically Generate Domain Models

ERIC Educational Resources Information Center

Pavlik, Philip I. Jr.; Cen, Hao; Koedinger, Kenneth R.

2009-01-01

This paper describes a novel method to create a quantitative model of an educational content domain of related practice item-types using learning curves. By using a pairwise test to search for the relationships between learning curves for these item-types, we show how the test results in a set of pairwise transfer relationships that can be…
Detection of the kinematic Sunyaev–Zel'dovich effect with DES Year 1 and SPT

DOE Office of Scientific and Technical Information (OSTI.GOV)

Soergel, B.; Flender, S.; Story, K. T.

Here, we detect the kinematic Sunyaev-Zel'dovich (kSZ) effect with a statistical significance ofmore » $$4.2 \\sigma$$ by combining a cluster catalogue derived from the first year data of the Dark Energy Survey (DES) with CMB temperature maps from the South Pole Telescope Sunyaev-Zel'dovich (SPT-SZ) Survey. This measurement is performed with a differential statistic that isolates the pairwise kSZ signal, providing the first detection of the large-scale, pairwise motion of clusters using redshifts derived from photometric data. By fitting the pairwise kSZ signal to a theoretical template we measure the average central optical depth of the cluster sample, $$\\bar{\\tau}_e = (3.75 \\pm 0.89)\\cdot 10^{-3}$$. We compare the extracted signal to realistic simulations and find good agreement with respect to the signal-to-noise, the constraint on $$\\bar{\\tau}_e$$, and the corresponding gas fraction. High-precision measurements of the pairwise kSZ signal with future data will be able to place constraints on the baryonic physics of galaxy clusters, and could be used to probe gravity on scales $$ \\gtrsim 100$$ Mpc.« less
Detection of the kinematic Sunyaev–Zel'dovich effect with DES Year 1 and SPT

DOE PAGES

Soergel, B.; Flender, S.; Story, K. T.; ...

2016-06-17

Here, we detect the kinematic Sunyaev-Zel'dovich (kSZ) effect with a statistical significance ofmore » $$4.2 \\sigma$$ by combining a cluster catalogue derived from the first year data of the Dark Energy Survey (DES) with CMB temperature maps from the South Pole Telescope Sunyaev-Zel'dovich (SPT-SZ) Survey. This measurement is performed with a differential statistic that isolates the pairwise kSZ signal, providing the first detection of the large-scale, pairwise motion of clusters using redshifts derived from photometric data. By fitting the pairwise kSZ signal to a theoretical template we measure the average central optical depth of the cluster sample, $$\\bar{\\tau}_e = (3.75 \\pm 0.89)\\cdot 10^{-3}$$. We compare the extracted signal to realistic simulations and find good agreement with respect to the signal-to-noise, the constraint on $$\\bar{\\tau}_e$$, and the corresponding gas fraction. High-precision measurements of the pairwise kSZ signal with future data will be able to place constraints on the baryonic physics of galaxy clusters, and could be used to probe gravity on scales $$ \\gtrsim 100$$ Mpc.« less
Nucleotide sequence of a resistance breaking mutant of southern bean mosaic virus.

PubMed

Lee, L; Anderson, E J

1998-01-01

SBMV-S is a resistance-breaking mutant of an Arkansas isolate of the bean strain of southern bean mosaic virus (SBMV-BARK) that is able to move systemically in Phaseolus vulgaris cvs. Pinto and Great Northern, whereas the wild-type SBMV-BARK causes local necrotic lesions and is restricted to the inoculated leaves of these hosts. Sequence analysis of the 4136 nucleotide genomes of SBMV-BARK and SBMV-S revealed seven nucleotide differences, but only four deduced amino acid changes. A single amino acid change occurred in the C-terminal region of the putative RNA-dependent RNA polymerase and three differences were identified in the N-terminal portion of the virus coat protein. SBMV-BARK and SBMV-S were compared with other sobemoviruses and were found to contain a high level of nucleotide sequence identity (91.3%) to SBMV-B. Unlike SBMV-B however, SBMV-BARK and SBMV-S contained four putative overlapping open reading frames, making them more similar in genome organization to the cowpea strain, SBMV-C. The possibility exists that mutations or even errors, that resulted in mis-identification of open reading frames, occurred in previously published information on nucleotide sequence and genomic organization for SBMV-B.
Structural and metabolic characterization of RNAs from rats with experimental Guerin tumor - I. Nucleotide composition of RNAs from the liver and tumor tissues of rats.

PubMed

Ratkiewicz, A; Galasinski, W

1976-01-01

The characteristics of the ribonucleic acids of Guerin tumor was the subject of this work. The effect of tumor development on the structure of the ribonucleic acids in the liver of tumor bearing rats was studied. Some differences of nucleotide compositions in RNAs isolated from subcellular fractions of liver of control and tumor bearing rats and of cancer tissue were observed. The nucleotide compositions of cancer nuclear RNA is distinctly different from liver RNA. The changes in primary structure of liver RNAs due by development of tumor in rats may be result of metabolic peculiarities of these RNAs.
Unbiased nonorthogonal bases for tomographic reconstruction

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sainz, Isabel; Klimov, Andrei B.; Roa, Luis

2010-05-15

We have developed a general method for constructing a set of nonorthogonal bases with equal separations between all different basis states in prime dimensions. The results are that the corresponding biorthogonal counterparts are pairwise unbiased with the components of the original bases. Using these bases, we derive an explicit expression for the optimal tomography in nonorthogonal bases. A special two-dimensional case is analyzed separately.
Accelerated spike resampling for accurate multiple testing controls.

PubMed

Harrison, Matthew T

2013-02-01

Controlling for multiple hypothesis tests using standard spike resampling techniques often requires prohibitive amounts of computation. Importance sampling techniques can be used to accelerate the computation. The general theory is presented, along with specific examples for testing differences across conditions using permutation tests and for testing pairwise synchrony and precise lagged-correlation between many simultaneously recorded spike trains using interval jitter.
Obtaining Rubric Weights for Assessments by More than One Lecturer Using a Pairwise Learning Model

ERIC Educational Resources Information Center

Quevedo, J. R.; Montanes, E.

2009-01-01

Specifying the criteria of a rubric to assess an activity, establishing the different quality levels of proficiency of development and defining weights for every criterion is not as easy as one a priori might think. Besides, the complexity of these tasks increases when they involve more than one lecturer. Reaching an agreement about the criteria…
Effect of congenital blindness on the semantic representation of some everyday concepts

PubMed Central

Connolly, Andrew C.; Gleitman, Lila R.; Thompson-Schill, Sharon L.

2007-01-01

This study explores how the lack of first-hand experience with color, as a result of congenital blindness, affects implicit judgments about “higher-order” concepts, such as “fruits and vegetables” (FV), but not others, such as “household items” (HHI). We demonstrate how the differential diagnosticity of color across our test categories interacts with visual experience to produce, in effect, a category-specific difference in implicit similarity. Implicit pair-wise similarity judgments were collected by using an odd-man-out triad task. Pair-wise similarities for both FV and for HHI were derived from this task and were compared by using cluster analysis and regression analyses. Color was found to be a significant component in the structure of implicit similarity for FV for sighted participants but not for blind participants; and this pattern remained even when the analysis was restricted to blind participants who had good explicit color knowledge of the stimulus items. There was also no evidence that either subject group used color knowledge in making decisions about HHI, nor was there an indication of any qualitative differences between blind and sighted subjects' judgments on HHI. PMID:17483447
Efficient molecular dynamics simulations with many-body potentials on graphics processing units

NASA Astrophysics Data System (ADS)

Fan, Zheyong; Chen, Wei; Vierimaa, Ville; Harju, Ari

2017-09-01

Graphics processing units have been extensively used to accelerate classical molecular dynamics simulations. However, there is much less progress on the acceleration of force evaluations for many-body potentials compared to pairwise ones. In the conventional force evaluation algorithm for many-body potentials, the force, virial stress, and heat current for a given atom are accumulated within different loops, which could result in write conflict between different threads in a CUDA kernel. In this work, we provide a new force evaluation algorithm, which is based on an explicit pairwise force expression for many-body potentials derived recently (Fan et al., 2015). In our algorithm, the force, virial stress, and heat current for a given atom can be accumulated within a single thread and is free of write conflicts. We discuss the formulations and algorithms and evaluate their performance. A new open-source code, GPUMD, is developed based on the proposed formulations. For the Tersoff many-body potential, the double precision performance of GPUMD using a Tesla K40 card is equivalent to that of the LAMMPS (Large-scale Atomic/Molecular Massively Parallel Simulator) molecular dynamics code running with about 100 CPU cores (Intel Xeon CPU X5670 @ 2.93 GHz).
Hunger enhances consistent economic choices in non-human primates.

PubMed

Yamada, Hiroshi

2017-05-24

Hunger and thirst are fundamental biological processes that drive consumption behavior in humans and non-human animals. While the existing literature in neuroscience suggests that these satiety states change how consumable rewards are represented in the brain, it remains unclear as to how they change animal choice behavior and the underlying economic preferences. Here, I used combined techniques from experimental economics, psychology, and neuroscience to measure food preferences of marmoset monkeys (Callithrix jacchus), a recently developed primate model for neuroscience. Hunger states of animals were manipulated by scheduling feeding intervals, resulting in three different conditions: sated, non-sated, and hungry. During these hunger states, animals performed pairwise choices of food items, which included all possible pairwise combinations of five different food items except for same-food pairs. Results showed that hunger enhanced economic rationality, evident as a decrease of transitivity violations (item A was preferred to item B, and B to C, but C was preferred to A). Further analysis demonstrated that hungry monkeys chose more-preferred items over less-preferred items in a more deterministic manner, while the individual food preferences appeared to remain stable across hunger states. These results suggest that hunger enhances consistent choice behavior and shifts animals towards efficient outcome maximization.
Reexamination of the interaction of atoms with a LiF(001) surface

NASA Astrophysics Data System (ADS)

Miraglia, J. E.; Gravielle, M. S.

2017-02-01

Pairwise additive potentials for multielectronic atoms interacting with a LiF(001) surface are revisited by including an improved description of the electron density associated with the different lattice sites, as well as nonlocal electron density contributions. Within this model, the electron distribution around each ionic site of the crystal is described by means of a so-called "onion" approach that accounts for the influence of the Madelung potential. From such densities, binary interatomic potentials are then derived by using well-known nonlocal functionals. Rumpling and long-range contributions due to projectile polarization and van der Waals forces are also included. We apply this pairwise additive approximation to evaluate the interaction potential between closed-shell (He, Ne, Ar, Kr, and Xe) and open-shell (N, S, and Cl) atoms and the LiF surface, analyzing the relative importance of the different contributions. The performance of the proposed potentials is assessed by contrasting angular positions of rainbow and supernumerary rainbow maxima produced by fast grazing incidence with available experimental data. One important result of our model is that both van der Waals contributions and thermal lattice vibrations play a negligible role for normal energies in the eV range.
Phenotype-genotype correlations in X linked retinitis pigmentosa.

PubMed Central

Kaplan, J; Pelet, A; Martin, C; Delrieu, O; Aymé, S; Bonneau, D; Briard, M L; Hanauer, A; Larget-Piet, L; Lefrançois, P

1992-01-01

Retinitis pigmentosa (RP) represents a group of clinically heterogeneous retinal degenerations in which all modes of inheritance have been described. We have previously found two different clinical profiles in X linked RP as a function of age and mode of onset. The first clinical form has very early onset with severe myopia. The second form starts later with night blindness with mild myopia or none. At least two genes have been identified in X linked forms, namely RP2 (linked to DXS7, DXS255, and DXS14) and RP3 (linked to DXS84 and OTC) on the short arm of the X chromosome. In order to contribute to phenotype-genotype correlations in X linked RP, we tested the hypothesis that the two clinical profiles could be accounted for by the two different gene loci. The present study provides evidence for linkage of the clinical form with early myopia as the onset symptom with the RP2 gene (pairwise linkage to DXS255: Z = 3.13 at theta = 0), while the clinical form with later night blindness as the onset symptom is linked to the RP3 gene (pairwise linkage to OTC: Z = 4.16 at theta = 0). Images PMID:1357178

Dietary nucleotides and early growth in formula-fed infants: a randomized controlled trial.

PubMed

Singhal, Atul; Kennedy, Kathy; Lanigan, J; Clough, Helen; Jenkins, Wendy; Elias-Jones, Alun; Stephenson, Terrence; Dudek, Peter; Lucas, Alan

2010-10-01

Dietary nucleotides are nonprotein nitrogenous compounds that are found in high concentrations in breast milk and are thought to be conditionally essential nutrients in infancy. A high nucleotide intake has been suggested to explain some of the benefits of breastfeeding compared with formula feeding and to promote infant growth. However, relatively few large-scale randomized trials have tested this hypothesis in healthy infants. We tested the hypothesis that nucleotide supplementation of formula benefits early infant growth. Occipitofrontal head circumference, weight, and length were assessed in infants who were randomly assigned to groups fed nucleotide-supplemented (31 mg/L; n=100) or control formula without nucleotide supplementation (n=100) from birth to the age of 20 weeks, and in infants who were breastfed (reference group; n=101). Infants fed with nucleotide-supplemented formula had greater occipitofrontal head circumference at ages 8, 16, and 20 weeks than infants fed control formula (mean difference in z scores at 8 weeks: 0.4 [95% confidence interval: 0.1-0.7]; P=.006) even after adjustment for potential confounding factors (P=.002). Weight at 8 weeks and the increase in both occipitofrontal head circumference and weight from birth to 8 weeks were also greater in infants fed nucleotide-supplemented formula than in those fed control formula. Our data support the hypothesis that nucleotide supplementation leads to increased weight gain and head growth in formula-fed infants. Therefore, nucleotides could be conditionally essential for optimal infant growth in some formula-fed populations. Additional research is needed to test the hypothesis that the benefits of nucleotide supplementation for early head growth, a critical period for brain growth, have advantages for long-term cognitive development.
Polymerization of non-complementary RNA: systematic symmetric nucleotide exchanges mainly involving uracil produce mitochondrial RNA transcripts coding for cryptic overlapping genes.

PubMed

Seligmann, Hervé

2013-03-01

Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA coding potential. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Systematic asymmetric nucleotide exchanges produce human mitochondrial RNAs cryptically encoding for overlapping protein coding genes.

PubMed

Seligmann, Hervé

2013-05-07

GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges. Copyright © 2013 Elsevier Ltd. All rights reserved.
Labeled nucleotide phosphate (NP) probes

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2009-02-03

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Quantum Point Contact Single-Nucleotide Conductance for DNA and RNA Sequence Identification.

PubMed

Afsari, Sepideh; Korshoj, Lee E; Abel, Gary R; Khan, Sajida; Chatterjee, Anushree; Nagpal, Prashant

2017-11-28

Several nanoscale electronic methods have been proposed for high-throughput single-molecule nucleic acid sequence identification. While many studies display a large ensemble of measurements as "electronic fingerprints" with some promise for distinguishing the DNA and RNA nucleobases (adenine, guanine, cytosine, thymine, and uracil), important metrics such as accuracy and confidence of base calling fall well below the current genomic methods. Issues such as unreliable metal-molecule junction formation, variation of nucleotide conformations, insufficient differences between the molecular orbitals responsible for single-nucleotide conduction, and lack of rigorous base calling algorithms lead to overlapping nanoelectronic measurements and poor nucleotide discrimination, especially at low coverage on single molecules. Here, we demonstrate a technique for reproducible conductance measurements on conformation-constrained single nucleotides and an advanced algorithmic approach for distinguishing the nucleobases. Our quantum point contact single-nucleotide conductance sequencing (QPICS) method uses combed and electrostatically bound single DNA and RNA nucleotides on a self-assembled monolayer of cysteamine molecules. We demonstrate that by varying the applied bias and pH conditions, molecular conductance can be switched ON and OFF, leading to reversible nucleotide perturbation for electronic recognition (NPER). We utilize NPER as a method to achieve >99.7% accuracy for DNA and RNA base calling at low molecular coverage (∼12×) using unbiased single measurements on DNA/RNA nucleotides, which represents a significant advance compared to existing sequencing methods. These results demonstrate the potential for utilizing simple surface modifications and existing biochemical moieties in individual nucleobases for a reliable, direct, single-molecule, nanoelectronic DNA and RNA nucleotide identification method for sequencing.
The CD8α gene in duck (Anatidae): cloning, characterization, and expression during viral infection.

PubMed

Xu, Qi; Chen, Yang; Zhao, Wen Ming; Huang, Zheng Yang; Duan, Xiu Jun; Tong, Yi Yu; Zhang, Yang; Li, Xiu; Chang, Guo Bin; Chen, Guo Hong

2015-02-01

Cluster of differentiation 8 alpha (CD8α) is critical for cell-mediated immune defense and T-cell development. Although CD8α sequences have been reported for several species, very little is known about CD8α in ducks. To elucidate the mechanisms involved in the innate and adaptive immune responses of ducks, we cloned CD8α coding sequences from domestic, Muscovy, Mallard, and Spotbill ducks using reverse transcription polymerase chain reaction (RT-PCR). Each sequence consisted of 714 nucleotides and encoded a signal peptide, an IgV-like domain, a stalk region, a transmembrane region, and a cytoplasmic tail. We identified 58 nucleotide differences and 37 amino acid differences among the four types of duck; of these, 53 nucleotide and 33 amino acid differences were between Muscovy ducks and the other duck species. The CD8α cDNA sequence from domestic duck consisted of a 61-nucleotide 5' untranslated region (UTR), a 714-nucleotide open reading frame, and an 849-nucleotide 3' UTR. Multiple sequence alignments showed that the amino acid sequence of CD8α is conserved in vertebrates. RT-PCR revealed that expression of CD8α mRNA of domestic ducks was highest in the thymus and very low in the kidney, cerebrum, cerebellum, and muscle. Immunohistochemical analyses detected CD8α on the splenic corpuscle and periarterial lymphatic sheath of the spleen. CD8α mRNA in domestic ducklings was initially up-regulated, and then down-regulated, in the thymus, spleen, and liver after treatment with duck hepatitis virus type I (DHV-1) or the immunostimulant polyriboinosinic polyribocytidylic acid (poly I:C).
A movie of the RNA polymerase nucleotide addition cycle.

PubMed

Brueckner, Florian; Ortiz, Julio; Cramer, Patrick

2009-06-01

During gene transcription, RNA polymerase (Pol) passes through repetitive cycles of adding a nucleotide to the growing mRNA chain. Here we obtained a movie of the nucleotide addition cycle by combining structural information on different functional states of the Pol II elongation complex (EC). The movie illustrates the two-step loading of the nucleoside triphosphate (NTP) substrate, closure of the active site for catalytic nucleotide incorporation, and the presumed two-step translocation of DNA and RNA, which is accompanied by coordinated conformational changes in the polymerase bridge helix and trigger loop. The movie facilitates teaching and a mechanistic analysis of transcription and can be downloaded from http://www.lmb.uni-muenchen.de/cramer/pr-materials.
Reaction mechanism and reaction coordinates from the viewpoint of energy flow

PubMed Central

2016-01-01

Reaction coordinates are of central importance for correct understanding of reaction dynamics in complex systems, but their counter-intuitive nature made it a daunting challenge to identify them. Starting from an energetic view of a reaction process as stochastic energy flows biased towards preferred channels, which we deemed the reaction coordinates, we developed a rigorous scheme for decomposing energy changes of a system, both potential and kinetic, into pairwise components. The pairwise energy flows between different coordinates provide a concrete statistical mechanical language for depicting reaction mechanisms. Application of this scheme to the C7eq → C7ax transition of the alanine dipeptide in vacuum revealed novel and intriguing mechanisms that eluded previous investigations of this well studied prototype system for biomolecular conformational dynamics. Using a cost function developed from the energy decomposition components by proper averaging over the transition path ensemble, we were able to identify signatures of the reaction coordinates of this system without requiring any input from human intuition. PMID:27004858
Communicating with sentences: A multi-word naming game model

NASA Astrophysics Data System (ADS)

Lou, Yang; Chen, Guanrong; Hu, Jianwei

2018-01-01

Naming game simulates the process of naming an object by a single word, in which a population of communicating agents can reach global consensus asymptotically through iteratively pair-wise conversations. We propose an extension of the single-word model to a multi-word naming game (MWNG), simulating the case of describing a complex object by a sentence (multiple words). Words are defined in categories, and then organized as sentences by combining them from different categories. We refer to a formatted combination of several words as a pattern. In such an MWNG, through a pair-wise conversation, it requires the hearer to achieve consensus with the speaker with respect to both every single word in the sentence as well as the sentence pattern, so as to guarantee the correct meaning of the saying; otherwise, they fail reaching consensus in the interaction. We validate the model in three typical topologies as the underlying communication network, and employ both conventional and man-designed patterns in performing the MWNG.
Reaction mechanism and reaction coordinates from the viewpoint of energy flow

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Wenjin; Ma, Ao, E-mail: aoma@uic.edu

Reaction coordinates are of central importance for correct understanding of reaction dynamics in complex systems, but their counter-intuitive nature made it a daunting challenge to identify them. Starting from an energetic view of a reaction process as stochastic energy flows biased towards preferred channels, which we deemed the reaction coordinates, we developed a rigorous scheme for decomposing energy changes of a system, both potential and kinetic, into pairwise components. The pairwise energy flows between different coordinates provide a concrete statistical mechanical language for depicting reaction mechanisms. Application of this scheme to the C{sub 7eq} → C{sub 7ax} transition of themore » alanine dipeptide in vacuum revealed novel and intriguing mechanisms that eluded previous investigations of this well studied prototype system for biomolecular conformational dynamics. Using a cost function developed from the energy decomposition components by proper averaging over the transition path ensemble, we were able to identify signatures of the reaction coordinates of this system without requiring any input from human intuition.« less
Multiclass Posterior Probability Twin SVM for Motor Imagery EEG Classification.

PubMed

She, Qingshan; Ma, Yuliang; Meng, Ming; Luo, Zhizeng

2015-01-01

Motor imagery electroencephalography is widely used in the brain-computer interface systems. Due to inherent characteristics of electroencephalography signals, accurate and real-time multiclass classification is always challenging. In order to solve this problem, a multiclass posterior probability solution for twin SVM is proposed by the ranking continuous output and pairwise coupling in this paper. First, two-class posterior probability model is constructed to approximate the posterior probability by the ranking continuous output techniques and Platt's estimating method. Secondly, a solution of multiclass probabilistic outputs for twin SVM is provided by combining every pair of class probabilities according to the method of pairwise coupling. Finally, the proposed method is compared with multiclass SVM and twin SVM via voting, and multiclass posterior probability SVM using different coupling approaches. The efficacy on the classification accuracy and time complexity of the proposed method has been demonstrated by both the UCI benchmark datasets and real world EEG data from BCI Competition IV Dataset 2a, respectively.
An efficient semi-supervised community detection framework in social networks.

PubMed

Li, Zhen; Gong, Yong; Pan, Zhisong; Hu, Guyu

2017-01-01

Community detection is an important tasks across a number of research fields including social science, biology, and physics. In the real world, topology information alone is often inadequate to accurately find out community structure due to its sparsity and noise. The potential useful prior information such as pairwise constraints which contain must-link and cannot-link constraints can be obtained from domain knowledge in many applications. Thus, combining network topology with prior information to improve the community detection accuracy is promising. Previous methods mainly utilize the must-link constraints while cannot make full use of cannot-link constraints. In this paper, we propose a semi-supervised community detection framework which can effectively incorporate two types of pairwise constraints into the detection process. Particularly, must-link and cannot-link constraints are represented as positive and negative links, and we encode them by adding different graph regularization terms to penalize closeness of the nodes. Experiments on multiple real-world datasets show that the proposed framework significantly improves the accuracy of community detection.
Pairwise Force SPH Model for Real-Time Multi-Interaction Applications.

PubMed

Yang, Tao; Martin, Ralph R; Lin, Ming C; Chang, Jian; Hu, Shi-Min

2017-10-01

In this paper, we present a novel pairwise-force smoothed particle hydrodynamics (PF-SPH) model to enable simulation of various interactions at interfaces in real time. Realistic capture of interactions at interfaces is a challenging problem for SPH-based simulations, especially for scenarios involving multiple interactions at different interfaces. Our PF-SPH model can readily handle multiple types of interactions simultaneously in a single simulation; its basis is to use a larger support radius than that used in standard SPH. We adopt a novel anisotropic filtering term to further improve the performance of interaction forces. The proposed model is stable; furthermore, it avoids the particle clustering problem which commonly occurs at the free surface. We show how our model can be used to capture various interactions. We also consider the close connection between droplets and bubbles, and show how to animate bubbles rising in liquid as well as bubbles in air. Our method is versatile, physically plausible and easy-to-implement. Examples are provided to demonstrate the capabilities and effectiveness of our approach.
On fuzzy semantic similarity measure for DNA coding.

PubMed

Ahmad, Muneer; Jung, Low Tang; Bhuiyan, Md Al-Amin

2016-02-01

A coding measure scheme numerically translates the DNA sequence to a time domain signal for protein coding regions identification. A number of coding measure schemes based on numerology, geometry, fixed mapping, statistical characteristics and chemical attributes of nucleotides have been proposed in recent decades. Such coding measure schemes lack the biologically meaningful aspects of nucleotide data and hence do not significantly discriminate coding regions from non-coding regions. This paper presents a novel fuzzy semantic similarity measure (FSSM) coding scheme centering on FSSM codons׳ clustering and genetic code context of nucleotides. Certain natural characteristics of nucleotides i.e. appearance as a unique combination of triplets, preserving special structure and occurrence, and ability to own and share density distributions in codons have been exploited in FSSM. The nucleotides׳ fuzzy behaviors, semantic similarities and defuzzification based on the center of gravity of nucleotides revealed a strong correlation between nucleotides in codons. The proposed FSSM coding scheme attains a significant enhancement in coding regions identification i.e. 36-133% as compared to other existing coding measure schemes tested over more than 250 benchmarked and randomly taken DNA datasets of different organisms. Copyright © 2015 Elsevier Ltd. All rights reserved.
Living network meta-analysis compared with pairwise meta-analysis in comparative effectiveness research: empirical study.

PubMed

Nikolakopoulou, Adriani; Mavridis, Dimitris; Furukawa, Toshi A; Cipriani, Andrea; Tricco, Andrea C; Straus, Sharon E; Siontis, George C M; Egger, Matthias; Salanti, Georgia

2018-02-28

To examine whether the continuous updating of networks of prospectively planned randomised controlled trials (RCTs) ("living" network meta-analysis) provides strong evidence against the null hypothesis in comparative effectiveness of medical interventions earlier than the updating of conventional, pairwise meta-analysis. Empirical study of the accumulating evidence about the comparative effectiveness of clinical interventions. Database of network meta-analyses of RCTs identified through searches of Medline, Embase, and the Cochrane Database of Systematic Reviews until 14 April 2015. Network meta-analyses published after January 2012 that compared at least five treatments and included at least 20 RCTs. Clinical experts were asked to identify in each network the treatment comparison of greatest clinical interest. Comparisons were excluded for which direct and indirect evidence disagreed, based on side, or node, splitting test (P<0.10). Cumulative pairwise and network meta-analyses were performed for each selected comparison. Monitoring boundaries of statistical significance were constructed and the evidence against the null hypothesis was considered to be strong when the monitoring boundaries were crossed. A significance level was defined as α=5%, power of 90% (β=10%), and an anticipated treatment effect to detect equal to the final estimate from the network meta-analysis. The frequency and time to strong evidence was compared against the null hypothesis between pairwise and network meta-analyses. 49 comparisons of interest from 44 networks were included; most (n=39, 80%) were between active drugs, mainly from the specialties of cardiology, endocrinology, psychiatry, and rheumatology. 29 comparisons were informed by both direct and indirect evidence (59%), 13 by indirect evidence (27%), and 7 by direct evidence (14%). Both network and pairwise meta-analysis provided strong evidence against the null hypothesis for seven comparisons, but for an additional 10 comparisons only network meta-analysis provided strong evidence against the null hypothesis (P=0.002). The median time to strong evidence against the null hypothesis was 19 years with living network meta-analysis and 23 years with living pairwise meta-analysis (hazard ratio 2.78, 95% confidence interval 1.00 to 7.72, P=0.05). Studies directly comparing the treatments of interest continued to be published for eight comparisons after strong evidence had become evident in network meta-analysis. In comparative effectiveness research, prospectively planned living network meta-analyses produced strong evidence against the null hypothesis more often and earlier than conventional, pairwise meta-analyses. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Living network meta-analysis compared with pairwise meta-analysis in comparative effectiveness research: empirical study

PubMed Central

Nikolakopoulou, Adriani; Mavridis, Dimitris; Furukawa, Toshi A; Cipriani, Andrea; Tricco, Andrea C; Straus, Sharon E; Siontis, George C M; Egger, Matthias

2018-01-01

Abstract Objective To examine whether the continuous updating of networks of prospectively planned randomised controlled trials (RCTs) (“living” network meta-analysis) provides strong evidence against the null hypothesis in comparative effectiveness of medical interventions earlier than the updating of conventional, pairwise meta-analysis. Design Empirical study of the accumulating evidence about the comparative effectiveness of clinical interventions. Data sources Database of network meta-analyses of RCTs identified through searches of Medline, Embase, and the Cochrane Database of Systematic Reviews until 14 April 2015. Eligibility criteria for study selection Network meta-analyses published after January 2012 that compared at least five treatments and included at least 20 RCTs. Clinical experts were asked to identify in each network the treatment comparison of greatest clinical interest. Comparisons were excluded for which direct and indirect evidence disagreed, based on side, or node, splitting test (P<0.10). Outcomes and analysis Cumulative pairwise and network meta-analyses were performed for each selected comparison. Monitoring boundaries of statistical significance were constructed and the evidence against the null hypothesis was considered to be strong when the monitoring boundaries were crossed. A significance level was defined as α=5%, power of 90% (β=10%), and an anticipated treatment effect to detect equal to the final estimate from the network meta-analysis. The frequency and time to strong evidence was compared against the null hypothesis between pairwise and network meta-analyses. Results 49 comparisons of interest from 44 networks were included; most (n=39, 80%) were between active drugs, mainly from the specialties of cardiology, endocrinology, psychiatry, and rheumatology. 29 comparisons were informed by both direct and indirect evidence (59%), 13 by indirect evidence (27%), and 7 by direct evidence (14%). Both network and pairwise meta-analysis provided strong evidence against the null hypothesis for seven comparisons, but for an additional 10 comparisons only network meta-analysis provided strong evidence against the null hypothesis (P=0.002). The median time to strong evidence against the null hypothesis was 19 years with living network meta-analysis and 23 years with living pairwise meta-analysis (hazard ratio 2.78, 95% confidence interval 1.00 to 7.72, P=0.05). Studies directly comparing the treatments of interest continued to be published for eight comparisons after strong evidence had become evident in network meta-analysis. Conclusions In comparative effectiveness research, prospectively planned living network meta-analyses produced strong evidence against the null hypothesis more often and earlier than conventional, pairwise meta-analyses. PMID:29490922
Dynamically heterogenous partitions and phylogenetic inference: an evaluation of analytical strategies with cytochrome b and ND6 gene sequences in cranes.

PubMed

Krajewski, C; Fain, M G; Buckley, L; King, D G

1999-11-01

ki ctes over whether molecular sequence data should be partitioned for phylogenetic analysis often confound two types of heterogeneity among partitions. We distinguish historical heterogeneity (i.e., different partitions have different evolutionary relationships) from dynamic heterogeneity (i.e., different partitions show different patterns of sequence evolution) and explore the impact of the latter on phylogenetic accuracy and precision with a two-gene, mitochondrial data set for cranes. The well-established phylogeny of cranes allows us to contrast tree-based estimates of relevant parameter values with estimates based on pairwise comparisons and to ascertain the effects of incorporating different amounts of process information into phylogenetic estimates. We show that codon positions in the cytochrome b and NADH dehydrogenase subunit 6 genes are dynamically heterogenous under both Poisson and invariable-sites + gamma-rates versions of the F84 model and that heterogeneity includes variation in base composition and transition bias as well as substitution rate. Estimates of transition-bias and relative-rate parameters from pairwise sequence comparisons were comparable to those obtained as tree-based maximum likelihood estimates. Neither rate-category nor mixed-model partitioning strategies resulted in a loss of phylogenetic precision relative to unpartitioned analyses. We suggest that weighted-average distances provide a computationally feasible alternative to direct maximum likelihood estimates of phylogeny for mixed-model analyses of large, dynamically heterogenous data sets. Copyright 1999 Academic Press.
Determination of the kinetics of guanine nucleotide exchange on EF-Tu and EF-Ts: continuing uncertainties.

PubMed

Manchester, Keith L

2004-01-30

An analysis is made of the rate constants for the reactions involving the interactions of EF-Tu, EF-Ts, GDP, and GTP recently derived by Gromadski et al. [Biochemistry 41 (2002) 162]. Though their measured values appear to allow a reasonable rate of nucleotide exchange sufficient to support rates of protein synthesis in vivo, their data underestimate the thermodynamic barrier involved in nucleotide exchange and therefore cannot be considered definitive. A kinetic scheme consistent with the thermodynamic barrier can be achieved by modification of various rate constants, particularly of those involving the release of EF-Ts from EF-Tu.GTP.EF-Ts, but such constants are markedly different from what are experimentally observed. It thus remains impossible at present satisfactorily to model guanine nucleotide exchange on EF-Tu, catalysed by EF-Ts by a double displacement mechanism, with experimentally derived rate constants. Metabolic control analysis has been applied to determine the degree of flux control of the different steps in the pathway.
Free amino acids and 5'-nucleotides in Finnish forest mushrooms.

PubMed

Manninen, Hanna; Rotola-Pukkila, Minna; Aisala, Heikki; Hopia, Anu; Laaksonen, Timo

2018-05-01

Edible mushrooms are valued because of their umami taste and good nutritional values. Free amino acids, 5'-nucleotides and nucleosides were analyzed from four Nordic forest mushroom species (Lactarius camphoratus, Boletus edulis, Cantharellus cibarius, Craterellus tubaeformis) using high precision liquid chromatography analysis. To our knowledge, these taste components were studied for the first time from Craterellus tubaeformis and Lactarius camphoratus. The focus was on the umami amino acids and 5'-nucleotides. The free amino acid and 5'-nucleotide/nucleoside contents of studied species differed from each other. In all studied samples, umami amino acids were among five major free amino acids. The highest concentration of umami amino acids was on L. camphoratus whereas B. edulis had the highest content of sweet amino acids and C. cibarius had the highest content of bitter amino acids. The content of umami enhancing 5'-nucleotides were low in all studied species. Copyright © 2017 Elsevier Ltd. All rights reserved.
Defining the Molecular Actions of Dietary Fatty Acids in Breast Cancer: Selective Modulation of Peroxisome Proliferator-Activated Receptor Gamma. Addendum

DTIC Science & Technology

2008-05-01

0.05 significance threshold. Fol- owing ANOVA, Fisher’s least significant difference, LSD , air-wise comparison was implemented post-hoc. Briefly, the SD...average absolute difference etween any two groups was greater than the LSD critical alue, then the pair-wise comparison for those two groups ere...Xu, Y., Hin- shaw , J.C., Zimmerman, G.A., Hama, K., Aoki, J., Arai, H., Prestwich, G.D., 2003. Identification of an intracellular receptor for

Portable Low Volume Therapy for Severe Blood Loss

DTIC Science & Technology

2016-08-01

loss . 8 24-hour survival showed no statistical differences (p>0.05) between BHB/M (mean survival 21.0 ± 2.74 hrs) and 4 M BHB, 4.3 mM melatonin...in Figure 6 were compared 24 hours and 10 days after 60% blood loss . Pairwise comparisons are summarized in Table 3. No treatment differences were...Award Number: W81XWH-11-1-0409 TITLE: Portable Low-Volume Therapy for Severe Blood Loss PRINCIPAL INVESTIGATOR: Matthew T. Andrews CONTRACTING
Genetic polymorphisms in ESR1 and ESR2 genes, and risk of hypospadias in a multiethnic study population.

PubMed

Choudhry, Shweta; Baskin, Laurence S; Lammer, Edward J; Witte, John S; Dasgupta, Sudeshna; Ma, Chen; Surampalli, Abhilasha; Shen, Joel; Shaw, Gary M; Carmichael, Suzan L

2015-05-01

Estrogenic endocrine disruptors acting via estrogen receptors α (ESR1) and β (ESR2) have been implicated in the etiology of hypospadias, a common congenital malformation of the male external genitalia. We determined the association of single nucleotide polymorphisms in ESR1 and ESR2 genes with hypospadias in a racially/ethnically diverse study population of California births. We investigated the relationship between hypospadias and 108 ESR1 and 36 ESR2 single nucleotide polymorphisms in 647 cases and 877 population based nonmalformed controls among infants born in selected California counties from 1990 to 2003. Subgroup analyses were performed by race/ethnicity (nonHispanic white and Hispanic subjects) and by hypospadias severity (mild to moderate and severe). Odds ratios for 33 of the 108 ESR1 single nucleotide polymorphisms had p values less than 0.05 (p = 0.05 to 0.007) for risk of hypospadias. However, none of the 36 ESR2 single nucleotide polymorphisms was significantly associated. In stratified analyses the association results were consistent by disease severity but different sets of single nucleotide polymorphisms were significantly associated with hypospadias in nonHispanic white and Hispanic subjects. Due to high linkage disequilibrium across the single nucleotide polymorphisms, haplotype analyses were conducted and identified 6 haplotype blocks in ESR1 gene that had haplotypes significantly associated with an increased risk of hypospadias (OR 1.3 to 1.8, p = 0.04 to 0.00001). Similar to single nucleotide polymorphism analysis, different ESR1 haplotypes were associated with risk of hypospadias in nonHispanic white and Hispanic subjects. No significant haplotype association was observed for ESR2. The data provide evidence that ESR1 single nucleotide polymorphisms and haplotypes influence the risk of hypospadias in white and Hispanic subjects, and warrant further examination in other study populations. Copyright © 2015 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
Variation in the Nucleotide Sequence of Cottontail Rabbit Papillomavirus a and b Subtypes Affects Wart Regression and Malignant Transformation and Level of Viral Replication in Domestic Rabbits

PubMed Central

Salmon, Jérôme; Nonnenmacher, Mathieu; Cazé, Sandrine; Flamant, Patricia; Croissant, Odile; Orth, Gérard; Breitburd, Françoise

2000-01-01

We previously reported the partial characterization of two cottontail rabbit papillomavirus (CRPV) subtypes with strikingly divergent E6 and E7 oncoproteins. We report now the complete nucleotide sequences of these subtypes, referred to as CRPVa4 (7,868 nucleotides) and CRPVb (7,867 nucleotides). The CRPVa4 and CRPVb genomes differed at 238 (3%) nucleotide positions, whereas CRPVa4 and the prototype CRPV differed by only 5 nucleotides. The most variable region (7% nucleotide divergence) included the long regulatory region (LRR) and the E6 and E7 genes. A mutation in the stop codon resulted in an 8-amino-acid-longer CRPVb E4 protein, and a nucleotide deletion reduced the coding capacity of the E5 gene from 101 to 25 amino acids. In domestic rabbits homozygous for a specific haplotype of the DRA and DQA genes of the major histocompatibility complex, warts induced by CRPVb DNA or a chimeric genome containing the CRPVb LRR/E6/E7 region showed an early regression, whereas warts induced by CRPVa4 or a chimeric genome containing the CRPVa4 LRR/E6/E7 region persisted and evolved into carcinomas. In contrast, most CRPVa, CRPVb, and chimeric CRPV DNA-induced warts showed no early regression in rabbits homozygous for another DRA-DQA haplotype. Little, if any, viral replication is usually observed in domestic rabbit warts. When warts induced by CRPVa and CRPVb virions and DNA were compared, the number of cells positive for viral DNA or capsid antigens was found to be greater by 1 order of magnitude for specimens induced by CRPVb. Thus, both sequence variation in the LRR/E6/E7 region and the genetic constitution of the host influence the expression of the oncogenic potential of CRPV. Furthermore, intratype variation may overcome to some extent the host restriction of CRPV replication in domestic rabbits. PMID:11044121
An Engineered Kinetic Amplification Mechanism for Single Nucleotide Variant Discrimination by DNA Hybridization Probes.

PubMed

Chen, Sherry Xi; Seelig, Georg

2016-04-20

Even a single-nucleotide difference between the sequences of two otherwise identical biological nucleic acids can have dramatic functional consequences. Here, we use model-guided reaction pathway engineering to quantitatively improve the performance of selective hybridization probes in recognizing single nucleotide variants (SNVs). Specifically, we build a detection system that combines discrimination by competition with DNA strand displacement-based catalytic amplification. We show, both mathematically and experimentally, that the single nucleotide selectivity of such a system in binding to single-stranded DNA and RNA is quadratically better than discrimination due to competitive hybridization alone. As an additional benefit the integrated circuit inherits the property of amplification and provides at least 10-fold better sensitivity than standard hybridization probes. Moreover, we demonstrate how the detection mechanism can be tuned such that the detection reaction is agnostic to the position of the SNV within the target sequence. in contrast, prior strand displacement-based probes designed for kinetic discrimination are highly sensitive to position effects. We apply our system to reliably discriminate between different members of the let-7 microRNA family that differ in only a single base position. Our results demonstrate the power of systematic reaction network design to quantitatively improve biotechnology.
One-carbon metabolism and nucleotide biosynthesis as attractive targets for anticancer therapy

PubMed Central

Shuvalov, Oleg; Petukhov, Alexey; Daks, Alexandra; Fedorova, Olga; Vasileva, Elena; Barlev, Nickolai A.

2017-01-01

Cancer-related metabolism has recently emerged as one of the “hallmarks of cancer”. It has several important features, including altered metabolism of glucose and glutamine. Importantly, altered cancer metabolism connects different biochemical pathways into the one fine-tuned metabolic network, which stimulates high proliferation rates and plasticity to malignant cells. Among the keystones of cancer metabolism are one-carbon metabolism and nucleotide biosynthesis, which provide building blocks to anabolic reactions. Accordingly, the importance of these metabolic pathways for anticancer therapy has well been documented by more than fifty years of clinical use of specific metabolic inhibitors – methotrexate and nucleotides analogs. In this review we discuss one-carbon metabolism and nucleotide biosynthesis as common and specific features of many, if not all, tumors. The key enzymes involved in these pathways also represent promising anti-cancer therapeutic targets. We review different aspects of these metabolic pathways including their biochemistry, compartmentalization and expression of the key enzymes and their regulation at different levels. We also discuss the effects of known inhibitors of these pathways as well as the recent data on other enzymes of the same pathways as perspective pharmacological targets. PMID:28177894
Computed Energetics of Nucleotides in Spatial Ribozyme Structures: An Accurate Identification of Functional Regions from Structure

PubMed Central

Torshin, Ivan Y.

2004-01-01

Ribozymes are functionally diverse RNA molecules with intrinsic catalytic activity. Multiple structural and biochemical studies are required to establish which nucleotide bases are involved in the catalysis. The relative energetic properties of the nucleotide bases have been analyzed in a set of the known ribozyme structures. It was found that many of the known catalytic nucleotides can be identified using only the structure without any additional biochemical data. The results of the calculations compare well with the available biochemical data on RNA stability. Extensive in silico mutagenesis suggests that most of the nucleotides in ribozymes stabilize the RNA. The calculations show that relative contribution of the catalytic bases to RNA stability observably differs from contributions of the noncatalytic bases. Distinction between the concepts of “relative stability” and “mutational stability” is suggested. As results of prediction for several models of ribozymes appear to be in agreement with the published data on the potential active site regions, the method can potentially be used for prediction of functional nucleotides from nucleic sequence. PMID:15105962
[Experiment study on ultrashort wave for treating vascular crisis after rat tail replantation].

PubMed

Tan, Long; Gao, Wenshan; Xi, Ali; Wang, Cong; Chen, Shouying; Zhao, Yanyan; Di, Keqian; Yang, Xincai; Weng, Shengbin

2012-10-01

To explore the effect and mechanism of ultrashort wave (USW) for prevention and treatment of vascular crisis after rat tail replantation. Eighty 3-month old female Sprague Dawley rats (weighing 232.8-289.6 g) were randomly divided into 5 groups. In each group, based on the caudal vein and the coccyx was retained, the tail was cut off. The tail artery was ligated in group A; the tail artery was anastomosed in groups B, C, D, and E to establish the tail replantation model. After surgery, the rats of group B were given normal management; the rats of group C were immediately given intraperitoneal injection (3.125 mL/kg) of diluted papaverine hydrochloride injection (1 mg/mL); the rats of groups D and E were immediately given the local USW treatment (once a day) at anastomotic site for 5 days at the dosage of 3 files and 50 mA for 20 minutes (group D) and 2 files and 28 mA for 20 minutes (group E). The survival rate of the rat tails was observed for 10 days after the tail replantation. The tail skin temperature difference between proximal and distal anastomosis was measured at pre- and post-operation; the change between postoperative and preoperative temperature difference was calculated. The blood plasma specimens were collected from the inner canthus before operation and from the tip of the tail at 8 hours after operation to measure the content of nitric oxide (NO). The survival rates of the rat tails were 0 (0/14), 36.4% (8/22), 57.1% (8/14), 22.2% (4/18), and 75.0% (9/12) in groups A, B, C, D, and E, respectively, showing significant overall differences among 5 groups (chi2 = 19.935, P = 0.001); the survival rate of group E was significantly higher than that of group B at 7 days (P < 0.05), but no significant difference was found between the other groups by pairwise comparison (P > 0.05). At preoperation, there was no significant difference in tail skin temperature difference among 5 groups (P > 0.05); at 8 hours, 5 days, 6 days, and 7 days after operation, significant overall difference was found in the change of the skin temperature difference among groups (P < 0.05); pairwise comparison showed significant differences after operation (P < 0.05): group B > group D at 8 hours, group C > group D at 5 days, groups A, B, and C > group D at 6 days, groups B and C > groups A and E, and group B > group D at 7 days; but no significant difference was found between the other groups at the other time points (P > 0.05). Preoperative plasma NO content between each group had no significant difference (P > 0.05). The overall differences had significance in the NO content at postopoerative 8 hours and in the change of the NO content at pre- and post-operation among groups (P < 0.05). Significant differences were found by pairwise comparison (P < 0.05): group D > groups A, B, and C in the plasma NO content, group D > groups A and B in the change of the NO content at pre- and post-operation; but no significant difference was found between the other groups by pairwise comparison (P > 0.05). Rat tail replantation model in this experiment is feasible. USW therapy can increase the survival rate of replanted rat tails, reduce skin temperature at 7 days, improve blood supply, increase the content of nitric oxide at the early period and prevent vascular crisis.
Relationship of host recurrence in fungi to rates of tropical leaf decomposition

Treesearch

Mirna E. Santanaa; JeanD. Lodgeb; Patricia Lebowc

2004-01-01

Here we explore the significance of fungal diversity on ecosystem processes by testing whether microfungal âpreferencesâ for (i.e., host recurrence) different tropical leaf species increases the rate of decomposition. We used pairwise combinations of girradiated litter of five tree species with cultures of two dominant microfungi derived from each plant in a microcosm...
TSP Symposium 2012 Proceedings

DTIC Science & Technology

2012-11-01

and Statistical Model 78 7.3 Analysis and Results 79 7.4 Threats to Validity and Limitations 85 7.5 Conclusions 86 7.6 Acknowledgments 87 7.7...Table 12: Overall Statistics of the Experiment 32 Table 13: Results of Pairwise ANOVA Analysis, Highlighting Statistically Significant Differences...we calculated the percentage of defects injected. The distribution statistics are shown in Table 2. Table 2: Mean Lower, Upper Confidence Interval
Relationship of host recurrence in fungi to rates of tropical leaf decomposition

Treesearch

Mirna E. Santana; D. Jean Lodge; Patricia Lebow

2005-01-01

Here we explore the significance of fungal diversity on ecosystem processes by testing whether microfungal âpreferencesâ for (i.e., host recurrence) different tropical leaf species increases the rate of decomposition. We used pairwise combinations of [gamma]-irradiated litter of five tree species with cultures of two dominant microfungi derived from each plant in a...
Electric Field Reconstruction in the Image Plane of a High-Contrast Coronagraph Using a Set of Pinholes Around the Lyot Plane

NASA Technical Reports Server (NTRS)

Giveon, Amir; Kern, Brian; Shaklan, Stuart; Wallace, Kent; Noecker, Charley

2012-01-01

The pair-wise estimation has been used now on various testbeds with different coronagraphs with the best contrast results to date. Pinholes estimate has been implemented and ready to be tested in closed loop correction. Pinholes estimate offers an independent method. We hope to improve the calibration process to gain better estimates.
Modularity, pollination systems, and interaction turnover in plant-pollinator networks across space.

PubMed

Carstensen, Daniel W; Sabatino, Malena; Morellato, Leonor Patricia C

2016-05-01

Mutualistic interaction networks have been shown to be structurally conserved over space and time while pairwise interactions show high variability. In such networks, modularity is the division of species into compartments, or modules, where species within modules share more interactions with each other than they do with species from other modules. Such a modular structure is common in mutualistic networks and several evolutionary and ecological mechanisms have been proposed as underlying drivers. One prominent explanation is the existence of pollination syndromes where flowers tend to attract certain pollinators as determined by a set of traits. We investigate the modularity of seven community level plant-pollinator networks sampled in rupestrian grasslands, or campos rupestres, in SE Brazil. Defining pollination systems as corresponding groups of flower syndromes and pollinator functional groups, we test the two hypotheses that (1) interacting species from the same pollination system are more often assigned to the same module than interacting species from different pollination systems and; that (2) interactions between species from the same pollination system are more consistent across space than interactions between species from different pollination systems. Specifically we ask (1) whether networks are consistently modular across space; (2) whether interactions among species of the same pollination system occur more often inside modules, compared to interactions among species of different pollination systems, and finally; (3) whether the spatial variation in interaction identity, i.e., spatial interaction rewiring, is affected by trait complementarity among species as indicated by pollination systems. We confirm that networks are consistently modular across space and that interactions within pollination systems principally occur inside modules. Despite a strong tendency, we did not find a significant effect of pollination systems on the spatial consistency of pairwise interactions. These results indicate that the spatial rewiring of interactions could be constrained by pollination systems, resulting in conserved network structures in spite of high variation in pairwise interactions. Our findings suggest a relevant role of pollination systems in structuring plant-pollinator networks and we argue that structural patterns at the sub-network level can help us to fully understand how and why interactions vary across space and time.
Nucleic acid analysis using terminal-phosphate-labeled nucleotides

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2008-04-22

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Leaf habit does not determine the investment in both physical and chemical defences and pair-wise correlations between these defensive traits.

PubMed

Moreira, X; Pearse, I S

2017-05-01

Plant life-history strategies associated with resource acquisition and economics (e.g. leaf habit) are thought to be fundamental determinants of the traits and mechanisms that drive herbivore pressure, resource allocation to plant defensive traits, and the simultaneous expression (positive correlations) or trade-offs (negative correlations) between these defensive traits. In particular, it is expected that evergreen species - which usually grow slower and support constant herbivore pressure in comparison with deciduous species - will exhibit higher levels of both physical and chemical defences and a higher predisposition to the simultaneous expression of physical and chemical defensive traits. Here, by using a dataset which included 56 oak species (Quercus genus), we investigated whether leaf habit of plant species governs the investment in both physical and chemical defences and pair-wise correlations between these defensive traits. Our results showed that leaf habit does not determine the production of most leaf physical and chemical defences. Although evergreen oak species had higher levels of leaf toughness and specific leaf mass (physical defences) than deciduous oak species, both traits are essentially prerequisites for evergreenness. Similarly, our results also showed that leaf habit does not determine pair-wise correlations between defensive traits because most physical and chemical defensive traits were simultaneously expressed in both evergreen and deciduous oak species. Our findings indicate that leaf habit does not substantially contribute to oak species differences in plant defence investment. © 2017 German Botanical Society and The Royal Botanical Society of the Netherlands.
An enhanced Petri-net model to predict synergistic effects of pairwise drug combinations from gene microarray data.

PubMed

Jin, Guangxu; Zhao, Hong; Zhou, Xiaobo; Wong, Stephen T C

2011-07-01

Prediction of synergistic effects of drug combinations has traditionally been relied on phenotypic response data. However, such methods cannot be used to identify molecular signaling mechanisms of synergistic drug combinations. In this article, we propose an enhanced Petri-Net (EPN) model to recognize the synergistic effects of drug combinations from the molecular response profiles, i.e. drug-treated microarray data. We addressed the downstream signaling network of the targets for the two individual drugs used in the pairwise combinations and applied EPN to the identified targeted signaling network. In EPN, drugs and signaling molecules are assigned to different types of places, while drug doses and molecular expressions are denoted by color tokens. The changes of molecular expressions caused by treatments of drugs are simulated by two actions of EPN: firing and blasting. Firing is to transit the drug and molecule tokens from one node or place to another, and blasting is to reduce the number of molecule tokens by drug tokens in a molecule node. The goal of EPN is to mediate the state characterized by control condition without any treatment to that of treatment and to depict the drug effects on molecules by the drug tokens. We applied EPN to our generated pairwise drug combination microarray data. The synergistic predictions using EPN are consistent with those predicted using phenotypic response data. The molecules responsible for the synergistic effects with their associated feedback loops display the mechanisms of synergism. The software implemented in Python 2.7 programming language is available from request. stwong@tmhs.org.
Template-based protein-protein docking exploiting pairwise interfacial residue restraints.

PubMed

Xue, Li C; Rodrigues, João P G L M; Dobbs, Drena; Honavar, Vasant; Bonvin, Alexandre M J J

2017-05-01

Although many advanced and sophisticated ab initio approaches for modeling protein-protein complexes have been proposed in past decades, template-based modeling (TBM) remains the most accurate and widely used approach, given a reliable template is available. However, there are many different ways to exploit template information in the modeling process. Here, we systematically evaluate and benchmark a TBM method that uses conserved interfacial residue pairs as docking distance restraints [referred to as alpha carbon-alpha carbon (CA-CA)-guided docking]. We compare it with two other template-based protein-protein modeling approaches, including a conserved non-pairwise interfacial residue restrained docking approach [referred to as the ambiguous interaction restraint (AIR)-guided docking] and a simple superposition-based modeling approach. Our results show that, for most cases, the CA-CA-guided docking method outperforms both superposition with refinement and the AIR-guided docking method. We emphasize the superiority of the CA-CA-guided docking on cases with medium to large conformational changes, and interactions mediated through loops, tails or disordered regions. Our results also underscore the importance of a proper refinement of superimposition models to reduce steric clashes. In summary, we provide a benchmarked TBM protocol that uses conserved pairwise interface distance as restraints in generating realistic 3D protein-protein interaction models, when reliable templates are available. The described CA-CA-guided docking protocol is based on the HADDOCK platform, which allows users to incorporate additional prior knowledge of the target system to further improve the quality of the resulting models. © The Author 2016. Published by Oxford University Press.
Delineating slowly and rapidly evolving fractions of the Drosophila genome.

PubMed

Keith, Jonathan M; Adams, Peter; Stephen, Stuart; Mattick, John S

2008-05-01

Evolutionary conservation is an important indicator of function and a major component of bioinformatic methods to identify non-protein-coding genes. We present a new Bayesian method for segmenting pairwise alignments of eukaryotic genomes while simultaneously classifying segments into slowly and rapidly evolving fractions. We also describe an information criterion similar to the Akaike Information Criterion (AIC) for determining the number of classes. Working with pairwise alignments enables detection of differences in conservation patterns among closely related species. We analyzed three whole-genome and three partial-genome pairwise alignments among eight Drosophila species. Three distinct classes of conservation level were detected. Sequences comprising the most slowly evolving component were consistent across a range of species pairs, and constituted approximately 62-66% of the D. melanogaster genome. Almost all (>90%) of the aligned protein-coding sequence is in this fraction, suggesting much of it (comprising the majority of the Drosophila genome, including approximately 56% of non-protein-coding sequences) is functional. The size and content of the most rapidly evolving component was species dependent, and varied from 1.6% to 4.8%. This fraction is also enriched for protein-coding sequence (while containing significant amounts of non-protein-coding sequence), suggesting it is under positive selection. We also classified segments according to conservation and GC content simultaneously. This analysis identified numerous sub-classes of those identified on the basis of conservation alone, but was nevertheless consistent with that classification. Software, data, and results available at www.maths.qut.edu.au/-keithj/. Genomic segments comprising the conservation classes available in BED format.
Estimating Seven Coefficients of Pairwise Relatedness Using Population-Genomic Data

PubMed Central

Ackerman, Matthew S.; Johri, Parul; Spitze, Ken; Xu, Sen; Doak, Thomas G.; Young, Kimberly; Lynch, Michael

2017-01-01

Population structure can be described by genotypic-correlation coefficients between groups of individuals, the most basic of which are the pairwise relatedness coefficients between any two individuals. There are nine pairwise relatedness coefficients in the most general model, and we show that these can be reduced to seven coefficients for biallelic loci. Although all nine coefficients can be estimated from pedigrees, six coefficients have been beyond empirical reach. We provide a numerical optimization procedure that estimates all seven reduced coefficients from population-genomic data. Simulations show that the procedure is nearly unbiased, even at 3× coverage, and errors in five of the seven coefficients are statistically uncorrelated. The remaining two coefficients have a negative correlation of errors, but their sum provides an unbiased assessment of the overall correlation of heterozygosity between two individuals. Application of these new methods to four populations of the freshwater crustacean Daphnia pulex reveal the occurrence of half siblings in our samples, as well as a number of identical individuals that are likely obligately asexual clone mates. Statistically significant negative estimates of these pairwise relatedness coefficients, including inbreeding coefficients that were typically negative, underscore the difficulties that arise when interpreting genotypic correlations as estimations of the probability that alleles are identical by descent. PMID:28341647
Ensemble survival tree models to reveal pairwise interactions of variables with time-to-events outcomes in low-dimensional setting

PubMed Central

Dazard, Jean-Eudes; Ishwaran, Hemant; Mehlotra, Rajeev; Weinberg, Aaron; Zimmerman, Peter

2018-01-01

Unraveling interactions among variables such as genetic, clinical, demographic and environmental factors is essential to understand the development of common and complex diseases. To increase the power to detect such variables interactions associated with clinical time-to-events outcomes, we borrowed established concepts from random survival forest (RSF) models. We introduce a novel RSF-based pairwise interaction estimator and derive a randomization method with bootstrap confidence intervals for inferring interaction significance. Using various linear and nonlinear time-to-events survival models in simulation studies, we first show the efficiency of our approach: true pairwise interaction-effects between variables are uncovered, while they may not be accompanied with their corresponding main-effects, and may not be detected by standard semi-parametric regression modeling and test statistics used in survival analysis. Moreover, using a RSF-based cross-validation scheme for generating prediction estimators, we show that informative predictors may be inferred. We applied our approach to an HIV cohort study recording key host gene polymorphisms and their association with HIV change of tropism or AIDS progression. Altogether, this shows how linear or nonlinear pairwise statistical interactions of variables may be efficiently detected with a predictive value in observational studies with time-to-event outcomes. PMID:29453930
Detection of the pairwise kinematic Sunyaev-Zel'dovich effect with BOSS DR11 and the Atacama Cosmology Telescope

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bernardis, F. De; Aiola, S.; Vavagiakis, E. M.

Here, we present a new measurement of the kinematic Sunyaev-Zel'dovich effect using data from the Atacama Cosmology Telescope (ACT) and the Baryon Oscillation Spectroscopic Survey (BOSS). Using 600 square degrees of overlapping sky area, we evaluate the mean pairwise baryon momentum associated with the positions of 50,000 bright galaxies in the BOSS DR11 Large Scale Structure catalog. A non-zero signal arises from the large-scale motions of halos containing the sample galaxies. The data fits an analytical signal model well, with the optical depth to microwave photon scattering as a free parameter determining the overall signal amplitude. We estimate the covariancemore » matrix of the mean pairwise momentum as a function of galaxy separation, using microwave sky simulations, jackknife evaluation, and bootstrap estimates. The most conservative simulation-based errors give signal-to-noise estimates between 3.6 and 4.1 for varying galaxy luminosity cuts. We discuss how the other error determinations can lead to higher signal-to-noise values, and consider the impact of several possible systematic errors. Estimates of the optical depth from the average thermal Sunyaev-Zel'dovich signal at the sample galaxy positions are broadly consistent with those obtained from the mean pairwise momentum signal.« less

Detection of the pairwise kinematic Sunyaev-Zel'dovich effect with BOSS DR11 and the Atacama Cosmology Telescope

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bernardis, F. De; Vavagiakis, E.M.; Niemack, M.D.

We present a new measurement of the kinematic Sunyaev-Zel'dovich effect using data from the Atacama Cosmology Telescope (ACT) and the Baryon Oscillation Spectroscopic Survey (BOSS). Using 600 square degrees of overlapping sky area, we evaluate the mean pairwise baryon momentum associated with the positions of 50,000 bright galaxies in the BOSS DR11 Large Scale Structure catalog. A non-zero signal arises from the large-scale motions of halos containing the sample galaxies. The data fits an analytical signal model well, with the optical depth to microwave photon scattering as a free parameter determining the overall signal amplitude. We estimate the covariance matrixmore » of the mean pairwise momentum as a function of galaxy separation, using microwave sky simulations, jackknife evaluation, and bootstrap estimates. The most conservative simulation-based errors give signal-to-noise estimates between 3.6 and 4.1 for varying galaxy luminosity cuts. We discuss how the other error determinations can lead to higher signal-to-noise values, and consider the impact of several possible systematic errors. Estimates of the optical depth from the average thermal Sunyaev-Zel'dovich signal at the sample galaxy positions are broadly consistent with those obtained from the mean pairwise momentum signal.« less
Detection of the Pairwise Kinematic Sunyaev-Zel'dovich Effect with BOSS DR11 and the Atacama Cosmology Telescope

NASA Technical Reports Server (NTRS)

De Bernardis, F.; Aiola, S.; Vavagiakis, E. M.; Battaglia, N.; Niemack, M. D.; Beall, J.; Becker, D. T.; Bond, J. R.; Calabrese, E.; Cho, H.;

2017-01-01

We present a new measurement of the kinematic Sunyaev-Zel'dovich effect using data from the Atacama Cosmology Telescope (ACT) and the Baryon Oscillation Spectroscopic Survey (BOSS). Using 600 square degrees of overlapping sky area, we evaluate the mean pairwise baryon momentum associated with the positions of 50,000 bright galaxies in the BOSS DR11 Large Scale Structure catalog. A non-zero signal arises from the large-scale motions of halos containing the sample galaxies. The data fits an analytical signal model well, with the optical depth to microwave photon scattering as a free parameter determining the overall signal amplitude. We estimate the covariance matrix of the mean pairwise momentum as a function of galaxy separation, using microwave sky simulations, jackknife evaluation, and bootstrap estimates. The most conservative simulation-based errors give signal-to-noise estimates between 3.6 and 4.1 for varying galaxy luminosity cuts. We discuss how the other error determinations can lead to higher signal-to-noise values, and consider the impact of several possible systematic errors. Estimates of the optical depth from the average thermal Sunyaev-Zel'dovich signal at the sample galaxy positions are broadly consistent with those obtained from the mean pairwise momentum signal.

Detection of the pairwise kinematic Sunyaev-Zel'dovich effect with BOSS DR11 and the Atacama Cosmology Telescope

NASA Astrophysics Data System (ADS)

De Bernardis, F.; Aiola, S.; Vavagiakis, E. M.; Battaglia, N.; Niemack, M. D.; Beall, J.; Becker, D. T.; Bond, J. R.; Calabrese, E.; Cho, H.; Coughlin, K.; Datta, R.; Devlin, M.; Dunkley, J.; Dunner, R.; Ferraro, S.; Fox, A.; Gallardo, P. A.; Halpern, M.; Hand, N.; Hasselfield, M.; Henderson, S. W.; Hill, J. C.; Hilton, G. C.; Hilton, M.; Hincks, A. D.; Hlozek, R.; Hubmayr, J.; Huffenberger, K.; Hughes, J. P.; Irwin, K. D.; Koopman, B. J.; Kosowsky, A.; Li, D.; Louis, T.; Lungu, M.; Madhavacheril, M. S.; Maurin, L.; McMahon, J.; Moodley, K.; Naess, S.; Nati, F.; Newburgh, L.; Nibarger, J. P.; Page, L. A.; Partridge, B.; Schaan, E.; Schmitt, B. L.; Sehgal, N.; Sievers, J.; Simon, S. M.; Spergel, D. N.; Staggs, S. T.; Stevens, J. R.; Thornton, R. J.; van Engelen, A.; Van Lanen, J.; Wollack, E. J.

2017-03-01

We present a new measurement of the kinematic Sunyaev-Zel'dovich effect using data from the Atacama Cosmology Telescope (ACT) and the Baryon Oscillation Spectroscopic Survey (BOSS). Using 600 square degrees of overlapping sky area, we evaluate the mean pairwise baryon momentum associated with the positions of 50,000 bright galaxies in the BOSS DR11 Large Scale Structure catalog. A non-zero signal arises from the large-scale motions of halos containing the sample galaxies. The data fits an analytical signal model well, with the optical depth to microwave photon scattering as a free parameter determining the overall signal amplitude. We estimate the covariance matrix of the mean pairwise momentum as a function of galaxy separation, using microwave sky simulations, jackknife evaluation, and bootstrap estimates. The most conservative simulation-based errors give signal-to-noise estimates between 3.6 and 4.1 for varying galaxy luminosity cuts. We discuss how the other error determinations can lead to higher signal-to-noise values, and consider the impact of several possible systematic errors. Estimates of the optical depth from the average thermal Sunyaev-Zel'dovich signal at the sample galaxy positions are broadly consistent with those obtained from the mean pairwise momentum signal.
Ensemble survival tree models to reveal pairwise interactions of variables with time-to-events outcomes in low-dimensional setting.

PubMed

Dazard, Jean-Eudes; Ishwaran, Hemant; Mehlotra, Rajeev; Weinberg, Aaron; Zimmerman, Peter

2018-02-17

Unraveling interactions among variables such as genetic, clinical, demographic and environmental factors is essential to understand the development of common and complex diseases. To increase the power to detect such variables interactions associated with clinical time-to-events outcomes, we borrowed established concepts from random survival forest (RSF) models. We introduce a novel RSF-based pairwise interaction estimator and derive a randomization method with bootstrap confidence intervals for inferring interaction significance. Using various linear and nonlinear time-to-events survival models in simulation studies, we first show the efficiency of our approach: true pairwise interaction-effects between variables are uncovered, while they may not be accompanied with their corresponding main-effects, and may not be detected by standard semi-parametric regression modeling and test statistics used in survival analysis. Moreover, using a RSF-based cross-validation scheme for generating prediction estimators, we show that informative predictors may be inferred. We applied our approach to an HIV cohort study recording key host gene polymorphisms and their association with HIV change of tropism or AIDS progression. Altogether, this shows how linear or nonlinear pairwise statistical interactions of variables may be efficiently detected with a predictive value in observational studies with time-to-event outcomes.
Detection of the pairwise kinematic Sunyaev-Zel'dovich effect with BOSS DR11 and the Atacama Cosmology Telescope

DOE PAGES

Bernardis, F. De; Aiola, S.; Vavagiakis, E. M.; ...

2017-03-07

Here, we present a new measurement of the kinematic Sunyaev-Zel'dovich effect using data from the Atacama Cosmology Telescope (ACT) and the Baryon Oscillation Spectroscopic Survey (BOSS). Using 600 square degrees of overlapping sky area, we evaluate the mean pairwise baryon momentum associated with the positions of 50,000 bright galaxies in the BOSS DR11 Large Scale Structure catalog. A non-zero signal arises from the large-scale motions of halos containing the sample galaxies. The data fits an analytical signal model well, with the optical depth to microwave photon scattering as a free parameter determining the overall signal amplitude. We estimate the covariancemore » matrix of the mean pairwise momentum as a function of galaxy separation, using microwave sky simulations, jackknife evaluation, and bootstrap estimates. The most conservative simulation-based errors give signal-to-noise estimates between 3.6 and 4.1 for varying galaxy luminosity cuts. We discuss how the other error determinations can lead to higher signal-to-noise values, and consider the impact of several possible systematic errors. Estimates of the optical depth from the average thermal Sunyaev-Zel'dovich signal at the sample galaxy positions are broadly consistent with those obtained from the mean pairwise momentum signal.« less
Uncovering drug-responsive regulatory elements

PubMed Central

Luizon, Marcelo R; Ahituv, Nadav

2015-01-01

Nucleotide changes in gene regulatory elements can have a major effect on interindividual differences in drug response. For example, by reviewing all published pharmacogenomic genome-wide association studies, we show here that 96.4% of the associated single nucleotide polymorphisms reside in noncoding regions. We discuss how sequencing technologies are improving our ability to identify drug response-associated regulatory elements genome-wide and to annotate nucleotide variants within them. We highlight specific examples of how nucleotide changes in these elements can affect drug response and illustrate the techniques used to find them and functionally characterize them. Finally, we also discuss challenges in the field of drug-responsive regulatory elements that need to be considered in order to translate these findings into the clinic. PMID:26555224
A novel representation of the conformational structure of transfer RNAs. Correlation of the folding patterns of the polynucleotide chain with the base sequence and the nucleotide backbone torsions.

PubMed Central

Srinivasan, A R; Yathindra, N

1977-01-01

A novel description of the conformational characteristics of all the individual nucleotides and the phosphodiesters in tRNAs is presented in the form of a circular plot. This representation furnishes information of the base sequence with the folding patterns of the polynucleotide chain as one traverses along the circumference and with the individual nucleotide and phosphodiester linkage torsions along the radii. The circular plot obtained for yeast tRNAPhe strikingly distinguishes the helical and the loop regions. The variation of the different nucleotide torsions along the entire chain length and their effect on the secondary helical and tertiary loop regions become readily apparent. PMID:339206
Comparing and explaining differences in the magnitude, content, and sensitivity of utilities predicted by the EQ-5D, SF-6D, HUI 3, 15D, QWB, and AQoL-8D multiattribute utility instruments.

PubMed

Richardson, Jeff; Khan, Munir A; Iezzi, Angelo; Maxwell, Aimee

2015-04-01

Cost utility analysis permits the comparison of disparate health services by measuring outcomes in comparable units, namely, quality-adjusted life-years, which equal life-years times the utility of the health state. However, comparability is compromised when different utility instruments predict different utilities for the same health state. The present paper measures the extent of, and reason for, differences between the utilities predicted by the EQ-5D-5L, SF-6D, HUI 3, 15D, QWB, and AQoL-8D. Data were obtained from patients in seven disease areas and members of the healthy public in six countries. Differences between public and patient utilities were estimated using each of the instruments. To explain discrepancies between the estimates, the measurement scales and content of the instruments were compared. The sensitivity of instruments to independently measured health dimensions was measured in pairwise comparisons of all combinations of the instruments. The difference between public and patient utilities varied with the choice of instrument by more than 50% for every disease group and in four of the seven groups by more than 100%. Discrepancies were associated with differences in both the instrument content and their measurement scales. Pairwise comparisons of instruments found that variation in the sensitivity to physical and psychosocial dimensions of health closely reflected the items in the instrument's descriptive systems. Results indicate that instruments measure related but different constructs. They imply that commonly used instruments systematically discriminate against some classes of services, most notably mental health services. Differences in the instrument scales imply the need for transformations between the instruments to increase the comparability of measurement. © The Author(s) 2014.
Substrate-specifying determinants of the nucleotide pyrophosphatases/phosphodiesterases NPP1 and NPP2

PubMed Central

2004-01-01

The nucleotide pyrophosphatases/phosphodiesterases NPP1 and NPP2/autotaxin are structurally related eukaryotic ecto-enzymes, but display a very different substrate specificity. NPP1 releases nucleoside 5′-monophosphates from various nucleotides, whereas NPP2 mainly functions as a lysophospholipase D. We have used a domain-swapping approach to map substrate-specifying determinants of NPP1 and NPP2. The catalytic domain of NPP1 fused to the N- and C-terminal domains of NPP2 was hyperactive as a nucleotide phosphodiesterase, but did not show any lysophospholipase D activity. In contrast, chimaeras of the catalytic domain of NPP2 and the N- and/or C-terminal domains of NPP1 were completely inactive. These data indicate that the catalytic domain as well as both extremities of NPP2 contain lysophospholipid-specifying sequences. Within the catalytic domain of NPP1 and NPP2, we have mapped residues close to the catalytic site that determine the activities towards nucleotides and lysophospholipids. We also show that the conserved Gly/Phe-Xaa-Gly-Xaa-Xaa-Gly (G/FXGXXG) motif near the catalytic site is required for metal binding, but is not involved in substrate-specification. Our data suggest that the distinct activities of NPP1 and NPP2 stem from multiple differences throughout the polypeptide chain. PMID:15096095
TATA Binding Protein Discriminates between Different Lesions on DNA, Resulting in a Transcription Decrease

PubMed Central

Coin, Frédéric; Frit, Philippe; Viollet, Benoit; Salles, Bernard; Egly, Jean-Marc

1998-01-01

DNA damage recognition by basal transcription factors follows different mechanisms. Using transcription-competition, nitrocellulose filter binding, and DNase I footprinting assays, we show that, although the general transcription factor TFIIH is able to target any kind of lesion which can be repaired by the nucleotide excision repair pathway, TATA binding protein (TBP)-TFIID is more selective in damage recognition. Only genotoxic agents which are able to induce kinked DNA structures similar to the one for the TATA box in its TBP complex are recognized. Indeed, DNase I footprinting patterns reveal that TBP protects equally 4 nucleotides upstream and 6 nucleotides downstream from the A-T (at position −29 of the noncoding strand) of the adenovirus major late promoter and from the G-G of a cisplatin-induced 1,2-d(GpG) cross-link. Together, our results may partially explain differences in transcription inhibition rates following DNA damage. PMID:9632775
Probing the stabilizing effects of modified nucleotides in the bacterial decoding region of 16S ribosomal RNA

PubMed Central

Mahto, Santosh K.

2013-01-01

The bacterial decoding region of 16S ribosomal RNA has multiple modified nucleotides. In order to study the role of N4,2′-O-dimethylcytidine (m4Cm), the corresponding phosphoramidite was synthesized utilizing 5′-silyl-2′-ACE chemistry. Using solid-phase synthesis, m4Cm, 5-methylcytidine (m5C), 3-methyluridine (m3U), and 2′-O-methylcytidine (Cm) were site-specifically incorporated into small RNAs representing the decoding regions of different bacterial species. Biophysical studies were then used to provide insight into the stabilizing roles of the modified nucleotides. These studies reveal that methylation of cytidine and uridine has different effects. The same modifications at different positions or sequence contexts within similar RNA constructs also have contrasting roles, such as stabilizing or destabilizing the RNA helix. PMID:23566761
Three new HLA-C alleles (HLA-C*14:02:13, HLA-C*15:72 and HLA-C*15:74) in Saudi bone marrow donors.

PubMed

Fakhoury, H A; Jawdat, D; Alaskar, A S; Al Jumah, M; Cereb, N; Hajeer, A H

2015-10-01

Three new HLA-C alleles were identified by sequence-based typing method (SBT) in donors for the Saudi Bone Marrow Donor Registry (SBMDR). HLA-C*14:02:13 differs from HLA-C*14:02:01 by a silent G to A substitution at nucleotide position 400 in exon 2, where lysine at position 66 remains unchanged. HLA-C*15:72 differs from HLA-C*15:22 by a nonsynonymous C to A substitution at nucleotide position 796 in exon 3, resulting in an amino acid change from phenylalanine to leucine at position 116. HLA-C*15:74 differs from HLA-C*15:08 by a nonsynonymous C to T substitution at nucleotide position 914 in exon 3, resulting in an amino acid change from arginine to tryptophan at position 156. © 2015 John Wiley & Sons Ltd.
Composition for nucleic acid sequencing

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2008-08-26

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules

DOEpatents

Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

2006-06-06

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules

DOEpatents

Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

2006-05-30

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
The complete mitochondrial genome of dhole Cuon alpinus: phylogenetic analysis and dating evolutionary divergence within Canidae.

PubMed

Zhang, Honghai; Chen, Lei

2011-03-01

The dhole (Cuon alpinus) is the only existent species in the genus Cuon (Carnivora: Canidae). In the present study, the complete mitochondrial genome of the dhole was sequenced. The total length is 16672 base pairs which is the shortest in Canidae. Sequence analysis revealed that most mitochondrial genomic functional regions were highly consistent among canid animals except the CSB domain of the control region. The difference in length among the Canidae mitochondrial genome sequences is mainly due to the number of short segments of tandem repeated in the CSB domain. Phylogenetic analysis was progressed based on the concatenated data set of 14 mitochondrial genes of 8 canid animals by using maximum parsimony (MP), maximum likelihood (ML) and Bayesian (BI) inference methods. The genera Vulpes and Nyctereutes formed a sister group and split first within Canidae, followed by that in the Cuon. The divergence in the genus Canis was the latest. The divarication of domestic dogs after that of the Canis lupus laniger is completely supported by all the three topologies. Pairwise sequence divergence data of different mitochondrial genes among canid animals were also determined. Except for the synonymous substitutions in protein-coding genes, the control region exhibits the highest sequence divergences. The synonymous rates are approximately two to six times higher than those of the non-synonymous sites except for a slightly higher rate in the non-synonymous substitution between Cuon alpinus and Vulpes vulpes. 16S rRNA genes have a slightly faster sequence divergence than 12S rRNA and tRNA genes. Based on nucleotide substitutions of tRNA genes and rRNA genes, the times since divergence between dhole and other canid animals, and between domestic dogs and three subspecies of wolves were evaluated. The result indicates that Vulpes and Nyctereutes have a close phylogenetic relationship and the divergence of Nyctereutes is a little earlier. The Tibetan wolf may be an archaic pedigree within wolf subspecies. The genetic distance between wolves and domestic dogs is less than that among different subspecies of wolves. The domestication of dogs was about 1.56-1.92 million years ago or even earlier.
Classification of complex information: inference of co-occurring affective states from their expressions in speech.

PubMed

Sobol-Shikler, Tal; Robinson, Peter

2010-07-01

We present a classification algorithm for inferring affective states (emotions, mental states, attitudes, and the like) from their nonverbal expressions in speech. It is based on the observations that affective states can occur simultaneously and different sets of vocal features, such as intonation and speech rate, distinguish between nonverbal expressions of different affective states. The input to the inference system was a large set of vocal features and metrics that were extracted from each utterance. The classification algorithm conducted independent pairwise comparisons between nine affective-state groups. The classifier used various subsets of metrics of the vocal features and various classification algorithms for different pairs of affective-state groups. Average classification accuracy of the 36 pairwise machines was 75 percent, using 10-fold cross validation. The comparison results were consolidated into a single ranked list of the nine affective-state groups. This list was the output of the system and represented the inferred combination of co-occurring affective states for the analyzed utterance. The inference accuracy of the combined machine was 83 percent. The system automatically characterized over 500 affective state concepts from the Mind Reading database. The inference of co-occurring affective states was validated by comparing the inferred combinations to the lexical definitions of the labels of the analyzed sentences. The distinguishing capabilities of the system were comparable to human performance.
Organization of Nucleotides in Different Environments and the Formation of Pre-Polymers

NASA Astrophysics Data System (ADS)

Himbert, Sebastian; Chapman, Mindy; Deamer, David W.; Rheinstädter, Maikel C.

2016-08-01

RNA is a linear polymer of nucleotides linked by a ribose-phosphate backbone. Polymerization of nucleotides occurs in a condensation reaction in which phosphodiester bonds are formed. However, in the absence of enzymes and metabolism there has been no obvious way for RNA-like molecules to be produced and then encapsulated in cellular compartments. We investigated 5‧-adenosine monophosphate (AMP) and 5‧-uridine monophosphate (UMP) molecules confined in multi-lamellar phospholipid bilayers, nanoscopic films, ammonium chloride salt crystals and Montmorillonite clay, previously proposed to promote polymerization. X-ray diffraction was used to determine whether such conditions imposed a degree of order on the nucleotides. Two nucleotide signals were observed in all matrices, one corresponding to a nearest neighbour distance of 4.6 Å attributed to nucleotides that form a disordered, glassy structure. A second, smaller distance of 3.4 Å agrees well with the distance between stacked base pairs in the RNA backbone, and was assigned to the formation of pre-polymers, i.e., the organization of nucleotides into stacks of about 10 monomers. Such ordering can provide conditions that promote the nonenzymatic polymerization of RNA strands under prebiotic conditions. Experiments were modeled by Monte-Carlo simulations, which provide details of the molecular structure of these pre-polymers.
Molecular identification of Trichuris vulpis and Trichuris suis isolated from different hosts.

PubMed

Cutillas, Cristina; de Rojas, Manuel; Ariza, Concepción; Ubeda, José Manuel; Guevara, Diego

2007-01-01

Trichuris suis was isolated from the cecum of two different hosts (Sus scrofa domestica -- swine and Sus scrofa scrofa -- wild boar) and Trichuris vulpis from dogs in Sevilla, Spain. Genomic DNA was isolated and internal transcribed spacers (ITS)1-5.8S-ITS2 segment from the ribosomal DNA (rDNA) was amplified and sequenced using polymerase chain reaction techniques. The sequence of T. suis from both hosts was 1,396 bp in length while that of T. vulpis was 1,044 bp. ITS1 of both populations isolated of T. suis was 661 nucleotides in length, while the ITS2 was 534 nucleotides in length. Furthermore, the ITS1 of T. vulpis was 410 nucleotides in length, while the ITS2 was 433 nucleotides in length. One hundred fifty-four nucleotides were observed along the 5.8S gene of T. suis and T. vulpis. Intraindividual and intraspecific variations were detected in the rDNA of both species. The presence of microsatellites was observed in all the individuals assayed. Sequence analysis of the ITSs and the 5.8S gene has demonstrated no sequence differences between T. suis isolated from both hosts (S. scrofa domestica -- swine and S. scrofa scrofa -- wild boar). Nevertheless, clear differences were detected between the ITS1 and ITS2 of T. suis and T. vulpis. Furthermore, a comparative molecular analysis between both species and the previously published ITS1-5.8S-ITS2 sequence data of Trichuris ovis, Trichuris leporis, Trichuris muris, Trichuris arvicolae, and Trichuris skrjabini was carried out. A common homology zone was detected in the ITS1 sequence of all species of trichurids.
High-throughput identification and rational design of synergistic small-molecule pairs for combating and bypassing antibiotic resistance.

PubMed

Wambaugh, Morgan A; Shakya, Viplendra P S; Lewis, Adam J; Mulvey, Matthew A; Brown, Jessica C S

2017-06-01

Antibiotic-resistant infections kill approximately 23,000 people and cost $20,000,000,000 each year in the United States alone despite the widespread use of small-molecule antimicrobial combination therapy. Antibiotic combinations typically have an additive effect: the efficacy of the combination matches the sum of the efficacies of each antibiotic when used alone. Small molecules can also act synergistically when the efficacy of the combination is greater than the additive efficacy. However, synergistic combinations are rare and have been historically difficult to identify. High-throughput identification of synergistic pairs is limited by the scale of potential combinations: a modest collection of 1,000 small molecules involves 1 million pairwise combinations. Here, we describe a high-throughput method for rapid identification of synergistic small-molecule pairs, the overlap2 method (O2M). O2M extracts patterns from chemical-genetic datasets, which are created when a collection of mutants is grown in the presence of hundreds of different small molecules, producing a precise set of phenotypes induced by each small molecule across the mutant set. The identification of mutants that show the same phenotype when treated with known synergistic molecules allows us to pinpoint additional molecule combinations that also act synergistically. As a proof of concept, we focus on combinations with the antibiotics trimethoprim and sulfamethizole, which had been standard treatment against urinary tract infections until widespread resistance decreased efficacy. Using O2M, we screened a library of 2,000 small molecules and identified several that synergize with the antibiotic trimethoprim and/or sulfamethizole. The most potent of these synergistic interactions is with the antiviral drug azidothymidine (AZT). We then demonstrate that understanding the molecular mechanism underlying small-molecule synergistic interactions allows the rational design of additional combinations that bypass drug resistance. Trimethoprim and sulfamethizole are both folate biosynthesis inhibitors. We find that this activity disrupts nucleotide homeostasis, which blocks DNA replication in the presence of AZT. Building on these data, we show that other small molecules that disrupt nucleotide homeostasis through other mechanisms (hydroxyurea and floxuridine) also act synergistically with AZT. These novel combinations inhibit the growth and virulence of trimethoprim-resistant clinical Escherichia coli and Klebsiella pneumoniae isolates, suggesting that they may be able to be rapidly advanced into clinical use. In sum, we present a generalizable method to screen for novel synergistic combinations, to identify particular mechanisms resulting in synergy, and to use the mechanistic knowledge to rationally design new combinations that bypass drug resistance.

High-throughput identification and rational design of synergistic small-molecule pairs for combating and bypassing antibiotic resistance

PubMed Central

Lewis, Adam J.; Mulvey, Matthew A.

2017-01-01

Antibiotic-resistant infections kill approximately 23,000 people and cost $20,000,000,000 each year in the United States alone despite the widespread use of small-molecule antimicrobial combination therapy. Antibiotic combinations typically have an additive effect: the efficacy of the combination matches the sum of the efficacies of each antibiotic when used alone. Small molecules can also act synergistically when the efficacy of the combination is greater than the additive efficacy. However, synergistic combinations are rare and have been historically difficult to identify. High-throughput identification of synergistic pairs is limited by the scale of potential combinations: a modest collection of 1,000 small molecules involves 1 million pairwise combinations. Here, we describe a high-throughput method for rapid identification of synergistic small-molecule pairs, the overlap2 method (O2M). O2M extracts patterns from chemical-genetic datasets, which are created when a collection of mutants is grown in the presence of hundreds of different small molecules, producing a precise set of phenotypes induced by each small molecule across the mutant set. The identification of mutants that show the same phenotype when treated with known synergistic molecules allows us to pinpoint additional molecule combinations that also act synergistically. As a proof of concept, we focus on combinations with the antibiotics trimethoprim and sulfamethizole, which had been standard treatment against urinary tract infections until widespread resistance decreased efficacy. Using O2M, we screened a library of 2,000 small molecules and identified several that synergize with the antibiotic trimethoprim and/or sulfamethizole. The most potent of these synergistic interactions is with the antiviral drug azidothymidine (AZT). We then demonstrate that understanding the molecular mechanism underlying small-molecule synergistic interactions allows the rational design of additional combinations that bypass drug resistance. Trimethoprim and sulfamethizole are both folate biosynthesis inhibitors. We find that this activity disrupts nucleotide homeostasis, which blocks DNA replication in the presence of AZT. Building on these data, we show that other small molecules that disrupt nucleotide homeostasis through other mechanisms (hydroxyurea and floxuridine) also act synergistically with AZT. These novel combinations inhibit the growth and virulence of trimethoprim-resistant clinical Escherichia coli and Klebsiella pneumoniae isolates, suggesting that they may be able to be rapidly advanced into clinical use. In sum, we present a generalizable method to screen for novel synergistic combinations, to identify particular mechanisms resulting in synergy, and to use the mechanistic knowledge to rationally design new combinations that bypass drug resistance. PMID:28632788
Pair-Wise and Many-Body Dispersive Interactions Coupled to an Optimally Tuned Range-Separated Hybrid Functional.

PubMed

Agrawal, Piyush; Tkatchenko, Alexandre; Kronik, Leeor

2013-08-13

We propose a nonempirical, pair-wise or many-body dispersion-corrected, optimally tuned range-separated hybrid functional. This functional retains the advantages of the optimal-tuning approach in the prediction of the electronic structure. At the same time, it gains accuracy in the prediction of binding energies for dispersively bound systems, as demonstrated on the S22 and S66 benchmark sets of weakly bound dimers.
Pairwise Classifier Ensemble with Adaptive Sub-Classifiers for fMRI Pattern Analysis.

PubMed

Kim, Eunwoo; Park, HyunWook

2017-02-01

The multi-voxel pattern analysis technique is applied to fMRI data for classification of high-level brain functions using pattern information distributed over multiple voxels. In this paper, we propose a classifier ensemble for multiclass classification in fMRI analysis, exploiting the fact that specific neighboring voxels can contain spatial pattern information. The proposed method converts the multiclass classification to a pairwise classifier ensemble, and each pairwise classifier consists of multiple sub-classifiers using an adaptive feature set for each class-pair. Simulated and real fMRI data were used to verify the proposed method. Intra- and inter-subject analyses were performed to compare the proposed method with several well-known classifiers, including single and ensemble classifiers. The comparison results showed that the proposed method can be generally applied to multiclass classification in both simulations and real fMRI analyses.
A composite likelihood approach for spatially correlated survival data

PubMed Central

Paik, Jane; Ying, Zhiliang

2013-01-01

The aim of this paper is to provide a composite likelihood approach to handle spatially correlated survival data using pairwise joint distributions. With e-commerce data, a recent question of interest in marketing research has been to describe spatially clustered purchasing behavior and to assess whether geographic distance is the appropriate metric to describe purchasing dependence. We present a model for the dependence structure of time-to-event data subject to spatial dependence to characterize purchasing behavior from the motivating example from e-commerce data. We assume the Farlie-Gumbel-Morgenstern (FGM) distribution and then model the dependence parameter as a function of geographic and demographic pairwise distances. For estimation of the dependence parameters, we present pairwise composite likelihood equations. We prove that the resulting estimators exhibit key properties of consistency and asymptotic normality under certain regularity conditions in the increasing-domain framework of spatial asymptotic theory. PMID:24223450
Pairwise adaptive thermostats for improved accuracy and stability in dissipative particle dynamics

DOE Office of Scientific and Technical Information (OSTI.GOV)

Leimkuhler, Benedict, E-mail: b.leimkuhler@ed.ac.uk; Shang, Xiaocheng, E-mail: x.shang@brown.edu

2016-11-01

We examine the formulation and numerical treatment of dissipative particle dynamics (DPD) and momentum-conserving molecular dynamics. We show that it is possible to improve both the accuracy and the stability of DPD by employing a pairwise adaptive Langevin thermostat that precisely matches the dynamical characteristics of DPD simulations (e.g., autocorrelation functions) while automatically correcting thermodynamic averages using a negative feedback loop. In the low friction regime, it is possible to replace DPD by a simpler momentum-conserving variant of the Nosé–Hoover–Langevin method based on thermostatting only pairwise interactions; we show that this method has an extra order of accuracy for anmore » important class of observables (a superconvergence result), while also allowing larger timesteps than alternatives. All the methods mentioned in the article are easily implemented. Numerical experiments are performed in both equilibrium and nonequilibrium settings; using Lees–Edwards boundary conditions to induce shear flow.« less
Pairwise Interaction Extended Point-Particle (PIEP) model for multiphase jets and sedimenting particles

NASA Astrophysics Data System (ADS)

Liu, Kai; Balachandar, S.

2017-11-01

We perform a series of Euler-Lagrange direct numerical simulations (DNS) for multiphase jets and sedimenting particles. The forces the flow exerts on the particles in these two-way coupled simulations are computed using the Basset-Bousinesq-Oseen (BBO) equations. These forces do not explicitly account for particle-particle interactions, even though such pairwise interactions induced by the perturbations from neighboring particles may be important especially when the particle volume fraction is high. Such effects have been largely unaddressed in the literature. Here, we implement the Pairwise Interaction Extended Point-Particle (PIEP) model to simulate the effect of neighboring particle pairs. A simple collision model is also applied to avoid unphysical overlapping of solid spherical particles. The simulation results indicate that the PIEP model provides a more elaborative and complicated movement of the dispersed phase (droplets and particles). Office of Naval Research (ONR) Multidisciplinary University Research Initiative (MURI) project N00014-16-1-2617.
Single-atom gold catalysis in the context of developments in parahydrogen-induced polarization.

PubMed

Corma, Avelino; Salnikov, Oleg G; Barskiy, Danila A; Kovtunov, Kirill V; Koptyug, Igor V

2015-05-04

A highly isolated monoatomic gold catalyst, with single gold atoms dispersed on multiwalled carbon nanotubes (MWCNTs), has been synthesized, characterized, and tested in heterogeneous hydrogenation of 1,3-butadiene and 1-butyne with parahydrogen to maximize the polarization level and the contribution of the pairwise hydrogen addition route. The Au/MWCNTs catalyst was found to be active and efficient in pairwise hydrogen addition and the estimated contributions from the pairwise hydrogen addition route are at least an order of magnitude higher than those for supported metal nanoparticle catalysts. Therefore, the use of the highly isolated monoatomic catalysts is very promising for production of hyperpolarized fluids that can be used for the significant enhancement of NMR signals. A mechanism of 1,3-butadiene hydrogenation with parahydrogen over the highly isolated monoatomic Au/MWCNTs catalyst is also proposed. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Genetic interactions contribute less than additive effects to quantitative trait variation in yeast

PubMed Central

Bloom, Joshua S.; Kotenko, Iulia; Sadhu, Meru J.; Treusch, Sebastian; Albert, Frank W.; Kruglyak, Leonid

2015-01-01

Genetic mapping studies of quantitative traits typically focus on detecting loci that contribute additively to trait variation. Genetic interactions are often proposed as a contributing factor to trait variation, but the relative contribution of interactions to trait variation is a subject of debate. Here we use a very large cross between two yeast strains to accurately estimate the fraction of phenotypic variance due to pairwise QTL–QTL interactions for 20 quantitative traits. We find that this fraction is 9% on average, substantially less than the contribution of additive QTL (43%). Statistically significant QTL–QTL pairs typically have small individual effect sizes, but collectively explain 40% of the pairwise interaction variance. We show that pairwise interaction variance is largely explained by pairs of loci at least one of which has a significant additive effect. These results refine our understanding of the genetic architecture of quantitative traits and help guide future mapping studies. PMID:26537231
GetReal in network meta-analysis: a review of the methodology.

PubMed

Efthimiou, Orestis; Debray, Thomas P A; van Valkenhoef, Gert; Trelle, Sven; Panayidou, Klea; Moons, Karel G M; Reitsma, Johannes B; Shang, Aijing; Salanti, Georgia

2016-09-01

Pairwise meta-analysis is an established statistical tool for synthesizing evidence from multiple trials, but it is informative only about the relative efficacy of two specific interventions. The usefulness of pairwise meta-analysis is thus limited in real-life medical practice, where many competing interventions may be available for a certain condition and studies informing some of the pairwise comparisons may be lacking. This commonly encountered scenario has led to the development of network meta-analysis (NMA). In the last decade, several applications, methodological developments, and empirical studies in NMA have been published, and the area is thriving as its relevance to public health is increasingly recognized. This article presents a review of the relevant literature on NMA methodology aiming to pinpoint the developments that have appeared in the field. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
A composite likelihood approach for spatially correlated survival data.

PubMed

Paik, Jane; Ying, Zhiliang

2013-01-01

The aim of this paper is to provide a composite likelihood approach to handle spatially correlated survival data using pairwise joint distributions. With e-commerce data, a recent question of interest in marketing research has been to describe spatially clustered purchasing behavior and to assess whether geographic distance is the appropriate metric to describe purchasing dependence. We present a model for the dependence structure of time-to-event data subject to spatial dependence to characterize purchasing behavior from the motivating example from e-commerce data. We assume the Farlie-Gumbel-Morgenstern (FGM) distribution and then model the dependence parameter as a function of geographic and demographic pairwise distances. For estimation of the dependence parameters, we present pairwise composite likelihood equations. We prove that the resulting estimators exhibit key properties of consistency and asymptotic normality under certain regularity conditions in the increasing-domain framework of spatial asymptotic theory.
Supplementing glycosylation: A review of applying nucleotide-sugar precursors to growth medium to affect therapeutic recombinant protein glycoform distributions.

PubMed

Blondeel, Eric J M; Aucoin, Marc G

2018-06-15

Glycosylation is a critical quality attribute (CQA) of many therapeutic proteins, particularly monoclonal antibodies (mAbs), and is a major consideration in the approval of biosimilar biologics due to its effects to therapeutic efficacy. Glycosylation generates a distribution of glycoforms, resulting in glycoproteins with inherent molecule-to-molecule heterogeneity, capable of activating (or failing to activate) different effector functions of the immune system. Glycoforms can be affected by the supplementation of nucleotide-sugar precursors, and related components, to culture growth medium, affecting the metabolism of glycosylation. These supplementations has been demonstrated to increase nucleotide-sugar intracellular pools, and impact glycoform distributions, but with varied results. These variations can be attributed to five key factors: Differences between cell platforms (enzyme/transporter expression levels); differences between recombinant proteins produced (glycan-site accessibility); the fermentation and sampling timeline (glucose availability and exoglycosidase accumulation); glutamine levels (affecting ammonia levels, which impact Golgi pH, as well as UDP-GlcNAc pools); and finally, a lack of standardized metrics for observing shifts in glycoform distributions (glycosylation indices) across different experiments. The purpose of this review is to provide detail and clarity on the state of the art of supplementation strategies for nucleotide-sugar precursors for affecting glycosylation in cell culture processes, and to apply glycosylation indices for standardized comparisons across the field. Copyright © 2018. Published by Elsevier Inc.
Imputation of single nucleotide polymorhpism genotypes of Hereford cattle: reference panel size, family relationship and population structure

USDA-ARS?s Scientific Manuscript database

The objective of this study is to investigate single nucleotide polymorphism (SNP) genotypes imputation of Hereford cattle. Purebred Herefords were from two sources, Line 1 Hereford (N=240) and representatives of Industry Herefords (N=311). Using different reference panels of 62 and 494 males with 1...
Molecular dynamics studies on the interaction and encapsulation processes of the nucleotide and peptide chains inside of a carbon nanotube matrix with inclusion of gold nanoparticles

NASA Astrophysics Data System (ADS)

Kholmurodov, Kholmirzo; Dushanov, Eric; Khusenov, Mirzoaziz; Rahmonov, Khaiyom; Zelenyak, Tatyana; Doroshkevich, Alexander; Majumder, Subrata

2017-05-01

Studying of molecular systems as single nucleotides, nucleotide and peptide chains, RNA and DNA interacting with metallic nanoparticles within a carbon nanotube matrix represents a great interest in modern research. In this respect it is worth mentioning the development of the electronics diagnostic apparatus, the biochemical and biotechnological application tools (nanorobotic design, facilities of drug delivery in a living cell), so on. In the present work using molecular dynamics (MD) simulation method the interaction process of small nucleotide chains (NCs) and elongated peptide chains with different sets of metallic nanoparticles (NPs) on a matrix from carbon nanotube (CNT) were simulated to study their mechanisms of encapsulation and folding processes. We have performed a series of the MD calculations with different NC,peptides-NP-CNT models that were aimed on the investigation of the peculiarities of NC,peptide-NP interactions, the formation of bonds and structures in the system, as well as the dynamical behavior in an environment confined by the CNT matrix.
Updating Our View of Organelle Genome Nucleotide Landscape

PubMed Central

Smith, David Roy

2012-01-01

Organelle genomes show remarkable variation in architecture and coding content, yet their nucleotide composition is relatively unvarying across the eukaryotic domain, with most having a high adenine and thymine (AT) content. Recent studies, however, have uncovered guanine and cytosine (GC)-rich mitochondrial and plastid genomes. These sequences come from a small but eclectic list of species, including certain green plants and animals. Here, I review GC-rich organelle DNAs and the insights they have provided into the evolution of nucleotide landscape. I emphasize that GC-biased mitochondrial and plastid DNAs are more widespread than once thought, sometimes occurring together in the same species, and suggest that the forces biasing their nucleotide content can differ both among and within lineages, and may be associated with specific genome architectural features and life history traits. PMID:22973299
Genetic and Antigenic Evidence Supports the Separation of Hepatozoon canis and Hepatozoon americanum at the Species Level

PubMed Central

Baneth, Gad; Barta, John R.; Shkap, Varda; Martin, Donald S.; Macintire, Douglass K.; Vincent-Johnson, Nancy

2000-01-01

Recognition of Hepatozoon canis and Hepatozoon americanum as distinct species was supported by the results of Western immunoblotting of canine anti-H. canis and anti-H. americanum sera against H. canis gamonts. Sequence analysis of 368 bases near the 3′ end of the 18S rRNA gene from each species revealed a pairwise difference of 13.59%. PMID:10699047
Neuro-ergonomic Research for Online Assessment of Cognitive Workload

DTIC Science & Technology

2011-10-01

computer interface (BCI) and medical diagnoses areas. In [65], Kullback - Leibler (KL) divergence was used in the classification 39 of raw EEG signals. It...the features for each EEG channel recorded, and then compared the effectiveness of each feature using a Kruskal-Wallis test . Table 1 lists the...and the KL-distance 5-NN classifier), using different sets of activities. The feature vector and distance measures were tested in pairwise
No significant brain volume decreases or increases in adults with high-functioning autism spectrum disorder and above average intelligence: a voxel-based morphometric study.

PubMed

Riedel, Andreas; Maier, Simon; Ulbrich, Melanie; Biscaldi, Monica; Ebert, Dieter; Fangmeier, Thomas; Perlov, Evgeniy; Tebartz van Elst, Ludger

2014-08-30

Autism spectrum disorder (ASD) is increasingly being recognized as an important issue in adult psychiatry and psychotherapy. High intelligence indicates overall good brain functioning and might thus present a particularly good opportunity to study possible cerebral correlates of core autistic features in terms of impaired social cognition, communication skills, the need for routines, and circumscribed interests. Anatomical MRI data sets for 30 highly intelligent patients with high-functioning autism and 30 pairwise-matched control subjects were acquired and analyzed with voxel-based morphometry. The gray matter volume of the pairwise-matched patients and the controls did not differ significantly. When correcting for total brain volume influences, the patients with ASD exhibited smaller left superior frontal volumes on a trend level. Heterogeneous volumetric findings in earlier studies might partly be explained by study samples biased by a high inclusion rate of secondary forms of ASD, which often go along with neuronal abnormalities. Including only patients with high IQ scores might have decreased the influence of secondary forms of ASD and might explain the absence of significant volumetric differences between the patients and the controls in this study. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Multidrug resistant pathogens respond differently to the presence of co-pathogen, commensal, probiotic and host cells.

PubMed

Chan, Agnes P; Choi, Yongwook; Brinkac, Lauren M; Krishnakumar, Radha; DePew, Jessica; Kim, Maria; Hinkle, Mary K; Lesho, Emil P; Fouts, Derrick E

2018-06-05

In light of the ongoing antimicrobial resistance crisis, there is a need to understand the role of co-pathogens, commensals, and the local microbiome in modulating virulence and antibiotic resistance. To identify possible interactions that influence the expression of virulence or survival mechanisms in both the multidrug-resistant organisms (MDROs) and human host cells, unique cohorts of clinical isolates were selected for whole genome sequencing with enhanced assembly and full annotation, pairwise co-culturing, and transcriptome profiling. The MDROs were co-cultured in pairwise combinations either with: (1) another MDRO, (2) skin commensals (Staphylococcus epidermidis and Corynebacterium jeikeium), (3) the common probiotic Lactobacillus reuteri, and (4) human fibroblasts. RNA-Seq analysis showed distinct regulation of virulence and antimicrobial resistance gene responses across different combinations of MDROs, commensals, and human cells. Co-culture assays demonstrated that microbial interactions can modulate gene responses of both the target and pathogen/commensal species, and that the responses are specific to the identity of the pathogen/commensal species. In summary, bacteria have mechanisms to distinguish between friends, foe and host cells. These results provide foundational data and insight into the possibility of manipulating the local microbiome when treating complicated polymicrobial wound, intra-abdominal, or respiratory infections.
Identification of landscape features influencing gene flow: How useful are habitat selection models?

USGS Publications Warehouse

Roffler, Gretchen H.; Schwartz, Michael K.; Pilgrim, Kristy L.; Talbot, Sandra L.; Sage, Kevin; Adams, Layne G.; Luikart, Gordon

2016-01-01

Understanding how dispersal patterns are influenced by landscape heterogeneity is critical for modeling species connectivity. Resource selection function (RSF) models are increasingly used in landscape genetics approaches. However, because the ecological factors that drive habitat selection may be different from those influencing dispersal and gene flow, it is important to consider explicit assumptions and spatial scales of measurement. We calculated pairwise genetic distance among 301 Dall's sheep (Ovis dalli dalli) in southcentral Alaska using an intensive noninvasive sampling effort and 15 microsatellite loci. We used multiple regression of distance matrices to assess the correlation of pairwise genetic distance and landscape resistance derived from an RSF, and combinations of landscape features hypothesized to influence dispersal. Dall's sheep gene flow was positively correlated with steep slopes, moderate peak normalized difference vegetation indices (NDVI), and open land cover. Whereas RSF covariates were significant in predicting genetic distance, the RSF model itself was not significantly correlated with Dall's sheep gene flow, suggesting that certain habitat features important during summer (rugged terrain, mid-range elevation) were not influential to effective dispersal. This work underscores that consideration of both habitat selection and landscape genetics models may be useful in developing management strategies to both meet the immediate survival of a species and allow for long-term genetic connectivity.
Functional connectivity in resting state as a phonemic fluency ability measure.

PubMed

Miró-Padilla, Anna; Bueichekú, Elisenda; Ventura-Campos, Noelia; Palomar-García, María-Ángeles; Ávila, César

2017-03-01

There is some evidence that functional connectivity (FC) measures obtained at rest may reflect individual differences in cognitive capabilities. We tested this possibility by using the FAS test as a measure of phonemic fluency. Seed regions of the main brain areas involved in this task were extracted from meta-analysis results (Wagner et al., 2014) and used for pairwise resting-state FC analysis. Ninety-three undergraduates completed the FAS test outside the scanner. A correlation analysis was conducted between the F-A-S scores (behavioral testing) and the pairwise FC pattern of verbal fluency regions of interest. Results showed that the higher FC between the thalamus and the cerebellum, and the lower FCs between the left inferior frontal gyrus and the right insula and between the supplementary motor area and the right insula were associated with better performance on the FAS test. Regression analyses revealed that the first two FCs contributed independently to this better phonemic fluency, reflecting a more general attentional factor (FC between thalamus and cerebellum) and a more specific fluency factor (FC between the left inferior frontal gyrus and the right insula). The results support the Spontaneous Trait Reactivation hypothesis, which explains how resting-state derived measures may reflect individual differences in cognitive abilities. Copyright © 2017 Elsevier Ltd. All rights reserved.

AlignMe—a membrane protein sequence alignment web server

PubMed Central

Stamm, Marcus; Staritzbichler, René; Khafizov, Kamil; Forrest, Lucy R.

2014-01-01

We present a web server for pair-wise alignment of membrane protein sequences, using the program AlignMe. The server makes available two operational modes of AlignMe: (i) sequence to sequence alignment, taking two sequences in fasta format as input, combining information about each sequence from multiple sources and producing a pair-wise alignment (PW mode); and (ii) alignment of two multiple sequence alignments to create family-averaged hydropathy profile alignments (HP mode). For the PW sequence alignment mode, four different optimized parameter sets are provided, each suited to pairs of sequences with a specific similarity level. These settings utilize different types of inputs: (position-specific) substitution matrices, secondary structure predictions and transmembrane propensities from transmembrane predictions or hydrophobicity scales. In the second (HP) mode, each input multiple sequence alignment is converted into a hydrophobicity profile averaged over the provided set of sequence homologs; the two profiles are then aligned. The HP mode enables qualitative comparison of transmembrane topologies (and therefore potentially of 3D folds) of two membrane proteins, which can be useful if the proteins have low sequence similarity. In summary, the AlignMe web server provides user-friendly access to a set of tools for analysis and comparison of membrane protein sequences. Access is available at http://www.bioinfo.mpg.de/AlignMe PMID:24753425
Sequence polymorphism data of the hypervariable regions of mitochondrial DNA in the Yadav population of Haryana.

PubMed

Verma, Kapil; Sharma, Sapna; Sharma, Arun; Dalal, Jyoti; Bhardwaj, Tapeshwar

2018-06-01

Genetic variations among humans occur both within and among populations and range from single nucleotide changes to multiple-nucleotide variants. These multiple-nucleotide variants are useful for studying the relationships among individuals or various population groups. The study of human genetic variations can help scientists understand how different population groups are biologically related to one another. Sequence analysis of hypervariable regions of human mitochondrial DNA (mtDNA) has been successfully used for the genetic characterization of different population groups for forensic purposes. It is well established that different ethnic or population groups differ significantly in their mtDNA distributions. In the last decade, very little research has been conducted on mtDNA variations in the Indian population, although such data would be useful for elucidating the history of human population expansion across the world. Moreover, forensic studies on mtDNA variations in the Indian subcontinent are also scarce, particularly in the northern part of India. In this report, variations in the hypervariable regions of mtDNA were analyzed in the Yadav population of Haryana. Different molecular diversity indices were computed. Further, the obtained haplotypes were classified into different haplogroups and the phylogenetic relationship between different haplogroups was inferred.
Insight into pattern of codon biasness and nucleotide base usage in serotonin receptor gene family from different mammalian species.

PubMed

Dass, J Febin Prabhu; Sudandiradoss, C

2012-07-15

5-HT (5-Hydroxy-tryptamine) or serotonin receptors are found both in central and peripheral nervous system as well as in non-neuronal tissues. In the animal and human nervous system, serotonin produces various functional effects through a variety of membrane bound receptors. In this study, we focus on 5-HT receptor family from different mammals and examined the factors that account for codon and nucleotide usage variation. A total of 110 homologous coding sequences from 11 different mammalian species were analyzed using relative synonymous codon usage (RSCU), correspondence analysis (COA) and hierarchical cluster analysis together with nucleotide base usage frequency of chemically similar amino acid codons. The mean effective number of codon (ENc) value of 37.06 for 5-HT(6) shows very high codon bias within the family and may be due to high selective translational efficiency. The COA and Spearman's rank correlation reveals that the nucleotide compositional mutation bias as the major factors influencing the codon usage in serotonin receptor genes. The hierarchical cluster analysis suggests that gene function is another dominant factor that affects the codon usage bias, while species is a minor factor. Nucleotide base usage was reported using Goldman, Engelman, Stietz (GES) scale reveals the presence of high uracil (>45%) content at functionally important hydrophobic regions. Our in silico approach will certainly help for further investigations on critical inference on evolution, structure, function and gene expression aspects of 5-HT receptors family which are potential antipsychotic drug targets. Copyright © 2012 Elsevier B.V. All rights reserved.
Health-Related Quality of Life in Children and Adolescents with Hereditary Bleeding Disorders and in Children and Adolescents with Stroke: Cross-Sectional Comparison to Siblings and Peers

PubMed Central

Neuner, Bruno; von Mackensen, Sylvia; Holzhauer, Susanne; Funk, Stephanie; Klamroth, Robert; Kurnik, Karin; Krümpel, Anne; Halimeh, Susan; Reinke, Sarah; Frühwald, Michael; Nowak-Göttl, Ulrike

2016-01-01

Objectives. To investigate self-reported health-related quality of life (HrQoL) in children and adolescents with chronic medical conditions compared with siblings/peers. Methods. Group 1 (6 treatment centers) consisted of 74 children/adolescents aged 8–16 years with hereditary bleeding disorders (HBD), 12 siblings, and 34 peers. Group 2 (one treatment center) consisted of 70 children/adolescents with stroke/transient ischemic attack, 14 siblings, and 72 peers. HrQoL was assessed with the “revised KINDer Lebensqualitätsfragebogen” (KINDL-R) questionnaire. Multivariate analyses within groups were done by one-way ANOVA and post hoc pairwise single comparisons by Student's t-tests. Adjusted pairwise comparisons were done by hierarchical linear regressions with individuals nested within treatment centers (group 1) and by linear regressions (group 2), respectively. Results. No differences were found in multivariate analyses of self-reported HrQoL in group 1, while in group 2 differences occurred in overall wellbeing and all subdimensions. These differences were due to differences between patients and peers. After adjusting for age, gender, number of siblings, and treatment center these differences persisted regarding self-worth (p = .0040) and friend-related wellbeing (p < .001). Conclusions. In children with HBD, HrQoL was comparable to siblings and peers. In children with stroke/TIA HrQoL was comparable to siblings while peers, independently of relevant confounder, showed better self-worth and friend-related wellbeing. PMID:27294108
A Comparison of Online, Video Synchronous, and Traditional Learning Modes for an Introductory Undergraduate Physics Course

NASA Astrophysics Data System (ADS)

Faulconer, E. K.; Griffith, J.; Wood, B.; Acharyya, S.; Roberts, D.

2018-05-01

While the equivalence between online and traditional classrooms has been well-researched, very little of this includes college-level introductory Physics. Only one study explored Physics at the whole-class level rather than specific course components such as a single lab or a homework platform. In this work, we compared the failure rate, grade distribution, and withdrawal rates in an introductory undergraduate Physics course across several learning modes including traditional face-to-face instruction, synchronous video instruction, and online classes. Statistically significant differences were found for student failure rates, grade distribution, and withdrawal rates but yielded small effect sizes. Post-hoc pair-wise test was run to determine differences between learning modes. Online students had a significantly lower failure rate than students who took the class via synchronous video classroom. While statistically significant differences were found for grade distributions, the pair-wise comparison yielded no statistically significance differences between learning modes when using the more conservative Bonferroni correction in post-hoc testing. Finally, in this study, student withdrawal rates were lowest for students who took the class in person (in-person classroom and synchronous video classroom) than online. Students that persist in an online introductory Physics class are more likely to achieve an A than in other modes. However, the withdrawal rate is higher from online Physics courses. Further research is warranted to better understand the reasons for higher withdrawal rates in online courses. Finding the root cause to help eliminate differences in student performance across learning modes should remain a high priority for education researchers and the education community as a whole.
Stochasticity of bacterial attachment and its predictability by the extended derjaguin-landau-verwey-overbeek theory.

PubMed

Chia, Teck Wah R; Nguyen, Vu Tuan; McMeekin, Thomas; Fegan, Narelle; Dykes, Gary A

2011-06-01

Bacterial attachment onto materials has been suggested to be stochastic by some authors but nonstochastic and based on surface properties by others. We investigated this by attaching pairwise combinations of two Salmonella enterica serovar Sofia (S. Sofia) strains (with different physicochemical and attachment properties) with one strain each of S. enterica serovar Typhimurium, S. enterica serovar Infantis, or S. enterica serovar Virchow (all with similar physicochemical and attachment abilities) in ratios of 0.428, 1, and 2.333 onto glass, stainless steel, Teflon, and polysulfone. Attached bacterial cells were recovered and counted. If the ratio of attached cells of each Salmonella serovar pair recovered was the same as the initial inoculum ratio, the attachment process was deemed stochastic. Experimental outcomes from the study were compared to those predicted by the extended Derjaguin-Landau-Verwey-Overbeek (XDLVO) theory. Significant differences (P < 0.05) between the initial and the attached ratios for serovar pairs containing S. Sofia S1296a for all different ratios were apparent for all materials. For S. Sofia S1635-containing pairs, 7 out of 12 combinations of serovar pairs and materials had attachment ratios not significantly different (P > 0.05) from the initial ratio of 0.428. Five out of 12 and 10 out of 12 samples had attachment ratios not significantly different (P > 0.05) from the initial ratios of 1 and 2.333, respectively. These results demonstrate that bacterial attachment to different materials is likely to be nonstochastic only when the key physicochemical properties of the bacteria were significantly different (P < 0.05) from each other. XDLVO theory could successfully predict the attachment of some individual isolates to particular materials but could not be used to predict the likelihood of stochasticity in pairwise attachment experiments.
Species detection and identification in sexual organisms using population genetic theory and DNA sequences.

PubMed

Birky, C William

2013-01-01

Phylogenetic trees of DNA sequences of a group of specimens may include clades of two kinds: those produced by stochastic processes (random genetic drift) within a species, and clades that represent different species. The ratio of the mean pairwise sequence difference between a pair of clades (K) to the mean pairwise sequence difference within a clade (θ) can be used to determine whether the clades are samples from different species (K/θ ≥ 4) or the same species (K/θ<4) with probability ≥ 0.95. Previously I applied this criterion to delimit species of asexual organisms. Here I use data from the literature to show how it can also be applied to delimit sexual species using four groups of sexual organisms as examples: ravens, spotted leopards, sea butterflies, and liverworts. Mitochondrial or chloroplast genes are used because these segregate earlier during speciation than most nuclear genes and hence detect earlier stages of speciation. In several cases the K/θ ratio was greater than 4, confirming the original authors' intuition that the clades were sufficiently different to be assigned to different species. But the K/θ ratio split each of two liverwort species into two evolutionary species, and showed that support for the distinction between the common and Chihuahuan raven species is weak. I also discuss some possible sources of error in using the K/θ ratio; the most significant one would be cases where males migrate between different populations but females do not, making the use of maternally inherited organelle genes problematic. The K/θ ratio must be used with some caution, like all other methods for species delimitation. Nevertheless, it is a simple theory-based quantitative method for using DNA sequences to make rigorous decisions about species delimitation in sexual as well as asexual eukaryotes.
Complete nucleotide sequences of the coat protein messenger RNAs of brome mosaic virus and cowpea chlorotic mottle virus.

PubMed Central

Dasgupta, R; Kaesberg, P

1982-01-01

The nucleotide sequences of the subgenomic coat protein messengers (RNA4's) of two related bromoviruses, brome mosaic virus (BMV) and cowpea chlorotic mottle virus (CCMV), have been determined by direct RNA and CDNA sequencing without cloning. BMV RNA4 is 876 b long including a 5' noncoding region of nine nucleotides and a 3' noncoding region of 300 nucleotides. CCMV RNA 4 is 824 b long, including a 5' noncoding region of 10 nucleotides and a 3' noncoding region of 244 nucleotides. The encoded coat proteins are similar in length (188 amino acids for BMV and 189 amino acids for CCMV) and display about 70% homology in their amino acid sequences. Length difference between the two RNAs is due mostly to a single deletion, in CCMV with respect to BMV, of about 57 b immediately following the coding region. Allowing for this deletion the RNAs are indicate that mutations leading to divergence were constrained in the coding region primarily by the requirement of maintaining a favorable coat protein structure and in the 3' noncoding region primarily by the requirement of maintaining a favorable RNA spatial configuration. PMID:6895941
Genomic diversity of the human intestinal parasite Entamoeba histolytica

PubMed Central

2012-01-01

Background Entamoeba histolytica is a significant cause of disease worldwide. However, little is known about the genetic diversity of the parasite. We re-sequenced the genomes of ten laboratory cultured lines of the eukaryotic pathogen Entamoeba histolytica in order to develop a picture of genetic diversity across the genome. Results The extreme nucleotide composition bias and repetitiveness of the E. histolytica genome provide a challenge for short-read mapping, yet we were able to define putative single nucleotide polymorphisms in a large portion of the genome. The results suggest a rather low level of single nucleotide diversity, although genes and gene families with putative roles in virulence are among the more polymorphic genes. We did observe large differences in coverage depth among genes, indicating differences in gene copy number between genomes. We found evidence indicating that recombination has occurred in the history of the sequenced genomes, suggesting that E. histolytica may reproduce sexually. Conclusions E. histolytica displays a relatively low level of nucleotide diversity across its genome. However, large differences in gene family content and gene copy number are seen among the sequenced genomes. The pattern of polymorphism indicates that E. histolytica reproduces sexually, or has done so in the past, which has previously been suggested but not proven. PMID:22630046
Implementation of anion-receptor macrocycles in supramolecular tandem assays for enzymes involving nucleotides as substrates, products, and cofactors.

PubMed

Florea, Mara; Nau, Werner M

2010-03-07

A supramolecular tandem assay for direct continuous monitoring of nucleotide triphosphate-dependent enzymes such as potato apyrase is described. The underlying principle of the assay relies on the use of anion-receptor macrocycles in combination with fluorescent dyes as reporter pairs. A combinatorial approach was used to identify two complementary reporter pairs, i.e. an amino-gamma-cyclodextrin with 2-anilinonaphtalene-6-sulfonate (ANS) as dye (fluorescence enhancement factor of 17 upon complexation) and a polycationic cyclophane with 8-hydroxy-1,3,6-pyrene trisulfonate (HPTS) as dye (fluorescence decrease by a factor of more than 2000), which allow the kinetic monitoring of potato apyrase activity at different ATP concentration ranges (microM and mM) with different types of photophysical responses (switch-ON and switch-OFF). Competitive fluorescence titrations revealed a differential binding of ATP (strongest competitor) versus ADP and AMP, which constitutes the prerequisite for monitoring enzymatic conversions (dephosphorylation or phosphorylation) involving nucleotides. The assay was tested for different enzyme and substrate concentrations and exploited for the screening of activating additives, namely divalent transition metal ions (Ni(2+), Mg(2+), Mn(2+), and Ca(2+)). The transferability of the assay could be demonstrated by monitoring the dephosphorylation of other nucleotide triphosphates (GTP, TTP, and CTP).
Clay catalysis of oligonucleotide formation: kinetics of the reaction of the 5'-phosphorimidazolides of nucleotides with the non-basic heterocycles uracil and hypoxanthine

NASA Technical Reports Server (NTRS)

Kawamura, K.; Ferris, J. P.

1999-01-01

The montmorillonite clay catalyzed condensation of activated monocleotides to oligomers of RNA is a possible first step in the formation of the proposed RNA world. The rate constants for the condensation of the phosphorimidazolide of adenosine were measured previously and these studies have been extended to the phosphorimidazolides of inosine and uridine in the present work to determine of substitution of neutral heterocycles for the basic adenine ring changes the reaction rate or regioselectivity. The oligomerization reactions of the 5'-phosphoromidazolides of uridine (ImpU) and inosine (ImpI) on montmorillonite yield oligo(U)s and oligo(I)s as long as heptamers. The rate constants for oligonucleotide formation were determined by measuring the rates of formation of the oligomers by HPLC. Both the apparent rate constants in the reaction mixture and the rate constants on the clay surface were calculated using the partition coefficients of the oligomers between the aqueous and clay phases. The rate constants for trimer formation are much greater than those dimer synthesis but there was little difference in the rate constants for the formation of trimers and higher oligomers. The overall rates of oligomerization of the phosphorimidazolides of purine and pyrimidine nucleosides in the presence of montmorillonite clay are the same suggesting that RNA formed on the primitive Earth could have contained a variety of heterocyclic bases. The rate constants for oligomerization of pyrimidine nucleotides on the clay surface are significantly higher than those of purine nucleotides since the pyrimidine nucleotides bind less strongly to the clay than do the purine nucleotides. The differences in the binding is probably due to Van der Waals interactions between the purine bases and the clay surface. Differences in the basicity of the heterocyclic ring in the nucleotide have little effect on the oligomerization process.
Interaction centres of pyrimidine nucleotides: cytidine-5'-diphosphate (CDP) and cytidine-5'-triphosphate (CTP) in their reactions with tetramines and Cu(II) ions.

PubMed

Gasowska, A

2005-08-01

The interactions between pyrimidine nucleotides: cytidine-5'-diphosphate (CDP) and cytidine-5'-triphosphate (CTP) and Cu(II) ions, spermine (Spm) and 1,11-diamino-4,8-diazaundecane (3,3,3-tet) have been studied. The composition and stability constants of the complexes formed have been determined by means of the potentiometric method, while the centres of interactions in the ligands have been identified by the spectral methods (UV-Vis, Ultraviolet and Visible spectroscopy; EPR, electron spin resonance; NMR). In the systems without metal, formation of the molecular complexes nucleotide-polyamine with the interaction centres at the endocyclic nitrogen atom of purine ring N3, the oxygen atoms of the phosphate group from the nucleotide and protonated nitrogen atoms of the polyamine have been detected. Significant differences have been found in the metallation between the systems with Spm and with 3,3,3-tet. In the systems with spermine, mainly protonated species are formed with the phosphate group of the nucleotide and deprotonated nitrogen atoms of the polyamine making the coordination centres, while the donor nitrogen atom of the nucleotide N3 is involved in the intramolecular interligand interactions, additionally stabilising the complex. In the systems with 3,3,3-tet, the MLL' type species are formed in which the oxygen atoms of the phosphate group and nitrogen atoms of the polyamine are involved in metallation, whereas the N3 atom from the pyrimidine ring of the nucleotide is located outside the inner coordination sphere of copper ion. The main centre of Cu(II) interaction in the nucleotide, both in the system with Spm and 3,3,3-tet is the phosphate group of the nucleotide.
ParallABEL: an R library for generalized parallelization of genome-wide association studies

PubMed Central

2010-01-01

Background Genome-Wide Association (GWA) analysis is a powerful method for identifying loci associated with complex traits and drug response. Parts of GWA analyses, especially those involving thousands of individuals and consuming hours to months, will benefit from parallel computation. It is arduous acquiring the necessary programming skills to correctly partition and distribute data, control and monitor tasks on clustered computers, and merge output files. Results Most components of GWA analysis can be divided into four groups based on the types of input data and statistical outputs. The first group contains statistics computed for a particular Single Nucleotide Polymorphism (SNP), or trait, such as SNP characterization statistics or association test statistics. The input data of this group includes the SNPs/traits. The second group concerns statistics characterizing an individual in a study, for example, the summary statistics of genotype quality for each sample. The input data of this group includes individuals. The third group consists of pair-wise statistics derived from analyses between each pair of individuals in the study, for example genome-wide identity-by-state or genomic kinship analyses. The input data of this group includes pairs of SNPs/traits. The final group concerns pair-wise statistics derived for pairs of SNPs, such as the linkage disequilibrium characterisation. The input data of this group includes pairs of individuals. We developed the ParallABEL library, which utilizes the Rmpi library, to parallelize these four types of computations. ParallABEL library is not only aimed at GenABEL, but may also be employed to parallelize various GWA packages in R. The data set from the North American Rheumatoid Arthritis Consortium (NARAC) includes 2,062 individuals with 545,080, SNPs' genotyping, was used to measure ParallABEL performance. Almost perfect speed-up was achieved for many types of analyses. For example, the computing time for the identity-by-state matrix was linearly reduced from approximately eight hours to one hour when ParallABEL employed eight processors. Conclusions Executing genome-wide association analysis using the ParallABEL library on a computer cluster is an effective way to boost performance, and simplify the parallelization of GWA studies. ParallABEL is a user-friendly parallelization of GenABEL. PMID:20429914
Identification of a novel bovine enterovirus possessing highly divergent amino acid sequences in capsid protein.

PubMed

Tsuchiaka, Shinobu; Rahpaya, Sayed Samim; Otomaru, Konosuke; Aoki, Hiroshi; Kishimoto, Mai; Naoi, Yuki; Omatsu, Tsutomu; Sano, Kaori; Okazaki-Terashima, Sachiko; Katayama, Yukie; Oba, Mami; Nagai, Makoto; Mizutani, Tetsuya

2017-01-17

Bovine enterovirus (BEV) belongs to the species Enterovirus E or F, genus Enterovirus and family Picornaviridae. Although numerous studies have identified BEVs in the feces of cattle with diarrhea, the pathogenicity of BEVs remains unclear. Previously, we reported the detection of novel kobu-like virus in calf feces, by metagenomics analysis. In the present study, we identified a novel BEV in diarrheal feces collected for that survey. Complete genome sequences were determined by deep sequencing in feces. Secondary RNA structure analysis of the 5' untranslated region (UTR), phylogenetic tree construction and pairwise identity analysis were conducted. The complete genome sequences of BEV were genetically distant from other EVs and the VP1 coding region contained novel and unique amino acid sequences. We named this strain as BEV AN12/Bos taurus/JPN/2014 (referred to as BEV-AN12). According to genome analysis, the genome length of this virus is 7414 nucleotides excluding the poly (A) tail and its genome consists of a 5'UTR, open reading frame encoding a single polyprotein, and 3'UTR. The results of secondary RNA structure analysis showed that in the 5'UTR, BEV-AN12 had an additional clover leaf structure and small stem loop structure, similarly to other BEVs. In pairwise identity analysis, BEV-AN12 showed high amino acid (aa) identities to Enterovirus F in the polyprotein, P2 and P3 regions (aa identity ≥82.4%). Therefore, BEV-AN12 is closely related to Enterovirus F. However, aa sequences in the capsid protein regions, particularly the VP1 encoding region, showed significantly low aa identity to other viruses in genus Enterovirus (VP1 aa identity ≤58.6%). In addition, BEV-AN12 branched separately from Enterovirus E and F in phylogenetic trees based on the aa sequences of P1 and VP1, although it clustered with Enterovirus F in trees based on sequences in the P2 and P3 genome region. We identified novel BEV possessing highly divergent aa sequences in the VP1 coding region in Japan. According to species definition, we proposed naming this strain as "Enterovirus K", which is a novel species within genus Enterovirus. Further genomic studies are needed to understand the pathogenicity of BEVs.
Ancestral sequence reconstruction in primate mitochondrial DNA: compositional bias and effect on functional inference.

PubMed

Krishnan, Neeraja M; Seligmann, Hervé; Stewart, Caro-Beth; De Koning, A P Jason; Pollock, David D

2004-10-01

Reconstruction of ancestral DNA and amino acid sequences is an important means of inferring information about past evolutionary events. Such reconstructions suggest changes in molecular function and evolutionary processes over the course of evolution and are used to infer adaptation and convergence. Maximum likelihood (ML) is generally thought to provide relatively accurate reconstructed sequences compared to parsimony, but both methods lead to the inference of multiple directional changes in nucleotide frequencies in primate mitochondrial DNA (mtDNA). To better understand this surprising result, as well as to better understand how parsimony and ML differ, we constructed a series of computationally simple "conditional pathway" methods that differed in the number of substitutions allowed per site along each branch, and we also evaluated the entire Bayesian posterior frequency distribution of reconstructed ancestral states. We analyzed primate mitochondrial cytochrome b (Cyt-b) and cytochrome oxidase subunit I (COI) genes and found that ML reconstructs ancestral frequencies that are often more different from tip sequences than are parsimony reconstructions. In contrast, frequency reconstructions based on the posterior ensemble more closely resemble extant nucleotide frequencies. Simulations indicate that these differences in ancestral sequence inference are probably due to deterministic bias caused by high uncertainty in the optimization-based ancestral reconstruction methods (parsimony, ML, Bayesian maximum a posteriori). In contrast, ancestral nucleotide frequencies based on an average of the Bayesian set of credible ancestral sequences are much less biased. The methods involving simpler conditional pathway calculations have slightly reduced likelihood values compared to full likelihood calculations, but they can provide fairly unbiased nucleotide reconstructions and may be useful in more complex phylogenetic analyses than considered here due to their speed and flexibility. To determine whether biased reconstructions using optimization methods might affect inferences of functional properties, ancestral primate mitochondrial tRNA sequences were inferred and helix-forming propensities for conserved pairs were evaluated in silico. For ambiguously reconstructed nucleotides at sites with high base composition variability, ancestral tRNA sequences from Bayesian analyses were more compatible with canonical base pairing than were those inferred by other methods. Thus, nucleotide bias in reconstructed sequences apparently can lead to serious bias and inaccuracies in functional predictions.
Lack of Association Between Toll-like Receptor 2 Polymorphisms (R753Q and A-16934T) and Atopic Dermatitis in Children from Thrace Region of Turkey

PubMed Central

Can, Ceren; Yazıcıoğlu, Mehtap; Gürkan, Hakan; Tozkır, Hilmi; Görgülü, Adnan; Süt, Necdet Hilmi

2017-01-01

Background: Atopic dermatitis is the most common chronic inflammatory skin disease. A complex interaction of both genetic and environmental factors is thought to contribute to the disease. Aims: To evaluate whether single nucleotide polymorphisms in the TLR2 gene c.2258C>T (R753Q) (rs5743708) and TLR2 c.-148+1614T>A (A-16934T) (rs4696480) (NM_0032643) are associated with atopic dermatitis in Turkish children. Study Design: Case-control study. Methods: The study was conducted on 70 Turkish children with atopic dermatitis aged 0.5-18 years. The clinical severity of atopic dermatitis was evaluated by the severity scoring of atopic dermatitis index. Serum total IgE levels, specific IgE antibodies to inhalant and food allergens were measured in both atopic dermatitis patients and controls, skin prick tests were done on 70 children with atopic dermatitis. Genotyping for TLR2 (R753Q and A-16934T) single nucleotide polymorphisms was performed in both atopic dermatitis patients and controls. Results: Cytosine-cytosine and cytosin-thymine genotype frequencies of the TLR2 R753Q single nucleotide polymorphism in the atopic dermatitis group were determined as being 98.6% and 1.4%, cytosine allele frequency for TLR2 R753Q single nucleotide polymorphism was determined as 99.29% and the thymine allele frequency was 0.71%, thymine-thymine, thymine-adenine, and adenine-adenine genotype frequencies of the TLR2 A-16934T single nucleotide polymorphism were 24.3%, 44.3%, and 31.4%. The thymine allele frequency for the TLR2 A-16934T single nucleotide polymorphism in the atopic dermatitis group was 46.43%, and the adenine allele frequency was 53.57%, respectively. There was not statistically significant difference between the groups for all investigated polymorphisms (p>0.05). For all single nucleotide polymorphisms studied, allelic distribution was analogous among atopic dermatitis patients and controls, and no significant statistical difference was observed. No homozygous carriers of the TLR2 R753Q single nucleotide polymorphism were found in the atopic dermatitis and control groups. Conclusion: The TLR2 (R753Q and A-16934T) single nucleotide polymorphisms are not associated with atopic dermatitis in a group of Turkish patients. PMID:28443596
Lack of Association Between Toll-like Receptor 2 Polymorphisms (R753Q and A-16934T) and Atopic Dermatitis in Children from Thrace Region of Turkey.

PubMed

Can, Ceren; Yazıcıoğlu, Mehtap; Gürkan, Hakan; Tozkır, Hilmi; Görgülü, Adnan; Süt, Necdet Hilmi

2017-05-05

Atopic dermatitis is the most common chronic inflammatory skin disease. A complex interaction of both genetic and environmental factors is thought to contribute to the disease. To evaluate whether single nucleotide polymorphisms in the TLR2 gene c.2258C>T (R753Q) (rs5743708) and TLR2 c.-148+1614T>A (A-16934T) (rs4696480) (NM_0032643) are associated with atopic dermatitis in Turkish children. Case-control study. The study was conducted on 70 Turkish children with atopic dermatitis aged 0.5-18 years. The clinical severity of atopic dermatitis was evaluated by the severity scoring of atopic dermatitis index. Serum total IgE levels, specific IgE antibodies to inhalant and food allergens were measured in both atopic dermatitis patients and controls, skin prick tests were done on 70 children with atopic dermatitis. Genotyping for TLR2 (R753Q and A-16934T) single nucleotide polymorphisms was performed in both atopic dermatitis patients and controls. Cytosine-cytosine and cytosin-thymine genotype frequencies of the TLR2 R753Q single nucleotide polymorphism in the atopic dermatitis group were determined as being 98.6% and 1.4%, cytosine allele frequency for TLR2 R753Q single nucleotide polymorphism was determined as 99.29% and the thymine allele frequency was 0.71%, thymine-thymine, thymine-adenine, and adenine-adenine genotype frequencies of the TLR2 A-16934T single nucleotide polymorphism were 24.3%, 44.3%, and 31.4%. The thymine allele frequency for the TLR2 A-16934T single nucleotide polymorphism in the atopic dermatitis group was 46.43%, and the adenine allele frequency was 53.57%, respectively. There was not statistically significant difference between the groups for all investigated polymorphisms (p>0.05). For all single nucleotide polymorphisms studied, allelic distribution was analogous among atopic dermatitis patients and controls, and no significant statistical difference was observed. No homozygous carriers of the TLR2 R753Q single nucleotide polymorphism were found in the atopic dermatitis and control groups. The TLR2 (R753Q and A-16934T) single nucleotide polymorphisms are not associated with atopic dermatitis in a group of Turkish patients.
Quantum Spin Dynamics with Pairwise-Tunable, Long-Range Interactions

DTIC Science & Technology

2016-08-05

rection of the arrows. Dashed (dotted) lines mark the NNN hopping terms (coefficients ±t2). NNNN long -range hopping along curved lines are included to...Quantum spin dynamics with pairwise-tunable, long -range interactions C.-L. Hunga,b,1,2, Alejandro González-Tudelac,1,2, J. Ignacio Ciracc, and H. J...atoms) that interact by way of a variety of processes, such as atomic collisions. Such pro- cesses typically lead to short -range, nearest-neighbor
Pairwise alignment of chromatograms using an extended Fisher-Rao metric.

PubMed

Wallace, W E; Srivastava, A; Telu, K H; Simón-Manso, Y

2014-09-02

A conceptually new approach for aligning chromatograms is introduced and applied to examples of metabolite identification in human blood plasma by liquid chromatography-mass spectrometry (LC-MS). A square-root representation of the chromatogram's derivative coupled with an extended Fisher-Rao metric enables the computation of relative differences between chromatograms. Minimization of these differences using a common dynamic programming algorithm brings the chromatograms into alignment. Application to a complex sample, National Institute of Standards and Technology (NIST) Standard Reference Material 1950, Metabolites in Human Plasma, analyzed by two different LC-MS methods having significantly different ranges of elution time is described. Published by Elsevier B.V.
Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model.

PubMed

Yang, Liang; Ge, Meng; Jin, Di; He, Dongxiao; Fu, Huazhu; Wang, Jing; Cao, Xiaochun

2017-01-01

Due to the demand for performance improvement and the existence of prior information, semi-supervised community detection with pairwise constraints becomes a hot topic. Most existing methods have been successfully encoding the must-link constraints, but neglect the opposite ones, i.e., the cannot-link constraints, which can force the exclusion between nodes. In this paper, we are interested in understanding the role of cannot-link constraints and effectively encoding pairwise constraints. Towards these goals, we define an integral generative process jointly considering the network topology, must-link and cannot-link constraints. We propose to characterize this process as a Multi-variance Mixed Gaussian Generative (MMGG) Model to address diverse degrees of confidences that exist in network topology and pairwise constraints and formulate it as a weighted nonnegative matrix factorization problem. The experiments on artificial and real-world networks not only illustrate the superiority of our proposed MMGG, but also, most importantly, reveal the roles of pairwise constraints. That is, though the must-link is more important than cannot-link when either of them is available, both must-link and cannot-link are equally important when both of them are available. To the best of our knowledge, this is the first work on discovering and exploring the importance of cannot-link constraints in semi-supervised community detection.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.