Risk Classification with an Adaptive Naive Bayes Kernel Machine Model.
Minnier, Jessica; Yuan, Ming; Liu, Jun S; Cai, Tianxi
2015-04-22
Genetic studies of complex traits have uncovered only a small number of risk markers explaining a small fraction of heritability and adding little improvement to disease risk prediction. Standard single marker methods may lack power in selecting informative markers or estimating effects. Most existing methods also typically do not account for non-linearity. Identifying markers with weak signals and estimating their joint effects among many non-informative markers remains challenging. One potential approach is to group markers based on biological knowledge such as gene structure. If markers in a group tend to have similar effects, proper usage of the group structure could improve power and efficiency in estimation. We propose a two-stage method relating markers to disease risk by taking advantage of known gene-set structures. Imposing a naive bayes kernel machine (KM) model, we estimate gene-set specific risk models that relate each gene-set to the outcome in stage I. The KM framework efficiently models potentially non-linear effects of predictors without requiring explicit specification of functional forms. In stage II, we aggregate information across gene-sets via a regularization procedure. Estimation and computational efficiency is further improved with kernel principle component analysis. Asymptotic results for model estimation and gene set selection are derived and numerical studies suggest that the proposed procedure could outperform existing procedures for constructing genetic risk models.
Korn, Edward L; Freidlin, Boris
2017-06-01
For a fallback randomized clinical trial design with a marker, Choai and Matsui (2015, Biometrics 71, 25-32) estimate the bias of the estimator of the treatment effect in the marker-positive subgroup conditional on the treatment effect not being statistically significant in the overall population. This is used to construct and examine conditionally bias-corrected estimators of the treatment effect for the marker-positive subgroup. We argue that it may not be appropriate to correct for conditional bias in this setting. Instead, we consider the unconditional bias of estimators of the treatment effect for marker-positive patients. © 2016, The International Biometric Society.
Robust estimation of the proportion of treatment effect explained by surrogate marker information.
Parast, Layla; McDermott, Mary M; Tian, Lu
2016-05-10
In randomized treatment studies where the primary outcome requires long follow-up of patients and/or expensive or invasive obtainment procedures, the availability of a surrogate marker that could be used to estimate the treatment effect and could potentially be observed earlier than the primary outcome would allow researchers to make conclusions regarding the treatment effect with less required follow-up time and resources. The Prentice criterion for a valid surrogate marker requires that a test for treatment effect on the surrogate marker also be a valid test for treatment effect on the primary outcome of interest. Based on this criterion, methods have been developed to define and estimate the proportion of treatment effect on the primary outcome that is explained by the treatment effect on the surrogate marker. These methods aim to identify useful statistical surrogates that capture a large proportion of the treatment effect. However, current methods to estimate this proportion usually require restrictive model assumptions that may not hold in practice and thus may lead to biased estimates of this quantity. In this paper, we propose a nonparametric procedure to estimate the proportion of treatment effect on the primary outcome that is explained by the treatment effect on a potential surrogate marker and extend this procedure to a setting with multiple surrogate markers. We compare our approach with previously proposed model-based approaches and propose a variance estimation procedure based on a perturbation-resampling method. Simulation studies demonstrate that the procedure performs well in finite samples and outperforms model-based procedures when the specified models are not correct. We illustrate our proposed procedure using a data set from a randomized study investigating a group-mediated cognitive behavioral intervention for peripheral artery disease participants. Copyright © 2015 John Wiley & Sons, Ltd.
No shortcut solution to the problem of Y-STR match probability calculation.
Caliebe, Amke; Jochens, Arne; Willuweit, Sascha; Roewer, Lutz; Krawczak, Michael
2015-03-01
Match probability calculation is deemed much more intricate for lineage genetic markers, including Y-chromosomal short tandem repeats (Y-STRs), than for autosomal markers. This is because, owing to the lack of recombination, strong interdependence between markers is likely, which implies that haplotype frequency estimates cannot simply be obtained through the multiplication of allele frequency estimates. As yet, however, the practical relevance of this problem has not been studied in much detail using real data. In fact, such scrutiny appears well warranted because the high mutation rates of Y-STRs and the possibility of backward mutation should have worked against the statistical association of Y-STRs. We examined haplotype data of 21 markers included in the PowerPlex(®)Y23 set (PPY23, Promega Corporation, Madison, WI) originating from six different populations (four European and two Asian). Assessing the conditional entropies of the markers, given different subsets of markers from the same panel, we demonstrate that the PowerPlex(®)Y23 set cannot be decomposed into smaller marker subsets that would be (conditionally) independent. Nevertheless, in all six populations, >94% of the joint entropy of the 21 markers is explained by the seven most rapidly mutating markers. Although this result might render a reduction in marker number a sensible option for practical casework, the partial haplotypes would still be almost as diverse as the full haplotypes. Therefore, match probability calculation remains difficult and calls for the improvement of currently available methods of haplotype frequency estimation. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Correcting for deformation in skin-based marker systems.
Alexander, E J; Andriacchi, T P
2001-03-01
A new technique is described that reduces error due to skin movement artifact in the opto-electronic measurement of in vivo skeletal motion. This work builds on a previously described point cluster technique marker set and estimation algorithm by extending the transformation equations to the general deformation case using a set of activity-dependent deformation models. Skin deformation during activities of daily living are modeled as consisting of a functional form defined over the observation interval (the deformation model) plus additive noise (modeling error). The method is described as an interval deformation technique. The method was tested using simulation trials with systematic and random components of deformation error introduced into marker position vectors. The technique was found to substantially outperform methods that require rigid-body assumptions. The method was tested in vivo on a patient fitted with an external fixation device (Ilizarov). Simultaneous measurements from markers placed on the Ilizarov device (fixed to bone) were compared to measurements derived from skin-based markers. The interval deformation technique reduced the errors in limb segment pose estimate by 33 and 25% compared to the classic rigid-body technique for position and orientation, respectively. This newly developed method has demonstrated that by accounting for the changing shape of the limb segment, a substantial improvement in the estimates of in vivo skeletal movement can be achieved.
Williams, Larry J; O'Boyle, Ernest H
2015-09-01
A persistent concern in the management and applied psychology literature is the effect of common method variance on observed relations among variables. Recent work (i.e., Richardson, Simmering, & Sturman, 2009) evaluated 3 analytical approaches to controlling for common method variance, including the confirmatory factor analysis (CFA) marker technique. Their findings indicated significant problems with this technique, especially with nonideal marker variables (those with theoretical relations with substantive variables). Based on their simulation results, Richardson et al. concluded that not correcting for method variance provides more accurate estimates than using the CFA marker technique. We reexamined the effects of using marker variables in a simulation study and found the degree of error in estimates of a substantive factor correlation was relatively small in most cases, and much smaller than error associated with making no correction. Further, in instances in which the error was large, the correlations between the marker and substantive scales were higher than that found in organizational research with marker variables. We conclude that in most practical settings, the CFA marker technique yields parameter estimates close to their true values, and the criticisms made by Richardson et al. are overstated. (c) 2015 APA, all rights reserved).
Kishine, Masahiro; Tsutsumi, Katsuji; Kitta, Kazumi
2017-12-01
Simple sequence repeat (SSR) is a popular tool for individual fingerprinting. The long-core motif (e.g. tetra-, penta-, and hexa-nucleotide) simple sequence repeats (SSRs) are preferred because they make it easier to separate and distinguish neighbor alleles. In the present study, a new set of 8 tetra-nucleotide SSRs in potato ( Solanum tuberosum ) is reported. By using these 8 markers, 72 out of 76 cultivars obtained from Japan and the United States were clearly discriminated, while two pairs, both of which arose from natural variation, showed identical profiles. The combined probability of identity between two random cultivars for the set of 8 SSR markers was estimated to be 1.10 × 10 -8 , confirming the usefulness of the proposed SSR markers for fingerprinting analyses of potato.
Optimal selection of markers for validation or replication from genome-wide association studies.
Greenwood, Celia M T; Rangrej, Jagadish; Sun, Lei
2007-07-01
With reductions in genotyping costs and the fast pace of improvements in genotyping technology, it is not uncommon for the individuals in a single study to undergo genotyping using several different platforms, where each platform may contain different numbers of markers selected via different criteria. For example, a set of cases and controls may be genotyped at markers in a small set of carefully selected candidate genes, and shortly thereafter, the same cases and controls may be used for a genome-wide single nucleotide polymorphism (SNP) association study. After such initial investigations, often, a subset of "interesting" markers is selected for validation or replication. Specifically, by validation, we refer to the investigation of associations between the selected subset of markers and the disease in independent data. However, it is not obvious how to choose the best set of markers for this validation. There may be a prior expectation that some sets of genotyping data are more likely to contain real associations. For example, it may be more likely for markers in plausible candidate genes to show disease associations than markers in a genome-wide scan. Hence, it would be desirable to select proportionally more markers from the candidate gene set. When a fixed number of markers are selected for validation, we propose an approach for identifying an optimal marker-selection configuration by basing the approach on minimizing the stratified false discovery rate. We illustrate this approach using a case-control study of colorectal cancer from Ontario, Canada, and we show that this approach leads to substantial reductions in the estimated false discovery rates in the Ontario dataset for the selected markers, as well as reductions in the expected false discovery rates for the proposed validation dataset. Copyright 2007 Wiley-Liss, Inc.
Ranking Bias in Association Studies
Jeffries, Neal O.
2009-01-01
Background It is widely appreciated that genomewide association studies often yield overestimates of the association of a marker with disease when attention focuses upon the marker showing the strongest relationship. For example, in a case-control setting the largest (in absolute value) estimated odds ratio has been found to typically overstate the association as measured in a second, independent set of data. The most common reason given for this observation is that the choice of the most extreme test statistic is often conditional upon first observing a significant p value associated with the marker. A second, less appreciated reason is described here. Under common circumstances it is the multiple testing of many markers and subsequent focus upon those with most extreme test statistics (i.e. highly ranked results) that leads to bias in the estimated effect sizes. Conclusions This bias, termed ranking bias, is separate from that arising from conditioning on a significant p value and may often be a more important factor in generating bias. An analytic description of this bias, simulations demonstrating its extent, and identification of some factors leading to its exacerbation are presented. PMID:19172085
Taylor, Jeremy M G; Conlon, Anna S C; Elliott, Michael R
2015-08-01
The validation of intermediate markers as surrogate markers (S) for the true outcome of interest (T) in clinical trials offers the possibility for trials to be run more quickly and cheaply by using the surrogate endpoint in place of the true endpoint. Working within a principal stratification framework, we propose causal quantities to evaluate surrogacy using a Gaussian copula model for an ordinal surrogate and time-to-event final outcome. The methods are applied to data from four colorectal cancer clinical trials, where S is tumor response and T is overall survival. For the Gaussian copula model, a Bayesian estimation strategy is used and, as some parameters are not identifiable from the data, we explore the use of informative priors that are consistent with reasonable assumptions in the surrogate marker setting to aid in estimation. While there is some bias in the estimation of the surrogacy quantities of interest, the estimation procedure does reasonably well at distinguishing between poor and good surrogate markers. Some of the parameters of the proposed model are not identifiable from the data, and therefore, assumptions must be made in order to aid in their estimation. The proposed quantities can be used in combination to provide evidence about the validity of S as a surrogate marker for T. © The Author(s) 2014.
Waldmann, P; García-Gil, M R; Sillanpää, M J
2005-06-01
Comparison of the level of differentiation at neutral molecular markers (estimated as F(ST) or G(ST)) with the level of differentiation at quantitative traits (estimated as Q(ST)) has become a standard tool for inferring that there is differential selection between populations. We estimated Q(ST) of timing of bud set from a latitudinal cline of Pinus sylvestris with a Bayesian hierarchical variance component method utilizing the information on the pre-estimated population structure from neutral molecular markers. Unfortunately, the between-family variances differed substantially between populations that resulted in a bimodal posterior of Q(ST) that could not be compared in any sensible way with the unimodal posterior of the microsatellite F(ST). In order to avoid publishing studies with flawed Q(ST) estimates, we recommend that future studies should present heritability estimates for each trait and population. Moreover, to detect variance heterogeneity in frequentist methods (ANOVA and REML), it is of essential importance to check also that the residuals are normally distributed and do not follow any systematically deviating trends.
Da, Yang; Wang, Chunkao; Wang, Shengwen; Hu, Guo
2014-01-01
We established a genomic model of quantitative trait with genomic additive and dominance relationships that parallels the traditional quantitative genetics model, which partitions a genotypic value as breeding value plus dominance deviation and calculates additive and dominance relationships using pedigree information. Based on this genomic model, two sets of computationally complementary but mathematically identical mixed model methods were developed for genomic best linear unbiased prediction (GBLUP) and genomic restricted maximum likelihood estimation (GREML) of additive and dominance effects using SNP markers. These two sets are referred to as the CE and QM sets, where the CE set was designed for large numbers of markers and the QM set was designed for large numbers of individuals. GBLUP and associated accuracy formulations for individuals in training and validation data sets were derived for breeding values, dominance deviations and genotypic values. Simulation study showed that GREML and GBLUP generally were able to capture small additive and dominance effects that each accounted for 0.00005–0.0003 of the phenotypic variance and GREML was able to differentiate true additive and dominance heritability levels. GBLUP of the total genetic value as the summation of additive and dominance effects had higher prediction accuracy than either additive or dominance GBLUP, causal variants had the highest accuracy of GREML and GBLUP, and predicted accuracies were in agreement with observed accuracies. Genomic additive and dominance relationship matrices using SNP markers were consistent with theoretical expectations. The GREML and GBLUP methods can be an effective tool for assessing the type and magnitude of genetic effects affecting a phenotype and for predicting the total genetic value at the whole genome level. PMID:24498162
Da, Yang; Wang, Chunkao; Wang, Shengwen; Hu, Guo
2014-01-01
We established a genomic model of quantitative trait with genomic additive and dominance relationships that parallels the traditional quantitative genetics model, which partitions a genotypic value as breeding value plus dominance deviation and calculates additive and dominance relationships using pedigree information. Based on this genomic model, two sets of computationally complementary but mathematically identical mixed model methods were developed for genomic best linear unbiased prediction (GBLUP) and genomic restricted maximum likelihood estimation (GREML) of additive and dominance effects using SNP markers. These two sets are referred to as the CE and QM sets, where the CE set was designed for large numbers of markers and the QM set was designed for large numbers of individuals. GBLUP and associated accuracy formulations for individuals in training and validation data sets were derived for breeding values, dominance deviations and genotypic values. Simulation study showed that GREML and GBLUP generally were able to capture small additive and dominance effects that each accounted for 0.00005-0.0003 of the phenotypic variance and GREML was able to differentiate true additive and dominance heritability levels. GBLUP of the total genetic value as the summation of additive and dominance effects had higher prediction accuracy than either additive or dominance GBLUP, causal variants had the highest accuracy of GREML and GBLUP, and predicted accuracies were in agreement with observed accuracies. Genomic additive and dominance relationship matrices using SNP markers were consistent with theoretical expectations. The GREML and GBLUP methods can be an effective tool for assessing the type and magnitude of genetic effects affecting a phenotype and for predicting the total genetic value at the whole genome level.
Kuhn, David N; Motamayor, Juan Carlos; Meerow, Alan W; Borrone, James W; Schnell, Raymond J
2008-10-01
For well-studied plant species with whole genome sequence or extensive EST data, SNP markers are the logical choice for both genotyping and whole genome association studies. However, SNP markers may not address the needs of researchers working on specialty crops with limited available genomic information. Microsatellite markers have been frequently employed due to their robustness, but marker development can be difficult and may result in few polymorphic markers. SSCP markers, such as microsatellites, are PCR-based and scored by electrophoretic mobility but, because they are based on SNPs rather than length differences, occur more frequently and are easier to develop than microsatellites. We have examined how well correlated the estimation of genetic diversity and genetic distance are in a population or germplasm collection when measured by 13 highly polymorphic microsatellite markers or 20 SSCP markers. We observed a significant correlation in pairwise genetic distances of 82 individuals in an international cacao germplasm collection (Mantel test Rxy=0.59, p<0.0001 for 10 000 permutations). Both sets of markers could distinguish each individual in the population. These data provide strong support for the use of SSCP markers in the genotyping of plant species where development of microsatellites would be difficult or expensive.
Comparing biomarkers as principal surrogate endpoints.
Huang, Ying; Gilbert, Peter B
2011-12-01
Recently a new definition of surrogate endpoint, the "principal surrogate," was proposed based on causal associations between treatment effects on the biomarker and on the clinical endpoint. Despite its appealing interpretation, limited research has been conducted to evaluate principal surrogates, and existing methods focus on risk models that consider a single biomarker. How to compare principal surrogate value of biomarkers or general risk models that consider multiple biomarkers remains an open research question. We propose to characterize a marker or risk model's principal surrogate value based on the distribution of risk difference between interventions. In addition, we propose a novel summary measure (the standardized total gain) that can be used to compare markers and to assess the incremental value of a new marker. We develop a semiparametric estimated-likelihood method to estimate the joint surrogate value of multiple biomarkers. This method accommodates two-phase sampling of biomarkers and is more widely applicable than existing nonparametric methods by incorporating continuous baseline covariates to predict the biomarker(s), and is more robust than existing parametric methods by leaving the error distribution of markers unspecified. The methodology is illustrated using a simulated example set and a real data set in the context of HIV vaccine trials. © 2011, The International Biometric Society.
Alghanim, Hussain; Antunes, Joana; Silva, Deborah Soares Bispo Santos; Alho, Clarice Sampaio; Balamurugan, Kuppareddi; McCord, Bruce
2017-11-01
Recent developments in the analysis of epigenetic DNA methylation patterns have demonstrated that certain genetic loci show a linear correlation with chronological age. It is the goal of this study to identify a new set of epigenetic methylation markers for the forensic estimation of human age. A total number of 27 CpG sites at three genetic loci, SCGN, DLX5 and KLF14, were examined to evaluate the correlation of their methylation status with age. These sites were evaluated using 72 blood samples and 91 saliva samples collected from volunteers with ages ranging from 5 to 73 years. DNA was bisulfite modified followed by PCR amplification and pyrosequencing to determine the level of DNA methylation at each CpG site. In this study, certain CpG sites in SCGN and KLF14 loci showed methylation levels that were correlated with chronological age, however, the tested CpG sites in DLX5 did not show a correlation with age. Using a 52-saliva sample training set, two age-predictor models were developed by means of a multivariate linear regression analysis for age prediction. The two models performed similarly with a single-locus model explaining 85% of the age variance at a mean absolute deviation of 5.8 years and a dual-locus model explaining 84% of the age variance with a mean absolute deviation of 6.2 years. In the validation set, the mean absolute deviation was measured to be 8.0 years and 7.1 years for the single- and dual-locus model, respectively. Another age predictor model was also developed using a 40-blood sample training set that accounted for 71% of the age variance. This model gave a mean absolute deviation of 6.6 years for the training set and 10.3years for the validation set. The results indicate that specific CpGs in SCGN and KLF14 can be used as potential epigenetic markers to estimate age using saliva and blood specimens. These epigenetic markers could provide important information in cases where the determination of a suspect's age is critical in developing investigative leads. Copyright © 2017. Published by Elsevier B.V.
Coelho, A V C; Moura, R R; Cavalcanti, C A J; Guimarães, R L; Sandrin-Garcia, P; Crovella, S; Brandão, L A C
2015-03-31
Genetic association studies determine how genes influence traits. However, non-detected population substructure may bias the analysis, resulting in spurious results. One method to detect substructure is to genotype ancestry informative markers (AIMs) besides the candidate variants, quantifying how much ancestral populations contribute to the samples' genetic background. The present study aimed to use a minimum quantity of markers, while retaining full potential to estimate ancestries. We tested the feasibility of a subset of the 12 most informative markers from a previously established study to estimate influence from three ancestral populations: European, African and Amerindian. The results showed that in a sample with a diverse ethnicity (N = 822) derived from 1000 Genomes database, the 12 AIMs had the same capacity to estimate ancestries when compared to the original set of 128 AIMs, since estimates from the two panels were closely correlated. Thus, these 12 SNPs were used to estimate ancestry in a new sample (N = 192) from an admixed population in Recife, Northeast Brazil. The ancestry estimates from Recife subjects were in accordance with previous studies, showing that Northeastern Brazilian populations show great influence from European ancestry (59.7%), followed by African (23.0%) and Amerindian (17.3%) ancestries. Ethnicity self-classification according to skin-color was confirmed to be a poor indicator of population substructure in Brazilians, since ancestry estimates overlapped between classifications. Thus, our streamlined panel of 12 markers may substitute panels with more markers, while retaining the capacity to control for population substructure and admixture, thereby reducing sample processing time.
THREaD Mapper Studio: a novel, visual web server for the estimation of genetic linkage maps
Cheema, Jitender; Ellis, T. H. Noel; Dicks, Jo
2010-01-01
The estimation of genetic linkage maps is a key component in plant and animal research, providing both an indication of the genetic structure of an organism and a mechanism for identifying candidate genes associated with traits of interest. Because of this importance, several computational solutions to genetic map estimation exist, mostly implemented as stand-alone software packages. However, the estimation process is often largely hidden from the user. Consequently, problems such as a program crashing may occur that leave a user baffled. THREaD Mapper Studio (http://cbr.jic.ac.uk/threadmapper) is a new web site that implements a novel, visual and interactive method for the estimation of genetic linkage maps from DNA markers. The rationale behind the web site is to make the estimation process as transparent and robust as possible, while also allowing users to use their expert knowledge during analysis. Indeed, the 3D visual nature of the tool allows users to spot features in a data set, such as outlying markers and potential structural rearrangements that could cause problems with the estimation procedure and to account for them in their analysis. Furthermore, THREaD Mapper Studio facilitates the visual comparison of genetic map solutions from third party software, aiding users in developing robust solutions for their data sets. PMID:20494977
Windhausen, Vanessa S; Atlin, Gary N; Hickey, John M; Crossa, Jose; Jannink, Jean-Luc; Sorrells, Mark E; Raman, Babu; Cairns, Jill E; Tarekegne, Amsal; Semagn, Kassa; Beyene, Yoseph; Grudloyma, Pichet; Technow, Frank; Riedelsheimer, Christian; Melchinger, Albrecht E
2012-11-01
Genomic prediction is expected to considerably increase genetic gains by increasing selection intensity and accelerating the breeding cycle. In this study, marker effects estimated in 255 diverse maize (Zea mays L.) hybrids were used to predict grain yield, anthesis date, and anthesis-silking interval within the diversity panel and testcross progenies of 30 F(2)-derived lines from each of five populations. Although up to 25% of the genetic variance could be explained by cross validation within the diversity panel, the prediction of testcross performance of F(2)-derived lines using marker effects estimated in the diversity panel was on average zero. Hybrids in the diversity panel could be grouped into eight breeding populations differing in mean performance. When performance was predicted separately for each breeding population on the basis of marker effects estimated in the other populations, predictive ability was low (i.e., 0.12 for grain yield). These results suggest that prediction resulted mostly from differences in mean performance of the breeding populations and less from the relationship between the training and validation sets or linkage disequilibrium with causal variants underlying the predicted traits. Potential uses for genomic prediction in maize hybrid breeding are discussed emphasizing the need of (1) a clear definition of the breeding scenario in which genomic prediction should be applied (i.e., prediction among or within populations), (2) a detailed analysis of the population structure before performing cross validation, and (3) larger training sets with strong genetic relationship to the validation set.
Weigel, K A; de los Campos, G; González-Recio, O; Naya, H; Wu, X L; Long, N; Rosa, G J M; Gianola, D
2009-10-01
The objective of the present study was to assess the predictive ability of subsets of single nucleotide polymorphism (SNP) markers for development of low-cost, low-density genotyping assays in dairy cattle. Dense SNP genotypes of 4,703 Holstein bulls were provided by the USDA Agricultural Research Service. A subset of 3,305 bulls born from 1952 to 1998 was used to fit various models (training set), and a subset of 1,398 bulls born from 1999 to 2002 was used to evaluate their predictive ability (testing set). After editing, data included genotypes for 32,518 SNP and August 2003 and April 2008 predicted transmitting abilities (PTA) for lifetime net merit (LNM$), the latter resulting from progeny testing. The Bayesian least absolute shrinkage and selection operator method was used to regress August 2003 PTA on marker covariates in the training set to arrive at estimates of marker effects and direct genomic PTA. The coefficient of determination (R(2)) from regressing the April 2008 progeny test PTA of bulls in the testing set on their August 2003 direct genomic PTA was 0.375. Subsets of 300, 500, 750, 1,000, 1,250, 1,500, and 2,000 SNP were created by choosing equally spaced and highly ranked SNP, with the latter based on the absolute value of their estimated effects obtained from the training set. The SNP effects were re-estimated from the training set for each subset of SNP, and the 2008 progeny test PTA of bulls in the testing set were regressed on corresponding direct genomic PTA. The R(2) values for subsets of 300, 500, 750, 1,000, 1,250, 1,500, and 2,000 SNP with largest effects (evenly spaced SNP) were 0.184 (0.064), 0.236 (0.111), 0.269 (0.190), 0.289 (0.179), 0.307 (0.228), 0.313 (0.268), and 0.322 (0.291), respectively. These results indicate that a low-density assay comprising selected SNP could be a cost-effective alternative for selection decisions and that significant gains in predictive ability may be achieved by increasing the number of SNP allocated to such an assay from 300 or fewer to 1,000 or more.
Surrogate markers for time-varying treatments and outcomes
Hsu, Jesse Y; Kennedy, Edward H; Roy, Jason A; Stephens-Shields, Alisa J; Small, Dylan S; Joffe, Marshall M
2015-01-01
Background A surrogate marker is a variable commonly used in clinical trials to guide treatment decisions when the outcome of ultimate interest is not available. A good surrogate marker is one where the treatment effect on the surrogate is a strong predictor of the effect of treatment on the outcome. We review the situation when there is one treatment delivered at baseline, one surrogate measured at one later time point and one ultimate outcome of interest, and discuss new issues arising when variables are time-varying. Methods Most of the literature on surrogate markers has only considered simple settings with one treatment, one surrogate, and one outcome of interest at a fixed time point. However, more complicated time-varying settings are common in practice. In this paper, we describe the unique challenges in two settings, time-varying treatments and time-varying surrogates, while relating the ideas back to the causal-effects and causal-association paradigms. Conclusions In addition to discussing and extending popular notions of surrogacy to time-varying settings, we give examples illustrating that one can be misled by not taking into account time-varying information about the surrogate or treatment. We hope this paper has provided some motivation for future work on estimation and inference in such settings. PMID:25948621
2013-01-01
Background Brachiaria ruziziensis is one of the most important forage species planted in the tropics. The application of genomic tools to aid the selection of superior genotypes can provide support to B. ruziziensis breeding programs. However, there is a complete lack of information about the B. ruziziensis genome. Also, the availability of genomic tools, such as molecular markers, to support B. ruziziensis breeding programs is rather limited. Recently, next-generation sequencing technologies have been applied to generate sequence data for the identification of microsatellite regions and primer design. In this study, we present a first validated set of SSR markers for Brachiaria ruziziensis, selected from a de novo partial genome assembly of single-end Illumina reads. Results A total of 85,567 perfect microsatellite loci were detected in contigs with a minimum 10X coverage. We selected a set of 500 microsatellite loci identified in contigs with minimum 100X coverage for primer design and synthesis, and tested a subset of 269 primer pairs, 198 of which were polymorphic on 11 representative B. ruziziensis accessions. Descriptive statistics for these primer pairs are presented, as well as estimates of marker transferability to other relevant brachiaria species. Finally, a set of 11 multiplex panels containing the 30 most informative markers was validated and proposed for B. ruziziensis genetic analysis. Conclusions We show that the detection and development of microsatellite markers from genome assembled Illumina single-end DNA sequences is highly efficient. The developed markers are readily suitable for genetic analysis and marker assisted selection of Brachiaria ruziziensis. The use of this approach for microsatellite marker development is promising for species with limited genomic information, whose breeding programs would benefit from the use of genomic tools. To our knowledge, this is the first set of microsatellite markers developed for this important species. PMID:23324172
A combined vision-inertial fusion approach for 6-DoF object pose estimation
NASA Astrophysics Data System (ADS)
Li, Juan; Bernardos, Ana M.; Tarrío, Paula; Casar, José R.
2015-02-01
The estimation of the 3D position and orientation of moving objects (`pose' estimation) is a critical process for many applications in robotics, computer vision or mobile services. Although major research efforts have been carried out to design accurate, fast and robust indoor pose estimation systems, it remains as an open challenge to provide a low-cost, easy to deploy and reliable solution. Addressing this issue, this paper describes a hybrid approach for 6 degrees of freedom (6-DoF) pose estimation that fuses acceleration data and stereo vision to overcome the respective weaknesses of single technology approaches. The system relies on COTS technologies (standard webcams, accelerometers) and printable colored markers. It uses a set of infrastructure cameras, located to have the object to be tracked visible most of the operation time; the target object has to include an embedded accelerometer and be tagged with a fiducial marker. This simple marker has been designed for easy detection and segmentation and it may be adapted to different service scenarios (in shape and colors). Experimental results show that the proposed system provides high accuracy, while satisfactorily dealing with the real-time constraints.
Windhausen, Vanessa S.; Atlin, Gary N.; Hickey, John M.; Crossa, Jose; Jannink, Jean-Luc; Sorrells, Mark E.; Raman, Babu; Cairns, Jill E.; Tarekegne, Amsal; Semagn, Kassa; Beyene, Yoseph; Grudloyma, Pichet; Technow, Frank; Riedelsheimer, Christian; Melchinger, Albrecht E.
2012-01-01
Genomic prediction is expected to considerably increase genetic gains by increasing selection intensity and accelerating the breeding cycle. In this study, marker effects estimated in 255 diverse maize (Zea mays L.) hybrids were used to predict grain yield, anthesis date, and anthesis-silking interval within the diversity panel and testcross progenies of 30 F2-derived lines from each of five populations. Although up to 25% of the genetic variance could be explained by cross validation within the diversity panel, the prediction of testcross performance of F2-derived lines using marker effects estimated in the diversity panel was on average zero. Hybrids in the diversity panel could be grouped into eight breeding populations differing in mean performance. When performance was predicted separately for each breeding population on the basis of marker effects estimated in the other populations, predictive ability was low (i.e., 0.12 for grain yield). These results suggest that prediction resulted mostly from differences in mean performance of the breeding populations and less from the relationship between the training and validation sets or linkage disequilibrium with causal variants underlying the predicted traits. Potential uses for genomic prediction in maize hybrid breeding are discussed emphasizing the need of (1) a clear definition of the breeding scenario in which genomic prediction should be applied (i.e., prediction among or within populations), (2) a detailed analysis of the population structure before performing cross validation, and (3) larger training sets with strong genetic relationship to the validation set. PMID:23173094
Crossa, José; Campos, Gustavo de Los; Pérez, Paulino; Gianola, Daniel; Burgueño, Juan; Araus, José Luis; Makumbi, Dan; Singh, Ravi P; Dreisigacker, Susanne; Yan, Jianbing; Arief, Vivi; Banziger, Marianne; Braun, Hans-Joachim
2010-10-01
The availability of dense molecular markers has made possible the use of genomic selection (GS) for plant breeding. However, the evaluation of models for GS in real plant populations is very limited. This article evaluates the performance of parametric and semiparametric models for GS using wheat (Triticum aestivum L.) and maize (Zea mays) data in which different traits were measured in several environmental conditions. The findings, based on extensive cross-validations, indicate that models including marker information had higher predictive ability than pedigree-based models. In the wheat data set, and relative to a pedigree model, gains in predictive ability due to inclusion of markers ranged from 7.7 to 35.7%. Correlation between observed and predictive values in the maize data set achieved values up to 0.79. Estimates of marker effects were different across environmental conditions, indicating that genotype × environment interaction is an important component of genetic variability. These results indicate that GS in plant breeding can be an effective strategy for selecting among lines whose phenotypes have yet to be observed.
Kelbe, David; Oak Ridge National Lab.; van Aardt, Jan; ...
2016-10-18
Terrestrial laser scanning has demonstrated increasing potential for rapid comprehensive measurement of forest structure, especially when multiple scans are spatially registered in order to reduce the limitations of occlusion. Although marker-based registration techniques (based on retro-reflective spherical targets) are commonly used in practice, a blind marker-free approach is preferable, insofar as it supports rapid operational data acquisition. To support these efforts, we extend the pairwise registration approach of our earlier work, and develop a graph-theoretical framework to perform blind marker-free global registration of multiple point cloud data sets. Pairwise pose estimates are weighted based on their estimated error, in ordermore » to overcome pose conflict while exploiting redundant information and improving precision. The proposed approach was tested for eight diverse New England forest sites, with 25 scans collected at each site. Quantitative assessment was provided via a novel embedded confidence metric, with a mean estimated root-mean-square error of 7.2 cm and 89% of scans connected to the reference node. Lastly, this paper assesses the validity of the embedded multiview registration confidence metric and evaluates the performance of the proposed registration algorithm.« less
Improving estimates of genetic maps: a meta-analysis-based approach.
Stewart, William C L
2007-07-01
Inaccurate genetic (or linkage) maps can reduce the power to detect linkage, increase type I error, and distort haplotype and relationship inference. To improve the accuracy of existing maps, I propose a meta-analysis-based method that combines independent map estimates into a single estimate of the linkage map. The method uses the variance of each independent map estimate to combine them efficiently, whether the map estimates use the same set of markers or not. As compared with a joint analysis of the pooled genotype data, the proposed method is attractive for three reasons: (1) it has comparable efficiency to the maximum likelihood map estimate when the pooled data are homogeneous; (2) relative to existing map estimation methods, it can have increased efficiency when the pooled data are heterogeneous; and (3) it avoids the practical difficulties of pooling human subjects data. On the basis of simulated data modeled after two real data sets, the proposed method can reduce the sampling variation of linkage maps commonly used in whole-genome linkage scans. Furthermore, when the independent map estimates are also maximum likelihood estimates, the proposed method performs as well as or better than when they are estimated by the program CRIMAP. Since variance estimates of maps may not always be available, I demonstrate the feasibility of three different variance estimators. Overall, the method should prove useful to investigators who need map positions for markers not contained in publicly available maps, and to those who wish to minimize the negative effects of inaccurate maps. Copyright 2007 Wiley-Liss, Inc.
Precision assessment of model-based RSA for a total knee prosthesis in a biplanar set-up.
Trozzi, C; Kaptein, B L; Garling, E H; Shelyakova, T; Russo, A; Bragonzoni, L; Martelli, S
2008-10-01
Model-based Roentgen Stereophotogrammetric Analysis (RSA) was recently developed for the measurement of prosthesis micromotion. Its main advantage is that markers do not need to be attached to the implants as traditional marker-based RSA requires. Model-based RSA has only been tested in uniplanar radiographic set-ups. A biplanar set-up would theoretically facilitate the pose estimation algorithm, since radiographic projections would show more different shape features of the implants than in uniplanar images. We tested the precision of model-based RSA and compared it with that of the traditional marker-based method in a biplanar set-up. Micromotions of both tibial and femoral components were measured with both the techniques from double examinations of patients participating in a clinical study. The results showed that in the biplanar set-up model-based RSA presents a homogeneous distribution of precision for all the translation directions, but an inhomogeneous error for rotations, especially internal-external rotation presented higher errors than rotations about the transverse and sagittal axes. Model-based RSA was less precise than the marker-based method, although the differences were not significant for the translations and rotations of the tibial component, with the exception of the internal-external rotations. For both prosthesis components the precisions of model-based RSA were below 0.2 mm for all the translations, and below 0.3 degrees for rotations about transverse and sagittal axes. These values are still acceptable for clinical studies aimed at evaluating total knee prosthesis micromotion. In a biplanar set-up model-based RSA is a valid alternative to traditional marker-based RSA where marking of the prosthesis is an enormous disadvantage.
On the validity of time-dependent AUC estimators.
Schmid, Matthias; Kestler, Hans A; Potapov, Sergej
2015-01-01
Recent developments in molecular biology have led to the massive discovery of new marker candidates for the prediction of patient survival. To evaluate the predictive value of these markers, statistical tools for measuring the performance of survival models are needed. We consider estimators of discrimination measures, which are a popular approach to evaluate survival predictions in biomarker studies. Estimators of discrimination measures are usually based on regularity assumptions such as the proportional hazards assumption. Based on two sets of molecular data and a simulation study, we show that violations of the regularity assumptions may lead to over-optimistic estimates of prediction accuracy and may therefore result in biased conclusions regarding the clinical utility of new biomarkers. In particular, we demonstrate that biased medical decision making is possible even if statistical checks indicate that all regularity assumptions are satisfied. © The Author 2013. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
NASA Astrophysics Data System (ADS)
Lu, Yao; Chan, Heang-Ping; Wei, Jun; Hadjiiski, Lubomir M.; Samala, Ravi K.
2017-10-01
In digital breast tomosynthesis (DBT), the high-attenuation metallic clips marking a previous biopsy site in the breast cause errors in the estimation of attenuation along the ray paths intersecting the markers during reconstruction, which result in interplane and inplane artifacts obscuring the visibility of subtle lesions. We proposed a new metal artifact reduction (MAR) method to improve image quality. Our method uses automatic detection and segmentation to generate a marker location map for each projection (PV). A voting technique based on the geometric correlation among different PVs is designed to reduce false positives (FPs) and to label the pixels on the PVs and the voxels in the imaged volume that represent the location and shape of the markers. An iterative diffusion method replaces the labeled pixels on the PVs with estimated tissue intensity from the neighboring regions while preserving the original pixel values in the neighboring regions. The inpainted PVs are then used for DBT reconstruction. The markers are repainted on the reconstructed DBT slices for radiologists’ information. The MAR method is independent of reconstruction techniques or acquisition geometry. For the training set, the method achieved 100% success rate with one FP in 19 views. For the test set, the success rate by view was 97.2% for core biopsy microclips and 66.7% for clusters of large post-lumpectomy markers with a total of 10 FPs in 58 views. All FPs were large dense benign calcifications that also generated artifacts if they were not corrected by MAR. For the views with successful detection, the metal artifacts were reduced to a level that was not visually apparent in the reconstructed slices. The visibility of breast lesions obscured by the reconstruction artifacts from the metallic markers was restored.
Marker Configuration Model-Based Roentgen Fluoroscopic Analysis.
Garling, Eric H; Kaptein, Bart L; Geleijns, Koos; Nelissen, Rob G H H; Valstar, Edward R
2005-04-01
It remains unknown if and how the polyethylene bearing in mobile bearing knees moves during dynamic activities with respect to the tibial base plate. Marker Configuration Model-Based Roentgen Fluoroscopic Analysis (MCM-based RFA) uses a marker configuration model of inserted tantalum markers in order to accurately estimate the pose of an implant or bone using single plane Roentgen images or fluoroscopic images. The goal of this study is to assess the accuracy of (MCM-Based RFA) in a standard fluoroscopic set-up using phantom experiments and to determine the error propagation with computer simulations. The experimental set-up of the phantom study was calibrated using a calibration box equipped with 600 tantalum markers, which corrected for image distortion and determined the focus position. In the computer simulation study the influence of image distortion, MC-model accuracy, focus position, the relative distance between MC-models and MC-model configuration on the accuracy of MCM-Based RFA were assessed. The phantom study established that the in-plane accuracy of MCM-Based RFA is 0.1 mm and the out-of-plane accuracy is 0.9 mm. The rotational accuracy is 0.1 degrees. A ninth-order polynomial model was used to correct for image distortion. Marker-Based RFA was estimated to have, in a worst case scenario, an in vivo translational accuracy of 0.14 mm (x-axis), 0.17 mm (y-axis), 1.9 mm (z-axis), respectively, and a rotational accuracy of 0.3 degrees. When using fluoroscopy to study kinematics, image distortion and the accuracy of models are important factors, which influence the accuracy of the measurements. MCM-Based RFA has the potential to be an accurate, clinically useful tool for studying kinematics after total joint replacement using standard equipment.
Schulthess, Albert W; Zhao, Yusheng; Longin, C Friedrich H; Reif, Jochen C
2018-03-01
Predictabilities for wheat hybrids less related to the estimation set were improved by shifting from single- to multiple-trait genomic prediction of Fusarium head blight severity. Breeding for improved Fusarium head blight resistance (FHBr) of wheat is a very laborious and expensive task. FHBr complexity is mainly due to its highly polygenic nature and because FHB severity (FHBs) is greatly influenced by the environment. Associated traits plant height and heading date may provide additional information related to FHBr, but this is ignored in single-trait genomic prediction (STGP). The aim of our study was to explore the benefits in predictabilities of multiple-trait genomic prediction (MTGP) over STGP of target trait FHBs in a population of 1604 wheat hybrids using information on 17,372 single nucleotide polymorphism markers along with indicator traits plant height and heading date. The additive inheritance of FHBs allowed accurate hybrid performance predictions using information on general combining abilities or average performance of both parents without the need of markers. Information on molecular markers and indicator trait(s) improved FHBs predictabilities for hybrids less related to the estimation set. Indicator traits must be observed on the predicted individuals to benefit from MTGP. Magnitudes of genetic and phenotypic correlations along with improvements in predictabilities made plant height a better indicator trait for FHBs than heading date. Thus, MTGP having only plant height as indicator trait already maximized FHBs predictabilities. Provided a good indicator trait was available, MTGP could reduce the impacts of genotype environment [Formula: see text] interaction on STGP for hybrids less related to the estimation set.
Santos, Hadassa C; Horimoto, Andréa V R; Tarazona-Santos, Eduardo; Rodrigues-Soares, Fernanda; Barreto, Mauricio L; Horta, Bernardo L; Lima-Costa, Maria F; Gouveia, Mateus H; Machado, Moara; Silva, Thiago M; Sanches, José M; Esteban, Nubia; Magalhaes, Wagner CS; Rodrigues, Maíra R; Kehdy, Fernanda S G; Pereira, Alexandre C
2016-01-01
The Brazilian population is considered to be highly admixed. The main contributing ancestral populations were European and African, with Amerindians contributing to a lesser extent. The aims of this study were to provide a resource for determining and quantifying individual continental ancestry using the smallest number of SNPs possible, thus allowing for a cost- and time-efficient strategy for genomic ancestry determination. We identified and validated a minimum set of 192 ancestry informative markers (AIMs) for the genetic ancestry determination of Brazilian populations. These markers were selected on the basis of their distribution throughout the human genome, and their capacity of being genotyped on widely available commercial platforms. We analyzed genotyping data from 6487 individuals belonging to three Brazilian cohorts. Estimates of individual admixture using this 192 AIM panels were highly correlated with estimates using ~370 000 genome-wide SNPs: 91%, 92%, and 74% of, respectively, African, European, and Native American ancestry components. Besides that, 192 AIMs are well distributed among populations from these ancestral continents, allowing greater freedom in future studies with this panel regarding the choice of reference populations. We also observed that genetic ancestry inferred by AIMs provides similar association results to the one obtained using ancestry inferred by genomic data (370 K SNPs) in a simple regression model with rs1426654, related to skin pigmentation, genotypes as dependent variable. In conclusion, these markers can be used to identify and accurately quantify ancestry of Latin Americans or US Hispanics/Latino individuals, in particular in the context of fine-mapping strategies that require the quantification of continental ancestry in thousands of individuals. PMID:26395555
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kelbe, David; Oak Ridge National Lab.; van Aardt, Jan
Terrestrial laser scanning has demonstrated increasing potential for rapid comprehensive measurement of forest structure, especially when multiple scans are spatially registered in order to reduce the limitations of occlusion. Although marker-based registration techniques (based on retro-reflective spherical targets) are commonly used in practice, a blind marker-free approach is preferable, insofar as it supports rapid operational data acquisition. To support these efforts, we extend the pairwise registration approach of our earlier work, and develop a graph-theoretical framework to perform blind marker-free global registration of multiple point cloud data sets. Pairwise pose estimates are weighted based on their estimated error, in ordermore » to overcome pose conflict while exploiting redundant information and improving precision. The proposed approach was tested for eight diverse New England forest sites, with 25 scans collected at each site. Quantitative assessment was provided via a novel embedded confidence metric, with a mean estimated root-mean-square error of 7.2 cm and 89% of scans connected to the reference node. Lastly, this paper assesses the validity of the embedded multiview registration confidence metric and evaluates the performance of the proposed registration algorithm.« less
Assessing genomic selection prediction accuracy in a dynamic barley breeding
USDA-ARS?s Scientific Manuscript database
Genomic selection is a method to improve quantitative traits in crops and livestock by estimating breeding values of selection candidates using phenotype and genome-wide marker data sets. Prediction accuracy has been evaluated through simulation and cross-validation, however validation based on prog...
Chronic Kidney Disease Awareness Among Individuals with Clinical Markers of Kidney Dysfunction
Plantinga, Laura C.; Hsu, Chi-yuan; Jordan, Regina; Burrows, Nilka Ríos; Hedgeman, Elizabeth; Yee, Jerry; Saran, Rajiv; Powe, Neil R.
2011-01-01
Summary Background and objectives Awareness of chronic kidney disease (CKD) among providers and patients is low. Whether clinical cues prompt recognition of CKD is unknown. We examined whether markers of kidney disease that should trigger CKD recognition among providers are associated with higher individual CKD awareness. Design, setting, participants, & measurements CKD awareness was assessed in 1852 adults with an estimated GFR <60 ml/min per 1.73 m2 using 1999 to 2008 National Health and Nutrition Examination Survey data. CKD awareness was a “yes” answer to “Have you ever been told you have weak or failing kidneys?” Participants were grouped by distribution of the following abnormal markers of CKD: hyperkalemia, acidosis, hyperphosphatemia, elevated blood urea nitrogen, anemia, albuminuria, and uncontrolled hypertension. Odds of CKD awareness associated with each abnormal marker and groupings of markers were estimated by multivariable logistic regression. Results Among individuals with kidney disease, only those with albuminuria had greater odds of CKD awareness (adjusted odds ratio, 4.0, P < 0.01) than those without. Odds of CKD awareness increased with each additional manifested clinical marker of CKD (adjusted odds ratio, 1.3, P = 0.05). Nonetheless, 90% of individuals with two to four markers of CKD and 84% of individuals with ≥5 markers of CKD were unaware of their disease. Conclusions Although individuals who manifest many markers of kidney dysfunction are more likely to be aware of their CKD, their CKD awareness remains low. A better understanding of mechanisms of awareness is required to facilitate earlier detection of CKD and implement therapy to minimize associated complications. PMID:21784832
A generalized model for multi-marker analysis of cell cycle progression in synchrony experiments.
Mayhew, Michael B; Robinson, Joshua W; Jung, Boyoun; Haase, Steven B; Hartemink, Alexander J
2011-07-01
To advance understanding of eukaryotic cell division, it is important to observe the process precisely. To this end, researchers monitor changes in dividing cells as they traverse the cell cycle, with the presence or absence of morphological or genetic markers indicating a cell's position in a particular interval of the cell cycle. A wide variety of marker data is available, including information-rich cellular imaging data. However, few formal statistical methods have been developed to use these valuable data sources in estimating how a population of cells progresses through the cell cycle. Furthermore, existing methods are designed to handle only a single binary marker of cell cycle progression at a time. Consequently, they cannot facilitate comparison of experiments involving different sets of markers. Here, we develop a new sampling model to accommodate an arbitrary number of different binary markers that characterize the progression of a population of dividing cells along a branching process. We engineer a strain of Saccharomyces cerevisiae with fluorescently labeled markers of cell cycle progression, and apply our new model to two image datasets we collected from the strain, as well as an independent dataset of different markers. We use our model to estimate the duration of post-cytokinetic attachment between a S.cerevisiae mother and daughter cell. The Java implementation is fast and extensible, and includes a graphical user interface. Our model provides a powerful and flexible cell cycle analysis tool, suitable to any type or combination of binary markers. The software is available from: http://www.cs.duke.edu/~amink/software/cloccs/. michael.mayhew@duke.edu; amink@cs.duke.edu.
Guillaume, François; Fritz, Sébastien; Boichard, Didier; Druet, Tom
2008-01-01
The efficiency of the French marker-assisted selection (MAS) was estimated by a simulation study. The data files of two different time periods were used: April 2004 and 2006. The simulation method used the structure of the existing French MAS: same pedigree, same marker genotypes and same animals with records. The program simulated breeding values and new records based on this existing structure and knowledge on the QTL used in MAS (variance and frequency). Reliabilities of genetic values of young animals (less than one year old) obtained with and without marker information were compared to assess the efficiency of MAS for evaluation of milk, fat and protein yields and fat and protein contents. Mean gains of reliability ranged from 0.015 to 0.094 and from 0.038 to 0.114 in 2004 and 2006, respectively. The larger number of animals genotyped and the use of a new set of genetic markers can explain the improvement of MAS reliability from 2004 to 2006. This improvement was also observed by analysis of information content for young candidates. The gain of MAS reliability with respect to classical selection was larger for sons of sires with genotyped progeny daughters with records. Finally, it was shown that when superiority of MAS over classical selection was estimated with daughter yield deviations obtained after progeny test instead of true breeding values, the gain was underestimated. PMID:18096117
Marker selection for the transmission/disequilibrium test, in recently admixed populations.
Kaplan, N L; Martin, E R; Morris, R W; Weir, B S
1998-01-01
Recent admixture between genetically differentiated populations can result in high levels of association between alleles at loci that are <=10 cM apart. The transmission/disequilibrium test (TDT) proposed by Spielman et al. (1993) can be a powerful test of linkage between disease and marker loci in the presence of association and therefore could be a useful test of linkage in admixed populations. The degree of association between alleles at two loci depends on the differences in allele frequencies, at the two loci, in the founding populations; therefore, the choice of marker is important. For a multiallelic marker, one strategy that may improve the power of the TDT is to group marker alleles within a locus, on the basis of information about the founding populations and the admixed population, thereby collapsing the marker into one with fewer alleles. We have examined the consequences of collapsing a microsatellite into a two-allele marker, when two founding populations are assumed for the admixed population, and have found that if there is random mating in the admixed population, then typically there is a collapsing for which the power of the TDT is greater than that for the original microsatellite marker. A method is presented for finding the optimal collapsing that has minimal dependence on the disease and that uses estimates either of marker allele frequencies in the two founding populations or of marker allele frequencies in the current, admixed population and in one of the founding populations. Furthermore, this optimal collapsing is not always the collapsing with the largest difference in allele frequencies in the founding populations. To demonstrate this strategy, we considered a recent data set, published previously, that provides frequency estimates for 30 microsatellites in 13 populations. PMID:9497257
Barik, Saumya Ranjan; Sahoo, Ambika; Mohapatra, Sudipti; Nayak, Deepak Kumar; Mahender, Anumalla; Meher, Jitandriya; Anandan, Annamalai
2016-01-01
Rice exhibits enormous genetic diversity, population structure and molecular marker-traits associated with abiotic stress tolerance to high temperature stress. A set of breeding lines and landraces representing 240 germplasm lines were studied. Based on spikelet fertility percent under high temperature, tolerant genotypes were broadly classified into four classes. Genetic diversity indicated a moderate level of genetic base of the population for the trait studied. Wright’s F statistic estimates showed a deviation of Hardy-Weinberg expectation in the population. The analysis of molecular variance revealed 25 percent variation between population, 61 percent among individuals and 14 percent within individuals in the set. The STRUCTURE analysis categorized the entire population into three sub-populations and suggested that most of the landraces in each sub-population had a common primary ancestor with few admix individuals. The composition of materials in the panel showed the presence of many QTLs representing the entire genome for the expression of tolerance. The strongly associated marker RM547 tagged with spikelet fertility under stress and the markers like RM228, RM205, RM247, RM242, INDEL3 and RM314 indirectly controlling the high temperature stress tolerance were detected through both mixed linear model and general linear model TASSEL analysis. These markers can be deployed as a resource for marker-assisted breeding program of high temperature stress tolerance. PMID:27494320
Pradhan, Sharat Kumar; Barik, Saumya Ranjan; Sahoo, Ambika; Mohapatra, Sudipti; Nayak, Deepak Kumar; Mahender, Anumalla; Meher, Jitandriya; Anandan, Annamalai; Pandit, Elssa
2016-01-01
Rice exhibits enormous genetic diversity, population structure and molecular marker-traits associated with abiotic stress tolerance to high temperature stress. A set of breeding lines and landraces representing 240 germplasm lines were studied. Based on spikelet fertility percent under high temperature, tolerant genotypes were broadly classified into four classes. Genetic diversity indicated a moderate level of genetic base of the population for the trait studied. Wright's F statistic estimates showed a deviation of Hardy-Weinberg expectation in the population. The analysis of molecular variance revealed 25 percent variation between population, 61 percent among individuals and 14 percent within individuals in the set. The STRUCTURE analysis categorized the entire population into three sub-populations and suggested that most of the landraces in each sub-population had a common primary ancestor with few admix individuals. The composition of materials in the panel showed the presence of many QTLs representing the entire genome for the expression of tolerance. The strongly associated marker RM547 tagged with spikelet fertility under stress and the markers like RM228, RM205, RM247, RM242, INDEL3 and RM314 indirectly controlling the high temperature stress tolerance were detected through both mixed linear model and general linear model TASSEL analysis. These markers can be deployed as a resource for marker-assisted breeding program of high temperature stress tolerance.
An ABC estimate of pedigree error rate: application in dog, sheep and cattle breeds.
Leroy, G; Danchin-Burge, C; Palhiere, I; Baumung, R; Fritz, S; Mériaux, J C; Gautier, M
2012-06-01
On the basis of correlations between pairwise individual genealogical kinship coefficients and allele sharing distances computed from genotyping data, we propose an approximate Bayesian computation (ABC) approach to assess pedigree file reliability through gene-dropping simulations. We explore the features of the method using simulated data sets and show precision increases with the number of markers. An application is further made with five dog breeds, four sheep breeds and one cattle breed raised in France and displaying various characteristics and population sizes, using microsatellite or SNP markers. Depending on the breeds, pedigree error estimations range between 1% and 9% in dog breeds, 1% and 10% in sheep breeds and 4% in cattle breeds. © 2011 The Authors, Animal Genetics © 2011 Stichting International Foundation for Animal Genetics.
[A Method for Selecting Self-Adoptive Chromaticity of the Projected Markers].
Zhao, Shou-bo; Zhang, Fu-min; Qu, Xing-hua; Zheng, Shi-wei; Chen, Zhe
2015-04-01
The authors designed a self-adaptive projection system which is composed of color camera, projector and PC. In detail, digital micro-mirror device (DMD) as a spatial light modulator for the projector was introduced in the optical path to modulate the illuminant spectrum based on red, green and blue light emitting diodes (LED). However, the color visibility of active markers is affected by the screen which has unknown reflective spectrum as well. Here active markers are projected spot array. And chromaticity feature of markers is sometimes submerged in similar spectral screen. In order to enhance the color visibility of active markers relative to screen, a method for selecting self-adaptive chromaticity of the projected markers in 3D scanning metrology is described. Color camera with 3 channels limits the accuracy of device characterization. For achieving interconversion of device-independent color space and device-dependent color space, high-dimensional linear model of reflective spectrum was built. Prior training samples provide additional constraints to yield high-dimensional linear model with more than three degrees of freedom. Meanwhile, spectral power distribution of ambient light was estimated. Subsequently, markers' chromaticity in CIE color spaces was selected via maximization principle of Euclidean distance. The setting values of RGB were easily estimated via inverse transform. Finally, we implemented a typical experiment to show the performance of the proposed approach. An 24 Munsell Color Checker was used as projective screen. Color difference in the chromaticity coordinates between the active marker and the color patch was utilized to evaluate the color visibility of active markers relative to the screen. The result comparison between self-adaptive projection system and traditional diode-laser light projector was listed and discussed to highlight advantage of our proposed method.
A generalized model for multi-marker analysis of cell cycle progression in synchrony experiments
Mayhew, Michael B.; Robinson, Joshua W.; Jung, Boyoun; Haase, Steven B.; Hartemink, Alexander J.
2011-01-01
Motivation: To advance understanding of eukaryotic cell division, it is important to observe the process precisely. To this end, researchers monitor changes in dividing cells as they traverse the cell cycle, with the presence or absence of morphological or genetic markers indicating a cell's position in a particular interval of the cell cycle. A wide variety of marker data is available, including information-rich cellular imaging data. However, few formal statistical methods have been developed to use these valuable data sources in estimating how a population of cells progresses through the cell cycle. Furthermore, existing methods are designed to handle only a single binary marker of cell cycle progression at a time. Consequently, they cannot facilitate comparison of experiments involving different sets of markers. Results: Here, we develop a new sampling model to accommodate an arbitrary number of different binary markers that characterize the progression of a population of dividing cells along a branching process. We engineer a strain of Saccharomyces cerevisiae with fluorescently labeled markers of cell cycle progression, and apply our new model to two image datasets we collected from the strain, as well as an independent dataset of different markers. We use our model to estimate the duration of post-cytokinetic attachment between a S.cerevisiae mother and daughter cell. The Java implementation is fast and extensible, and includes a graphical user interface. Our model provides a powerful and flexible cell cycle analysis tool, suitable to any type or combination of binary markers. Availability: The software is available from: http://www.cs.duke.edu/~amink/software/cloccs/. Contact: michael.mayhew@duke.edu; amink@cs.duke.edu PMID:21685084
Martínez-Camblor, Pablo; Pardo-Fernández, Juan C
2017-01-01
Diagnostic procedures are based on establishing certain conditions and then checking if those conditions are satisfied by a given individual. When the diagnostic procedure is based on a continuous marker, this is equivalent to fix a region or classification subset and then check if the observed value of the marker belongs to that region. Receiver operating characteristic curve is a valuable and popular tool to study and compare the diagnostic ability of a given marker. Besides, the area under the receiver operating characteristic curve is frequently used as an index of the global discrimination ability. This paper revises and widens the scope of the receiver operating characteristic curve definition by setting the classification subsets in which the final decision is based in the spotlight of the analysis. We revise the definition of the receiver operating characteristic curve in terms of particular classes of classification subsets and then focus on a receiver operating characteristic curve generalization for situations in which both low and high values of the marker are associated with more probability of having the studied characteristic. Parametric and non-parametric estimators of the receiver operating characteristic curve generalization are investigated. Monte Carlo studies and real data examples illustrate their practical performance.
Pardo, Luba M; Piras, Giovanna; Asproni, Rosanna; van der Gaag, Kristiaan J; Gabbas, Attilio; Ruiz-Linares, Andres; de Knijff, Peter; Monne, Maria; Rizzu, Patrizia; Heutink, Peter
2012-09-01
Sardinia has been used for genetic studies because of its historical isolation, genetic homogeneity and increased prevalence of certain rare diseases. Controversy remains concerning the genetic substructure and the extent of genetic homogeneity, which has implications for the design of genome-wide association studies (GWAS). We revisited this issue by examining the genetic make-up of a sample from North-East Sardinia using a dense set of autosomal, Y chromosome and mitochondrial markers to assess the potential of the sample for GWAS and fine mapping studies. We genotyped individuals for 500K single-nucleotide polymorphisms, Y chromosome markers and sequenced the mitochondrial hypervariable (HVI-HVII) regions. We identified major haplogroups and compared these with other populations. We estimated linkage disequilibrium (LD) and haplotype diversity across autosomal markers, and compared these with other populations. Our results show that within Sardinia there is no major population substructure and thus it can be considered a genetically homogenous population. We did not find substantial differences in the extent of LD in Sardinians compared with other populations. However, we showed that at least 9% of genomic regions in Sardinians differed in LD structure, which is helpful for identifying functional variants using fine mapping. We concluded that Sardinia is a powerful setting for genetic studies including GWAS and other mapping approaches.
Links between causal effects and causal association for surrogacy evaluation in a gaussian setting.
Conlon, Anna; Taylor, Jeremy; Li, Yun; Diaz-Ordaz, Karla; Elliott, Michael
2017-11-30
Two paradigms for the evaluation of surrogate markers in randomized clinical trials have been proposed: the causal effects paradigm and the causal association paradigm. Each of these paradigms rely on assumptions that must be made to proceed with estimation and to validate a candidate surrogate marker (S) for the true outcome of interest (T). We consider the setting in which S and T are Gaussian and are generated from structural models that include an unobserved confounder. Under the assumed structural models, we relate the quantities used to evaluate surrogacy within both the causal effects and causal association frameworks. We review some of the common assumptions made to aid in estimating these quantities and show that assumptions made within one framework can imply strong assumptions within the alternative framework. We demonstrate that there is a similarity, but not exact correspondence between the quantities used to evaluate surrogacy within each framework, and show that the conditions for identifiability of the surrogacy parameters are different from the conditions, which lead to a correspondence of these quantities. Copyright © 2017 John Wiley & Sons, Ltd.
Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics.
Edwards, Scott V; Cloutier, Alison; Baker, Allan J
2017-11-01
Noncoding markers have a particular appeal as tools for phylogenomic analysis because, at least in vertebrates, they appear less subject to strong variation in GC content among lineages. Thus far, ultraconserved elements (UCEs) and introns have been the most widely used noncoding markers. Here we analyze and study the evolutionary properties of a new type of noncoding marker, conserved nonexonic elements (CNEEs), which consists of noncoding elements that are estimated to evolve slower than the neutral rate across a set of species. Although they often include UCEs, CNEEs are distinct from UCEs because they are not ultraconserved, and, most importantly, the core region alone is analyzed, rather than both the core and its flanking regions. Using a data set of 16 birds plus an alligator outgroup, and ∼3600-∼3800 loci per marker type, we found that although CNEEs were less variable than bioinformatically derived UCEs or introns and in some cases exhibited a slower approach to branch resolution as determined by phylogenomic subsampling, the quality of CNEE alignments was superior to those of the other markers, with fewer gaps and missing species. Phylogenetic resolution using coalescent approaches was comparable among the three marker types, with most nodes being fully and congruently resolved. Comparison of phylogenetic results across the three marker types indicated that one branch, the sister group to the passerine + falcon clade, was resolved differently and with moderate (>70%) bootstrap support between CNEEs and UCEs or introns. Overall, CNEEs appear to be promising as phylogenomic markers, yielding phylogenetic resolution as high as for UCEs and introns but with fewer gaps, less ambiguity in alignments and with patterns of nucleotide substitution more consistent with the assumptions of commonly used methods of phylogenetic analysis. © The Author(s) 2017. Published by Oxford University Press on behalf of the Systematic Biologists.
Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics
Cloutier, Alison; Baker, Allan J.
2017-01-01
Abstract Noncoding markers have a particular appeal as tools for phylogenomic analysis because, at least in vertebrates, they appear less subject to strong variation in GC content among lineages. Thus far, ultraconserved elements (UCEs) and introns have been the most widely used noncoding markers. Here we analyze and study the evolutionary properties of a new type of noncoding marker, conserved nonexonic elements (CNEEs), which consists of noncoding elements that are estimated to evolve slower than the neutral rate across a set of species. Although they often include UCEs, CNEEs are distinct from UCEs because they are not ultraconserved, and, most importantly, the core region alone is analyzed, rather than both the core and its flanking regions. Using a data set of 16 birds plus an alligator outgroup, and ∼3600–∼3800 loci per marker type, we found that although CNEEs were less variable than bioinformatically derived UCEs or introns and in some cases exhibited a slower approach to branch resolution as determined by phylogenomic subsampling, the quality of CNEE alignments was superior to those of the other markers, with fewer gaps and missing species. Phylogenetic resolution using coalescent approaches was comparable among the three marker types, with most nodes being fully and congruently resolved. Comparison of phylogenetic results across the three marker types indicated that one branch, the sister group to the passerine + falcon clade, was resolved differently and with moderate (>70%) bootstrap support between CNEEs and UCEs or introns. Overall, CNEEs appear to be promising as phylogenomic markers, yielding phylogenetic resolution as high as for UCEs and introns but with fewer gaps, less ambiguity in alignments and with patterns of nucleotide substitution more consistent with the assumptions of commonly used methods of phylogenetic analysis. PMID:28637293
Kamalandua, Aubeline
2015-01-01
Age estimation from DNA methylation markers has seen an exponential growth of interest, not in the least from forensic scientists. The current published assays, however, can still be improved by lowering the number of markers in the assay and by providing more accurate models to predict chronological age. From the published literature we selected 4 age-associated genes (ASPA, PDE4C, ELOVL2, and EDARADD) and determined CpG methylation levels from 206 blood samples of both deceased and living individuals (age range: 0–91 years). This data was subsequently used to compare prediction accuracy with both linear and non-linear regression models. A quadratic regression model in which the methylation levels of ELOVL2 were squared showed the highest accuracy with a Mean Absolute Deviation (MAD) between chronological age and predicted age of 3.75 years and an adjusted R2 of 0.95. No difference in accuracy was observed for samples obtained either from living and deceased individuals or between the 2 genders. In addition, 29 teeth from different individuals (age range: 19–70 years) were analyzed using the same set of markers resulting in a MAD of 4.86 years and an adjusted R2 of 0.74. Cross validation of the results obtained from blood samples demonstrated the robustness and reproducibility of the assay. In conclusion, the set of 4 CpG DNA methylation markers is capable of producing highly accurate age predictions for blood samples from deceased and living individuals PMID:26280308
Gene–Environment Correlation: Difficulties and a Natural Experiment–Based Strategy
Li, Jiang; Liu, Hexuan; Guo, Guang
2013-01-01
Objectives. We explored how gene–environment correlations can result in endogenous models, how natural experiments can protect against this threat, and if unbiased estimates from natural experiments are generalizable to other contexts. Methods. We compared a natural experiment, the College Roommate Study, which measured genes and behaviors of college students and their randomly assigned roommates in a southern public university, with observational data from the National Longitudinal Study of Adolescent Health in 2008. We predicted exposure to exercising peers using genetic markers and estimated environmental effects on alcohol consumption. A mixed-linear model estimated an alcohol consumption variance that was attributable to genetic markers and across peer environments. Results. Peer exercise environment was associated with respondent genotype in observational data, but not in the natural experiment. The effects of peer drinking and presence of a general gene–environment interaction were similar between data sets. Conclusions. Natural experiments, like random roommate assignment, could protect against potential bias introduced by gene–environment correlations. When combined with representative observational data, unbiased and generalizable causal effects could be estimated. PMID:23927502
Torres, Sandra Regina Rachadel; Uehara, Clineu Julien Seki; Sutter-Latorre, Ana Frederica; de Almeida, Bibiana Sgorla; Sauerbier, Tania Streck; Muniz, Yara Costa Netto; Marrero, Andrea Rita; de Souza, Ilíada Rainha
2014-08-01
The application of DNA technology in forensic investigations has grown rapidly in the last 25 years and with an exponential increase of short tandem repeats (STRs) data, usually presented as allele frequencies, that may be later used as databases for forensic and population genetics purposes. Thereby, classes of molecular markers such as single nucleotide polymorphisms and insertions/deletions (InDels) have been presented as another option of genetic marker sets. These markers can be used in paternity cases, when mutations in STR polymorphisms are present, as well as in highly degraded DNA analysis. In the present study, the allele frequencies and heterozygosity (H) of a 30 InDel markers set were determined and the forensic efficacy was evaluated through estimation of discrimination power (DP), match probability, typical paternity index and power of paternity exclusion in 108 unrelated volunteers from the State of Santa Catarina (South Brazil). The observed H per locus showed a range between 0.370 and 0.574 (mean = 0.479). HLD128 was the locus with the highest DP (DP = 0.656). DP for all markers combined was greater than 99.9999999999646 % which provides satisfactory levels of information for forensic demands. Genetic comparisons (exact tests of population differentiation and pairwise genetic distances) revealed that the population of Santa Catarina State differs from Korea and USA Afro-American populations but is similar to the Portuguese, German, Polish, Spanish and Basque populations.
A Coalescent-Based Estimator of Admixture From DNA Sequences
Wang, Jinliang
2006-01-01
A variety of estimators have been developed to use genetic marker information in inferring the admixture proportions (parental contributions) of a hybrid population. The majority of these estimators used allele frequency data, ignored molecular information that is available in markers such as microsatellites and DNA sequences, and assumed that mutations are absent since the admixture event. As a result, these estimators may fail to deliver an estimate or give rather poor estimates when admixture is ancient and thus mutations are not negligible. A previous molecular estimator based its inference of admixture proportions on the average coalescent times between pairs of genes taken from within and between populations. In this article I propose an estimator that considers the entire genealogy of all of the sampled genes and infers admixture proportions from the numbers of segregating sites in DNA sequence samples. By considering the genealogy of all sequences rather than pairs of sequences, this new estimator also allows the joint estimation of other interesting parameters in the admixture model, such as admixture time, divergence time, population size, and mutation rate. Comparative analyses of simulated data indicate that the new coalescent estimator generally yields better estimates of admixture proportions than the previous molecular estimator, especially when the parental populations are not highly differentiated. It also gives reasonably accurate estimates of other admixture parameters. A human mtDNA sequence data set was analyzed to demonstrate the method, and the analysis results are discussed and compared with those from previous studies. PMID:16624918
Can hip and knee kinematics be improved by eliminating thigh markers?
Schulz, Brian W.; Kimmel, Wendy L.
2017-01-01
Background Marker sets developed for gait analysis are often applied to more dynamic tasks with little or no validation, despite known complications of soft tissue artifact. Methods This study presents a comparison of hip and knee kinematics as calculated by five concurrently-worn tracking marker sets during eight different tasks. The first three marker sets were based on Helen Hayes but used 1) proximal thigh wands, 2) distal thigh wands, and 3) patellar markers instead of thigh wands. The remaining two marker sets used rigid clusters on the 4) thighs and shanks and 5) only shanks. Pelvis and foot segments were shared by all marker sets. The first three tasks were maximal femoral rotations using different knee and hip positions to quantify the ability of each marker set to capture this motion. The remaining five tasks were walking, walking a 1m radius circle, running, jumping, and lunging. Findings In general, few and small differences in knee and hip flexion-extension were observed between marker sets, while many and large differences in adduction-abduction and external-internal rotations were observed. The shank-only tracking marker set was capable of detecting the greatest hip external-internal rotation, yet only did so during dynamic tasks where greater hip axial motions would be expected. All data are available as supplementary material. Interpretation Marker set selection is critical to non-sagittal hip and knee motions. The shank-only tracking marker set presented here is a viable alternative that may improve knee and hip kinematics by eliminating errors from thigh soft tissue artifact. PMID:20493599
Ritschel, Patricia Silva; Lins, Tulio Cesar de Lima; Tristan, Rodrigo Lourenço; Buso, Gláucia Salles Cortopassi; Buso, José Amauri; Ferreira, Márcio Elias
2004-01-01
Background Despite the great advances in genomic technology observed in several crop species, the availability of molecular tools such as microsatellite markers has been limited in melon (Cucumis melo L.) and cucurbit species. The development of microsatellite markers will have a major impact on genetic analysis and breeding of melon, especially on the generation of marker saturated genetic maps and implementation of marker assisted breeding programs. Genomic microsatellite enriched libraries can be an efficient alternative for marker development in such species. Results Seven hundred clones containing microsatellite sequences from a Tsp-AG/TC microsatellite enriched library were identified and one-hundred and forty-four primer pairs designed and synthesized. When 67 microsatellite markers were tested on a panel of melon and other cucurbit accessions, 65 revealed DNA polymorphisms among the melon accessions. For some cucurbit species, such as Cucumis sativus, up to 50% of the melon microsatellite markers could be readily used for DNA polymophism assessment, representing a significant reduction of marker development costs. A random sample of 25 microsatellite markers was extracted from the new microsatellite marker set and characterized on 40 accessions of melon, generating an allelic frequency database for the species. The average expected heterozygosity was 0.52, varying from 0.45 to 0.70, indicating that a small set of selected markers should be sufficient to solve questions regarding genotype identity and variety protection. Genetic distances based on microsatellite polymorphism were congruent with data obtained from RAPD marker analysis. Mapping analysis was initiated with 55 newly developed markers and most primers showed segregation according to Mendelian expectations. Linkage analysis detected linkage between 56% of the markers, distributed in nine linkage groups. Conclusions Genomic library microsatellite enrichment is an efficient procedure for marker development in melon. One-hundred and forty-four new markers were developed from Tsp-AG/TC genomic library. This is the first reported attempt of successfully using enriched library for microsatellite marker development in the species. A sample of the microsatellite markers tested proved efficient for genetic analysis of melon, including genetic distance estimates and identity tests. Linkage analysis indicated that the markers developed are dispersed throughout the genome and should be very useful for genetic analysis of melon. PMID:15149552
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa.
Shahin, Arwa; van Kaauwen, Martijn; Esselink, Danny; Bargsten, Joachim W; van Tuyl, Jaap M; Visser, Richard G F; Arens, Paul
2012-11-20
Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies.
Measurement of the translation and rotation of a sphere in fluid flow
NASA Astrophysics Data System (ADS)
Barros, Diogo; Hiltbrand, Ben; Longmire, Ellen K.
2018-06-01
The problem of determining the translation and rotation of a spherical particle moving in fluid flow is considered. Lagrangian tracking of markers printed over the surface of a sphere is employed to compute the center motion and the angular velocity of the solid body. The method initially calculates the sphere center from the 3D coordinates of the reconstructed markers, then finds the optimal rotation matrix that aligns a set of markers tracked at sequential time steps. The parameters involved in the experimental implementation of this procedure are discussed, and the associated uncertainty is estimated from numerical analysis. Finally, the proposed methodology is applied to characterize the motion of a large spherical particle released in a turbulent boundary layer developing in a water channel.
Carlsson, Sigrid V; Peltola, Mari T; Sjoberg, Daniel; Schröder, Fritz H; Hugosson, Jonas; Pettersson, Kim; Scardino, Peter T; Vickers, Andrew J; Lilja, Hans; Roobol, Monique J
2013-09-01
To explore whether a panel of kallikrein markers in blood: total, free and intact prostate-specific antigen (PSA) and kallikrein-related peptidase 2, could be used as a non-invasive alternative for predicting prostate cancer on biopsy in a screening setting. The study cohort comprised previously unscreened men who underwent sextant biopsy owing to elevated PSA (≥3 ng/mL) in two different centres of the European Randomized Study of Screening for Prostate Cancer, Rotterdam (n = 2914) and Göteborg (n = 740). A statistical model, based on kallikrein markers, was compared with one based on established clinical factors for the prediction of biopsy outcome. The clinical tests were found to be no better than blood markers, with an area under the curve in favour of the blood measurements of 0.766 vs. 0.763 in Rotterdam and 0.809 vs. 0.774 in Göteborg. Adding digital rectal examination (DRE) or DRE plus transrectal ultrasonography (TRUS) volume to the markers improved discrimination, although the increases were small. Results were similar for predicting high-grade cancer. There was a strong correlation between the blood measurements and TRUS-estimated prostate volume (Spearman's correlation 0.60 in Rotterdam and 0.57 in Göteborg). In previously unscreened men, each with indication for biopsy, a statistical model based on kallikrein levels was similar to a clinical model in predicting prostate cancer in a screening setting, outside the day-to-day clinical practice. Whether a clinical approach can be replaced by laboratory analyses or used in combination with decision models (nomograms) is a clinical judgment that may vary from clinician to clinician depending on how they weigh the different advantages and disadvantages (harms, costs, time, invasiveness) of both approaches. © 2013 BJU International.
Attard, Catherine R M; Beheregaray, Luciano B; Möller, Luciana M
2018-05-01
There has been remarkably little attention to using the high resolution provided by genotyping-by-sequencing (i.e., RADseq and similar methods) for assessing relatedness in wildlife populations. A major hurdle is the genotyping error, especially allelic dropout, often found in this type of data that could lead to downward-biased, yet precise, estimates of relatedness. Here, we assess the applicability of genotyping-by-sequencing for relatedness inferences given its relatively high genotyping error rate. Individuals of known relatedness were simulated under genotyping error, allelic dropout and missing data scenarios based on an empirical ddRAD data set, and their true relatedness was compared to that estimated by seven relatedness estimators. We found that an estimator chosen through such analyses can circumvent the influence of genotyping error, with the estimator of Ritland (Genetics Research, 67, 175) shown to be unaffected by allelic dropout and to be the most accurate when there is genotyping error. We also found that the choice of estimator should not rely solely on the strength of correlation between estimated and true relatedness as a strong correlation does not necessarily mean estimates are close to true relatedness. We also demonstrated how even a large SNP data set with genotyping error (allelic dropout or otherwise) or missing data still performs better than a perfectly genotyped microsatellite data set of tens of markers. The simulation-based approach used here can be easily implemented by others on their own genotyping-by-sequencing data sets to confirm the most appropriate and powerful estimator for their data. © 2017 John Wiley & Sons Ltd.
Bertholet, Jenny; Worm, Esben; Høyer, Morten; Poulsen, Per
2017-06-01
Accurate patient positioning is crucial in stereotactic body radiation therapy (SBRT) due to a high dose regimen. Cone-beam computed tomography (CBCT) is often used for patient positioning based on radio-opaque markers. We compared six CBCT-based set-up strategies with or without rotational correction. Twenty-nine patients with three implanted markers received 3-6 fraction liver SBRT. The markers were delineated on the mid-ventilation phase of a 4D-planning-CT. One pretreatment CBCT was acquired per fraction. Set-up strategy 1 used only translational correction based on manual marker match between the CBCT and planning CT. Set-up strategy 2 used automatic 6 degrees-of-freedom registration of the vertebrae closest to the target. The 3D marker trajectories were also extracted from the projections and the mean position of each marker was calculated and used for set-up strategies 3-6. Translational correction only was used for strategy 3. Translational and rotational corrections were used for strategies 4-6 with the rotation being either vertebrae based (strategy 4), or marker based and constrained to ±3° (strategy 5) or unconstrained (strategy 6). The resulting set-up error was calculated as the 3D root-mean-square set-up error of the three markers. The set-up error of the spinal cord was calculated for all strategies. The bony anatomy set-up (2) had the largest set-up error (5.8 mm). The marker-based set-up with unconstrained rotations (6) had the smallest set-up error (0.8 mm) but the largest spinal cord set-up error (12.1 mm). The marker-based set-up with translational correction only (3) or with bony anatomy rotational correction (4) had equivalent set-up error (1.3 mm) but rotational correction reduced the spinal cord set-up error from 4.1 mm to 3.5 mm. Marker-based set-up was substantially better than bony-anatomy set-up. Rotational correction may improve the set-up, but further investigations are required to determine the optimal correction strategy.
Use of direct and iterative solvers for estimation of SNP effects in genome-wide selection
2010-01-01
The aim of this study was to compare iterative and direct solvers for estimation of marker effects in genomic selection. One iterative and two direct methods were used: Gauss-Seidel with Residual Update, Cholesky Decomposition and Gentleman-Givens rotations. For resembling different scenarios with respect to number of markers and of genotyped animals, a simulated data set divided into 25 subsets was used. Number of markers ranged from 1,200 to 5,925 and number of animals ranged from 1,200 to 5,865. Methods were also applied to real data comprising 3081 individuals genotyped for 45181 SNPs. Results from simulated data showed that the iterative solver was substantially faster than direct methods for larger numbers of markers. Use of a direct solver may allow for computing (co)variances of SNP effects. When applied to real data, performance of the iterative method varied substantially, depending on the level of ill-conditioning of the coefficient matrix. From results with real data, Gentleman-Givens rotations would be the method of choice in this particular application as it provided an exact solution within a fairly reasonable time frame (less than two hours). It would indeed be the preferred method whenever computer resources allow its use. PMID:21637627
HYPOTHESIS SETTING AND ORDER STATISTIC FOR ROBUST GENOMIC META-ANALYSIS.
Song, Chi; Tseng, George C
2014-01-01
Meta-analysis techniques have been widely developed and applied in genomic applications, especially for combining multiple transcriptomic studies. In this paper, we propose an order statistic of p-values ( r th ordered p-value, rOP) across combined studies as the test statistic. We illustrate different hypothesis settings that detect gene markers differentially expressed (DE) "in all studies", "in the majority of studies", or "in one or more studies", and specify rOP as a suitable method for detecting DE genes "in the majority of studies". We develop methods to estimate the parameter r in rOP for real applications. Statistical properties such as its asymptotic behavior and a one-sided testing correction for detecting markers of concordant expression changes are explored. Power calculation and simulation show better performance of rOP compared to classical Fisher's method, Stouffer's method, minimum p-value method and maximum p-value method under the focused hypothesis setting. Theoretically, rOP is found connected to the naïve vote counting method and can be viewed as a generalized form of vote counting with better statistical properties. The method is applied to three microarray meta-analysis examples including major depressive disorder, brain cancer and diabetes. The results demonstrate rOP as a more generalizable, robust and sensitive statistical framework to detect disease-related markers.
Hayford, Kyla T; Shomik, Mohammed S; Al-Emran, Hassan M; Moss, William J; Bishai, David; Levine, Orin S
2013-12-20
Recent outbreaks of measles and polio in low-income countries illustrate that conventional methods for estimating vaccination coverage do not adequately identify susceptible children. Immune markers of protection against vaccine-preventable diseases in oral fluid (OF) or blood may generate more accurate measures of effective vaccination history, but questions remain about whether antibody surveys are feasible and informative tools for monitoring immunization program performance compared to conventional vaccination coverage indicators. This study compares six indicators of measles vaccination status, including immune markers in oral fluid and blood, from children in rural Bangladesh and evaluates the implications of using each indicator to estimate measles vaccination coverage. A cross-sectional population-based study of children ages 12-16 months in Mirzapur, Bangladesh, ascertained measles vaccination (MCV1) history from conventional indicators: maternal report, vaccination card records, 'card+history' and EPI clinic records. Oral fluid from all participants (n=1226) and blood from a subset (n=342) were tested for measles IgG antibodies as indicators of MCV1 history and compared to conventional MCV1 coverage indicators. Maternal report yielded the highest MCV1 coverage estimates (90.8%), followed by EPI records (88.6%), and card+history (84.2%). Seroprotection against measles by OF (57.3%) was significantly lower than other indicators, even after adjusting for incomplete seroconversion and assay performance (71.5%). Among children with blood results, 88.6% were seroprotected, which was significantly higher than coverage by card+history and OF serostatus but consistent with coverage by maternal report and EPI records. Children with vaccination cards or EPI records were more likely to have a history of receiving MCV1 than those without cards or records. Despite similar MCV1 coverage estimates across most indicators, within-child agreement was poor for all indicators. Measles IgG antibodies in OF was not a suitable immune marker for monitoring measles vaccination coverage in this setting. Because agreement between conventional MCV1 indicators was mediocre, immune marker surveillance with blood samples could be used to validate conventional MCV1 indicators and generate adjusted results that can be compared across indicators.
Spinks, Phillip Q; Thomson, Robert C; Zhang, YaPing; Che, Jing; Wu, Yonghua; Shaffer, H Bradley
2012-06-01
Turtles are currently the most endangered major clade of vertebrates on earth, and Asian box turtles (Cuora) are in catastrophic decline. Effective management of this diverse turtle clade has been hampered by human-mediated, and perhaps natural hybridization, resulting in discordance between mitochondrial and nuclear markers and confusion regarding species boundaries and phylogenetic relationships among hypothesized species of Cuora. Here, we present analyses of mitochondrial and nuclear DNA data for all 12 currently hypothesized species to resolve both species boundaries and phylogenetic relationships. Our 15-gene, 40-individual nuclear data set was frequently in conflict with our mitochondrial data set; based on its general concordance with published morphological analyses and the strength of 15 independent estimates of evolutionary history, we interpret the nuclear data as representing the most reliable estimate of species boundaries and phylogeny of Cuora. Our results strongly reiterate the necessity of using multiple nuclear markers for phylogeny and species delimitation in these animals, including any form of DNA "barcoding", and point to Cuora as an important case study where reliance on mitochondrial DNA can lead to incorrect species identification. Copyright © 2012 Elsevier Inc. All rights reserved.
You, Jianling; Liu, Wensheng; Zhao, Yao; Zhu, Yongqing; Zhang, Wenju; Wang, Yuguo; Lu, Fan; Song, Zhiping
2013-03-01
Microsatellite loci are described for Rhodiola, a medicinal herb genus widely used in traditional Chinese medicine. • A total of 17 polymorphic microsatellite primer pairs were developed using the combined biotin capture method. The number of alleles per locus ranged from one to 12 across 192 individuals from R. bupleuroides, R. crenulata, R. fastigiata, and R. sacra, and the mean observed and expected heterozygosities ranged from 0.177 to 0.412 and from 0.363 to 0.578, respectively. • The results demonstrate the potential use of this new set of microsatellite markers for genotyping individuals and estimating genetic diversity in Rhodiola.
How Different Marker Sets Affect Joint Angles in Inverse Kinematics Framework.
Mantovani, Giulia; Lamontagne, Mario
2017-04-01
The choice of marker set is a source of variability in motion analysis. Studies exist which assess the performance of marker sets when direct kinematics is used, but these results cannot be extrapolated to the inverse kinematic framework. Therefore, the purpose of this study was to examine the sensitivity of kinematic outcomes to inter-marker set variability in an inverse kinematic framework. The compared marker sets were plug-in-gait, University of Ottawa motion analysis model and a three-marker-cluster marker set. Walking trials of 12 participants were processed in opensim. The coefficient of multiple correlations was very good for sagittal (>0.99) and transverse (>0.92) plane angles, but worsened for the transverse plane (0.72). Absolute reliability indices are also provided for comparison among studies: minimum detectable change values ranged from 3 deg for the hip sagittal range of motion to 16.6 deg of the hip transverse range of motion. Ranges of motion of hip and knee abduction/adduction angles and hip and ankle rotations were significantly different among the three marker configurations (P < 0.001), with plug-in-gait producing larger ranges of motion. Although the same model was used for all the marker sets, the resulting minimum detectable changes were high and clinically relevant, which warns for caution when comparing studies that use different marker configurations, especially if they differ in the joint-defining markers.
Godinho, Raquel; López-Bao, José Vicente; Castro, Diana; Llaneza, Luís; Lopes, Susana; Silva, Pedro; Ferrand, Nuno
2015-03-01
Wolves and dogs provide a paradigmatic example of the ecological and conservation implications of hybridization events between wild and domesticated forms. However, our understanding of such implications has been traditionally hampered by both high genetic similarity and the difficulties in obtaining tissue samples (TS), which limit our ability to assess ongoing hybridization events. To assess the occurrence and extension of hybridization in a pack of wolf-dog hybrids in northwestern Iberia, we compared the power of 52 nuclear markers implemented on TS with a subset of 13 ancestry informative markers (AIMs) typed in noninvasive samples (NIS). We demonstrate that the 13 AIMs are as accurate as the 52 markers that were chosen without regard to the power to differentiate between wolves and dogs, also having the advantage of being rapidly screened on NIS. The efficiency of AIMs significantly outperformed ten random sets of similar size and an additional commercial set of 18 markers. Bayesian clustering analysis implemented on AIMs and NIS identified nine hybrids, two wolves and two dogs. Four hybrids were unambiguously assigned to F1xWolf backcrosses. Our approach (AIMs + NIS) overcomes previous difficulties related to sample availability and informative power of markers, allowing a quick identification of wolf-dog hybrids in the first phases of hybridization episodes. This provides managers with a reliable tool to evaluate hybridization and estimate the success of their actions. This approach may be easily adapted for other pairs of wild/domesticated species, thus improving our understanding of the introgression of domestication genes into natural populations. © 2014 John Wiley & Sons Ltd.
Toker, Lilah; Rocco, Brad; Sibille, Etienne
2017-01-01
Establishing the molecular diversity of cell types is crucial for the study of the nervous system. We compiled a cross-laboratory database of mouse brain cell type-specific transcriptomes from 36 major cell types from across the mammalian brain using rigorously curated published data from pooled cell type microarray and single-cell RNA-sequencing (RNA-seq) studies. We used these data to identify cell type-specific marker genes, discovering a substantial number of novel markers, many of which we validated using computational and experimental approaches. We further demonstrate that summarized expression of marker gene sets (MGSs) in bulk tissue data can be used to estimate the relative cell type abundance across samples. To facilitate use of this expanding resource, we provide a user-friendly web interface at www.neuroexpresso.org. PMID:29204516
Short-term velocity measurements at Columbia Glacier, Alaska; August-September 1984
Vaughn, B.H.; Raymond, C.F.; Rasmussen, Lowell A.; Miller, D.S.; Michaelson, C.A.; Meier, M.F.; Krimmel, R.M.; Fountain, A.G.; Dunlap, W.W.; Brown, C.S.
1985-01-01
Ice velocity data are presented for the lower reach of Columbia Glacier, Alaska. The data span a 29 day period and contain 1,072 angle sightings from two survey stations to 22 markers placed on the ice surface, and 1,621 laser measurements of the distance to one of those markers (number 11) from another station. These short-interval observations were made to investigate the dynamics of the glacier and to provide input to models for estimation of future retreat and iceberg discharge. The mean ice velocity (at marker number 11) was approximately 9 m/day and ranged from 8 to < 15 m/day. The data set includes a well defined 2-day, 50% velocity increase and a clear pattern of velocity fluctuations of about 5% with approximately diurnal and semiurnal periods. (Author 's abstract)
Optical Enhancement of Exoskeleton-Based Estimation of Glenohumeral Angles
Cortés, Camilo; Unzueta, Luis; de los Reyes-Guzmán, Ana; Ruiz, Oscar E.; Flórez, Julián
2016-01-01
In Robot-Assisted Rehabilitation (RAR) the accurate estimation of the patient limb joint angles is critical for assessing therapy efficacy. In RAR, the use of classic motion capture systems (MOCAPs) (e.g., optical and electromagnetic) to estimate the Glenohumeral (GH) joint angles is hindered by the exoskeleton body, which causes occlusions and magnetic disturbances. Moreover, the exoskeleton posture does not accurately reflect limb posture, as their kinematic models differ. To address the said limitations in posture estimation, we propose installing the cameras of an optical marker-based MOCAP in the rehabilitation exoskeleton. Then, the GH joint angles are estimated by combining the estimated marker poses and exoskeleton Forward Kinematics. Such hybrid system prevents problems related to marker occlusions, reduced camera detection volume, and imprecise joint angle estimation due to the kinematic mismatch of the patient and exoskeleton models. This paper presents the formulation, simulation, and accuracy quantification of the proposed method with simulated human movements. In addition, a sensitivity analysis of the method accuracy to marker position estimation errors, due to system calibration errors and marker drifts, has been carried out. The results show that, even with significant errors in the marker position estimation, method accuracy is adequate for RAR. PMID:27403044
Dependence of paracentric inversion rate on tract length.
York, Thomas L; Durrett, Rick; Nielsen, Rasmus
2007-04-03
We develop a Bayesian method based on MCMC for estimating the relative rates of pericentric and paracentric inversions from marker data from two species. The method also allows estimation of the distribution of inversion tract lengths. We apply the method to data from Drosophila melanogaster and D. yakuba. We find that pericentric inversions occur at a much lower rate compared to paracentric inversions. The average paracentric inversion tract length is approx. 4.8 Mb with small inversions being more frequent than large inversions. If the two breakpoints defining a paracentric inversion tract are uniformly and independently distributed over chromosome arms there will be more short tract-length inversions than long; we find an even greater preponderance of short tract lengths than this would predict. Thus there appears to be a correlation between the positions of breakpoints which favors shorter tract lengths. The method developed in this paper provides the first statistical estimator for estimating the distribution of inversion tract lengths from marker data. Application of this method for a number of data sets may help elucidate the relationship between the length of an inversion and the chance that it will get accepted.
Dependence of paracentric inversion rate on tract length
York, Thomas L; Durrett, Rick; Nielsen, Rasmus
2007-01-01
Background We develop a Bayesian method based on MCMC for estimating the relative rates of pericentric and paracentric inversions from marker data from two species. The method also allows estimation of the distribution of inversion tract lengths. Results We apply the method to data from Drosophila melanogaster and D. yakuba. We find that pericentric inversions occur at a much lower rate compared to paracentric inversions. The average paracentric inversion tract length is approx. 4.8 Mb with small inversions being more frequent than large inversions. If the two breakpoints defining a paracentric inversion tract are uniformly and independently distributed over chromosome arms there will be more short tract-length inversions than long; we find an even greater preponderance of short tract lengths than this would predict. Thus there appears to be a correlation between the positions of breakpoints which favors shorter tract lengths. Conclusion The method developed in this paper provides the first statistical estimator for estimating the distribution of inversion tract lengths from marker data. Application of this method for a number of data sets may help elucidate the relationship between the length of an inversion and the chance that it will get accepted. PMID:17407601
NASA Astrophysics Data System (ADS)
Tahavori, Fatemeh
Respiratory motion induces uncertainty in External Beam Radiotherapy (EBRT), which can result in sub-optimal dose delivery to the target tissue and unwanted dose to normal tissue. The conventional approach to managing patient respiratory motion for EBRT within the area of abdominal-thoracic cancer is through the use of internal radiological imaging methods (e.g. Megavoltage imaging or Cone-Beam Computed Tomography) or via surrogate estimates of tumour position using external markers placed on the patient chest. This latter method uses tracking with video-based techniques, and relies on an assumed correlation or mathematical model, between the external surrogate signal and the internal target position. The marker's trajectory can be used in both respiratory gating techniques and real-time tracking methods. Internal radiological imaging methods bring with them limited temporal resolution, and additional radiation burden, which can be addressed by external marker-based methods that carry no such issues. Moreover, by including multiple external markers and placing them closer to the internal target organs, the effciency of correlation algorithms can be increased. However, the quality of such external monitoring methods is underpinned by the performance of the associated correlation model. Therefore, several new approaches to correlation modelling have been developed as part of this thesis and compared using publicly-available datasets. Highly competitive results have been obtained when compared against state-of-the-art methods. Marker-based methods also have the disadvantages of requiring manual set-up time for marker placement and patient positioning and potential issues with reproducibility of marker placement. This motivates the investigation of non-contact marker-free methods for use in EBRT, which is the main topic of this thesis. The Microsoft Kinect is used as an example of a low-cost consumer grade 3D depth camera for capturing and analysing external respiratory motion. This thesis makes the first presentation of detailed studies of external respiratory motion captured using such low-cost technology and demonstrates its potential in a healthcare environment. Firstly, the fundamental performance of a range of Microsoft Kinect sensors is assessed for use in radiotherapy (and potentially other healthcare applications), in terms of static and dynamic performance using both phantoms and volunteers. Then external respiratory motion is captured using the above technology from a group of 32 healthy volunteers and Principal Component Analysis (PCA) is applied to a region of interest encompassing the complete anterior surface to demonstrate breathing style. This work demonstrates that this surface motion can be compactly described by the first two PCA eigenvectors. The reproducibility of subject-specific EBRT set-up using conventional laser-based alignment and marker-based Deep Inspiration Breath Hold (DIBH) methods are also studied using the Microsoft Kinect sensor. A cohort of five healthy female volunteers is repeatedly set-up for left-sided breast cancer EBRT and multiple DIBH episodes captured over five separate sessions representing multiple fractionated radiotherapy treatment sessions, but without dose delivery. This provided an independent assessment that subjects were set-up and generally achieved variations within currently accepted margins of clinical practice. Moreover, this work demonstrated the potential role of consumer-grade 3D depth camera technology as a possible replacement for marker based set-up and DIBH management procedures. This brings with it the additional benefits of low cost, and potential through-put benefits, as patient set-up could ultimately be fully automated with this technology, and DIBH could be independently monitored without requiring preparatory manual intervention.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dasari, Paul K. R.; Shazeeb, Mohammed Salman; Könik, Arda
Purpose: Binning list-mode acquisitions as a function of a surrogate signal related to respiration has been employed to reduce the impact of respiratory motion on image quality in cardiac emission tomography (SPECT and PET). Inherent in amplitude binning is the assumption that there is a monotonic relationship between the amplitude of the surrogate signal and respiratory motion of the heart. This assumption is not valid in the presence of hysteresis when heart motion exhibits a different relationship with the surrogate during inspiration and expiration. The purpose of this study was to investigate the novel approach of using the Bouc–Wen (BW)more » model to provide a signal accounting for hysteresis when binning list-mode data with the goal of thereby improving motion correction. The study is based on the authors’ previous observations that hysteresis between chest and abdomen markers was indicative of hysteresis between abdomen markers and the internal motion of the heart. Methods: In 19 healthy volunteers, they determined the internal motion of the heart and diaphragm in the superior–inferior direction during free breathing using MRI navigators. A visual tracking system (VTS) synchronized with MRI acquisition tracked the anterior–posterior motions of external markers placed on the chest and abdomen. These data were employed to develop and test the Bouc–Wen model by inputting the VTS derived chest and abdomen motions into it and using the resulting output signals as surrogates for cardiac motion. The data of the volunteers were divided into training and testing sets. The training set was used to obtain initial values for the model parameters for all of the volunteers in the set, and for set members based on whether they were or were not classified as exhibiting hysteresis using a metric derived from the markers. These initial parameters were then employed with the testing set to estimate output signals. Pearson’s linear correlation coefficient between the abdomen, chest, average of chest and abdomen markers, and Bouc–Wen derived signals versus the true internal motion of the heart from MRI was used to judge the signals match to the heart motion. Results: The results show that the Bouc–Wen model generated signals demonstrated strong correlation with the heart motion. This correlation was slightly larger on average than that of the external surrogate signals derived from the abdomen marker, and average of the abdomen and chest markers, but was not statistically significantly different from them. Conclusions: The results suggest that the proposed model has the potential to be a unified framework for modeling hysteresis in respiratory motion in cardiac perfusion studies and beyond.« less
Barbaro, Josephine; Dissanayake, Cheryl
2013-01-01
The Social Attention and Communication Study involved the successful implementation of developmental surveillance of the early markers of autism spectrum disorders in a community-based setting. The objective in the current study was to determine the most discriminating and predictive markers of autism spectrum disorders used in the Social Attention and Communication Study at 12, 18 and 24 months of age, so that these could be used to identify children with autism spectrum disorders with greater accuracy. The percentage of 'yes/no' responses for each behavioural marker was compared between children with autistic disorder (n = 39), autism spectrum disorder (n = 50) and developmental and/or language delay (n = 20) from 12 to 24 months, with a logistic regression also conducted at 24 months. Across all ages, the recurring key markers of both autistic disorder and autism spectrum disorder were deficits in eye contact and pointing, and from 18 months, deficits in showing became an important marker. In combination, these behaviours, along with pretend play, were found to be the best group of predictors for a best estimate diagnostic classification of autistic disorder/autism spectrum disorder at 24 months. It is argued that the identified markers should be monitored repeatedly during the second year of life by community health-care professionals.
Lee, Yin Leng; Teitelbaum, Susan; Wolff, Mary S; Wetmur, James G; Chen, Jia
2010-12-01
Self-reported race/ethnicity is frequently used in epidemiological studies to assess an individual's background origin. However, in admixed populations such as Hispanic, self-reported race/ethnicity may not accurately represent them genetically because they are admixed with European, African and Native American ancestry. We estimated the proportions of genetic admixture in an ethnically diverse population of 396 mothers and 188 of their children with 35 ancestry informative markers (AIMs) using the STRUCTURE version 2.2 program. The majority of the markers showed significant deviation from Hardy-Weinberg equilibrium in our study population. In mothers self-identified as Black and White, the imputed ancestry proportions were 77.6% African and 75.1% European respectively, while the racial composition among self-identified Hispanics was 29.2% European, 26.0% African, and 44.8% Native American. We also investigated the utility of AIMs by showing the improved fitness of models in paraoxanase-1 genotype-phenotype associations after incorporating AIMs; however, the improvement was moderate at best. In summary, a minimal set of 35 AIMs is sufficient to detect population stratification and estimate the proportion of individual genetic admixture; however, the utility of these markers remains questionable.
Optimal tumor sampling for immunostaining of biomarkers in breast carcinoma
2011-01-01
Introduction Biomarkers, such as Estrogen Receptor, are used to determine therapy and prognosis in breast carcinoma. Immunostaining assays of biomarker expression have a high rate of inaccuracy; for example, estimates are as high as 20% for Estrogen Receptor. Biomarkers have been shown to be heterogeneously expressed in breast tumors and this heterogeneity may contribute to the inaccuracy of immunostaining assays. Currently, no evidence-based standards exist for the amount of tumor that must be sampled in order to correct for biomarker heterogeneity. The aim of this study was to determine the optimal number of 20X fields that are necessary to estimate a representative measurement of expression in a whole tissue section for selected biomarkers: ER, HER-2, AKT, ERK, S6K1, GAPDH, Cytokeratin, and MAP-Tau. Methods Two collections of whole tissue sections of breast carcinoma were immunostained for biomarkers. Expression was quantified using the Automated Quantitative Analysis (AQUA) method of quantitative immunofluorescence. Simulated sampling of various numbers of fields (ranging from one to thirty five) was performed for each marker. The optimal number was selected for each marker via resampling techniques and minimization of prediction error over an independent test set. Results The optimal number of 20X fields varied by biomarker, ranging between three to fourteen fields. More heterogeneous markers, such as MAP-Tau protein, required a larger sample of 20X fields to produce representative measurement. Conclusions The optimal number of 20X fields that must be sampled to produce a representative measurement of biomarker expression varies by marker with more heterogeneous markers requiring a larger number. The clinical implication of these findings is that breast biopsies consisting of a small number of fields may be inadequate to represent whole tumor biomarker expression for many markers. Additionally, for biomarkers newly introduced into clinical use, especially if therapeutic response is dictated by level of expression, the optimal size of tissue sample must be determined on a marker-by-marker basis. PMID:21592345
Wang, Xuefeng; Lee, Seunggeun; Zhu, Xiaofeng; Redline, Susan; Lin, Xihong
2013-12-01
Family-based genetic association studies of related individuals provide opportunities to detect genetic variants that complement studies of unrelated individuals. Most statistical methods for family association studies for common variants are single marker based, which test one SNP a time. In this paper, we consider testing the effect of an SNP set, e.g., SNPs in a gene, in family studies, for both continuous and discrete traits. Specifically, we propose a generalized estimating equations (GEEs) based kernel association test, a variance component based testing method, to test for the association between a phenotype and multiple variants in an SNP set jointly using family samples. The proposed approach allows for both continuous and discrete traits, where the correlation among family members is taken into account through the use of an empirical covariance estimator. We derive the theoretical distribution of the proposed statistic under the null and develop analytical methods to calculate the P-values. We also propose an efficient resampling method for correcting for small sample size bias in family studies. The proposed method allows for easily incorporating covariates and SNP-SNP interactions. Simulation studies show that the proposed method properly controls for type I error rates under both random and ascertained sampling schemes in family studies. We demonstrate through simulation studies that our approach has superior performance for association mapping compared to the single marker based minimum P-value GEE test for an SNP-set effect over a range of scenarios. We illustrate the application of the proposed method using data from the Cleveland Family GWAS Study. © 2013 WILEY PERIODICALS, INC.
Quebeda-Clerkin, Patricia B.; Dodge, Carter P.; Harris, Brent T.; Hillier, Simon C.; Duhaime, Ann-Christine
2012-01-01
Abstract A peripheral indicator of the presence and magnitude of brain injury has been a sought-after tool by clinicians. We measured neuron-specific enolase (NSE), myelin basic protein (MBP), and S100B, prior to and after scaled cortical impact in immature pigs, to determine if these purported markers increase after injury, correlate with the resulting lesion volume, and if these relationships vary with maturation. Scaled cortical impact resulted in increased lesion volume with increasing age. Concentrations of NSE, but not S100B or MBP, increased after injury in all age groups. The high variability of S100B concentrations prior to injury may have precluded detection of an increase due to injury. Total serum markers were estimated, accounting for the allometric growth of blood volume, and resulted in a positive correlation of both NSE and S100B with lesion volume. Even with allometric scaling of blood volume and a uniform mechanism of injury, NSE had only a fair to poor predictive value. In a clinical setting, where the types of injuries are varied, more investigation is required to yield a panel of serum markers that can reliably predict the extent of injury. Allometric scaling may improve estimation of serum marker release in pediatric populations. PMID:22867012
Ice Cores Dating With a New Inverse Method Taking Account of the Flow Modeling Errors
NASA Astrophysics Data System (ADS)
Lemieux-Dudon, B.; Parrenin, F.; Blayo, E.
2007-12-01
Deep ice cores extracted from Antarctica or Greenland recorded a wide range of past climatic events. In order to contribute to the Quaternary climate system understanding, the calculation of an accurate depth-age relationship is a crucial point. Up to now ice chronologies for deep ice cores estimated with inverse approaches are based on quite simplified ice-flow models that fail to reproduce flow irregularities and consequently to respect all available set of age markers. We describe in this paper, a new inverse method that takes into account the model uncertainty in order to circumvent the restrictions linked to the use of simplified flow models. This method uses first guesses on two flow physical entities, the ice thinning function and the accumulation rate and then identifies correction functions on both flow entities. We highlight two major benefits brought by this new method: first of all the ability to respect large set of observations and as a consequence, the feasibility to estimate a synchronized common ice chronology for several cores at the same time. This inverse approach relies on a bayesian framework. To respect the positive constraint on the searched correction functions, we assume lognormal probability distribution on one hand for the background errors, but also for one particular set of the observation errors. We test this new inversion method on three cores simultaneously (the two EPICA cores : DC and DML and the Vostok core) and we assimilate more than 150 observations (e.g.: age markers, stratigraphic links,...). We analyze the sensitivity of the solution with respect to the background information, especially the prior error covariance matrix. The confidence intervals based on the posterior covariance matrix calculation, are estimated on the correction functions and for the first time on the overall output chronologies.
Moving metal artifact reduction in cone-beam CT scans with implanted cylindrical gold markers
DOE Office of Scientific and Technical Information (OSTI.GOV)
Toftegaard, Jakob, E-mail: jaktofte@rm.dk; Fledelius, Walther; Worm, Esben S.
2014-12-15
Purpose: Implanted gold markers for image-guided radiotherapy lead to streaking artifacts in cone-beam CT (CBCT) scans. Several methods for metal artifact reduction (MAR) have been published, but they all fail in scans with large motion. Here the authors propose and investigate a method for automatic moving metal artifact reduction (MMAR) in CBCT scans with cylindrical gold markers. Methods: The MMAR CBCT reconstruction method has six steps. (1) Automatic segmentation of the cylindrical markers in the CBCT projections. (2) Removal of each marker in the projections by replacing the pixels within a masked area with interpolated values. (3) Reconstruction of amore » marker-free CBCT volume from the manipulated CBCT projections. (4) Reconstruction of a standard CBCT volume with metal artifacts from the original CBCT projections. (5) Estimation of the three-dimensional (3D) trajectory during CBCT acquisition for each marker based on the segmentation in Step 1, and identification of the smallest ellipsoidal volume that encompasses 95% of the visited 3D positions. (6) Generation of the final MMAR CBCT reconstruction from the marker-free CBCT volume of Step 3 by replacing the voxels in the 95% ellipsoid with the corresponding voxels of the standard CBCT volume of Step 4. The MMAR reconstruction was performed retrospectively using a half-fan CBCT scan for 29 consecutive stereotactic body radiation therapy patients with 2–3 gold markers implanted in the liver. The metal artifacts of the MMAR reconstructions were scored and compared with a standard MAR reconstruction by counting the streaks and by calculating the standard deviation of the Hounsfield units in a region around each marker. Results: The markers were found with the same autosegmentation settings in 27 CBCT scans, while two scans needed slightly changed settings to find all markers automatically in Step 1 of the MMAR method. MMAR resulted in 15 scans with no streaking artifacts, 11 scans with 1–4 streaks, and 3 scans with severe streaking artifacts. The corresponding numbers for MAR were 8 (no streaks), 1 (1–4 streaks), and 20 (severe streaking artifacts). The MMAR method was superior to MAR in scans with more than 8 mm 3D marker motion and comparable to MAR for scans with less than 8 mm motion. In addition, the MMAR method was tested on a 4D CBCT reconstruction for which it worked equally well as for the 3D case. The markers in the 4D case had very low motion blur. Conclusions: An automatic method for MMAR in CBCT scans was proposed and shown to effectively remove almost all streaking artifacts in a large set of clinical CBCT scans with implanted gold markers in the liver. Residual streaking artifacts observed in three CBCT scans may be removed with better marker segmentation.« less
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa
2012-01-01
Background Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Results Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Conclusions Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies. PMID:23167289
Segmentation and Quantification for Angle-Closure Glaucoma Assessment in Anterior Segment OCT.
Fu, Huazhu; Xu, Yanwu; Lin, Stephen; Zhang, Xiaoqin; Wong, Damon Wing Kee; Liu, Jiang; Frangi, Alejandro F; Baskaran, Mani; Aung, Tin
2017-09-01
Angle-closure glaucoma is a major cause of irreversible visual impairment and can be identified by measuring the anterior chamber angle (ACA) of the eye. The ACA can be viewed clearly through anterior segment optical coherence tomography (AS-OCT), but the imaging characteristics and the shapes and locations of major ocular structures can vary significantly among different AS-OCT modalities, thus complicating image analysis. To address this problem, we propose a data-driven approach for automatic AS-OCT structure segmentation, measurement, and screening. Our technique first estimates initial markers in the eye through label transfer from a hand-labeled exemplar data set, whose images are collected over different patients and AS-OCT modalities. These initial markers are then refined by using a graph-based smoothing method that is guided by AS-OCT structural information. These markers facilitate segmentation of major clinical structures, which are used to recover standard clinical parameters. These parameters can be used not only to support clinicians in making anatomical assessments, but also to serve as features for detecting anterior angle closure in automatic glaucoma screening algorithms. Experiments on Visante AS-OCT and Cirrus high-definition-OCT data sets demonstrate the effectiveness of our approach.
Gaussian covariance graph models accounting for correlated marker effects in genome-wide prediction.
Martínez, C A; Khare, K; Rahman, S; Elzo, M A
2017-10-01
Several statistical models used in genome-wide prediction assume uncorrelated marker allele substitution effects, but it is known that these effects may be correlated. In statistics, graphical models have been identified as a useful tool for covariance estimation in high-dimensional problems and it is an area that has recently experienced a great expansion. In Gaussian covariance graph models (GCovGM), the joint distribution of a set of random variables is assumed to be Gaussian and the pattern of zeros of the covariance matrix is encoded in terms of an undirected graph G. In this study, methods adapting the theory of GCovGM to genome-wide prediction were developed (Bayes GCov, Bayes GCov-KR and Bayes GCov-H). In simulated data sets, improvements in correlation between phenotypes and predicted breeding values and accuracies of predicted breeding values were found. Our models account for correlation of marker effects and permit to accommodate general structures as opposed to models proposed in previous studies, which consider spatial correlation only. In addition, they allow incorporation of biological information in the prediction process through its use when constructing graph G, and their extension to the multi-allelic loci case is straightforward. © 2017 Blackwell Verlag GmbH.
The practical evaluation of DNA barcode efficacy.
Spouge, John L; Mariño-Ramírez, Leonardo
2012-01-01
This chapter describes a workflow for measuring the efficacy of a barcode in identifying species. First, assemble individual sequence databases corresponding to each barcode marker. A controlled collection of taxonomic data is preferable to GenBank data, because GenBank data can be problematic, particularly when comparing barcodes based on more than one marker. To ensure proper controls when evaluating species identification, specimens not having a sequence in every marker database should be discarded. Second, select a computer algorithm for assigning species to barcode sequences. No algorithm has yet improved notably on assigning a specimen to the species of its nearest neighbor within a barcode database. Because global sequence alignments (e.g., with the Needleman-Wunsch algorithm, or some related algorithm) examine entire barcode sequences, they generally produce better species assignments than local sequence alignments (e.g., with BLAST). No neighboring method (e.g., global sequence similarity, global sequence distance, or evolutionary distance based on a global alignment) has yet shown a notable superiority in identifying species. Finally, "the probability of correct identification" (PCI) provides an appropriate measurement of barcode efficacy. The overall PCI for a data set is the average of the species PCIs, taken over all species in the data set. This chapter states explicitly how to calculate PCI, how to estimate its statistical sampling error, and how to use data on PCR failure to set limits on how much improvements in PCR technology can improve species identification.
Pereira, Rui; Phillips, Christopher; Pinto, Nádia; Santos, Carla; dos Santos, Sidney Emanuel Batista; Amorim, António; Carracedo, Ángel; Gusmão, Leonor
2012-01-01
Ancestry-informative markers (AIMs) show high allele frequency divergence between different ancestral or geographically distant populations. These genetic markers are especially useful in inferring the likely ancestral origin of an individual or estimating the apportionment of ancestry components in admixed individuals or populations. The study of AIMs is of great interest in clinical genetics research, particularly to detect and correct for population substructure effects in case-control association studies, but also in population and forensic genetics studies. This work presents a set of 46 ancestry-informative insertion deletion polymorphisms selected to efficiently measure population admixture proportions of four different origins (African, European, East Asian and Native American). All markers are analyzed in short fragments (under 230 basepairs) through a single PCR followed by capillary electrophoresis (CE) allowing a very simple one tube PCR-to-CE approach. HGDP-CEPH diversity panel samples from the four groups, together with Oceanians, were genotyped to evaluate the efficiency of the assay in clustering populations from different continental origins and to establish reference databases. In addition, other populations from diverse geographic origins were tested using the HGDP-CEPH samples as reference data. The results revealed that the AIM-INDEL set developed is highly efficient at inferring the ancestry of individuals and provides good estimates of ancestry proportions at the population level. In conclusion, we have optimized the multiplexed genotyping of 46 AIM-INDELs in a simple and informative assay, enabling a more straightforward alternative to the commonly available AIM-SNP typing methods dependent on complex, multi-step protocols or implementation of large-scale genotyping technologies. PMID:22272242
Effect of blood pressure lowering on markers of kidney disease progression.
Udani, Suneel M; Koyner, Jay L
2009-10-01
Hypertension remains a common comorbidity and cause of chronic kidney disease (CKD). As the number of patients with CKD grows, so does the need to identify modifiable risk factors for CKD progression. Data on slowing progression of CKD or preventing end-stage renal disease with aggressive blood pressure control have not yielded definitive conclusions regarding ideal blood pressure targets. Shifting the focus of antihypertensive therapy to alternative markers of end-organ damage, specifically proteinuria, has yielded some promise in preventing the progression of CKD. Nevertheless, proteinuria and decline in estimated GFR may represent an irreversible degree of injury to the kidney that limits the impact of any therapy. The identification and use of novel markers of kidney injury to assess the impact of antihyper-tensive therapy may yield clearer direction with regard to optimal management of hypertension in the setting of CKD.
Foster, Charles S P; Henwood, Murray J; Ho, Simon Y W
2018-05-25
Data sets comprising small numbers of genetic markers are not always able to resolve phylogenetic relationships. This has frequently been the case in molecular systematic studies of plants, with many analyses being based on sequence data from only two or three chloroplast genes. An example of this comes from the riceflowers Pimelea Banks & Sol. ex Gaertn. (Thymelaeaceae), a large genus of flowering plants predominantly distributed in Australia. Despite the considerable morphological variation in the genus, low sequence divergence in chloroplast markers has led to the phylogeny of Pimelea remaining largely uncertain. In this study, we resolve the backbone of the phylogeny of Pimelea in comprehensive Bayesian and maximum-likelihood analyses of plastome sequences from 41 taxa. However, some relationships received only moderate to poor support, and the Pimelea clade contained extremely short internal branches. By using topology-clustering analyses, we demonstrate that conflicting phylogenetic signals can be found across the trees estimated from individual chloroplast protein-coding genes. A relaxed-clock dating analysis reveals that Pimelea arose in the mid-Miocene, with most divergences within the genus occurring during a subsequent rapid diversification. Our new phylogenetic estimate offers better resolution and is more strongly supported than previous estimates, providing a platform for future taxonomic revisions of both Pimelea and the broader subfamily. Our study has demonstrated the substantial improvements in phylogenetic resolution that can be achieved using plastome-scale data sets in plant molecular systematics. Copyright © 2018 Elsevier Inc. All rights reserved.
Inference from single occasion capture experiments using genetic markers.
Hettiarachchige, Chathurika K H; Huggins, Richard M
2018-05-01
Accurate estimation of the size of animal populations is an important task in ecological science. Recent advances in the field of molecular genetics researches allow the use of genetic data to estimate the size of a population from a single capture occasion rather than repeated occasions as in the usual capture-recapture experiments. Estimating the population size using genetic data also has sometimes led to estimates that differ markedly from each other and also from classical capture-recapture estimates. Here, we develop a closed form estimator that uses genetic information to estimate the size of a population consisting of mothers and daughters, focusing on estimating the number of mothers, using data from a single sample. We demonstrate the estimator is consistent and propose a parametric bootstrap to estimate the standard errors. The estimator is evaluated in a simulation study and applied to real data. We also consider maximum likelihood in this setting and discover problems that preclude its general use. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Van Aardt, Jan; Romanczyk, Paul; van Leeuwen, Martin
Terrestrial laser scanning (TLS) has emerged as an effective tool for rapid comprehensive measurement of object structure. Registration of TLS data is an important prerequisite to overcome the limitations of occlusion. However, due to the high dissimilarity of point cloud data collected from disparate viewpoints in the forest environment, adequate marker-free registration approaches have not been developed. The majority of studies instead rely on the utilization of artificial tie points (e.g., reflective tooling balls) placed within a scene to aid in coordinate transformation. We present a technique for generating view-invariant feature descriptors that are intrinsic to the point cloud datamore » and, thus, enable blind marker-free registration in forest environments. To overcome the limitation of initial pose estimation, we employ a voting method to blindly determine the optimal pairwise transformation parameters, without an a priori estimate of the initial sensor pose. To provide embedded error metrics, we developed a set theory framework in which a circular transformation is traversed between disjoint tie point subsets. This provides an upper estimate of the Root Mean Square Error (RMSE) confidence associated with each pairwise transformation. Output RMSE errors are commensurate with the RMSE of input tie points locations. Thus, while the mean output RMSE=16.3cm, improved results could be achieved with a more precise laser scanning system. This study 1) quantifies the RMSE of the proposed marker-free registration approach, 2) assesses the validity of embedded confidence metrics using receiver operator characteristic (ROC) curves, and 3) informs optimal sample spacing considerations for TLS data collection in New England forests. Furthermore, while the implications for rapid, accurate, and precise forest inventory are obvious, the conceptual framework outlined here could potentially be extended to built environments.« less
Van Aardt, Jan; Romanczyk, Paul; van Leeuwen, Martin; ...
2016-04-04
Terrestrial laser scanning (TLS) has emerged as an effective tool for rapid comprehensive measurement of object structure. Registration of TLS data is an important prerequisite to overcome the limitations of occlusion. However, due to the high dissimilarity of point cloud data collected from disparate viewpoints in the forest environment, adequate marker-free registration approaches have not been developed. The majority of studies instead rely on the utilization of artificial tie points (e.g., reflective tooling balls) placed within a scene to aid in coordinate transformation. We present a technique for generating view-invariant feature descriptors that are intrinsic to the point cloud datamore » and, thus, enable blind marker-free registration in forest environments. To overcome the limitation of initial pose estimation, we employ a voting method to blindly determine the optimal pairwise transformation parameters, without an a priori estimate of the initial sensor pose. To provide embedded error metrics, we developed a set theory framework in which a circular transformation is traversed between disjoint tie point subsets. This provides an upper estimate of the Root Mean Square Error (RMSE) confidence associated with each pairwise transformation. Output RMSE errors are commensurate with the RMSE of input tie points locations. Thus, while the mean output RMSE=16.3cm, improved results could be achieved with a more precise laser scanning system. This study 1) quantifies the RMSE of the proposed marker-free registration approach, 2) assesses the validity of embedded confidence metrics using receiver operator characteristic (ROC) curves, and 3) informs optimal sample spacing considerations for TLS data collection in New England forests. Furthermore, while the implications for rapid, accurate, and precise forest inventory are obvious, the conceptual framework outlined here could potentially be extended to built environments.« less
Exploring the Use of Thermal Infrared Imaging in Human Stress Research
Grant, Joshua A.; Cardone, Daniela; Tusche, Anita; Singer, Tania
2014-01-01
High resolution thermal infrared imaging is a pioneering method giving indices of sympathetic activity via the contact-free recording of facial tissues (thermal imprints). Compared to established stress markers, the great advantage of this method is its non-invasiveness. The goal of our study was to pilot the use of thermal infrared imaging in the classical setting of human stress research. Thermal imprints were compared to established stress markers (heart rate, heart rate variability, finger temperature, alpha-amylase and cortisol) in 15 participants undergoing anticipation, stress and recovery phases of two laboratory stress tests, the Cold Pressor Test and the Trier Social Stress Test. The majority of the thermal imprints proved to be change-sensitive in both tests. While correlations between the thermal imprints and established stress markers were mostly non-significant, the thermal imprints (but not the established stress makers) did correlate with stress-induced mood changes. Multivariate pattern analysis revealed that in contrast to the established stress markers the thermal imprints could not disambiguate anticipation, stress and recovery phases of both tests. Overall, these results suggest that thermal infrared imaging is a valuable method for the estimation of sympathetic activity in the stress laboratory setting. The use of this non-invasive method may be particularly beneficial for covert recordings, in the study of special populations showing difficulties in complying with the standard instruments of data collection and in the domain of psychophysiological covariance research. Meanwhile, the established stress markers seem to be superior when it comes to the characterization of complex physiological states during the different phases of the stress cycle. PMID:24675709
Exploring the use of thermal infrared imaging in human stress research.
Engert, Veronika; Merla, Arcangelo; Grant, Joshua A; Cardone, Daniela; Tusche, Anita; Singer, Tania
2014-01-01
High resolution thermal infrared imaging is a pioneering method giving indices of sympathetic activity via the contact-free recording of facial tissues (thermal imprints). Compared to established stress markers, the great advantage of this method is its non-invasiveness. The goal of our study was to pilot the use of thermal infrared imaging in the classical setting of human stress research. Thermal imprints were compared to established stress markers (heart rate, heart rate variability, finger temperature, alpha-amylase and cortisol) in 15 participants undergoing anticipation, stress and recovery phases of two laboratory stress tests, the Cold Pressor Test and the Trier Social Stress Test. The majority of the thermal imprints proved to be change-sensitive in both tests. While correlations between the thermal imprints and established stress markers were mostly non-significant, the thermal imprints (but not the established stress makers) did correlate with stress-induced mood changes. Multivariate pattern analysis revealed that in contrast to the established stress markers the thermal imprints could not disambiguate anticipation, stress and recovery phases of both tests. Overall, these results suggest that thermal infrared imaging is a valuable method for the estimation of sympathetic activity in the stress laboratory setting. The use of this non-invasive method may be particularly beneficial for covert recordings, in the study of special populations showing difficulties in complying with the standard instruments of data collection and in the domain of psychophysiological covariance research. Meanwhile, the established stress markers seem to be superior when it comes to the characterization of complex physiological states during the different phases of the stress cycle.
USDA-ARS?s Scientific Manuscript database
Genetic marker effects and type of inheritance are estimated with poor precision when minor marker allele frequencies are low. A stable composite population (MARC II) was subjected to marker assisted selection for two years to equalize CSN1S1 and TG genetic marker frequencies to evaluate the epista...
Genomic Prediction of Genotype × Environment Interaction Kernel Regression Models.
Cuevas, Jaime; Crossa, José; Soberanis, Víctor; Pérez-Elizalde, Sergio; Pérez-Rodríguez, Paulino; Campos, Gustavo de Los; Montesinos-López, O A; Burgueño, Juan
2016-11-01
In genomic selection (GS), genotype × environment interaction (G × E) can be modeled by a marker × environment interaction (M × E). The G × E may be modeled through a linear kernel or a nonlinear (Gaussian) kernel. In this study, we propose using two nonlinear Gaussian kernels: the reproducing kernel Hilbert space with kernel averaging (RKHS KA) and the Gaussian kernel with the bandwidth estimated through an empirical Bayesian method (RKHS EB). We performed single-environment analyses and extended to account for G × E interaction (GBLUP-G × E, RKHS KA-G × E and RKHS EB-G × E) in wheat ( L.) and maize ( L.) data sets. For single-environment analyses of wheat and maize data sets, RKHS EB and RKHS KA had higher prediction accuracy than GBLUP for all environments. For the wheat data, the RKHS KA-G × E and RKHS EB-G × E models did show up to 60 to 68% superiority over the corresponding single environment for pairs of environments with positive correlations. For the wheat data set, the models with Gaussian kernels had accuracies up to 17% higher than that of GBLUP-G × E. For the maize data set, the prediction accuracy of RKHS EB-G × E and RKHS KA-G × E was, on average, 5 to 6% higher than that of GBLUP-G × E. The superiority of the Gaussian kernel models over the linear kernel is due to more flexible kernels that accounts for small, more complex marker main effects and marker-specific interaction effects. Copyright © 2016 Crop Science Society of America.
Physical Activity and Adiposity Markers at Older Ages: Accelerometer Vs Questionnaire Data
Sabia, Séverine; Cogranne, Pol; van Hees, Vincent T.; Bell, Joshua A.; Elbaz, Alexis; Kivimaki, Mika; Singh-Manoux, Archana
2015-01-01
Objective Physical activity is critically important for successful aging, but its effect on adiposity markers at older ages is unclear as much of the evidence comes from self-reported data on physical activity. We assessed the associations of questionnaire-assessed and accelerometer-assessed physical activity with adiposity markers in older adults. Design/Setting/Participants This was a cross-sectional study on 3940 participants (age range 60-83 years) of the Whitehall II study who completed a 20-item physical activity questionnaire and wore a wrist-mounted accelerometer for 9 days in 2012 and 2013. Measurements Total physical activity was estimated using metabolic equivalent hours/week for the questionnaire and mean acceleration for the accelerometer. Time spent in moderate-and-vigorous physical activity (MVPA) was also assessed by questionnaire and accelerometer. Adiposity assessment included body mass index, waist circumference, and fat mass index. Fat mass index was calculated as fat mass/height² (kg/m²), with fat mass estimated using bioimpedance. Results Greater total physical activity was associated with lower adiposity for all adiposity markers in a dose-response manner. In men, the strength of this association was 2.4 to 2.8 times stronger with the accelerometer than with questionnaire data. In women, it was 1.9 to 2.3 times stronger. For MVPA, questionnaire data in men suggested no further benefit for adiposity markers past 1 hour/week of activity. This was not the case for accelerometer-assessed MVPA where, for example, compared with men undertaking <1 hour/week of accelerometer-assessed MVPA, waist circumference was 3.06 (95% confidence interval 2.06–4.06) cm lower in those performing MVPA 1–2.5 hours/week, 4.69 (3.47–5.91) cm lower in those undertaking 2.5–4 hours/week, and 7.11 (5.93–8.29) cm lower in those performing ≥4 hours/week. Conclusions The association of physical activity with adiposity markers in older adults was stronger when physical activity was assessed by accelerometer compared with questionnaire, suggesting that physical activity might be more important for adiposity than previously estimated. PMID:25752539
Estimation of the diagnostic threshold accounting for decision costs and sampling uncertainty.
Skaltsa, Konstantina; Jover, Lluís; Carrasco, Josep Lluís
2010-10-01
Medical diagnostic tests are used to classify subjects as non-diseased or diseased. The classification rule usually consists of classifying subjects using the values of a continuous marker that is dichotomised by means of a threshold. Here, the optimum threshold estimate is found by minimising a cost function that accounts for both decision costs and sampling uncertainty. The cost function is optimised either analytically in a normal distribution setting or empirically in a free-distribution setting when the underlying probability distributions of diseased and non-diseased subjects are unknown. Inference of the threshold estimates is based on approximate analytically standard errors and bootstrap-based approaches. The performance of the proposed methodology is assessed by means of a simulation study, and the sample size required for a given confidence interval precision and sample size ratio is also calculated. Finally, a case example based on previously published data concerning the diagnosis of Alzheimer's patients is provided in order to illustrate the procedure.
Taylor, Alexander J; Granwehr, Josef; Lesbats, Clémentine; Krupa, James L; Six, Joseph S; Pavlovskaya, Galina E; Thomas, Neil R; Auer, Dorothee P; Meersmann, Thomas; Faas, Henryk M
2016-01-01
Due to low fluorine background signal in vivo, 19F is a good marker to study the fate of exogenous molecules by magnetic resonance imaging (MRI) using equilibrium nuclear spin polarization schemes. Since 19F MRI applications require high sensitivity, it can be important to assess experimental feasibility during the design stage already by estimating the minimum detectable fluorine concentration. Here we propose a simple method for the calibration of MRI hardware, providing sensitivity estimates for a given scanner and coil configuration. An experimental "calibration factor" to account for variations in coil configuration and hardware set-up is specified. Once it has been determined in a calibration experiment, the sensitivity of an experiment or, alternatively, the minimum number of required spins or the minimum marker concentration can be estimated without the need for a pilot experiment. The definition of this calibration factor is derived based on standard equations for the sensitivity in magnetic resonance, yet the method is not restricted by the limited validity of these equations, since additional instrument-dependent factors are implicitly included during calibration. The method is demonstrated using MR spectroscopy and imaging experiments with different 19F samples, both paramagnetically and susceptibility broadened, to approximate a range of realistic environments.
Feng, Lei; Peng, Fuduan; Li, Shanfei; Jiang, Li; Sun, Hui; Ji, Anquan; Zeng, Changqing; Li, Caixia; Liu, Fan
2018-03-23
Estimating individual age from biomarkers may provide key information facilitating forensic investigations. Recent progress has shown DNA methylation at age-associated CpG sites as the most informative biomarkers for estimating the individual age of an unknown donor. Optimal feature selection plays a critical role in determining the performance of the final prediction model. In this study we investigate methylation levels at 153 age-associated CpG sites from 21 previously reported genomic regions using the EpiTYPER system for their predictive power on individual age in 390 Han Chinese males ranging from 15 to 75 years of age. We conducted a systematic feature selection using a stepwise backward multiple linear regression analysis as well as an exhaustive searching algorithm. Both approaches identified the same subset of 9 CpG sites, which in linear combination provided the optimal model fitting with mean absolute deviation (MAD) of 2.89 years of age and explainable variance (R 2 ) of 0.92. The final model was validated in two independent Han Chinese male samples (validation set 1, N = 65, MAD = 2.49, R 2 = 0.95, and validation set 2, N = 62, MAD = 3.36, R 2 = 0.89). Other competing models such as support vector machine and artificial neural network did not outperform the linear model to any noticeable degree. The validation set 1 was additionally analyzed using Pyrosequencing technology for cross-platform validation and was termed as validation set 3. Directly applying our model, in which the methylation levels were detected by the EpiTYPER system, to the data from pyrosequencing technology showed, however, less accurate results in terms of MAD (validation set 3, N = 65 Han Chinese males, MAD = 4.20, R 2 = 0.93), suggesting the presence of a batch effect between different data generation platforms. This batch effect could be partially overcome by a z-score transformation (MAD = 2.76, R 2 = 0.93). Overall, our systematic feature selection identified 9 CpG sites as the optimal subset for forensic age estimation and the prediction model consisting of these 9 markers demonstrated high potential in forensic practice. An age estimator implementing our prediction model allowing missing markers is freely available at http://liufan.big.ac.cn/AgePrediction. Copyright © 2018 Elsevier B.V. All rights reserved.
Portable DNA markers tailored for systematic characterization of Gossypium germplasm
USDA-ARS?s Scientific Manuscript database
Many small-scale ad-hoc studies on characterization of Gossypium germplasm have been conducted that use different sets of markers. Coordination with the cotton community is needed to reach a consensus on the appropriate initial set of DNA markers. In consultation with the cotton community, a set o...
Blood ammonia and lactate as markers of muscle metabolites during leg press exercise.
Gorostiaga, Esteban M; Navarro-Amézqueta, Ion; Calbet, Jose A L; Sánchez-Medina, Luis; Cusso, Roser; Guerrero, Mario; Granados, Cristina; González-Izal, Miriam; Ibáñez, Javier; Izquierdo, Mikel
2014-10-01
To examine whether blood lactate and ammonia concentrations can be used to estimate the functional state of the muscle contractile machinery with regard to muscle lactate and adenosine triphosphate (ATP) levels during leg press exercise. Thirteen men (age, 34 ± 5 years; 1 repetition maximum leg press strength 199 ± 33 kg) performed either 5 sets of 10 repetitions to failure (5×10RF), or 10 sets of 5 repetitions not to failure (10×5RNF) with the same initial load (10RM) and interset rests (2 minutes) on 2 separate sessions in random order. Capillary blood samples were obtained before and during exercise and recovery. Six subjects underwent vastus lateralis muscle biopsies at rest, before the first set and after the final exercise set. The 5×10RF resulted in a significant and marked decrease in power output (37%), muscle ATP content (24%), and high levels of muscle lactate (25.0 ± 8.1 mmol·kg wet weight), blood lactate (10.3 ± 2.6 mmol·L), and blood ammonia (91.6 ± 40.5 μmol·L). During 10×5RNF no or minimal changes were observed. Significant correlations were found between: (a) blood ammonia and muscle ATP (r = -0.75), (b) changes in peak power output and blood ammonia (r = -0.87) and blood lactate (r = -0.84), and (c) blood and muscle lactate (r = 0.90). Blood lactate and ammonia concentrations can be used as extracellular markers for muscle lactate and ATP contents, respectively. The decline in mechanical power output can be used to indirectly estimate blood ammonia and lactate during leg press exercise.
Ayesh, Basim M
2017-01-01
Molecular markers are credible for the discrimination of genotypes and estimation of the extent of genetic diversity and relatedness in a set of genotypes. Inter-simple sequence repeat (ISSR) markers rapidly reveal high polymorphic fingerprints and have been used frequently to determine the genetic diversity among date palm cultivars. This chapter describes the application of ISSR markers for genotyping of date palm cultivars. The application involves extraction of genomic DNA from the target cultivars with reliable quality and quantity. Subsequently the extracted DNA serves as a template for amplification of genomic regions flanked by inverted simple sequence repeats using a single primer. The similarity of each pair of samples is measured by calculating the number of mono- and polymorphic bands revealed by gel electrophoresis. Matrices constructed for similarity and genetic distance are used to build a phylogenetic tree and cluster analysis, to determine the molecular relatedness of cultivars. The protocol describes 3 out of 9 tested primers consistently amplified 31 loci in 6 date palm cultivars, with 28 polymorphic loci.
Tillmar, Andreas O; Phillips, Chris
2017-01-01
Advances in massively parallel sequencing technology have enabled the combination of a much-expanded number of DNA markers (notably STRs and SNPs in one or combined multiplexes), with the aim of increasing the weight of evidence in forensic casework. However, when data from multiple loci on the same chromosome are used, genetic linkage can affect the final likelihood calculation. In order to study the effect of linkage for different sets of markers we developed the biostatistical tool ILIR, (Impact of Linkage on forensic markers for Identity and Relationship tests). The ILIR tool can be used to study the overall impact of genetic linkage for an arbitrary set of markers used in forensic testing. Application of ILIR can be useful during marker selection and design of new marker panels, as well as being highly relevant for existing marker sets as a way to properly evaluate the effects of linkage on a case-by-case basis. ILIR, implemented via the open source platform R, includes variation and genomic position reference data for over 40 STRs and 140 SNPs, combined with the ability to include additional forensic markers of interest. The use of the software is demonstrated with examples from several different established marker sets (such as the expanded CODIS core loci) including a review of the interpretation of linked genetic data. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Over 70% of women with ovarian/fallopian tube cancer (OC) are diagnosed with advanced stage disease which has a 5-year relative survival rate of 30%. Five-year survival is 90% when disease is confined to the ovaries, but overall survival is poor because only 25% of cases are found early. Screening for ovarian cancer using tools with high sensitivity is potentially cost-effective, but because OC is so rare, very high specificity is needed to achieve an acceptable PPV. We have conducted preliminary work both in clinical and in preclinical (CARET) samples. We have identified candidate markers, developed assays for novel markers including HE4 and MSLN, and evaluated their diagnostic performance. We evaluated the markers’ contribution to a diagnostic panel in a standard set in order to identify the best of the candidates and developed methods for combining markers to define a decision rule for a marker panel. We found that our PEB rule yields comparable performance to the Single Threshold (ST) rule 2 years earlier, using the same two markers. The PEB makes an even larger contribution with the 4-marker panel. The 4-marker panel with the PEB rule represents a substantial improvement over any of the other decision rules as a first-line screen to select women for imaging. Our goal in the proposed work is to estimate the improvement in performance possible in the PLCO serial samples.
Changren Weng; Thomas L. Kubisiak; C. Dana Nelson; James P. Geaghan; Michael Stine
1999-01-01
Single marker regression and single marker maximum likelihood estimation were tied to detect quantitative trait loci (QTLs) controlling the early height growth of longleaf pine and slash pine using a ((longleaf pine x slash pine) x slash pine) BC, population consisting of 83 progeny. Maximum likelihood estimation was found to be more power than regression and could...
Heritability of markers of bone metabolism
NASA Technical Reports Server (NTRS)
Smith, Scott M.; Zwart, S. R.; Hargens, A. R.
2005-01-01
Several classic twin studies show genetic effects on markers of bone health, including bone mineral density and parathyroid hormone (PTH). This study was performed to assess the relative contribution of genetics to biochemical markers of bone metabolism. Fifteen sets of identical twins (8 male, 7 female) were housed in a clinical research center where diet was controlled (15% protein, 55% carbohydrate, 30% fat) for 3 consecutive days. Each day, 24-h urine pools were collected and N-telopeptide (NTX), deoxypyridinoline (DPD), calcium, and serum PTH were measured. The broad-sense heritability factor (H2) is an estimation of the portion of the total variance of a given phenotype that is attributable to genetic variance. H2 was estimated from the correlation coefficient of the phenotype data. H2 for NTX was 94% for males and 80% for females, DPD was 88% for males and 97% for females, urinary calcium excretion was 97% for males and 90% for females, and PTH was 92% for males and 79% for females. Since environmental variability was minimized for the 3 days of data collection, these heritability factors are likely overestimated. Nonetheless, the data support the concept that PTH is a predominantly heritable trait, and suggest that NTX, DPD, and calcium excretion are as well. These biochemical data support the previously documented heritability of bone health.
NASA Astrophysics Data System (ADS)
Bravo, Jaime; Davis, Scott C.; Roberts, David W.; Paulsen, Keith D.; Kanick, Stephen C.
2015-03-01
Quantification of targeted fluorescence markers during neurosurgery has the potential to improve and standardize surgical distinction between normal and cancerous tissues. However, quantitative analysis of marker fluorescence is complicated by tissue background absorption and scattering properties. Correction algorithms that transform raw fluorescence intensity into quantitative units, independent of absorption and scattering, require a paired measurement of localized white light reflectance to provide estimates of the optical properties. This study focuses on the unique problem of developing a spectral analysis algorithm to extract tissue absorption and scattering properties from white light spectra that contain contributions from both elastically scattered photons and fluorescence emission from a strong fluorophore (i.e. fluorescein). A fiber-optic reflectance device was used to perform measurements in a small set of optical phantoms, constructed with Intralipid (1% lipid), whole blood (1% volume fraction) and fluorescein (0.16-10 μg/mL). Results show that the novel spectral analysis algorithm yields accurate estimates of tissue parameters independent of fluorescein concentration, with relative errors of blood volume fraction, blood oxygenation fraction (BOF), and the reduced scattering coefficient (at 521 nm) of <7%, <1%, and <22%, respectively. These data represent a first step towards quantification of fluorescein in tissue in vivo.
Kumar, Amit; Bhowmick, Prolay Kumar; Singh, Vikram Jeet; Malik, Manoj; Gupta, Ashish Kumar; Seth, R; Nagarajan, M; Krishnan, S Gopala; Singh, Ashok Kumar
2017-10-01
Iso-cytoplasmic restorers possess the same male sterile cytoplasm as the cytoplasmic male sterile (CMS) lines, thereby minimizing the potential cyto-nuclear conflict in the hybrids. Restoration of fertility of the wild abortive CMS is governed by two major genes namely, Rf3 and Rf4 . Therefore, assessing the allelic status of these restorer genes in the iso-cytoplasmic restorers using molecular markers will not only help in estimating the efficiency of these genes either alone or in combination, in fertility restoration in the hybrids in different environments, but will also be useful in determining the efficacy of these markers. In the present study, the efficiency of molecular markers in identifying genotypes carrying restorer allele of the gene(s) Rf3 and Rf4, restoring male fertility of WA cytoplasm in rice was assessed in a set of 100 iso-cytoplasmic rice restorers using gene linked as well as candidate gene based markers. In order to validate the efficacy of markers in identifying the restorers, a sub-set of selected 25 iso-cytoplasmic rice restorers were crossed with four different cytoplasmic male sterile lines namely, IR 79156A, IR 58025A, Pusa 6A and RTN 12A, and the pollen and spikelet fertility of the F 1 s were evaluated at three different locations. Marker analysis showed that Rf4 was the predominant fertility restorer gene in the iso-cytoplasmic restorers and Rf3 had a synergistic effect on fertility restoration. The efficiency of gene based markers, DRCG-RF4-14 and DRRM-RF3-10 for Rf4 (87%) and Rf3 (84%) genes was higher than respective gene-linked SSR markers RM6100 (80%) and RM3873 (82%). It is concluded that the gene based markers can be effectively used in identifying fertility restorer lines obviating the need for making crosses and evaluating the F 1 s. Though gene based markers are more efficient, there is a need to identify functional polymorphisms which can provide 100% efficiency. Three iso-cytoplasmic restorers namely, PRR 300, PRR 363 and PRR 396 possessing both Rf4 and Rf3 genes and good fertility restoration have been identified which could be used further in hybrid rice breeding.
Kapun, Martin; van Schalkwyk, Hester; McAllister, Bryant; Flatt, Thomas; Schlötterer, Christian
2014-04-01
Sequencing of pools of individuals (Pool-Seq) represents a reliable and cost-effective approach for estimating genome-wide SNP and transposable element insertion frequencies. However, Pool-Seq does not provide direct information on haplotypes so that, for example, obtaining inversion frequencies has not been possible until now. Here, we have developed a new set of diagnostic marker SNPs for seven cosmopolitan inversions in Drosophila melanogaster that can be used to infer inversion frequencies from Pool-Seq data. We applied our novel marker set to Pool-Seq data from an experimental evolution study and from North American and Australian latitudinal clines. In the experimental evolution data, we find evidence that positive selection has driven the frequencies of In(3R)C and In(3R)Mo to increase over time. In the clinal data, we confirm the existence of frequency clines for In(2L)t, In(3L)P and In(3R)Payne in both North America and Australia and detect a previously unknown latitudinal cline for In(3R)Mo in North America. The inversion markers developed here provide a versatile and robust tool for characterizing inversion frequencies and their dynamics in Pool-Seq data from diverse D. melanogaster populations. © 2013 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.
Kapun, Martin; van Schalkwyk, Hester; McAllister, Bryant; Flatt, Thomas; Schlötterer, Christian
2014-01-01
Sequencing of pools of individuals (Pool-Seq) represents a reliable and cost-effective approach for estimating genome-wide SNP and transposable element insertion frequencies. However, Pool-Seq does not provide direct information on haplotypes so that, for example, obtaining inversion frequencies has not been possible until now. Here, we have developed a new set of diagnostic marker SNPs for seven cosmopolitan inversions in Drosophila melanogaster that can be used to infer inversion frequencies from Pool-Seq data. We applied our novel marker set to Pool-Seq data from an experimental evolution study and from North American and Australian latitudinal clines. In the experimental evolution data, we find evidence that positive selection has driven the frequencies of In(3R)C and In(3R)Mo to increase over time. In the clinal data, we confirm the existence of frequency clines for In(2L)t, In(3L)P and In(3R)Payne in both North America and Australia and detect a previously unknown latitudinal cline for In(3R)Mo in North America. The inversion markers developed here provide a versatile and robust tool for characterizing inversion frequencies and their dynamics in Pool-Seq data from diverse D. melanogaster populations. PMID:24372777
A two step Bayesian approach for genomic prediction of breeding values.
Shariati, Mohammad M; Sørensen, Peter; Janss, Luc
2012-05-21
In genomic models that assign an individual variance to each marker, the contribution of one marker to the posterior distribution of the marker variance is only one degree of freedom (df), which introduces many variance parameters with only little information per variance parameter. A better alternative could be to form clusters of markers with similar effects where markers in a cluster have a common variance. Therefore, the influence of each marker group of size p on the posterior distribution of the marker variances will be p df. The simulated data from the 15th QTL-MAS workshop were analyzed such that SNP markers were ranked based on their effects and markers with similar estimated effects were grouped together. In step 1, all markers with minor allele frequency more than 0.01 were included in a SNP-BLUP prediction model. In step 2, markers were ranked based on their estimated variance on the trait in step 1 and each 150 markers were assigned to one group with a common variance. In further analyses, subsets of 1500 and 450 markers with largest effects in step 2 were kept in the prediction model. Grouping markers outperformed SNP-BLUP model in terms of accuracy of predicted breeding values. However, the accuracies of predicted breeding values were lower than Bayesian methods with marker specific variances. Grouping markers is less flexible than allowing each marker to have a specific marker variance but, by grouping, the power to estimate marker variances increases. A prior knowledge of the genetic architecture of the trait is necessary for clustering markers and appropriate prior parameterization.
USDA-ARS?s Scientific Manuscript database
Genetic marker effects and type of inheritance are estimated with poor precision when minor marker allele frequencies are low. A stable composite population (MARC III) was subjected to marker assisted selection for multiple years to equalize specific marker frequencies to 1) estimate effect size an...
USDA-ARS?s Scientific Manuscript database
Genetic marker effects and interactions are estimated with poor precision when minor marker allele frequencies are low. An Angus population was subjected to marker assisted selection for multiple years to increase divergent haplotype and minor marker allele frequencies to 1) estimate effect size an...
Appropriateness of tumor marker request: a case of study
Trevisiol, Chiara; Fabricio, Aline S. C.
2017-01-01
Appropriateness is crucial to provide efficient and high-quality health services at affordable costs. Laboratory medicine is a sector of special interest for the investigation of inappropriateness, due to the high rate of technological innovation and its pivotal role in many diseases and clinical settings. Some subjective aspects related to either the patient or physician seem to have a major role on inappropriateness rates. Given the psychological impact of cancer on both patients and physicians, tumor markers represent a case of study for appropriateness. The assessment of inappropriateness of laboratory tests has been focused mainly on ordering patterns. Appropriateness can barely be appraised by matching the requested test with the clinical problem because clinical information on the test requisition form is usually inadequate. Monitoring inappropriateness through individual clinical information may be feasible in inpatient (clinical data are available), while an indirect approach should be used for outpatients. To estimate inappropriateness in outpatients our group developed innovative models based on comparison between the actually ordered and expected requests of tumor marker, calculated according to recommendations of clinical practice guidelines (CPGs) applied to figures of cancer prevalence. The implementation of the model at national scale in Italy led to recognize a very high rate of overordering of tumor markers. The model was further focused by a dedicated algorithm to be adapted to different clinical conditions or organizational settings by applying performance indicators to cohort-wide structured information in electronic health records (EHRs). With this novel approach, we showed that inappropriateness is multifaceted even within the specific category of tumour markers. The model was effective in identifying both over- and underordering. Implementation of evidence based information and monitoring their impact on the clinical practice are parts of the same, multistage, process aimed at the progressive improvement of health care. PMID:28758100
The discrete Laplace exponential family and estimation of Y-STR haplotype frequencies.
Andersen, Mikkel Meyer; Eriksen, Poul Svante; Morling, Niels
2013-07-21
Estimating haplotype frequencies is important in e.g. forensic genetics, where the frequencies are needed to calculate the likelihood ratio for the evidential weight of a DNA profile found at a crime scene. Estimation is naturally based on a population model, motivating the investigation of the Fisher-Wright model of evolution for haploid lineage DNA markers. An exponential family (a class of probability distributions that is well understood in probability theory such that inference is easily made by using existing software) called the 'discrete Laplace distribution' is described. We illustrate how well the discrete Laplace distribution approximates a more complicated distribution that arises by investigating the well-known population genetic Fisher-Wright model of evolution by a single-step mutation process. It was shown how the discrete Laplace distribution can be used to estimate haplotype frequencies for haploid lineage DNA markers (such as Y-chromosomal short tandem repeats), which in turn can be used to assess the evidential weight of a DNA profile found at a crime scene. This was done by making inference in a mixture of multivariate, marginally independent, discrete Laplace distributions using the EM algorithm to estimate the probabilities of membership of a set of unobserved subpopulations. The discrete Laplace distribution can be used to estimate haplotype frequencies with lower prediction error than other existing estimators. Furthermore, the calculations could be performed on a normal computer. This method was implemented in the freely available open source software R that is supported on Linux, MacOS and MS Windows. Copyright © 2013 Elsevier Ltd. All rights reserved.
Rolf, Megan M; Taylor, Jeremy F; Schnabel, Robert D; McKay, Stephanie D; McClure, Matthew C; Northcutt, Sally L; Kerley, Monty S; Weaber, Robert L
2010-04-19
Molecular estimates of breeding value are expected to increase selection response due to improvements in the accuracy of selection and a reduction in generation interval, particularly for traits that are difficult or expensive to record or are measured late in life. Several statistical methods for incorporating molecular data into breeding value estimation have been proposed, however, most studies have utilized simulated data in which the generated linkage disequilibrium may not represent the targeted livestock population. A genomic relationship matrix was developed for 698 Angus steers and 1,707 Angus sires using 41,028 single nucleotide polymorphisms and breeding values were estimated using feed efficiency phenotypes (average daily feed intake, residual feed intake, and average daily gain) recorded on the steers. The number of SNPs needed to accurately estimate a genomic relationship matrix was evaluated in this population. Results were compared to estimates produced from pedigree-based mixed model analysis of 862 Angus steers with 34,864 identified paternal relatives but no female ancestors. Estimates of additive genetic variance and breeding value accuracies were similar for AFI and RFI using the numerator and genomic relationship matrices despite fewer animals in the genomic analysis. Bootstrap analyses indicated that 2,500-10,000 markers are required for robust estimation of genomic relationship matrices in cattle. This research shows that breeding values and their accuracies may be estimated for commercially important sires for traits recorded in experimental populations without the need for pedigree data to establish identity by descent between members of the commercial and experimental populations when at least 2,500 SNPs are available for the generation of a genomic relationship matrix.
Hidden marker position estimation during sit-to-stand with walker.
Yoon, Sang Ho; Jun, Hong Gul; Dan, Byung Ju; Jo, Byeong Rim; Min, Byung Hoon
2012-01-01
Motion capture analysis of sit-to-stand task with assistive device is hard to achieve due to obstruction on reflective makers. Previously developed robotic system, Smart Mobile Walker, is used as an assistive device to perform motion capture analysis in sit-to-stand task. All lower limb markers except hip markers are invisible through whole session. The link-segment and regression method is applied to estimate the marker position during sit-to-stand. Applying a new method, the lost marker positions are restored and the biomechanical evaluation of the sit-to-stand movement with a Smart Mobile Walker could be carried out. The accuracy of the marker position estimation is verified with normal sit-to-stand data from more than 30 clinical trials. Moreover, further research on improving the link segment and regression method is addressed.
NASA Astrophysics Data System (ADS)
Ahn, Chul Kyun; Heo, Changyong; Jin, Heongmin; Kim, Jong Hyo
2017-03-01
Mammographic breast density is a well-established marker for breast cancer risk. However, accurate measurement of dense tissue is a difficult task due to faint contrast and significant variations in background fatty tissue. This study presents a novel method for automated mammographic density estimation based on Convolutional Neural Network (CNN). A total of 397 full-field digital mammograms were selected from Seoul National University Hospital. Among them, 297 mammograms were randomly selected as a training set and the rest 100 mammograms were used for a test set. We designed a CNN architecture suitable to learn the imaging characteristic from a multitudes of sub-images and classify them into dense and fatty tissues. To train the CNN, not only local statistics but also global statistics extracted from an image set were used. The image set was composed of original mammogram and eigen-image which was able to capture the X-ray characteristics in despite of the fact that CNN is well known to effectively extract features on original image. The 100 test images which was not used in training the CNN was used to validate the performance. The correlation coefficient between the breast estimates by the CNN and those by the expert's manual measurement was 0.96. Our study demonstrated the feasibility of incorporating the deep learning technology into radiology practice, especially for breast density estimation. The proposed method has a potential to be used as an automated and quantitative assessment tool for mammographic breast density in routine practice.
Erbe, Malena; Gredler, Birgit; Seefried, Franz Reinhold; Bapst, Beat; Simianer, Henner
2013-01-01
Prediction of genomic breeding values is of major practical relevance in dairy cattle breeding. Deterministic equations have been suggested to predict the accuracy of genomic breeding values in a given design which are based on training set size, reliability of phenotypes, and the number of independent chromosome segments ([Formula: see text]). The aim of our study was to find a general deterministic equation for the average accuracy of genomic breeding values that also accounts for marker density and can be fitted empirically. Two data sets of 5'698 Holstein Friesian bulls genotyped with 50 K SNPs and 1'332 Brown Swiss bulls genotyped with 50 K SNPs and imputed to ∼600 K SNPs were available. Different k-fold (k = 2-10, 15, 20) cross-validation scenarios (50 replicates, random assignment) were performed using a genomic BLUP approach. A maximum likelihood approach was used to estimate the parameters of different prediction equations. The highest likelihood was obtained when using a modified form of the deterministic equation of Daetwyler et al. (2010), augmented by a weighting factor (w) based on the assumption that the maximum achievable accuracy is [Formula: see text]. The proportion of genetic variance captured by the complete SNP sets ([Formula: see text]) was 0.76 to 0.82 for Holstein Friesian and 0.72 to 0.75 for Brown Swiss. When modifying the number of SNPs, w was found to be proportional to the log of the marker density up to a limit which is population and trait specific and was found to be reached with ∼20'000 SNPs in the Brown Swiss population studied.
Meuwissen, Theo H E; Indahl, Ulf G; Ødegård, Jørgen
2017-12-27
Non-linear Bayesian genomic prediction models such as BayesA/B/C/R involve iteration and mostly Markov chain Monte Carlo (MCMC) algorithms, which are computationally expensive, especially when whole-genome sequence (WGS) data are analyzed. Singular value decomposition (SVD) of the genotype matrix can facilitate genomic prediction in large datasets, and can be used to estimate marker effects and their prediction error variances (PEV) in a computationally efficient manner. Here, we developed, implemented, and evaluated a direct, non-iterative method for the estimation of marker effects for the BayesC genomic prediction model. The BayesC model assumes a priori that markers have normally distributed effects with probability [Formula: see text] and no effect with probability (1 - [Formula: see text]). Marker effects and their PEV are estimated by using SVD and the posterior probability of the marker having a non-zero effect is calculated. These posterior probabilities are used to obtain marker-specific effect variances, which are subsequently used to approximate BayesC estimates of marker effects in a linear model. A computer simulation study was conducted to compare alternative genomic prediction methods, where a single reference generation was used to estimate marker effects, which were subsequently used for 10 generations of forward prediction, for which accuracies were evaluated. SVD-based posterior probabilities of markers having non-zero effects were generally lower than MCMC-based posterior probabilities, but for some regions the opposite occurred, resulting in clear signals for QTL-rich regions. The accuracies of breeding values estimated using SVD- and MCMC-based BayesC analyses were similar across the 10 generations of forward prediction. For an intermediate number of generations (2 to 5) of forward prediction, accuracies obtained with the BayesC model tended to be slightly higher than accuracies obtained using the best linear unbiased prediction of SNP effects (SNP-BLUP model). When reducing marker density from WGS data to 30 K, SNP-BLUP tended to yield the highest accuracies, at least in the short term. Based on SVD of the genotype matrix, we developed a direct method for the calculation of BayesC estimates of marker effects. Although SVD- and MCMC-based marker effects differed slightly, their prediction accuracies were similar. Assuming that the SVD of the marker genotype matrix is already performed for other reasons (e.g. for SNP-BLUP), computation times for the BayesC predictions were comparable to those of SNP-BLUP.
Sallam, Ahmed; Arbaoui, Mustapha; El-Esawi, Mohamed; Abshire, Nathan; Martsch, Regina
2016-01-01
Frost stress is one of the abiotic stresses that causes a significant reduction in winter faba bean yield in Europe. The main objective of this work is to genetically improve frost tolerance in winter faba bean by identifying and validating QTL associated with frost tolerance to be used in marker-assisted selection (MAS). Two different genetic backgrounds were used: a biparental population (BPP) consisting of 101 inbred lines, and 189 genotypes from single seed descent (SSD) from the Gottingen Winter bean Population (GWBP). All experiments were conducted in a frost growth chamber under controlled conditions. Both populations were genotyped using the same set of 189 SNP markers. Visual scoring for frost stress symptoms was used to define frost tolerance in both populations. In addition, leaf fatty acid composition (FAC) and proline content were analyzed in BPP as physiological traits. QTL mapping (for BPP) and genome wide association studies (for GWBP) were performed to detect QTL associated with frost tolerance. High genetic variation between genotypes, and repeatability estimates, were found for all traits. QTL mapping and GWAS identified new putative QTL associated with promising frost tolerance and related traits. A set of 54 SNP markers common in both genetic backgrounds showed a high genetic diversity with polymorphic information content (PIC) ranging from 0.31 to 0.37 and gene diversity ranging from 0.39 to 0.50. This indicates that these markers may be polymorphic for many faba bean populations. Five SNP markers showed a significant marker-trait association with frost tolerance and related traits in both populations. Moreover, synteny analysis between Medicago truncatula (a model legume) and faba bean genomes was performed to identify candidate genes for these markers. Collinearity was evaluated between the faba bean genetic map constructed in this study and the faba bean consensus map, resulting in identifying possible genomic regions in faba bean which may control frost tolerance genes. The two genetic backgrounds were useful in detecting new variation for improving frost tolerance in winter faba bean. Of the five validated SNP markers, one (VF_Mt3g086600) was found to be associated with frost tolerance and FAC in both populations. This marker was also associated with winter hardiness and high yield in earlier studies. This marker is located in a gene of unknown function.
Sallam, Ahmed; Arbaoui, Mustapha; El-Esawi, Mohamed; Abshire, Nathan; Martsch, Regina
2016-01-01
Frost stress is one of the abiotic stresses that causes a significant reduction in winter faba bean yield in Europe. The main objective of this work is to genetically improve frost tolerance in winter faba bean by identifying and validating QTL associated with frost tolerance to be used in marker-assisted selection (MAS). Two different genetic backgrounds were used: a biparental population (BPP) consisting of 101 inbred lines, and 189 genotypes from single seed descent (SSD) from the Gottingen Winter bean Population (GWBP). All experiments were conducted in a frost growth chamber under controlled conditions. Both populations were genotyped using the same set of 189 SNP markers. Visual scoring for frost stress symptoms was used to define frost tolerance in both populations. In addition, leaf fatty acid composition (FAC) and proline content were analyzed in BPP as physiological traits. QTL mapping (for BPP) and genome wide association studies (for GWBP) were performed to detect QTL associated with frost tolerance. High genetic variation between genotypes, and repeatability estimates, were found for all traits. QTL mapping and GWAS identified new putative QTL associated with promising frost tolerance and related traits. A set of 54 SNP markers common in both genetic backgrounds showed a high genetic diversity with polymorphic information content (PIC) ranging from 0.31 to 0.37 and gene diversity ranging from 0.39 to 0.50. This indicates that these markers may be polymorphic for many faba bean populations. Five SNP markers showed a significant marker-trait association with frost tolerance and related traits in both populations. Moreover, synteny analysis between Medicago truncatula (a model legume) and faba bean genomes was performed to identify candidate genes for these markers. Collinearity was evaluated between the faba bean genetic map constructed in this study and the faba bean consensus map, resulting in identifying possible genomic regions in faba bean which may control frost tolerance genes. The two genetic backgrounds were useful in detecting new variation for improving frost tolerance in winter faba bean. Of the five validated SNP markers, one (VF_Mt3g086600) was found to be associated with frost tolerance and FAC in both populations. This marker was also associated with winter hardiness and high yield in earlier studies. This marker is located in a gene of unknown function. PMID:27540381
Detection of a dynamic topography signal in last interglacial sea-level records
Austermann, Jacqueline; Mitrovica, Jerry X.; Huybers, Peter; Rovere, Alessio
2017-01-01
Estimating minimum ice volume during the last interglacial based on local sea-level indicators requires that these indicators are corrected for processes that alter local sea level relative to the global average. Although glacial isostatic adjustment is generally accounted for, global scale dynamic changes in topography driven by convective mantle flow are generally not considered. We use numerical models of mantle flow to quantify vertical deflections caused by dynamic topography and compare predictions at passive margins to a globally distributed set of last interglacial sea-level markers. The deflections predicted as a result of dynamic topography are significantly correlated with marker elevations (>95% probability) and are consistent with construction and preservation attributes across marker types. We conclude that a dynamic topography signal is present in the elevation of last interglacial sea-level records and that the signal must be accounted for in any effort to determine peak global mean sea level during the last interglacial to within an accuracy of several meters. PMID:28695210
Kanda, Mitsuro; Murotani, Kenta; Tanaka, Haruyoshi; Miwa, Takashi; Umeda, Shinichi; Tanaka, Chie; Kobayashi, Daisuke; Hayashi, Masamichi; Hattori, Norifumi; Suenaga, Masaya; Yamada, Suguru; Nakayama, Goro; Fujiwara, Michitaka; Kodera, Yasuhiro
2018-05-07
Development of specific biomarkers is necessary for individualized management of patients with gastric cancer. The aim of this study was to design a simple expression panel comprising novel molecular markers for precise risk stratification. Patients (n = 200) who underwent gastrectomy for gastric cancer were randomly assigned into learning and validation sets. Tissue mRNA expression levels of 15 candidate molecular markers were determined using quantitative PCR analysis. A dual-marker expression panel was created according to concordance index (C-index) values of overall survival for all 105 combinations of two markers in the learning set. The reproducibility and clinical significance of the dual-marker expression panel were evaluated in the validation set. The patient characteristics of the learning and validation sets were well balanced. The C-index values of combinations were significantly higher compared with those of single markers. The panel with the highest C-index (0.718) of the learning set comprised SYT8 and MAGED2, which clearly stratified patients into low-, intermediate-, and high-risk groups. The reproducibility of the panel was demonstrated in the validation set. High expression scores were significantly associated with larger tumor size, vascular invasion, lymph node metastasis, peritoneal metastasis, and advanced disease. The dual-marker expression panel provides a simple tool that clearly stratifies patients with gastric cancer into low-, intermediate-, and high risk after gastrectomy. © 2018 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.
Bhavsar, Nrupen A.; Appel, Lawrence J.; Kusek, John W.; Contreras, Gabriel; Bakris, George; Coresh, Josef; Astor, Brad C.
2011-01-01
Background Identification of persons with chronic kidney disease (CKD) who are at highest risk to progress to end stage renal disease (ESRD) is necessary to reduce the burden of kidney failure. The relative utility of traditional markers of kidney function, including estimated glomerular filtration rate (GFR) and serum creatinine, and emerging markers of kidney function, including cystatin C and beta-trace protein (BTP), to predict ESRD and mortality has yet to be established. Study Design Randomized clinical trial followed by an observational cohort study. Setting & Participants 865 African American individuals with hypertensive CKD enrolled in a clinical trial of two levels of blood pressure control and three different antihypertensive drugs as initial therapy and subsequently followed by an observational cohort study. Predictors Quintile of measured GFR (mGFR) by iothalamate clearance, serum creatinine, serum creatinine-based estimated GFR (eGFRSCr), cystatin C, and BTP. Outcomes and Measurements Incidence of ESRD and mortality. Results A total of 246 participants reached ESRD over a median follow-up of 102 months. The incidence rate of ESRD was higher with higher quintiles of each marker. The association between higher BTP and ESRD was stronger than those for the other markers, including mGFR. All the markers remained significantly associated with ESRD after adjustment for mGFR and relevant covariates (all p<0.05), with BTP retaining the strongest association (HR for highest versus lowest quintile, 5.7; 95% CI, 2.2-14.9). Associations with the combined endpoint of ESRD or mortality (n=390) were weaker, but remained significant for cystatin C (p=0.05) and BTP (p=0.004). Limitations The ability of these markers to predict ESRD and mortality in other racial and ethnic groups and among individuals with CKD due to other causes is unknown. Conclusions Plasma BTP and cystatin C may be useful adjuncts to serum creatinine and mGFR in evaluating risk for progression of kidney disease. PMID:21944667
Power analysis of QTL detection in half-sib families using selective DNA pooling
Baro, Jesús Á; Carleos, Carlos; Corral, Norberto; López, Teresa; Cañón, Javier
2001-01-01
Individual loci of economic importance (QTL) can be detected by comparing the inheritance of a trait and the inheritance of loci with alleles readily identifiable by laboratory methods (genetic markers). Data on allele segregation at the individual level are costly and alternatives have been proposed that make use of allele frequencies among progeny, rather than individual genotypes. Among the factors that may affect the power of the set up, the most important are those intrinsic to the QTL: the additive effect of the QTL, and its dominance, and distance between markers and QTL. Other factors are relative to the choice of animals and markers, such as the frequency of the QTL and marker alleles among dams and sires. Data collection may affect the detection power through the size of half-sib families, selection rate within families, and the technical error incurred when estimating genetic frequencies. We present results for a sensitivity analysis for QTL detection using pools of DNA from selected half-sibs. Simulations showed that conclusive detection may be achieved with families of at least 500 half-sibs if sires are chosen on the criteria that most of their marker alleles are either both missing, or one is fixed, among dams. PMID:11403746
Jia, Wan-Zhong; Yan, Hong-Bin; Guo, Ai-Jiang; Zhu, Xing-Quan; Wang, Yu-Chao; Shi, Wan-Gui; Chen, Hao-Tai; Zhan, Fang; Zhang, Shao-Hua; Fu, Bao-Quan; Littlewood, D Timothy J; Cai, Xue-Peng
2010-07-22
Mitochondrial genomes provide a rich source of molecular variation of proven and widespread utility in molecular ecology, population genetics and evolutionary biology. The tapeworm genus Taenia includes a diversity of tapeworm parasites of significant human and veterinary importance. Here we add complete sequences of the mt genomes of T. multiceps, T. hydatigena and T. pisiformis, to a data set of 4 published mtDNAs in the same genus. Seven complete mt genomes of Taenia species are used to compare and contrast variation within and between genomes in the genus, to estimate a phylogeny for the genus, and to develop novel molecular markers as part of an extended mitochondrial toolkit. The complete circular mtDNAs of T. multiceps, T. hydatigena and T. pisiformis were 13,693, 13,492 and 13,387 bp in size respectively, comprising the usual complement of flatworm genes. Start and stop codons of protein coding genes included those found commonly amongst other platyhelminth mt genomes, but the much rarer initiation codon GTT was inferred for the gene atp6 in T. pisiformis. Phylogenetic analysis of mtDNAs offered novel estimates of the interrelationships of Taenia. Sliding window analyses showed nad6, nad5, atp6, nad3 and nad2 are amongst the most variable of genes per unit length, with the highest peaks in nucleotide diversity found in nad5. New primer pairs capable of amplifying fragments of variable DNA in nad1, rrnS and nad5 genes were designed in silico and tested as possible alternatives to existing mitochondrial markers for Taenia. With the availability of complete mtDNAs of 7 Taenia species, we have shown that analysis of amino acids provides a robust estimate of phylogeny for the genus that differs markedly from morphological estimates or those using partial genes; with implications for understanding the evolutionary radiation of important Taenia. Full alignment of the nucleotides of Taenia mtDNAs and sliding window analysis suggests numerous alternative gene regions are likely to capture greater nucleotide variation than those currently pursued as molecular markers. New PCR primers developed from a comparative mitogenomic analysis of Taenia species, extend the use of mitochondrial markers for molecular ecology, population genetics and diagnostics.
2010-01-01
Background Mitochondrial genomes provide a rich source of molecular variation of proven and widespread utility in molecular ecology, population genetics and evolutionary biology. The tapeworm genus Taenia includes a diversity of tapeworm parasites of significant human and veterinary importance. Here we add complete sequences of the mt genomes of T. multiceps, T. hydatigena and T. pisiformis, to a data set of 4 published mtDNAs in the same genus. Seven complete mt genomes of Taenia species are used to compare and contrast variation within and between genomes in the genus, to estimate a phylogeny for the genus, and to develop novel molecular markers as part of an extended mitochondrial toolkit. Results The complete circular mtDNAs of T. multiceps, T. hydatigena and T. pisiformis were 13,693, 13,492 and 13,387 bp in size respectively, comprising the usual complement of flatworm genes. Start and stop codons of protein coding genes included those found commonly amongst other platyhelminth mt genomes, but the much rarer initiation codon GTT was inferred for the gene atp6 in T. pisiformis. Phylogenetic analysis of mtDNAs offered novel estimates of the interrelationships of Taenia. Sliding window analyses showed nad6, nad5, atp6, nad3 and nad2 are amongst the most variable of genes per unit length, with the highest peaks in nucleotide diversity found in nad5. New primer pairs capable of amplifying fragments of variable DNA in nad1, rrnS and nad5 genes were designed in silico and tested as possible alternatives to existing mitochondrial markers for Taenia. Conclusions With the availability of complete mtDNAs of 7 Taenia species, we have shown that analysis of amino acids provides a robust estimate of phylogeny for the genus that differs markedly from morphological estimates or those using partial genes; with implications for understanding the evolutionary radiation of important Taenia. Full alignment of the nucleotides of Taenia mtDNAs and sliding window analysis suggests numerous alternative gene regions are likely to capture greater nucleotide variation than those currently pursued as molecular markers. New PCR primers developed from a comparative mitogenomic analysis of Taenia species, extend the use of mitochondrial markers for molecular ecology, population genetics and diagnostics. PMID:20649981
Roche, Erin A.; Dovichin, Colin M.; Arnold, Todd W.
2014-01-01
Implicit assumptions for most mark-recapture studies are that individuals do not lose their markers and all observed markers are correctly recorded. If these assumptions are violated, e.g., due to loss or extreme wear of markers, estimates of population size and vital rates will be biased. Double-marking experiments have been widely used to estimate rates of marker loss and adjust for associated bias, and we extended this approach to estimate rates of recording errors. We double-marked 309 Piping Plovers (Charadrius melodus) with unique combinations of color bands and alphanumeric flags and used multi-state mark recapture models to estimate the frequency with which plovers were misidentified. Observers were twice as likely to read and report an invalid color-band combination (2.4% of the time) as an invalid alphanumeric code (1.0%). Observers failed to read matching band combinations or alphanumeric flag codes 4.5% of the time. Unlike previous band resighting studies, use of two resightable markers allowed us to identify when resighting errors resulted in reports of combinations or codes that were valid, but still incorrect; our results suggest this may be a largely unappreciated problem in mark-resight studies. Field-readable alphanumeric flags offer a promising auxiliary marker for identifying and potentially adjusting for false-positive resighting errors that may otherwise bias demographic estimates.
Improving a Lecture-Size Molecular Model Set by Repurposing Used Whiteboard Markers
ERIC Educational Resources Information Center
Dragojlovic, Veljko
2015-01-01
Preparation of an inexpensive model set from whiteboard markers and either HGS molecular model set or atoms made of wood is described. The model set is relatively easy to prepare and is sufficiently large to be suitable as an instructor set for use in lectures.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hooper, Cornelia M.; Stevens, Tim J.; Saukkonen, Anna
Measuring changes in protein or organelle abundance in the cell is an essential, but challenging aspect of cell biology. Frequently-used methods for determining organelle abundance typically rely on detection of a very few marker proteins, so are unsatisfactory. In silico estimates of protein abundances from publicly available protein spectra can provide useful standard abundance values but contain only data from tissue proteomes, and are not coupled to organelle localization data. A new protein abundance score, the normalized protein abundance scale (NPAS), expands on the number of scored proteins and the scoring accuracy of lower-abundance proteins in Arabidopsis. NPAS was combinedmore » with subcellular protein localization data, facilitating quantitative estimations of organelle abundance during routine experimental procedures. A suite of targeted proteomics markers for subcellular compartment markers was developed, enabling independent verification of in silico estimates for relative organelle abundance. Estimation of relative organelle abundance was found to be reproducible and consistent over a range of tissues and growth conditions. In silico abundance estimations and localization data have been combined into an online tool, multiple marker abundance profiling, available in the SUBA4 toolbox (http://suba.live).« less
Hooper, Cornelia M.; Stevens, Tim J.; Saukkonen, Anna; ...
2017-10-12
Measuring changes in protein or organelle abundance in the cell is an essential, but challenging aspect of cell biology. Frequently-used methods for determining organelle abundance typically rely on detection of a very few marker proteins, so are unsatisfactory. In silico estimates of protein abundances from publicly available protein spectra can provide useful standard abundance values but contain only data from tissue proteomes, and are not coupled to organelle localization data. A new protein abundance score, the normalized protein abundance scale (NPAS), expands on the number of scored proteins and the scoring accuracy of lower-abundance proteins in Arabidopsis. NPAS was combinedmore » with subcellular protein localization data, facilitating quantitative estimations of organelle abundance during routine experimental procedures. A suite of targeted proteomics markers for subcellular compartment markers was developed, enabling independent verification of in silico estimates for relative organelle abundance. Estimation of relative organelle abundance was found to be reproducible and consistent over a range of tissues and growth conditions. In silico abundance estimations and localization data have been combined into an online tool, multiple marker abundance profiling, available in the SUBA4 toolbox (http://suba.live).« less
Markerless motion estimation for motion-compensated clinical brain imaging
NASA Astrophysics Data System (ADS)
Kyme, Andre Z.; Se, Stephen; Meikle, Steven R.; Fulton, Roger R.
2018-05-01
Motion-compensated brain imaging can dramatically reduce the artifacts and quantitative degradation associated with voluntary and involuntary subject head motion during positron emission tomography (PET), single photon emission computed tomography (SPECT) and computed tomography (CT). However, motion-compensated imaging protocols are not in widespread clinical use for these modalities. A key reason for this seems to be the lack of a practical motion tracking technology that allows for smooth and reliable integration of motion-compensated imaging protocols in the clinical setting. We seek to address this problem by investigating the feasibility of a highly versatile optical motion tracking method for PET, SPECT and CT geometries. The method requires no attached markers, relying exclusively on the detection and matching of distinctive facial features. We studied the accuracy of this method in 16 volunteers in a mock imaging scenario by comparing the estimated motion with an accurate marker-based method used in applications such as image guided surgery. A range of techniques to optimize performance of the method were also studied. Our results show that the markerless motion tracking method is highly accurate (<2 mm discrepancy against a benchmarking system) on an ethnically diverse range of subjects and, moreover, exhibits lower jitter and estimation of motion over a greater range than some marker-based methods. Our optimization tests indicate that the basic pose estimation algorithm is very robust but generally benefits from rudimentary background masking. Further marginal gains in accuracy can be achieved by accounting for non-rigid motion of features. Efficiency gains can be achieved by capping the number of features used for pose estimation provided that these features adequately sample the range of head motion encountered in the study. These proof-of-principle data suggest that markerless motion tracking is amenable to motion-compensated brain imaging and holds good promise for a practical implementation in clinical PET, SPECT and CT systems.
Vision based object pose estimation for mobile robots
NASA Technical Reports Server (NTRS)
Wu, Annie; Bidlack, Clint; Katkere, Arun; Feague, Roy; Weymouth, Terry
1994-01-01
Mobile robot navigation using visual sensors requires that a robot be able to detect landmarks and obtain pose information from a camera image. This paper presents a vision system for finding man-made markers of known size and calculating the pose of these markers. The algorithm detects and identifies the markers using a weighted pattern matching template. Geometric constraints are then used to calculate the position of the markers relative to the robot. The selection of geometric constraints comes from the typical pose of most man-made signs, such as the sign standing vertical and the dimensions of known size. This system has been tested successfully on a wide range of real images. Marker detection is reliable, even in cluttered environments, and under certain marker orientations, estimation of the orientation has proven accurate to within 2 degrees, and distance estimation to within 0.3 meters.
Keene, Keith L; Mychaleckyj, Josyf C; Leak, Tennille S; Smith, Shelly G; Perlegas, Peter S; Divers, Jasmin; Langefeld, Carl D; Freedman, Barry I; Bowden, Donald W; Sale, Michèle M
2008-09-01
Admixture and population stratification are major concerns in genetic association studies. We wished to evaluate the impact of admixture using empirically derived data from genetic association studies of African Americans (AA) with type 2 diabetes (T2DM) and end-stage renal disease (ESRD). Seventy ancestry informative markers (AIMs) were genotyped in 577 AA with T2DM-ESRD, 596 AA controls, 44 Yoruba Nigerian (YRI) and 39 European American (EA) controls. Genotypic data and association results for eight T2DM candidate gene studies in our AA population were included. Ancestral estimates were calculated using FRAPPE, ADMIXMAP and STRUCTURE for all AA samples, using varying numbers of AIMs (25, 50, and 70). Ancestry estimates varied significantly across all three programs with the highest estimates obtained using STRUCTURE, followed by ADMIXMAP; while FRAPPE estimates were the lowest. FRAPPE estimates were similar using varying numbers of AIMs, while STRUCTURE estimates using 25 AIMs differed from estimates using 50 and 70 AIMs. Female T2DM-ESRD cases showed higher mean African proportions as compared to female controls, male cases, and male controls. Age showed a weak but significant correlation with individual ancestral estimates in AA cases (r2 = 0.101; P = 0.019) and in the combined set (r2 = 0.131; P = 3.57 x 10(-5)). The absolute difference between frequencies in parental populations, absolute delta, was correlated with admixture impact for dominant, additive, and recessive genotypic models of association. This study presents exploratory analyses of the impact of admixture on studies of AA with T2DM-ESRD and supports the use of ancestral proportions as a means of reducing confounding effects due to admixture.
Keene, Keith L.; Mychaleckyj, Josyf C.; Leak, Tennille S.; Smith, Shelly G.; Perlegas, Peter S.; Divers, Jasmin; Langefeld, Carl D.; Freedman, Barry I.; Bowden, Donald W.; Sale, Michèle M.
2009-01-01
Admixture and population stratification are major concerns in genetic association studies. We wished to evaluate the impact of admixture using empirically derived data from genetic association studies of African Americans (AA) with type 2 diabetes (T2DM) and end-stage renal disease (ESRD). Seventy ancestry informative markers (AIMs) were genotyped in 577 AA with T2DM-ESRD, 596 AA controls, 44 Yoruba Nigerian (YRI) and 39 European American (EA) controls. Genotypic data and association results for eight T2DM candidate gene studies in our AA population were included. Ancestral estimates were calculated using FRAPPE, ADMIXMAP and STRUCTURE for all AA samples, using varying numbers of AIMs (25, 50, and 70). Ancestry estimates varied significantly across all three programs with the highest estimates obtained using STRUCTURE, followed by ADMIXMAP; while FRAPPE estimates were the lowest. FRAPPE estimates were similar using varying numbers of AIMs, while STRUCTURE estimates using 25 AIMs differed from estimates using 50 and 70 AIMs. Female T2DM-ESRD cases showed higher mean African proportions as compared to female controls, male cases, and male controls. Age showed a weak but significant correlation with individual ancestral estimates in AA cases (r2=0.101; P=0.019) and in the combined set (r2=0.131; P=3.57×10−5). The absolute difference between frequencies in parental populations, absolute δ, was correlated with admixture impact for dominant, additive, and recessive genotypic models of association. This study presents exploratory analyses of the impact of admixture on studies of AA with T2DM-ESRD and supports the use of ancestral proportions as a means of reducing confounding effects due to admixture. PMID:18654799
Using dynamic programming to improve fiducial marker localization
NASA Astrophysics Data System (ADS)
Wan, Hanlin; Ge, Jiajia; Parikh, Parag
2014-04-01
Fiducial markers are used in a wide range of medical imaging applications. In radiation therapy, they are often implanted near tumors and used as motion surrogates that are tracked with fluoroscopy. We propose a novel and robust method based on dynamic programming (DP) for retrospectively localizing radiopaque fiducial markers in fluoroscopic images. Our method was compared to template matching (TM) algorithms on 407 data sets from 24 patients. We found that the performance of TM varied dramatically depending on the template used (ranging from 47% to 92% of data sets with a mean error <1 mm). DP by itself requires no template and performed as well as the best TM method, localizing the markers in 91% of the data sets with a mean error <1 mm. Finally, by combining DP and TM, we were able to localize the markers in 99% of the data sets with a mean error <1 mm, regardless of the template used. Our results show that DP can be a powerful tool for analyzing tumor motion, capable of accurately locating fiducial markers in fluoroscopic images regardless of marker type, shape, and size.
Kärkkäinen, Hanni P; Sillanpää, Mikko J
2013-09-04
Because of the increased availability of genome-wide sets of molecular markers along with reduced cost of genotyping large samples of individuals, genomic estimated breeding values have become an essential resource in plant and animal breeding. Bayesian methods for breeding value estimation have proven to be accurate and efficient; however, the ever-increasing data sets are placing heavy demands on the parameter estimation algorithms. Although a commendable number of fast estimation algorithms are available for Bayesian models of continuous Gaussian traits, there is a shortage for corresponding models of discrete or censored phenotypes. In this work, we consider a threshold approach of binary, ordinal, and censored Gaussian observations for Bayesian multilocus association models and Bayesian genomic best linear unbiased prediction and present a high-speed generalized expectation maximization algorithm for parameter estimation under these models. We demonstrate our method with simulated and real data. Our example analyses suggest that the use of the extra information present in an ordered categorical or censored Gaussian data set, instead of dichotomizing the data into case-control observations, increases the accuracy of genomic breeding values predicted by Bayesian multilocus association models or by Bayesian genomic best linear unbiased prediction. Furthermore, the example analyses indicate that the correct threshold model is more accurate than the directly used Gaussian model with a censored Gaussian data, while with a binary or an ordinal data the superiority of the threshold model could not be confirmed.
Kärkkäinen, Hanni P.; Sillanpää, Mikko J.
2013-01-01
Because of the increased availability of genome-wide sets of molecular markers along with reduced cost of genotyping large samples of individuals, genomic estimated breeding values have become an essential resource in plant and animal breeding. Bayesian methods for breeding value estimation have proven to be accurate and efficient; however, the ever-increasing data sets are placing heavy demands on the parameter estimation algorithms. Although a commendable number of fast estimation algorithms are available for Bayesian models of continuous Gaussian traits, there is a shortage for corresponding models of discrete or censored phenotypes. In this work, we consider a threshold approach of binary, ordinal, and censored Gaussian observations for Bayesian multilocus association models and Bayesian genomic best linear unbiased prediction and present a high-speed generalized expectation maximization algorithm for parameter estimation under these models. We demonstrate our method with simulated and real data. Our example analyses suggest that the use of the extra information present in an ordered categorical or censored Gaussian data set, instead of dichotomizing the data into case-control observations, increases the accuracy of genomic breeding values predicted by Bayesian multilocus association models or by Bayesian genomic best linear unbiased prediction. Furthermore, the example analyses indicate that the correct threshold model is more accurate than the directly used Gaussian model with a censored Gaussian data, while with a binary or an ordinal data the superiority of the threshold model could not be confirmed. PMID:23821618
Use of DNA markers in forest tree improvement research
D.B. Neale; M.E. Devey; K.D. Jermstad; M.R. Ahuja; M.C. Alosi; K.A. Marshall
1992-01-01
DNA markers are rapidly being developed for forest trees. The most important markers are restriction fragment length polymorphisms (RFLPs), polymerase chain reaction- (PCR) based markers such as random amplified polymorphic DNA (RAPD), and fingerprinting markers. DNA markers can supplement isozyme markers for monitoring tree improvement activities such as; estimating...
Regularized quantile regression for SNP marker estimation of pig growth curves.
Barroso, L M A; Nascimento, M; Nascimento, A C C; Silva, F F; Serão, N V L; Cruz, C D; Resende, M D V; Silva, F L; Azevedo, C F; Lopes, P S; Guimarães, S E F
2017-01-01
Genomic growth curves are generally defined only in terms of population mean; an alternative approach that has not yet been exploited in genomic analyses of growth curves is the Quantile Regression (QR). This methodology allows for the estimation of marker effects at different levels of the variable of interest. We aimed to propose and evaluate a regularized quantile regression for SNP marker effect estimation of pig growth curves, as well as to identify the chromosome regions of the most relevant markers and to estimate the genetic individual weight trajectory over time (genomic growth curve) under different quantiles (levels). The regularized quantile regression (RQR) enabled the discovery, at different levels of interest (quantiles), of the most relevant markers allowing for the identification of QTL regions. We found the same relevant markers simultaneously affecting different growth curve parameters (mature weight and maturity rate): two (ALGA0096701 and ALGA0029483) for RQR(0.2), one (ALGA0096701) for RQR(0.5), and one (ALGA0003761) for RQR(0.8). Three average genomic growth curves were obtained and the behavior was explained by the curve in quantile 0.2, which differed from the others. RQR allowed for the construction of genomic growth curves, which is the key to identifying and selecting the most desirable animals for breeding purposes. Furthermore, the proposed model enabled us to find, at different levels of interest (quantiles), the most relevant markers for each trait (growth curve parameter estimates) and their respective chromosomal positions (identification of new QTL regions for growth curves in pigs). These markers can be exploited under the context of marker assisted selection while aiming to change the shape of pig growth curves.
Henning, John A; Coggins, Jamie; Peterson, Matthew
2015-10-06
Hop is an economically important crop for the Pacific Northwest USA as well as other regions of the world. It is a perennial crop with rhizomatous or clonal propagation system for varietal distribution. A big concern for growers as well as brewers is variety purity and questions are regularly posed to public agencies concerning the availability of genotype testing. Current means for genotyping are based upon 25 microsatellites that provides relatively accurate genotyping but cannot always differentiate sister-lines. In addition, numerous PCR runs (25) are required to complete this process and only a few laboratories exist that perform this service. A genotyping protocol based upon SNPs would enable rapid accurate genotyping that can be assayed at any laboratory facility set up for SNP-based genotyping. The results of this study arose from a larger project designed for whole genome association studies upon the USDA-ARS hop germplasm collection consisting of approximately 116 distinct hop varieties and germplasm (female lines) from around the world. The original dataset that arose from partial sequencing of 121 genotypes resulted in the identification of 374,829 SNPs using TASSEL-UNEAK pipeline. After filtering out genotypes with more than 50% missing data (5 genotypes) and SNP markers with more than 20% missing data, 32,206 highly filtered SNP markers across 116 genotypes were identified and considered for this study. Minor allele frequency (MAF) was calculated for each SNP and ranked according to the most informative to least informative. Only those markers without missing data across genotypes as well as 60% or less heterozygous gamete calls were considered for further analysis. Genetic distances among individuals in the study were calculated using the marker with the highest MAF value, then by using a combination of the two markers with highest MAF values and so on. This process was reiterated until a set of markers was identified that allowed for all genotypes in the study to be genetically differentiated from each other. Next, we compared genetic matrices calculated from the minimal marker sets [(Table 2; 6-, 7-, 8-, 10- and 12-marker set matrices] and that of a matrix calculated from a set of markers with no missing data across all 116 samples (1006 SNP markers). The minimum number of markers required to meet both specifications was a set of 7-markers (Table 3). These seven SNPs were then aligned with a genome assembly, and DNA sequence both upstream and downstream were used to identify primer sequences that can be used to develop seven amplicons for high resolution melting curve PCR detection or other SNP-based PCR detection methods. This study identifies a set of 7 SNP markers that may prove useful for the identification and validation of hop varieties and accessions. Variety validation of unknown samples assumes that the variety under question has been included a priori in a discovery panel. These results are based upon in silica studies and markers need to be validated using different SNP marker technology upon a differential set of hop genotypes. The marker sequence data and suggested primer sets provide potential means to fingerprint hop varieties in most genetic laboratories utilizing SNP-marker technology.
Lobach, Iryna; Fan, Ruzong; Manga, Prashiela
A central problem in genetic epidemiology is to identify and rank genetic markers involved in a disease. Complex diseases, such as cancer, hypertension, diabetes, are thought to be caused by an interaction of a panel of genetic factors, that can be identified by markers, which modulate environmental factors. Moreover, the effect of each genetic marker may be small. Hence, the association signal may be missed unless a large sample is considered, or a priori biomedical data are used. Recent advances generated a vast variety of a priori information, including linkage maps and information about gene regulatory dependence assembled into curated pathway databases. We propose a genotype-based approach that takes into account linkage disequilibrium (LD) information between genetic markers that are in moderate LD while modeling gene-gene and gene-environment interactions. A major advantage of our method is that the observed genetic information enters a model directly thus eliminating the need to estimate haplotype-phase. Our approach results in an algorithm that is inexpensive computationally and does not suffer from bias induced by haplotype-phase ambiguity. We investigated our model in a series of simulation experiments and demonstrated that the proposed approach results in estimates that are nearly unbiased and have small variability. We applied our method to the analysis of data from a melanoma case-control study and investigated interaction between a set of pigmentation genes and environmental factors defined by age and gender. Furthermore, an application of our method is demonstrated using a study of Alcohol Dependence.
Estimating time-dependent ROC curves using data under prevalent sampling.
Li, Shanshan
2017-04-15
Prevalent sampling is frequently a convenient and economical sampling technique for the collection of time-to-event data and thus is commonly used in studies of the natural history of a disease. However, it is biased by design because it tends to recruit individuals with longer survival times. This paper considers estimation of time-dependent receiver operating characteristic curves when data are collected under prevalent sampling. To correct the sampling bias, we develop both nonparametric and semiparametric estimators using extended risk sets and the inverse probability weighting techniques. The proposed estimators are consistent and converge to Gaussian processes, while substantial bias may arise if standard estimators for right-censored data are used. To illustrate our method, we analyze data from an ovarian cancer study and estimate receiver operating characteristic curves that assess the accuracy of the composite markers in distinguishing subjects who died within 3-5 years from subjects who remained alive. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Lee, Hyeonjeong; Shin, Miyoung
2017-01-01
The problem of discovering genetic markers as disease signatures is of great significance for the successful diagnosis, treatment, and prognosis of complex diseases. Even if many earlier studies worked on identifying disease markers from a variety of biological resources, they mostly focused on the markers of genes or gene-sets (i.e., pathways). However, these markers may not be enough to explain biological interactions between genetic variables that are related to diseases. Thus, in this study, our aim is to investigate distinctive associations among active pathways (i.e., pathway-sets) shown each in case and control samples which can be observed from gene expression and/or methylation data. The pathway-sets are obtained by identifying a set of associated pathways that are often active together over a significant number of class samples. For this purpose, gene expression or methylation profiles are first analyzed to identify significant (active) pathways via gene-set enrichment analysis. Then, regarding these active pathways, an association rule mining approach is applied to examine interesting pathway-sets in each class of samples (case or control). By doing so, the sets of associated pathways often working together in activity profiles are finally chosen as our distinctive signature of each class. The identified pathway-sets are aggregated into a pathway activity network (PAN), which facilitates the visualization of differential pathway associations between case and control samples. From our experiments with two publicly available datasets, we could find interesting PAN structures as the distinctive signatures of breast cancer and uterine leiomyoma cancer, respectively. Our pathway-set markers were shown to be superior or very comparable to other genetic markers (such as genes or gene-sets) in disease classification. Furthermore, the PAN structure, which can be constructed from the identified markers of pathway-sets, could provide deeper insights into distinctive associations between pathway activities in case and control samples.
Precise determination of time to reach viral load set point after acute HIV-1 infection.
Huang, Xiaojie; Chen, Hui; Li, Wei; Li, Haiying; Jin, Xia; Perelson, Alan S; Fox, Zoe; Zhang, Tong; Xu, Xiaoning; Wu, Hao
2012-12-01
The HIV viral load set point has long been used as a prognostic marker of disease progression and more recently as an end-point parameter in HIV vaccine clinical trials. The definition of set point, however, is variable. Moreover, the earliest time at which the set point is reached after the onset of infection has never been clearly defined. In this study, we obtained sequential plasma viral load data from 60 acutely HIV-infected Chinese patients among a cohort of men who have sex with men, mathematically determined viral load set point levels, and estimated time to attain set point after infection. We also compared the results derived from our models and that obtained from an empirical method. With novel uncomplicated mathematic model, we discovered that set points may vary from 21 to 119 days dependent on the patients' initial viral load trajectory. The viral load set points were 4.28 ± 0.86 and 4.25 ± 0.87 log10 copies per milliliter (P = 0.08), respectively, as determined by our model and an empirical method, suggesting an excellent agreement between the old and new methods. We provide a novel method to estimate viral load set point at the very early stage of HIV infection. Application of this model can accurately and reliably determine the set point, thus providing a new tool for physicians to better monitor early intervention strategies in acutely infected patients and scientists to rationally design preventative vaccine studies.
Semenova, S K; Illarionova, N A; Vasil'ev, V A; Shubkina, A V; Ryskov, A P
2002-06-01
The method of polymerase chain reaction with a set of arbitrary primers (RAPD-PCR) was used to describe genetic variation and to estimate genetic diversity in East-European swift hounds, Russian Psovyi and Hortyi Borzois. For comparison, swift hounds of two West-European breeds (Whippet and Greyhound) and single dogs of other breed groups (shepherd, terriers, mastiffs, and bird dogs) were examined. For all dog groups, their closest related species, the wolf Canis lupus, was used as an outgroup. Variation of RAPD markers was studied at several hierarchic levels: intra- and interfamily (for individual families of Russian Psovyi and Hortyi Borzois), intra- and interbreed (for ten dog breeds), and interspecific (C. familiaris-C. lupus). In total, 57 dogs and 4 wolfs were studied. Using RAPD-PCR with three primers, 93 DNA fragments with a length of 150-1500 bp were detected in several Borzoi families with known filiation. These fragments were found to be inherited as dominant markers and to be applicable for estimation of genetic differences between parents and their offspring and for comparison of individuals and families with different level of inbreeding. A high level of intra- and interbreed variation was found in Russian Psovyi and Hortyi Borzois. In these dog groups, genetic similarity indices varied in a range of 72.2 to 93.4% (parents-offspring) and 68.0 to 94.5 (sibs). Based on the patterns of RAPD markers obtained using six primers, a dendrogram of genetic similarity between the wolf and different dog breeds was constructed, and indices of intragroup diversity were calculated. All studied breeds were found to fall into two clusters, swift hounds (Borzoi-like dogs) and other dogs. Russian Borzois represent a very heterogeneous group, in which the Russian Psovyi Borzoi is closer to Greyhound than the Russian Hortyi Borzoi. All studied wolfs constituted a separate cluster. Significant differences were found between the wolf and dogs by the number of RAPD markers (92.8 and 86.1, respectively) and by the indices of genetic diversity (54.3 and 64.8%, respectively). The reasons for the high intraspecific variation of dogs (including Russian Borzois) and the prospects of using the studied group of markers for genetic analysis and differentiation in C. familiaris are discussed.
Duan, Chuanren; Cui, Yamin; Zhao, Yi; Zhai, Jun; Zhang, Baoyun; Zhang, Kun; Sun, Da; Chen, Hang
2016-10-01
A genetic marker within the 16S rRNA gene of Faecalibacterium was identified for use in a quantitative PCR (qPCR) assay to detect swine faecal contamination in water. A total of 146,038 bacterial sequences were obtained using 454 pyrosequencing. By comparative bioinformatics analysis of Faecalibacterium sequences with those of numerous swine and other animal species, swine-specific Faecalibacterium 16S rRNA gene sequences were identified and Polymerase Chain Okabe (PCR) primer sets designed and tested against faecal DNA samples from swine and non-swine sources. Two PCR primer sets, PFB-1 and PFB-2, showed the highest specificity to swine faecal waste and had no cross-reaction with other animal samples. PFB-1 and PFB-2 amplified 16S rRNA gene sequences from 50 samples of swine with positive ratios of 86 and 90%, respectively. We compared swine-specific Faecalibacterium qPCR assays for the purpose of quantifying the newly identified markers. The quantification limits (LOQs) of PFB-1 and PFB-2 markers in environmental water were 6.5 and 2.9 copies per 100 ml, respectively. Of the swine-associated assays tested, PFB-2 was more sensitive in detecting the swine faecal waste and quantifying the microbial load. Furthermore, the microbial abundance and diversity of the microbiomes of swine and other animal faeces were estimated using operational taxonomic units (OTUs). The species specificity was demonstrated for the microbial populations present in various animal faeces. Copyright © 2016 Elsevier Ltd. All rights reserved.
Rohde, Palle Duun; Demontis, Ditte; Cuyabano, Beatriz Castro Dias; Børglum, Anders D; Sørensen, Peter
2016-08-01
Schizophrenia is a psychiatric disorder with large personal and social costs, and understanding the genetic etiology is important. Such knowledge can be obtained by testing the association between a disease phenotype and individual genetic markers; however, such single-marker methods have limited power to detect genetic markers with small effects. Instead, aggregating genetic markers based on biological information might increase the power to identify sets of genetic markers of etiological significance. Several set test methods have been proposed: Here we propose a new set test derived from genomic best linear unbiased prediction (GBLUP), the covariance association test (CVAT). We compared the performance of CVAT to other commonly used set tests. The comparison was conducted using a simulated study population having the same genetic parameters as for schizophrenia. We found that CVAT was among the top performers. When extending CVAT to utilize a mixture of SNP effects, we found an increase in power to detect the causal sets. Applying the methods to a Danish schizophrenia case-control data set, we found genomic evidence for association of schizophrenia with vitamin A metabolism and immunological responses, which previously have been implicated with schizophrenia based on experimental and observational studies. Copyright © 2016 by the Genetics Society of America.
Smith, JohnEric W.; Krings, Ben M.; Peterson, Timothy J.; Rountree, Jaden A.; Zak, Roksana B.; McAllister, Matthew J.
2017-01-01
The purpose of this study was to investigate the impact of ingesting an amino acid-electrolyte (AAE) beverage during upper body resistance exercise on transient muscle hypertrophy, exercise performance, markers of muscle damage, and recovery. Participants (n = 15) performed three sets of six repetitions—bench press, lat pull down, incline press, and seated row—followed by three sets of eight repetitions at 75% of the estimated 1 repetition maximum—triceps kickback, hammer curl, triceps push down, and preacher curl—with 90 s of rest between sets. The final set of the push down/preacher curl was performed to failure. Prior to and immediately post-exercise, as well as 24, 48, and 72 h post exercise, cross-sectional muscle thickness was measured. Blood samples were collected prior to exercise, as well as 24, 48, and 72 h post-exercise for serum creatine kinase (CK) analysis. No treatment effect was found for muscle cross-sectional area, repetitions to failure, or serum CK. A main effect (p < 0.001) was observed in the change in serum CK levels in the days following the resistance exercise session. The findings of this study suggest that the acute ingestion of a AAE beverage does not alter acute muscle thickness, performance, perceived soreness and weakness, or markers of muscle damage.
Chen, Hongda; Knebel, Phillip; Brenner, Hermann
2016-07-01
Search for biomarkers for early detection of cancer is a very active area of research, but most studies are done in clinical rather than screening settings. We aimed to empirically evaluate the role of study setting for early detection marker identification and validation. A panel of 92 candidate cancer protein markers was measured in 35 clinically identified colorectal cancer patients and 35 colorectal cancer patients identified at screening colonoscopy. For each case group, we selected 38 controls without colorectal neoplasms at screening colonoscopy. Single-, two- and three-marker combinations discriminating cases and controls were identified in each setting and subsequently validated in the alternative setting. In all scenarios, a higher number of predictive biomarkers were initially detected in the clinical setting, but a substantially lower proportion of identified biomarkers could subsequently be confirmed in the screening setting. Confirmation rates were 50.0%, 84.5%, and 74.2% for one-, two-, and three-marker algorithms identified in the screening setting and were 42.9%, 18.6%, and 25.7% for algorithms identified in the clinical setting. Validation of early detection markers of cancer in a true screening setting is important to limit the number of false-positive findings. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
O'Brien, Jake William; Banks, Andrew Phillip William; Novic, Andrew Joseph; Mueller, Jochen F; Jiang, Guangming; Ort, Christoph; Eaglesham, Geoff; Yuan, Zhiguo; Thai, Phong K
2017-04-04
A key uncertainty of wastewater-based epidemiology is the size of the population which contributed to a given wastewater sample. We previously developed and validated a Bayesian inference model to estimate population size based on 14 population markers which: (1) are easily measured and (2) have mass loads which correlate with population size. However, the potential uncertainty of the model prediction due to in-sewer degradation of these markers was not evaluated. In this study, we addressed this gap by testing their stability under sewer conditions and assessed whether degradation impacts the model estimates. Five markers, which formed the core of our model, were stable in the sewers while the others were not. Our evaluation showed that the presence of unstable population markers in the model did not decrease the precision of the population estimates providing that stable markers such as acesulfame remained in the model. However, to achieve the minimum uncertainty in population estimates, we propose that the core markers to be included in population models for other sites should meet two additional criteria: (3) negligible degradation in wastewater to ensure the stability of chemicals during collection; and (4) < 10% in-sewer degradation could occur during the mean residence time of the sewer network.
Williams, Robert C; Elston, Robert C; Kumar, Pankaj; Knowler, William C; Abboud, Hanna E; Adler, Sharon; Bowden, Donald W; Divers, Jasmin; Freedman, Barry I; Igo, Robert P; Ipp, Eli; Iyengar, Sudha K; Kimmel, Paul L; Klag, Michael J; Kohn, Orly; Langefeld, Carl D; Leehey, David J; Nelson, Robert G; Nicholas, Susanne B; Pahl, Madeleine V; Parekh, Rulan S; Rotter, Jerome I; Schelling, Jeffrey R; Sedor, John R; Shah, Vallabh O; Smith, Michael W; Taylor, Kent D; Thameem, Farook; Thornley-Brown, Denyse; Winkler, Cheryl A; Guo, Xiuqing; Zager, Phillip; Hanson, Robert L
2016-05-04
The presence of population structure in a sample may confound the search for important genetic loci associated with disease. Our four samples in the Family Investigation of Nephropathy and Diabetes (FIND), European Americans, Mexican Americans, African Americans, and American Indians are part of a genome- wide association study in which population structure might be particularly important. We therefore decided to study in detail one component of this, individual genetic ancestry (IGA). From SNPs present on the Affymetrix 6.0 Human SNP array, we identified 3 sets of ancestry informative markers (AIMs), each maximized for the information in one the three contrasts among ancestral populations: Europeans (HAPMAP, CEU), Africans (HAPMAP, YRI and LWK), and Native Americans (full heritage Pima Indians). We estimate IGA and present an algorithm for their standard errors, compare IGA to principal components, emphasize the importance of balancing information in the ancestry informative markers (AIMs), and test the association of IGA with diabetic nephropathy in the combined sample. A fixed parental allele maximum likelihood algorithm was applied to the FIND to estimate IGA in four samples: 869 American Indians; 1385 African Americans; 1451 Mexican Americans; and 826 European Americans. When the information in the AIMs is unbalanced, the estimates are incorrect with large error. Individual genetic admixture is highly correlated with principle components for capturing population structure. It takes ~700 SNPs to reduce the average standard error of individual admixture below 0.01. When the samples are combined, the resulting population structure creates associations between IGA and diabetic nephropathy. The identified set of AIMs, which include American Indian parental allele frequencies, may be particularly useful for estimating genetic admixture in populations from the Americas. Failure to balance information in maximum likelihood, poly-ancestry models creates biased estimates of individual admixture with large error. This also occurs when estimating IGA using the Bayesian clustering method as implemented in the program STRUCTURE. Odds ratios for the associations of IGA with disease are consistent with what is known about the incidence and prevalence of diabetic nephropathy in these populations.
Rašić, Gordana; Filipović, Igor; Weeks, Andrew R; Hoffmann, Ary A
2014-04-11
Genetic markers are widely used to understand the biology and population dynamics of disease vectors, but often markers are limited in the resolution they provide. In particular, the delineation of population structure, fine scale movement and patterns of relatedness are often obscured unless numerous markers are available. To address this issue in the major arbovirus vector, the yellow fever mosquito (Aedes aegypti), we used double digest Restriction-site Associated DNA (ddRAD) sequencing for the discovery of genome-wide single nucleotide polymorphisms (SNPs). We aimed to characterize the new SNP set and to test the resolution against previously described microsatellite markers in detecting broad and fine-scale genetic patterns in Ae. aegypti. We developed bioinformatics tools that support the customization of restriction enzyme-based protocols for SNP discovery. We showed that our approach for RAD library construction achieves unbiased genome representation that reflects true evolutionary processes. In Ae. aegypti samples from three continents we identified more than 18,000 putative SNPs. They were widely distributed across the three Ae. aegypti chromosomes, with 47.9% found in intergenic regions and 17.8% in exons of over 2,300 genes. Pattern of their imputed effects in ORFs and UTRs were consistent with those found in a recent transcriptome study. We demonstrated that individual mosquitoes from Indonesia, Australia, Vietnam and Brazil can be assigned with a very high degree of confidence to their region of origin using a large SNP panel. We also showed that familial relatedness of samples from a 0.4 km2 area could be confidently established with a subset of SNPs. Using a cost-effective customized RAD sequencing approach supported by our bioinformatics tools, we characterized over 18,000 SNPs in field samples of the dengue fever mosquito Ae. aegypti. The variants were annotated and positioned onto the three Ae. aegypti chromosomes. The new SNP set provided much greater resolution in detecting population structure and estimating fine-scale relatedness than a set of polymorphic microsatellites. RAD-based markers demonstrate great potential to advance our understanding of mosquito population processes, critical for implementing new control measures against this major disease vector.
Linking the potato genome to the Conserved Ortholog Set (COS) markers
USDA-ARS?s Scientific Manuscript database
Conserved ortholog set (COS) markers are an important functional genomics resource that has greatly improved orthology detection in Asterid species. A comprehensive list of these markers is available at Sol Genomics Network (http://www.sgn.cornell.edu) and many of these have been placed in the genet...
Wang, Jianren; Xu, Junkai; Shull, Peter B
2018-03-01
Vertical jump height is widely used for assessing motor development, functional ability, and motor capacity. Traditional methods for estimating vertical jump height rely on force plates or optical marker-based motion capture systems limiting assessment to people with access to specialized laboratories. Current wearable designs need to be attached to the skin or strapped to an appendage which can potentially be uncomfortable and inconvenient to use. This paper presents a novel algorithm for estimating vertical jump height based on foot-worn inertial sensors. Twenty healthy subjects performed countermovement jumping trials and maximum jump height was determined via inertial sensors located above the toe and under the heel and was compared with the gold standard maximum jump height estimation via optical marker-based motion capture. Average vertical jump height estimation errors from inertial sensing at the toe and heel were -2.2±2.1 cm and -0.4±3.8 cm, respectively. Vertical jump height estimation with the presented algorithm via inertial sensing showed excellent reliability at the toe (ICC(2,1)=0.98) and heel (ICC(2,1)=0.97). There was no significant bias in the inertial sensing at the toe, but proportional bias (b=1.22) and fixed bias (a=-10.23cm) were detected in inertial sensing at the heel. These results indicate that the presented algorithm could be applied to foot-worn inertial sensors to estimate maximum jump height enabling assessment outside of traditional laboratory settings, and to avoid bias errors, the toe may be a more suitable location for inertial sensor placement than the heel.
Cahoon, D.R.; Marin, P.E.; Black, B.K.; Lynch, J.C.
2000-01-01
High-resolution measures of vertical accretion, elevation, and compaction of shallow-water sediments are fundamental to understanding the processes that control elevation change and the mechanisms of progradation (e.g., development of mudflats and intertidal wetlands) in coastal systems. Yet, measurements of elevation by traditional survey methods often are of low accuracy because of the compressible nature of the substrates. Nor do they provide measures of vertical accretion or sediment compaction. This paper evaluates the use in shallow-water systems of an approach designed to measure these variables in vegetated wetlands. The approach employs simultaneous measures of elevation from temporary benchmarks using a sedimentation-erosion table (SET) and vertical accretion from marker horizons with sediment cores collected with a cryogenic coring apparatus. The measures are made with a level of resolution sufficient to distinguish between the influence of surface and subsurface processes on elevation, thus providing quantitative estimates of shallow subsidence. The SET-marker horizon approach was evaluated on a developing splay created by an artificial crevasse of a distributary in the Mississippi River delta. The approach provided high-resolution measures of vertical accretion (48.3 ' 2.0 cm.) and elevation (36.7 ' 1.6 cm) over a 4-year period, with the difference between the two indicating the amount of shallow subsidence. In addition, by laying new marker horizons in later years, the approach provided rates not only of shallow subsidence (3.9 ' 0.5 cm y-1) but also compaction of newly deposited seiments (2.1 ' 0.6 cm y-1) and compaction of underlying sediments (1.8 ' 2.0 cm y-1 ) over a two-year period. Hence, the SET-marker horizon approach has widespread applicability in both emergent wetland and shallow water environments for providing high resolution measures of the processes controlling elevation change.
Solav, Dana; Rubin, M B; Cereatti, Andrea; Camomilla, Valentina; Wolf, Alon
2016-04-01
Accurate estimation of the position and orientation (pose) of a bone from a cluster of skin markers is limited mostly by the relative motion between the bone and the markers, which is known as the soft tissue artifact (STA). This work presents a method, based on continuum mechanics, to describe the kinematics of a cluster affected by STA. The cluster is characterized by triangular cosserat point elements (TCPEs) defined by all combinations of three markers. The effects of the STA on the TCPEs are quantified using three parameters describing the strain in each TCPE and the relative rotation and translation between TCPEs. The method was evaluated using previously collected ex vivo kinematic data. Femur pose was estimated from 12 skin markers on the thigh, while its reference pose was measured using bone pins. Analysis revealed that instantaneous subsets of TCPEs exist which estimate bone position and orientation more accurately than the Procrustes Superimposition applied to the cluster of all markers. It has been shown that some of these parameters correlate well with femur pose errors, which suggests that they can be used to select, at each instant, subsets of TCPEs leading an improved estimation of the underlying bone pose.
Tsuruta, S; Lourenco, D A L; Misztal, I; Lawlor, T J
2015-08-01
The objective of this study was to investigate genotype by environment interactions for culling rates and milk production in large and small dairy herds in 3 US regions, using genotypes, pedigree, and phenotypes. Single nucleotide polymorphism (SNP) marker variances were also estimated in these different environments. Culling rates including cow mortality were based on 6 Dairy Herd Improvement termination codes reported by dairy producers. Separate data sets for culling rates and 305-d milk yield were created for large and small dairy herds in the US regions of the Southeast (SE), Southwest (SW), and Northeast (NE) for the first 3 lactation cows that calved between 1999 and 2008. Genomic information from 42,503 SNP markers on 34,506 bulls was included in the analysis to predict genomic estimated breeding value (GEBV) of culling rates and 305-d milk yield with a single-step genomic BLUP using a bivariate threshold-linear model. Cow replacement rates in large SE and NE herds were higher. Heritability estimates of culling rates ranged from 0.03 to 0.11, but the differences were small between large and small herds and among the 3 US regions. Genetic correlations between culling rates and 305-d milk yield were medium to high for cows sold for poor production and reproduction problems. Correlations of GEBV for culling rates among the 3 US regions ranged from 0.34 to 0.92 and were lower between the SW and the other regions, especially in small herds. Correlations of GEBV between large and small herds ranged from 0.44 to 0.90 and were lower in the SW. These results indicate genotype by environment interactions of cow culling rate between the US regions and between large and small herds. Correlations of top 30 SNP marker effects for culling rates between 2 US regions ranged from 0.64 to 0.98 and were higher than those of more SNP marker effects except for a culling reason "sold for dairy purpose." Those correlations between large and small herds ranged from 0.67 to 0.98. High correlations of top SNP marker effects on culling reasons between the US regions and between large and small herds suggest that major markers can be useful for selection in different environments. The SNP variance shown in a marker gene segment on chromosome 14 was strongly associated with milk production in large and small herds in the NE but not in the SE and SW. Marker genes on chromosome 14 also showed a strong association with cow culling rates due to poor production and mortality in large herds in the NE. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Mukherjee, S; Saha, A; Kumar P, Senthil; Chandak, G R; Majumder, P P; Ray, K
2006-01-01
Hemophilia B, an X-linked recessive bleeding disorder, is caused by heterogeneous mutations in the factor IX (F9) gene. Hence, carriers of the disease are usually detected by F9 gene linked RFLP analysis. We aimed to test a set of RFLP markers (DdeI, XmnI, MnlI, TaqI & HhaI), used worldwide for carrier detection, to estimate its heterozygosity in different population groups of India, and identify additional single nucleotide polymorphisms (SNPs) if necessary. A total of 8 population groups encompassing different regions of India, consisting of 107 unrelated normal females without any history of hemophilia B in the family and 13 unrelated obligate carriers were recruited in the study. Regions of F9 gene were amplified by PCR from genomic DNA of the donors followed by restriction enzyme digestion and/or sequencing as appropriate. Combined informativeness for the markers varied between 52-86% among normal females belonging to different geographical locations of India. Haplotype analysis revealed that the most prevalent haplotype lacked the restriction sites for all five RFLP markers. Screening regions of F9 gene that harbor 10 SNPs reported in dbSNP yielded only two SNPs, which increased the overall informativeness in each population group and heterozygosity in the obligate carriers for the disease from 38% to 69%. Our data show that heterozygosity of commonly used RFLP markers is remarkably variable across different regions of India. Thus prudent selection of the markers based on specific population groups including usage of additional markers is recommended for efficient carrier detection.
Georgi, Laura; Johnson-Cicalese, Jennifer; Honig, Josh; Das, Sushma Parankush; Rajah, Veeran D; Bhattacharya, Debashish; Bassil, Nahla; Rowland, Lisa J; Polashock, James; Vorsa, Nicholi
2013-03-01
The first genetic map of cranberry (Vaccinium macrocarpon) has been constructed, comprising 14 linkage groups totaling 879.9 cM with an estimated coverage of 82.2 %. This map, based on four mapping populations segregating for field fruit-rot resistance, contains 136 distinct loci. Mapped markers include blueberry-derived simple sequence repeat (SSR) and cranberry-derived sequence-characterized amplified region markers previously used for fingerprinting cranberry cultivars. In addition, SSR markers were developed near cranberry sequences resembling genes involved in flavonoid biosynthesis or defense against necrotrophic pathogens, or conserved orthologous set (COS) sequences. The cranberry SSRs were developed from next-generation cranberry genomic sequence assemblies; thus, the positions of these SSRs on the genomic map provide information about the genomic location of the sequence scaffold from which they were derived. The use of SSR markers near COS and other functional sequences, plus 33 SSR markers from blueberry, facilitates comparisons of this map with maps of other plant species. Regions of the cranberry map were identified that showed conservation of synteny with Vitis vinifera and Arabidopsis thaliana. Positioned on this map are quantitative trait loci (QTL) for field fruit-rot resistance (FFRR), fruit weight, titratable acidity, and sound fruit yield (SFY). The SFY QTL is adjacent to one of the fruit weight QTL and may reflect pleiotropy. Two of the FFRR QTL are in regions of conserved synteny with grape and span defense gene markers, and the third FFRR QTL spans a flavonoid biosynthetic gene.
DNA fingerprinting sets for four southern pines
Craig Echt; Sedley Josserand
2018-01-01
DNA markers can provide valuable genetic information for forest tree research, breeding, conservation, and restoration programs. When properly evaluated, selected sets of DNA markers can be used to efficiently get information about genetic diversity in regions, forests, or stands, or in seed lots and orchards. Selected markers also can be used to determine parentage or...
Automatic Nuclei Segmentation in H&E Stained Breast Cancer Histopathology Images
Veta, Mitko; van Diest, Paul J.; Kornegoor, Robert; Huisman, André; Viergever, Max A.; Pluim, Josien P. W.
2013-01-01
The introduction of fast digital slide scanners that provide whole slide images has led to a revival of interest in image analysis applications in pathology. Segmentation of cells and nuclei is an important first step towards automatic analysis of digitized microscopy images. We therefore developed an automated nuclei segmentation method that works with hematoxylin and eosin (H&E) stained breast cancer histopathology images, which represent regions of whole digital slides. The procedure can be divided into four main steps: 1) pre-processing with color unmixing and morphological operators, 2) marker-controlled watershed segmentation at multiple scales and with different markers, 3) post-processing for rejection of false regions and 4) merging of the results from multiple scales. The procedure was developed on a set of 21 breast cancer cases (subset A) and tested on a separate validation set of 18 cases (subset B). The evaluation was done in terms of both detection accuracy (sensitivity and positive predictive value) and segmentation accuracy (Dice coefficient). The mean estimated sensitivity for subset A was 0.875 (±0.092) and for subset B 0.853 (±0.077). The mean estimated positive predictive value was 0.904 (±0.075) and 0.886 (±0.069) for subsets A and B, respectively. For both subsets, the distribution of the Dice coefficients had a high peak around 0.9, with the vast majority of segmentations having values larger than 0.8. PMID:23922958
Automatic nuclei segmentation in H&E stained breast cancer histopathology images.
Veta, Mitko; van Diest, Paul J; Kornegoor, Robert; Huisman, André; Viergever, Max A; Pluim, Josien P W
2013-01-01
The introduction of fast digital slide scanners that provide whole slide images has led to a revival of interest in image analysis applications in pathology. Segmentation of cells and nuclei is an important first step towards automatic analysis of digitized microscopy images. We therefore developed an automated nuclei segmentation method that works with hematoxylin and eosin (H&E) stained breast cancer histopathology images, which represent regions of whole digital slides. The procedure can be divided into four main steps: 1) pre-processing with color unmixing and morphological operators, 2) marker-controlled watershed segmentation at multiple scales and with different markers, 3) post-processing for rejection of false regions and 4) merging of the results from multiple scales. The procedure was developed on a set of 21 breast cancer cases (subset A) and tested on a separate validation set of 18 cases (subset B). The evaluation was done in terms of both detection accuracy (sensitivity and positive predictive value) and segmentation accuracy (Dice coefficient). The mean estimated sensitivity for subset A was 0.875 (±0.092) and for subset B 0.853 (±0.077). The mean estimated positive predictive value was 0.904 (±0.075) and 0.886 (±0.069) for subsets A and B, respectively. For both subsets, the distribution of the Dice coefficients had a high peak around 0.9, with the vast majority of segmentations having values larger than 0.8.
Marques, Paul R; Tippetts, A Scott; Yegles, Michel
2014-01-01
This study focuses on the predictive and comparative significance of ethyl glucuronide measured in head hair (hEtG) for estimating risks associated with alcohol-impaired driving offenders. Earlier work compared different alcohol biomarkers for estimating rates of failed blood alcohol concentration (BAC) tests logged during 8 months of interlock participation. These analyses evaluate the comparative performance of several alcohol markers including hEtG and other markers, past driver records, and psychometric assessment predictors for the detection of 4 criteria: new driving under the influence (DUI) recidivism, alcohol dependence, and interlock record variables including fail rates and maximal interlock BACs logged. Drivers charged with alcohol impairment (DUI) in Alberta, Canada (n = 534; 64% first offenders, 36% multiple offenders) installed ignition interlock devices and consented to participate in research to evaluate blood-, hair-, and urine-derived alcohol biomarkers; sit for interviews; take psychometric assessments; and permit analyses of driving records and interlock log files. Subject variables included demographics, alcohol dependence at program entry, preprogram prior DUI convictions, postenrollment new DUI convictions, self-reported drinking assessments, morning and overall rates of failed interlock BAC tests, and maximal interlock BAC readings. Recidivism, dependence, high BAC, and combined fail rates were set as criteria; other variables were set as predictors. Area under the receiver operating characteristics (ROC) curve (A') estimates of sensitivity and specificity were calculated. Additional analyses were conducted on baseline hEtG levels. Driver performance and drinking indicators were evaluated against the standard hEtG cutoff for excessive drinking at (30 pg/mg) and a higher criterion of 50 pg/mg. HEtG splits were evaluated with the Mann-Whitney rank statistic. HEtG emerged as a top overall predictor for discriminating new recidivism events that occur after interlock installation, for entry alcohol dependence, and for the highest interlock BACs recorded. Together, hEtG and phosphatidylethanol (PEth) were the top predictors of all criterion measures. By contrast, the hair-derived alcohol biomarkers hEtG and hFAEE (fatty acid ethyl esters) were poorer than other alcohol biomarkers as detectors of interlock BAC test fail rates. This study showed that hEtG, an objective alternative to often unreliable self-reported past representation of drinking levels, yields crucial insight into driver alcohol-related risks early in an interlock program and is a top predictor of new recidivist events. Together with PEth, these markers would be excellent anchors in a panel for detecting alcohol consumption.
related: an R package for analysing pairwise relatedness from codominant molecular markers.
Pew, Jack; Muir, Paul H; Wang, Jinliang; Frasier, Timothy R
2015-05-01
Analyses of pairwise relatedness represent a key component to addressing many topics in biology. However, such analyses have been limited because most available programs provide a means to estimate relatedness based on only a single estimator, making comparison across estimators difficult. Second, all programs to date have been platform specific, working only on a specific operating system. This has the undesirable outcome of making choice of relatedness estimator limited by operating system preference, rather than being based on scientific rationale. Here, we present a new R package, called related, that can calculate relatedness based on seven estimators, can account for genotyping errors, missing data and inbreeding, and can estimate 95% confidence intervals. Moreover, simulation functions are provided that allow for easy comparison of the performance of different estimators and for analyses of how much resolution to expect from a given data set. Because this package works in R, it is platform independent. Combined, this functionality should allow for more appropriate analyses and interpretation of pairwise relatedness and will also allow for the integration of relatedness data into larger R workflows. © 2014 John Wiley & Sons Ltd.
Solav, Dana; Camomilla, Valentina; Cereatti, Andrea; Barré, Arnaud; Aminian, Kamiar; Wolf, Alon
2017-09-06
The aim of this study was to analyze the accuracy of bone pose estimation based on sub-clusters of three skin-markers characterized by triangular Cosserat point elements (TCPEs) and to evaluate the capability of four instantaneous physical parameters, which can be measured non-invasively in vivo, to identify the most accurate TCPEs. Moreover, TCPE pose estimations were compared with the estimations of two least squares minimization methods applied to the cluster of all markers, using rigid body (RBLS) and homogeneous deformation (HDLS) assumptions. Analysis was performed on previously collected in vivo treadmill gait data composed of simultaneous measurements of the gold-standard bone pose by bi-plane fluoroscopy tracking the subjects' knee prosthesis and a stereophotogrammetric system tracking skin-markers affected by soft tissue artifact. Femur orientation and position errors estimated from skin-marker clusters were computed for 18 subjects using clusters of up to 35 markers. Results based on gold-standard data revealed that instantaneous subsets of TCPEs exist which estimate the femur pose with reasonable accuracy (median root mean square error during stance/swing: 1.4/2.8deg for orientation, 1.5/4.2mm for position). A non-invasive and instantaneous criteria to select accurate TCPEs for pose estimation (4.8/7.3deg, 5.8/12.3mm), was compared with RBLS (4.3/6.6deg, 6.9/16.6mm) and HDLS (4.6/7.6deg, 6.7/12.5mm). Accounting for homogeneous deformation, using HDLS or selected TCPEs, yielded more accurate position estimations than RBLS method, which, conversely, yielded more accurate orientation estimations. Further investigation is required to devise effective criteria for cluster selection that could represent a significant improvement in bone pose estimation accuracy. Copyright © 2017 Elsevier Ltd. All rights reserved.
Gingerich, W.H.; Pityer, R.A.; Rach, J.J.
1987-01-01
1. Total blood volume and relative blood volumes in selected tissues were determined in non-anesthetized, confined rainbow trout by using 51Cr-labelled trout erythrocytes as a vascular space marker.2. Mean total blood volume was estimated to be 4.09 ± 0.55 ml/100 g, or about 75% of that estimated with the commonly used plasma space marker Evans blue dye.3. Relative tissue blood volumes were greatest in highly perfused tissues such as kidney, gills, brain and liver and least in mosaic muscle.4. Estimates of tissue vascular spaces, made using radiolabelled erythrocytes, were only 25–50% of those based on plasma space markers.5. The consistently smaller vascular volumes obtained with labelled erythrocytes could be explained by assuming that commonly used plasma space markers diffuse from the vascular compartment.
Urrestarazu, Jorge; Royo, José B.; Santesteban, Luis G.; Miranda, Carlos
2015-01-01
Fingerprinting information can be used to elucidate in a robust manner the genetic structure of germplasm collections, allowing a more rational and fine assessment of genetic resources. Bayesian model-based approaches are nowadays majorly preferred to infer genetic structure, but it is still largely unresolved how marker sets should be built in order to obtain a robust inference. The objective was to evaluate, in Pyrus germplasm collections, the influence of the SSR marker set size on the genetic structure inferred, also evaluating the influence of the criterion used to select those markers. Inferences were performed considering an increasing number of SSR markers that ranged from just two up to 25, incorporated one at a time into the analysis. The influence of the number of SSR markers used was evaluated comparing the number of populations and the strength of the signal detected, and also the similarity of the genotype assignments to populations between analyses. In order to test if those results were influenced by the criterion used to select the SSRs, several choosing scenarios based on the discrimination power or the fixation index values of the SSRs were tested. Our results indicate that population structure could be inferred accurately once a certain SSR number threshold was reached, which depended on the underlying structure within the genotypes, but the method used to select the markers included on each set appeared not to be very relevant. The minimum number of SSRs required to provide robust structure inferences and adequate measurements of the differentiation, even when low differentiation levels exist within populations, was proved similar to that of the complete list of recommended markers for fingerprinting. When a SSR set size similar to the minimum marker sets recommended for fingerprinting it is used, only major divisions or moderate (F ST>0.05) differentiation of the germplasm are detected. PMID:26382618
Do biological measures mediate the relationship between education and health: a comparative study
Goldman, Noreen; Turra, Cassio; Rosero-Bixby, Luis; Weir, David; Crimmins, Eileen
2010-01-01
Despite a myriad of studies examining the relationship between socioeconomic status and health outcomes, few have assessed the extent to which biological markers of chronic disease account for social disparities in health. Studies that have examined this issue have generally been based on surveys in wealthy countries that include a small set of clinical markers of cardiovascular disease. The availability of recent data from nationally representative surveys of older adults in Costa Rica and Taiwan that collected a rich set of biomarkers comparable to those in a recent US survey permits us to explore these associations across diverse populations. Similar regression models were estimated on three data sets – the Social Environment and Biomarkers of Aging Study in Taiwan, the Costa Rican Study on Longevity and Healthy Aging, and the Health and Retirement Study in the USA – in order to assess (1) the strength of the associations between educational attainment and a broad range of biomarkers; and (2) the extent to which these biomarkers account for the relationships between education and two measures of health status (self-rated health, functional limitations) in older populations. The estimates suggest non-systematic and weak associations between education and high risk biomarker values in Taiwan and Costa Rica, in contrast to generally negative and significant associations in the US, especially among women. The results also reveal negligible or modest contributions of the biomarkers to educational disparities in the health outcomes. The findings are generally consistent with previous research suggesting stronger associations between socioeconomic status and health in wealthy countries than in middle income countries and may reflect higher levels of social stratification in the US. With access to an increasing number of longitudinal biosocial surveys, researchers may be better able to distinguish true variations in the relationship between socioeconomic status and health across different settings from methodological differences. PMID:21159415
Paz-García, David A; Munguía-Vega, Adrián; Plomozo-Lugo, Tomas; Weaver, Amy Hudson
2017-04-01
We developed a set of hypervariable microsatellite markers for the Pacific red snapper (Lutjanus peru), an economically important marine fish for small-scale fisheries in the west coast of Mexico. We performed shotgun genome sequencing with the 454 XL titanium chemistry and used bioinformatic tools to search for perfect microsatellite loci. We selected 66 primer pairs that were synthesized and genotyped in an ABI PRISM 3730XL DNA sequencer in 32 individuals from the Gulf of California. We estimated levels of genetic diversity, deviations from linkage and Hardy-Weinberg equilibrium, estimated the frequency of null alleles and the probability of individual identity for the new markers. We reanalyzed 16 loci in 16 individuals to estimate genotyping error rates. Eighteen loci failed to amplify, 16 loci were discarded due to unspecific amplifications and 32 loci (14 tetranucleotide and 18 dinucleotide) were successfully scored. The average number of alleles per locus was 21 (±6.87, SD) and ranged from 8 to 34. The average observed and expected heterozygosities were 0.787 (±0.144 SD, range 0.250-0.935) and 0.909 (±0.122 SD, range 0.381-0.965), respectively. No significant linkage was detected. Eight loci showed deviations from Hardy-Weinberg equilibrium, and from these, four loci showed moderate null allele frequencies (0.104-0.220). The probability of individual identity for the new loci was 1.46 -62 . Genotyping error rates averaged 9.58%. The new markers will be useful to investigate patterns of larval dispersal, metapopulation dynamics, fine-scale genetic structure and diversity aimed to inform the implementation of spatially explicit fisheries management strategies in the Gulf of California.
bigSCale: an analytical framework for big-scale single-cell data.
Iacono, Giovanni; Mereu, Elisabetta; Guillaumet-Adkins, Amy; Corominas, Roser; Cuscó, Ivon; Rodríguez-Esteban, Gustavo; Gut, Marta; Pérez-Jurado, Luis Alberto; Gut, Ivo; Heyn, Holger
2018-06-01
Single-cell RNA sequencing (scRNA-seq) has significantly deepened our insights into complex tissues, with the latest techniques capable of processing tens of thousands of cells simultaneously. Analyzing increasing numbers of cells, however, generates extremely large data sets, extending processing time and challenging computing resources. Current scRNA-seq analysis tools are not designed to interrogate large data sets and often lack sensitivity to identify marker genes. With bigSCale, we provide a scalable analytical framework to analyze millions of cells, which addresses the challenges associated with large data sets. To handle the noise and sparsity of scRNA-seq data, bigSCale uses large sample sizes to estimate an accurate numerical model of noise. The framework further includes modules for differential expression analysis, cell clustering, and marker identification. A directed convolution strategy allows processing of extremely large data sets, while preserving transcript information from individual cells. We evaluated the performance of bigSCale using both a biological model of aberrant gene expression in patient-derived neuronal progenitor cells and simulated data sets, which underlines the speed and accuracy in differential expression analysis. To test its applicability for large data sets, we applied bigSCale to assess 1.3 million cells from the mouse developing forebrain. Its directed down-sampling strategy accumulates information from single cells into index cell transcriptomes, thereby defining cellular clusters with improved resolution. Accordingly, index cell clusters identified rare populations, such as reelin ( Reln )-positive Cajal-Retzius neurons, for which we report previously unrecognized heterogeneity associated with distinct differentiation stages, spatial organization, and cellular function. Together, bigSCale presents a solution to address future challenges of large single-cell data sets. © 2018 Iacono et al.; Published by Cold Spring Harbor Laboratory Press.
Pedigrees or markers: Which are better in estimating relatedness and inbreeding coefficient?
Wang, Jinliang
2016-02-01
Individual inbreeding coefficient (F) and pairwise relatedness (r) are fundamental parameters in population genetics and have important applications in diverse fields such as human medicine, forensics, plant and animal breeding, conservation and evolutionary biology. Traditionally, both parameters are calculated from pedigrees, but are now increasingly estimated from genetic marker data. Conceptually, a pedigree gives the expected F and r values, FP and rP, with the expectations being taken (hypothetically) over an infinite number of individuals with the same pedigree. In contrast, markers give the realised (actual) F and r values at the particular marker loci of the particular individuals, FM and rM. Both pedigree (FP, rP) and marker (FM, rM) estimates can be used as inferences of genomic inbreeding coefficients FG and genomic relatedness rG, which are the underlying quantities relevant to most applications (such as estimating inbreeding depression and heritability) of F and r. In the pre-genomic era, it was widely accepted that pedigrees are much better than markers in delineating FG and rG, and markers should better be used to validate, amend and construct pedigrees rather than to replace them. Is this still true in the genomic era when genome-wide dense SNPs are available? In this simulation study, I showed that genomic markers can yield much better estimates of FG and rG than pedigrees when they are numerous (say, 10(4) SNPs) under realistic situations (e.g. genome and population sizes). Pedigree estimates are especially poor for species with a small genome, where FG and rG are determined to a large extent by Mendelian segregations and may thus deviate substantially from their expectations (FP and rP). Simulations also confirmed that FM, when estimated from many SNPs, can be much more powerful than FP for detecting inbreeding depression in viability. However, I argue that pedigrees cannot be replaced completely by genomic SNPs, because the former allows for the calculation of more complicated IBD coefficients (involving more than 2 individuals, more than one locus, and more than 2 genes at a locus) for which the latter may have reduced capacity or limited power, and because the former has social and other significance for remote relationships which have little genetic significance and cannot be inferred reliably from markers. Copyright © 2015 Elsevier Inc. All rights reserved.
Wahl, Simone; Boulesteix, Anne-Laure; Zierer, Astrid; Thorand, Barbara; van de Wiel, Mark A
2016-10-26
Missing values are a frequent issue in human studies. In many situations, multiple imputation (MI) is an appropriate missing data handling strategy, whereby missing values are imputed multiple times, the analysis is performed in every imputed data set, and the obtained estimates are pooled. If the aim is to estimate (added) predictive performance measures, such as (change in) the area under the receiver-operating characteristic curve (AUC), internal validation strategies become desirable in order to correct for optimism. It is not fully understood how internal validation should be combined with multiple imputation. In a comprehensive simulation study and in a real data set based on blood markers as predictors for mortality, we compare three combination strategies: Val-MI, internal validation followed by MI on the training and test parts separately, MI-Val, MI on the full data set followed by internal validation, and MI(-y)-Val, MI on the full data set omitting the outcome followed by internal validation. Different validation strategies, including bootstrap und cross-validation, different (added) performance measures, and various data characteristics are considered, and the strategies are evaluated with regard to bias and mean squared error of the obtained performance estimates. In addition, we elaborate on the number of resamples and imputations to be used, and adopt a strategy for confidence interval construction to incomplete data. Internal validation is essential in order to avoid optimism, with the bootstrap 0.632+ estimate representing a reliable method to correct for optimism. While estimates obtained by MI-Val are optimistically biased, those obtained by MI(-y)-Val tend to be pessimistic in the presence of a true underlying effect. Val-MI provides largely unbiased estimates, with a slight pessimistic bias with increasing true effect size, number of covariates and decreasing sample size. In Val-MI, accuracy of the estimate is more strongly improved by increasing the number of bootstrap draws rather than the number of imputations. With a simple integrated approach, valid confidence intervals for performance estimates can be obtained. When prognostic models are developed on incomplete data, Val-MI represents a valid strategy to obtain estimates of predictive performance measures.
Qu, Conghui; Schuetz, Johanna M.; Min, Jeong Eun; Leach, Stephen; Daley, Denise; Spinelli, John J.; Brooks-Wilson, Angela; Graham, Jinko
2011-01-01
We describe a statistical approach to predict gender-labeling errors in candidate-gene association studies, when Y-chromosome markers have not been included in the genotyping set. The approach adds value to methods that consider only the heterozygosity of X-chromosome SNPs, by incorporating available information about the intensity of X-chromosome SNPs in candidate genes relative to autosomal SNPs from the same individual. To our knowledge, no published methods formalize a framework in which heterozygosity and relative intensity are simultaneously taken into account. Our method offers the advantage that, in the genotyping set, no additional space is required beyond that already assigned to X-chromosome SNPs in the candidate genes. We also show how the predictions can be used in a two-phase sampling design to estimate the gender-labeling error rates for an entire study, at a fraction of the cost of a conventional design. PMID:22303327
Erdei, Esther; Sheng, Huiping; Maestas, Erika; Mackey, Amanda; White, Kirsten A.; Li, Lin; Dong, Yan; Taylor, Justin; Berwick, Marianne; Morse, Douglas E.
2011-01-01
Background Hispanics are known to be an extremely diverse and genetically admixed ethnic group. The lack of methodologies to control for ethnicity and the unknown admixture in complex study populations of Hispanics has left a gap in understanding certain cancer disparity issues. Incidence rates for oral and pharyngeal cancer (OPC) in Puerto Rico are among the highest in the Western Hemisphere. We conducted an epidemiological study to examine risk and protective factors, in addition to possible genetic susceptibility components, for oral cancer and precancer in Puerto Rico. Methodology/Principal Findings We recruited 310 Puerto Rico residents who had been diagnosed with either an incident oral squamous cell carcinoma, oral precancer, or benign oral condition. Participants completed an in-person interview and contributed buccal cells for DNA extraction. ABI Biosystem Taqman™ primer sets were used for genotyping 12 ancestry informative markers (AIMs). Ancestral group estimates were generated using maximum likelihood estimation software (LEADMIX), and additional principal component analysis was carried out to detect population substructures. We used unconditional logistic regression to assess the contribution of ancestry to the risk of being diagnosed with either an oral cancer or precancer while controlling for other potential confounders. The maximum likelihood estimates showed that study participants had a group average ancestry contribution of 69.9% European, 24.5% African, and 5.7% detectable Native American. The African and Indigenous American group estimates were significantly higher than anticipated. Neither self-identified ethnicity nor ancestry markers showed any significant associations with oral cancer/precancer risk in our study. Conclusions/Significance The application of ancestry informative markers (AIMs), specifically designed for Hispanics, suggests no hidden population substructure is present based on our sampling and provides a viable approach for the evaluation and control of ancestry in future studies involving Hispanic populations. PMID:21897864
Gran, Jon Michael; Røysland, Kjetil; Wolbers, Marcel; Didelez, Vanessa; Sterne, Jonathan A C; Ledergerber, Bruno; Furrer, Hansjakob; von Wyl, Viktor; Aalen, Odd O
2010-11-20
When estimating the effect of treatment on HIV using data from observational studies, standard methods may produce biased estimates due to the presence of time-dependent confounders. Such confounding can be present when a covariate, affected by past exposure, is both a predictor of the future exposure and the outcome. One example is the CD4 cell count, being a marker for disease progression for HIV patients, but also a marker for treatment initiation and influenced by treatment. Fitting a marginal structural model (MSM) using inverse probability weights is one way to give appropriate adjustment for this type of confounding. In this paper we study a simple and intuitive approach to estimate similar treatment effects, using observational data to mimic several randomized controlled trials. Each 'trial' is constructed based on individuals starting treatment in a certain time interval. An overall effect estimate for all such trials is found using composite likelihood inference. The method offers an alternative to the use of inverse probability of treatment weights, which is unstable in certain situations. The estimated parameter is not identical to the one of an MSM, it is conditioned on covariate values at the start of each mimicked trial. This allows the study of questions that are not that easily addressed fitting an MSM. The analysis can be performed as a stratified weighted Cox analysis on the joint data set of all the constructed trials, where each trial is one stratum. The model is applied to data from the Swiss HIV cohort study. Copyright © 2010 John Wiley & Sons, Ltd.
Investigator® HDplex (Qiagen) reference population database for forensic use in Argentina.
Martínez, Gustavo; Borosky, Alicia; Corach, Daniel; Llull, Cintia; Locarno, Laura; Lojo, Mercedes; Marino, Miguel; Miozzo, María Cecilia; Modesti, Nidia; Pacharoni, Carla; Pilili, Juan Pablo; Ramella, María Isabel; Sala, Andrea; Schaller, Cecilia; Vullo, Carlos; Toscanini, Ulises
2017-01-01
Currently, autosomal Short Tandem Repeat (STR) markers represent the method of election in forensic human identification. Commercial kits of most common use nowadays -e.g. PowerPlex ® Fusion, Promega Corp.; AmpFlSTR GlobalFiler, Thermofisher scientific; Investigator 24Plex QS,Qiagen-, allow the co-amplification of 23 highly polymorphic STR loci providing a high discrimination power in human identity testing. However, in complex kinship analysis and familial database searches involving distant relationships, additional DNA typing is often required in order to achieve well-founded conclusions. The recently developed kit Investigator ® HDplex (Qiagen) co-amplify twelve autosomal STRs markers (D7S1517, D3S1744, D12S391, D2S1360, D6S474, D4S2366, D8S1132, D5S2500, D18S51, D21S2055, D10S2325, SE33), nine of which are not present in the above mentioned kits, providing a set of efficient supplementary markers for human identification purposes. In this study we genotyped a sample of 980 individuals from urban areas of ten Argentinean provinces using the Investigator ® HDplex kit, aiming to provide forensic estimates for use in forensic casework and parentage testing in Argentina. We report reference allelic frequency databases for each of the provinces studied as well as for the combined samples. No deviation of Hardy-Weinberg equilibrium was observed. A reasonable discrimination capacity and power of exclusion was estimated which allowed predicting an acceptable forensic behavior of this kit, either to be used as the main STR panel for simple cases or as an auxiliary tool in complex cases. Additionally, population comparison tests showed that the studied samples are relatively homogeneous across the country for these STR set. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Estimation of regional lung expansion via 3D image registration
NASA Astrophysics Data System (ADS)
Pan, Yan; Kumar, Dinesh; Hoffman, Eric A.; Christensen, Gary E.; McLennan, Geoffrey; Song, Joo Hyun; Ross, Alan; Simon, Brett A.; Reinhardt, Joseph M.
2005-04-01
A method is described to estimate regional lung expansion and related biomechanical parameters using multiple CT images of the lungs, acquired at different inflation levels. In this study, the lungs of two sheep were imaged utilizing a multi-detector row CT at different lung inflations in the prone and supine positions. Using the lung surfaces and the airway branch points for guidance, a 3D inverse consistent image registration procedure was used to match different lung volumes at each orientation. The registration was validated using a set of implanted metal markers. After registration, the Jacobian of the deformation field was computed to express regional expansion or contraction. The regional lung expansion at different pressures and different orientations are compared.
Olar, Adriana; Wani, Khalida; Mansouri, Alireza; Zadeh, Gelareh; Wilson, Charmaine; DeMonte, Franco; Fuller, Gregory; Jones, David; Pfister, Stefan; von Deimling, Andreas; Sulman, Erik; Aldape, Kenneth
2014-01-01
BACKGROUND: Methylation profiling of solid tumors has revealed biologic subtypes, often with clinical implications. Methylation profiles of meningioma and their clinical implications are not well understood. METHODS: Ninety-two meningioma samples (n = 44 test set and n = 48 validation set) were profiled using the Illumina HumanMethylation450 BeadChip. Unsupervised clustering and analyses for recurrence-free survival (RFS) were performed. RESULTS: Unsupervised clustering of the test set using approximately 900 highly variable markers identified two clearly defined methylation subgroups. One of the groups (n = 19) showed global hypermethylation of a set of markers, analogous to CpG island methylator phenotype (CIMP). These findings were reproducible in the validation set, with 18/48 samples showing the CIMP-positive phenotype. Importantly, of 347 highly variable markers common to both the test and validation set analyses, 107 defined CIMP in the test set and 94 defined CIMP in the validation set, with an overlap of 83 markers between the two datasets. This number is much greater than expected by chance indicating reproducibly of the hypermethylated markers that define CIMP in meningioma. With respect to clinical correlation, the 37 CIMP-positive cases displayed significantly shorter RFS compared to the 55 non-CIMP cases (hazard ratio 2.9, p = 0.013). In an effort to develop a preliminary outcome predictor, a 155-marker subset correlated with RFS was identified in the test dataset. When interrogated in the validation dataset, this 155-marker subset showed a statistical trend (p < 0.1) towards distinguishing survival groups. CONCLUSIONS: This study defines the existence of a CIMP phenotype in meningioma, which involves a substantial proportion (37/92, 40%) of samples with clinical implications. Ongoing work will expand this cohort and examine identification of additional biologic differences (mutational and DNA copy number analysis) to further characterize the aberrant methylation subtype in meningioma. CIMP-positivity with aberrant methylation in recurrent/malignant meningioma suggests a potential therapeutic target for clinically aggressive cases.
Diegoli, Toni Marie; Rohde, Heinrich; Borowski, Stefan; Krawczak, Michael; Coble, Michael D; Nothnagel, Michael
2016-11-01
Typing of X chromosomal short tandem repeat (X STR) markers has become a standard element of human forensic genetic analysis. Joint consideration of many X STR markers at a time increases their discriminatory power but, owing to physical linkage, requires inter-marker recombination rates to be accurately known. We estimated the recombination rates between 15 well established X STR markers using genotype data from 158 families (1041 individuals) and following a previously proposed likelihood-based approach that allows for single-step mutations. To meet the computational requirements of this family-based type of analysis, we modified a previous implementation so as to allow multi-core parallelization on a high-performance computing system. While we obtained recombination rate estimates larger than zero for all but one pair of adjacent markers within the four previously proposed linkage groups, none of the three X STR pairs defining the junctions of these groups yielded a recombination rate estimate of 0.50. Corroborating previous studies, our results therefore argue against a simple model of independent X chromosomal linkage groups. Moreover, the refined recombination fraction estimates obtained in our study will facilitate the appropriate joint consideration of all 15 investigated markers in forensic analysis. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Localization Based on Magnetic Markers for an All-Wheel Steering Vehicle
Byun, Yeun Sub; Kim, Young Chol
2016-01-01
Real-time continuous localization is a key technology in the development of intelligent transportation systems. In these systems, it is very important to have accurate information about the position and heading angle of the vehicle at all times. The most widely implemented methods for positioning are the global positioning system (GPS), vision-based system, and magnetic marker system. Among these methods, the magnetic marker system is less vulnerable to indoor and outdoor environment conditions; moreover, it requires minimal maintenance expenses. In this paper, we present a position estimation scheme based on magnetic markers and odometry sensors for an all-wheel-steering vehicle. The heading angle of the vehicle is determined by using the position coordinates of the last two detected magnetic markers and odometer data. The instant position and heading angle of the vehicle are integrated with an extended Kalman filter to estimate the continuous position. GPS data with the real-time kinematics mode was obtained to evaluate the performance of the proposed position estimation system. The test results show that the performance of the proposed localization algorithm is accurate (mean error: 3 cm; max error: 9 cm) and reliable under unexpected missing markers or incorrect markers. PMID:27916827
DOE Office of Scientific and Technical Information (OSTI.GOV)
Worm, Esben S., E-mail: esbeworm@rm.dk; Department of Medical Physics, Aarhus University Hospital, Aarhus; Hoyer, Morten
2012-05-01
Purpose: To develop and evaluate accurate and objective on-line patient setup based on a novel semiautomatic technique in which three-dimensional marker trajectories were estimated from two-dimensional cone-beam computed tomography (CBCT) projections. Methods and Materials: Seven treatment courses of stereotactic body radiotherapy for liver tumors were delivered in 21 fractions in total to 6 patients by a linear accelerator. Each patient had two to three gold markers implanted close to the tumors. Before treatment, a CBCT scan with approximately 675 two-dimensional projections was acquired during a full gantry rotation. The marker positions were segmented in each projection. From this, the three-dimensionalmore » marker trajectories were estimated using a probability based method. The required couch shifts for patient setup were calculated from the mean marker positions along the trajectories. A motion phantom moving with known tumor trajectories was used to examine the accuracy of the method. Trajectory-based setup was retrospectively used off-line for the first five treatment courses (15 fractions) and on-line for the last two treatment courses (6 fractions). Automatic marker segmentation was compared with manual segmentation. The trajectory-based setup was compared with setup based on conventional CBCT guidance on the markers (first 15 fractions). Results: Phantom measurements showed that trajectory-based estimation of the mean marker position was accurate within 0.3 mm. The on-line trajectory-based patient setup was performed within approximately 5 minutes. The automatic marker segmentation agreed with manual segmentation within 0.36 {+-} 0.50 pixels (mean {+-} SD; pixel size, 0.26 mm in isocenter). The accuracy of conventional volumetric CBCT guidance was compromised by motion smearing ({<=}21 mm) that induced an absolute three-dimensional setup error of 1.6 {+-} 0.9 mm (maximum, 3.2) relative to trajectory-based setup. Conclusions: The first on-line clinical use of trajectory estimation from CBCT projections for precise setup in stereotactic body radiotherapy was demonstrated. Uncertainty in the conventional CBCT-based setup procedure was eliminated with the new method.« less
Ehler, Edvard; Vaněk, Daniel; Stenzl, Vlastimil; Vančata, Václav
2011-01-01
Aim To evaluate Y-chromosomal diversity of the Moravian Valachs of the Czech Republic and compare them with a Czech population sample and other samples from Central and South-Eastern Europe, and to evaluate the effects of genetic isolation and sampling. Methods The first sample set of the Valachs consisted of 94 unrelated male donors from the Valach region in northeastern Czech Republic border-area. The second sample set of the Valachs consisted of 79 men who originated from 7 paternal lineages defined by surname. No close relatives were sampled. The third sample set consisted of 273 unrelated men from the whole of the Czech Republic and was used for comparison, as well as published data for other 27 populations. The total number of samples was 3244. Y-short tandem repeat (STR) markers were typed by standard methods using PowerPlex® Y System (Promega) and Yfiler® Amplification Kit (Applied Biosystems) kits. Y-chromosomal haplogroups were estimated from the haplotype information. Haplotype diversity and other intra- and inter-population statistics were computed. Results The Moravian Valachs showed a lower genetic variability of Y-STR markers than other Central European populations, resembling more to the isolated Balkan populations (Aromuns, Csango, Bulgarian, and Macedonian Roma) than the surrounding populations (Czechs, Slovaks, Poles, Saxons). We illustrated the effect of sampling on Valach paternal lineages, which includes reduction of discrimination capacity and variability inside Y-chromosomal haplogroups. Valach modal haplotype belongs to R1a haplogroup and it was not detected in the Czech population. Conclusion The Moravian Valachs display strong substructure and isolation in their Y chromosomal markers. They represent a unique Central European population model for population genetics. PMID:21674832
The use of biomarkers to describe plasma-, red cell-, and blood volume from a simple blood test.
Lobigs, Louisa Margit; Sottas, Pierre-Edouard; Bourdon, Pitre Collier; Nikolovski, Zoran; El-Gingo, Mohamed; Varamenti, Evdokia; Peeling, Peter; Dawson, Brian; Schumacher, Yorck Olaf
2017-01-01
Plasma volume and red cell mass are key health markers used to monitor numerous disease states, such as heart failure, kidney disease, or sepsis. Nevertheless, there is currently no practically applicable method to easily measure absolute plasma or red cell volumes in a clinical setting. Here, a novel marker for plasma volume and red cell mass was developed through analysis of the observed variability caused by plasma volume shifts in common biochemical measures, selected based on their propensity to present with low variations over time. Once a month for 6 months, serum and whole blood samples were collected from 33 active males. Concurrently, the CO-rebreathing method was applied to determine target levels of hemoglobin mass (HbM) and blood volumes. The variability of 18 common chemistry markers and 27 Full Blood Count variables was investigated and matched to the observed plasma volume variation. After the removal of between-subject variations using a Bayesian model, multivariate analysis identified two sets of 8 and 15 biomarkers explaining 68% and 69% of plasma volume variance, respectively. The final multiparametric model contains a weighting function to allow for isolated abnormalities in single biomarkers. This proof-of-concept investigation describes a novel approach to estimate absolute vascular volumes, with a simple blood test. Despite the physiological instability of critically ill patients, it is hypothesized the model, with its multiparametric approach and weighting function, maintains the capacity to describe vascular volumes. This model has potential to transform volume management in clinical settings. Am. J. Hematol. 92:62-67, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Van Inghelandt, Delphine; Melchinger, Albrecht E; Lebreton, Claude; Stich, Benjamin
2010-05-01
Information about the genetic diversity and population structure in elite breeding material is of fundamental importance for the improvement of crops. The objectives of our study were to (a) examine the population structure and the genetic diversity in elite maize germplasm based on simple sequence repeat (SSR) markers, (b) compare these results with those obtained from single nucleotide polymorphism (SNP) markers, and (c) compare the coancestry coefficient calculated from pedigree records with genetic distance estimates calculated from SSR and SNP markers. Our study was based on 1,537 elite maize inbred lines genotyped with 359 SSR and 8,244 SNP markers. The average number of alleles per locus, of group specific alleles, and the gene diversity (D) were higher for SSRs than for SNPs. Modified Roger's distance (MRD) estimates and membership probabilities of the STRUCTURE matrices were higher for SSR than for SNP markers but the germplasm organization in four heterotic pools was consistent with STRUCTURE results based on SSRs and SNPs. MRD estimates calculated for the two marker systems were highly correlated (0.87). Our results suggested that the same conclusions regarding the structure and the diversity of heterotic pools could be drawn from both markers types. Furthermore, although our results suggested that the ratio of the number of SSRs and SNPs required to obtain MRD or D estimates with similar precision is not constant across the various precision levels, we propose that between 7 and 11 times more SNPs than SSRs should be used for analyzing population structure and genetic diversity.
Greenhouse, Bryan; Dokomajilar, Christian; Hubbard, Alan; Rosenthal, Philip J; Dorsey, Grant
2007-09-01
Antimalarial clinical trials use genotyping techniques to distinguish new infection from recrudescence. In areas of high transmission, the accuracy of genotyping may be compromised due to the high number of infecting parasite strains. We compared the accuracies of genotyping methods, using up to six genotyping markers, to assign outcomes for two large antimalarial trials performed in areas of Africa with different transmission intensities. We then estimated the probability of genotyping misclassification and its effect on trial results. At a moderate-transmission site, three genotyping markers were sufficient to generate accurate estimates of treatment failure. At a high-transmission site, even with six markers, estimates of treatment failure were 20% for amodiaquine plus artesunate and 17% for artemether-lumefantrine, regimens expected to be highly efficacious. Of the observed treatment failures for these two regimens, we estimated that at least 45% and 35%, respectively, were new infections misclassified as recrudescences. Increasing the number of genotyping markers improved the ability to distinguish new infection from recrudescence at a moderate-transmission site, but using six markers appeared inadequate at a high-transmission site. Genotyping-adjusted estimates of treatment failure from high-transmission sites may represent substantial overestimates of the true risk of treatment failure.
Curk, Franck; Ancillo, Gema; Ollitrault, Frédérique; Perrier, Xavier; Jacquemoud-Collet, Jean-Pierre; Garcia-Lor, Andres; Navarro, Luis; Ollitrault, Patrick
2015-01-01
Most cultivated Citrus species originated from interspecific hybridisation between four ancestral taxa (C. reticulata, C. maxima, C. medica, and C. micrantha) with limited further interspecific recombination due to vegetative propagation. This evolution resulted in admixture genomes with frequent interspecific heterozygosity. Moreover, a major part of the phenotypic diversity of edible citrus results from the initial differentiation between these taxa. Deciphering the phylogenomic structure of citrus germplasm is therefore essential for an efficient utilization of citrus biodiversity in breeding schemes. The objective of this work was to develop a set of species-diagnostic single nucleotide polymorphism (SNP) markers for the four Citrus ancestral taxa covering the nine chromosomes, and to use these markers to infer the phylogenomic structure of secondary species and modern cultivars. Species-diagnostic SNPs were mined from 454 amplicon sequencing of 57 gene fragments from 26 genotypes of the four basic taxa. Of the 1,053 SNPs mined from 28,507 kb sequence, 273 were found to be highly diagnostic for a single basic taxon. Species-diagnostic SNP markers (105) were used to analyse the admixture structure of varieties and rootstocks. This revealed C. maxima introgressions in most of the old and in all recent selections of mandarins, and suggested that C. reticulata × C. maxima reticulation and introgression processes were important in edible mandarin domestication. The large range of phylogenomic constitutions between C. reticulata and C. maxima revealed in mandarins, tangelos, tangors, sweet oranges, sour oranges, grapefruits, and orangelos is favourable for genetic association studies based on phylogenomic structures of the germplasm. Inferred admixture structures were in agreement with previous hypotheses regarding the origin of several secondary species and also revealed the probable origin of several acid citrus varieties. The developed species-diagnostic SNP marker set will be useful for systematic estimation of admixture structure of citrus germplasm and for diverse genetic studies. PMID:25973611
Curk, Franck; Ancillo, Gema; Ollitrault, Frédérique; Perrier, Xavier; Jacquemoud-Collet, Jean-Pierre; Garcia-Lor, Andres; Navarro, Luis; Ollitrault, Patrick
2015-01-01
Most cultivated Citrus species originated from interspecific hybridisation between four ancestral taxa (C. reticulata, C. maxima, C. medica, and C. micrantha) with limited further interspecific recombination due to vegetative propagation. This evolution resulted in admixture genomes with frequent interspecific heterozygosity. Moreover, a major part of the phenotypic diversity of edible citrus results from the initial differentiation between these taxa. Deciphering the phylogenomic structure of citrus germplasm is therefore essential for an efficient utilization of citrus biodiversity in breeding schemes. The objective of this work was to develop a set of species-diagnostic single nucleotide polymorphism (SNP) markers for the four Citrus ancestral taxa covering the nine chromosomes, and to use these markers to infer the phylogenomic structure of secondary species and modern cultivars. Species-diagnostic SNPs were mined from 454 amplicon sequencing of 57 gene fragments from 26 genotypes of the four basic taxa. Of the 1,053 SNPs mined from 28,507 kb sequence, 273 were found to be highly diagnostic for a single basic taxon. Species-diagnostic SNP markers (105) were used to analyse the admixture structure of varieties and rootstocks. This revealed C. maxima introgressions in most of the old and in all recent selections of mandarins, and suggested that C. reticulata × C. maxima reticulation and introgression processes were important in edible mandarin domestication. The large range of phylogenomic constitutions between C. reticulata and C. maxima revealed in mandarins, tangelos, tangors, sweet oranges, sour oranges, grapefruits, and orangelos is favourable for genetic association studies based on phylogenomic structures of the germplasm. Inferred admixture structures were in agreement with previous hypotheses regarding the origin of several secondary species and also revealed the probable origin of several acid citrus varieties. The developed species-diagnostic SNP marker set will be useful for systematic estimation of admixture structure of citrus germplasm and for diverse genetic studies.
Ferrer, E S; García-Navas, V; Sanz, J J; Ortego, J
2014-11-01
Understanding the importance of host genetic diversity for coping with parasites and infectious diseases is a long-standing goal in evolutionary biology. Here, we study the association between probability of infection by avian malaria (Plasmodium relictum) and individual genetic diversity in three blue tit (Cyanistes caeruleus) populations that strongly differ in prevalence of this parasite. For this purpose, we screened avian malaria infections and genotyped 789 blue tits across 26 microsatellite markers. We used two different arrays of markers: 14 loci classified as neutral and 12 loci classified as putatively functional. We found a significant relationship between probability of infection and host genetic diversity estimated at the subset of neutral markers that was not explained by strong local effects and did not differ among the studied populations. This relationship was not linear, and probability of infection increased up to values of homozygosity by locus (HL) around 0.15, reached a plateau at values of HL from 0.15 to 0.40 and finally declined among a small proportion of highly homozygous individuals (HL > 0.4). We did not find evidence for significant identity disequilibrium, which may have resulted from a low variance of inbreeding in the study populations and/or the small power of our set of markers to detect it. A combination of subtle positive and negative local effects and/or a saturation threshold in the association between probability of infection and host genetic diversity in combination with increased resistance to parasites in highly homozygous individuals may explain the observed negative quadratic relationship. Overall, our study highlights that parasites play an important role in shaping host genetic variation and suggests that the use of large sets of neutral markers may be more appropriate for the study of heterozygosity-fitness correlations. © 2014 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2014 European Society For Evolutionary Biology.
Clark, Samuel A; Hickey, John M; Daetwyler, Hans D; van der Werf, Julius H J
2012-02-09
The theory of genomic selection is based on the prediction of the effects of genetic markers in linkage disequilibrium with quantitative trait loci. However, genomic selection also relies on relationships between individuals to accurately predict genetic value. This study aimed to examine the importance of information on relatives versus that of unrelated or more distantly related individuals on the estimation of genomic breeding values. Simulated and real data were used to examine the effects of various degrees of relationship on the accuracy of genomic selection. Genomic Best Linear Unbiased Prediction (gBLUP) was compared to two pedigree based BLUP methods, one with a shallow one generation pedigree and the other with a deep ten generation pedigree. The accuracy of estimated breeding values for different groups of selection candidates that had varying degrees of relationships to a reference data set of 1750 animals was investigated. The gBLUP method predicted breeding values more accurately than BLUP. The most accurate breeding values were estimated using gBLUP for closely related animals. Similarly, the pedigree based BLUP methods were also accurate for closely related animals, however when the pedigree based BLUP methods were used to predict unrelated animals, the accuracy was close to zero. In contrast, gBLUP breeding values, for animals that had no pedigree relationship with animals in the reference data set, allowed substantial accuracy. An animal's relationship to the reference data set is an important factor for the accuracy of genomic predictions. Animals that share a close relationship to the reference data set had the highest accuracy from genomic predictions. However a baseline accuracy that is driven by the reference data set size and the overall population effective population size enables gBLUP to estimate a breeding value for unrelated animals within a population (breed), using information previously ignored by pedigree based BLUP methods.
Guo, Jing; Chen, Shangxiang; Li, Shun; Sun, Xiaowei; Li, Wei; Zhou, Zhiwei; Chen, Yingbo; Xu, Dazhi
2018-01-12
Several studies have highlighted the prognostic value of the individual and the various combinations of the tumor markers for gastric cancer (GC). Our study was designed to assess establish a new novel model incorporating carcino-embryonic antigen (CEA), carbohydrate antigen 19-9 (CA19-9), carbohydrate antigen 72-4 (CA72-4). A total of 1,566 GC patients (Primary cohort) between Jan 2000 and July 2013 were analyzed. The Primary cohort was randomly divided into Training set (n=783) and Validation set (n=783). A three-tumor marker classifier was developed in the Training set and validated in the Validation set by multivariate regression and risk-score analysis. We have identified a three-tumor marker classifier (including CEA, CA19-9 and CA72-4) for the cancer specific survival (CSS) of GC (p<0.001). Consistent results were obtained in the both Training set and Validation set. Multivariate analysis showed that the classifier was an independent predictor of GC (All p value <0.001 in the Training set, Validation set and Primary cohort). Furthermore, when the leave-one-out approach was performed, the classifier showed superior predictive value to the individual or two of them (with the highest AUC (Area Under Curve); 0.618 for the Training set, and 0.625 for the Validation set), which ascertained its predictive value. Our three-tumor marker classifier is closely associated with the CSS of GC and may serve as a novel model for future decisions concerning treatments.
Gaunt, Tom R; Rodriguez, Santiago; Zapata, Carlos; Day, Ian NM
2006-01-01
Background Various software tools are available for the display of pairwise linkage disequilibrium across multiple single nucleotide polymorphisms. The HapMap project also presents these graphics within their website. However, these approaches are limited in their use of data from multiallelic markers and provide limited information in a graphical form. Results We have developed a software package (MIDAS – Multiallelic Interallelic Disequilibrium Analysis Software) for the estimation and graphical display of interallelic linkage disequilibrium. Linkage disequilibrium is analysed for each allelic combination (of one allele from each of two loci), between all pairwise combinations of any type of multiallelic loci in a contig (or any set) of many loci (including single nucleotide polymorphisms, microsatellites, minisatellites and haplotypes). Data are presented graphically in a novel and informative way, and can also be exported in tabular form for other analyses. This approach facilitates visualisation of patterns of linkage disequilibrium across genomic regions, analysis of the relationships between different alleles of multiallelic markers and inferences about patterns of evolution and selection. Conclusion MIDAS is a linkage disequilibrium analysis program with a comprehensive graphical user interface providing novel views of patterns of linkage disequilibrium between all types of multiallelic and biallelic markers. Availability Available from and PMID:16643648
David A. Tallmon; Dave Gregovich; Robin S. Waples; C. Scott Baker; Jennifer Jackson; Barbara L. Taylor; Eric Archer; Karen K. Martien; Fred W. Allendorf; Michael K. Schwartz
2010-01-01
The utility of microsatellite markers for inferring population size and trend has not been rigorously examined, even though these markers are commonly used to monitor the demography of natural populations. We assessed the ability of a linkage disequilibrium estimator of effective population size (Ne) and a simple capture-recapture estimator of abundance (N) to quantify...
Calder, Philip C; Boobis, Alan; Braun, Deborah; Champ, Claire L; Dye, Louise; Einöther, Suzanne; Greyling, Arno; Matthys, Christophe; Putz, Peter; Wopereis, Suzan; Woodside, Jayne V; Antoine, Jean-Michel
2017-06-01
The conduct of high-quality nutrition research requires the selection of appropriate markers as outcomes, for example as indicators of food or nutrient intake, nutritional status, health status or disease risk. Such selection requires detailed knowledge of the markers, and consideration of the factors that may influence their measurement, other than the effects of nutritional change. A framework to guide selection of markers within nutrition research studies would be a valuable tool for researchers. A multidisciplinary Expert Group set out to test criteria designed to aid the evaluation of candidate markers for their usefulness in nutrition research and subsequently to develop a scoring system for markers. The proposed criteria were tested using thirteen markers selected from a broad range of nutrition research fields. The result of this testing was a modified list of criteria and a template for evaluating a potential marker against the criteria. Subsequently, a semi-quantitative system for scoring a marker and an associated template were developed. This system will enable the evaluation and comparison of different candidate markers within the same field of nutrition research in order to identify their relative usefulness. The ranking criteria of proven, strong, medium or low are likely to vary according to research setting, research field and the type of tool used to assess the marker and therefore the considerations for scoring need to be determined in a setting-, field- and tool-specific manner. A database of such markers, their interpretation and range of possible values would be valuable to nutrition researchers.
Hinze, Lori L; Fang, David D; Gore, Michael A; Scheffler, Brian E; Yu, John Z; Frelichowski, James; Percy, Richard G
2015-02-01
A core marker set containing markers developed to be informative within a single commercial cotton species can elucidate diversity structure within a multi-species subset of the Gossypium germplasm collection. An understanding of the genetic diversity of cotton (Gossypium spp.) as represented in the US National Cotton Germplasm Collection is essential to develop strategies for collecting, conserving, and utilizing these germplasm resources. The US collection is one of the largest world collections and includes not only accessions with improved yield and fiber quality within cultivated species, but also accessions possessing sources of abiotic and biotic stress resistance often found in wild species. We evaluated the genetic diversity of a subset of 272 diploid and 1,984 tetraploid accessions in the collection (designated the Gossypium Diversity Reference Set) using a core set of 105 microsatellite markers. Utility of the core set of markers in differentiating intra-genome variation was much greater in commercial tetraploid genomes (99.7 % polymorphic bands) than in wild diploid genomes (72.7 % polymorphic bands), and may have been influenced by pre-selection of markers for effectiveness in the commercial species. Principal coordinate analyses revealed that the marker set differentiated interspecific variation among tetraploid species, but was only capable of partially differentiating among species and genomes of the wild diploids. Putative species-specific marker bands in G. hirsutum (73) and G. barbadense (81) were identified that could be used for qualitative identification of misclassifications, redundancies, and introgression within commercial tetraploid species. The results of this broad-scale molecular characterization are essential to the management and conservation of the collection and provide insight and guidance in the use of the collection by the cotton research community in their cotton improvement efforts.
Algorithms and Complexity Results for Genome Mapping Problems.
Rajaraman, Ashok; Zanetti, Joao Paulo Pereira; Manuch, Jan; Chauve, Cedric
2017-01-01
Genome mapping algorithms aim at computing an ordering of a set of genomic markers based on local ordering information such as adjacencies and intervals of markers. In most genome mapping models, markers are assumed to occur uniquely in the resulting map. We introduce algorithmic questions that consider repeats, i.e., markers that can have several occurrences in the resulting map. We show that, provided with an upper bound on the copy number of repeated markers and with intervals that span full repeat copies, called repeat spanning intervals, the problem of deciding if a set of adjacencies and repeat spanning intervals admits a genome representation is tractable if the target genome can contain linear and/or circular chromosomal fragments. We also show that extracting a maximum cardinality or weight subset of repeat spanning intervals given a set of adjacencies that admits a genome realization is NP-hard but fixed-parameter tractable in the maximum copy number and the number of adjacent repeats, and tractable if intervals contain a single repeated marker.
Thomas, E; Sexton, J; Helmreich, R
2004-01-01
Improving teamwork in healthcare may help reduce and manage errors. This paper takes a step toward that goal by (1) proposing a set of teamwork behaviours, or behavioural markers, for neonatal resuscitation; (2) presenting a data form for recording observations about these markers; and (3) comparing and contrasting different sets of teamwork behaviours that have been developed for healthcare. Data from focus groups of neonatal providers, surveys, and video recordings of neonatal resuscitations were used to identify some new teamwork behaviours, to translate existing aviation team behaviours to this setting, and to develop a data collection form. This behavioural marker audit form for neonatal resuscitation lists and defines 10 markers that describe specific, observable behaviours seen during the resuscitation of newborn infants. These markers are compared with those developed by other groups. Future research should determine the relations among these behaviours and errors, and test their usefulness in measuring the impact of team training interventions. PMID:15465957
Xu, Jason; Guttorp, Peter; Kato-Maeda, Midori; Minin, Vladimir N
2015-12-01
Continuous-time birth-death-shift (BDS) processes are frequently used in stochastic modeling, with many applications in ecology and epidemiology. In particular, such processes can model evolutionary dynamics of transposable elements-important genetic markers in molecular epidemiology. Estimation of the effects of individual covariates on the birth, death, and shift rates of the process can be accomplished by analyzing patient data, but inferring these rates in a discretely and unevenly observed setting presents computational challenges. We propose a multi-type branching process approximation to BDS processes and develop a corresponding expectation maximization algorithm, where we use spectral techniques to reduce calculation of expected sufficient statistics to low-dimensional integration. These techniques yield an efficient and robust optimization routine for inferring the rates of the BDS process, and apply broadly to multi-type branching processes whose rates can depend on many covariates. After rigorously testing our methodology in simulation studies, we apply our method to study intrapatient time evolution of IS6110 transposable element, a genetic marker frequently used during estimation of epidemiological clusters of Mycobacterium tuberculosis infections. © 2015, The International Biometric Society.
High-resolution Doppler model of the human gait
NASA Astrophysics Data System (ADS)
Geisheimer, Jonathan L.; Greneker, Eugene F., III; Marshall, William S.
2002-07-01
A high resolution Doppler model of the walking human was developed for analyzing the continuous wave (CW) radar gait signature. Data for twenty subjects were collected simultaneously using an infrared motion capture system along with a two channel 10.525 GHz CW radar. The motion capture system recorded three-dimensional coordinates of infrared markers placed on the body. These body marker coordinates were used as inputs to create the theoretical Doppler output using a model constructed in MATLAB. The outputs of the model are the simulated Doppler signals due to each of the major limbs and the thorax. An estimated radar cross section for each part of the body was assigned using the Lund & Browder chart of estimated body surface area. The resultant Doppler model was then compared with the actual recorded Doppler gait signature in the frequency domain using the spectrogram. Comparison of the two sets of data has revealed several identifiable biomechanical features in the radar gait signature due to leg and body motion. The result of the research shows that a wealth of information can be unlocked from the radar gait signature, which may be useful in security and biometric applications.
Diurnal Variation of Hormonal and Lipid Biomarkers in a Molecular Epidemiology-Like Setting.
van Kerkhof, Linda W M; Van Dycke, Kirsten C G; Jansen, Eugene H J M; Beekhof, Piet K; van Oostrom, Conny T M; Ruskovska, Tatjana; Velickova, Nevenka; Kamcev, Nikola; Pennings, Jeroen L A; van Steeg, Harry; Rodenburg, Wendy
2015-01-01
Many molecular epidemiology studies focusing on high prevalent diseases, such as metabolic disorders and cancer, investigate metabolic and hormonal markers. In general, sampling for these markers can occur at any time-point during the day or after an overnight fast. However, environmental factors, such as light exposure and food intake might affect the levels of these markers, since they provide input for the internal time-keeping system. When diurnal variation is larger than the inter-individual variation, time of day should be taken into account. Importantly, heterogeneity in diurnal variation and disturbance of circadian rhythms among a study population might increasingly occur as a result of our increasing 24/7 economy and related variation in exposure to environmental factors (such as light and food). The aim of the present study was to determine whether a set of often used biomarkers shows diurnal variation in a setting resembling large molecular epidemiology studies, i.e., non-fasted and limited control possibilities for other environmental influences. We show that markers for which diurnal variation is not an issue are adrenocorticotropic hormone, follicle stimulating hormone, estradiol and high-density lipoprotein. For all other tested markers diurnal variation was observed in at least one gender (cholesterol, cortisol, dehydroepiandrosterone sulfate, free fatty acids, low-density lipoprotein, luteinizing hormone, prolactin, progesterone, testosterone, triglycerides, total triiodothyronine and thyroid-stimulating hormone) or could not reliably be detected (human growth hormone). Thus, studies investigating these markers should take diurnal variation into account, for which we provide some options. Furthermore, our study indicates the need for investigating diurnal variation (in literature or experimentally) before setting up studies measuring markers in routine and controlled settings, especially since time-of-day likely matters for many more markers than the ones investigated in the present study.
Biomagnetic techniques for evaluating gastric emptying, peristaltic contraction and transit time
la Roca-Chiapas, Jose María De; Cordova-Fraga, Teodoro
2011-01-01
Biomagnetic techniques were used to measure motility in various parts of the gastrointestinal (GI) tract, particularly a new technique for detecting magnetic markers and tracers. A coil was used to enhance the signal from a magnetic tracer in the GI tract and the signal was detected using a fluxgate magnetometer or a magnetoresistor in an unshielded room. Estimates of esophageal transit time were affected by the position of the subject. The reproducibility of estimates derived using the new biomagnetic technique was greater than 85% and it yielded estimates similar to those obtained using scintigraphy. This technique is suitable for studying the effect of emotional state on GI physiology and for measuring GI transit time. The biomagnetic technique can be used to evaluate digesta transit time in the esophagus, stomach and colon, peristaltic frequency and gastric emptying and is easy to use in the hospital setting. PMID:22025978
Biomagnetic techniques for evaluating gastric emptying, peristaltic contraction and transit time.
la Roca-Chiapas, Jose María De; Cordova-Fraga, Teodoro
2011-10-15
Biomagnetic techniques were used to measure motility in various parts of the gastrointestinal (GI) tract, particularly a new technique for detecting magnetic markers and tracers. A coil was used to enhance the signal from a magnetic tracer in the GI tract and the signal was detected using a fluxgate magnetometer or a magnetoresistor in an unshielded room. Estimates of esophageal transit time were affected by the position of the subject. The reproducibility of estimates derived using the new biomagnetic technique was greater than 85% and it yielded estimates similar to those obtained using scintigraphy. This technique is suitable for studying the effect of emotional state on GI physiology and for measuring GI transit time. The biomagnetic technique can be used to evaluate digesta transit time in the esophagus, stomach and colon, peristaltic frequency and gastric emptying and is easy to use in the hospital setting.
Belinchón, Rocío; Ellis, Christopher J; Yahr, Rebecca
2014-11-01
Microsatellite markers were characterized for two epiphytic cyanolichens, Nephroma laevigatum and N. parile (Nephromataceae), and will be used to investigate population structure and estimate gene flow among populations of these two closely related species with contrasting dispersal modes. • Twelve and 14 microsatellite loci were characterized for N. laevigatum and N. parile, respectively. Allele number in N. laevigatum ranged from three to 13 per locus, while in N. parile there were from two to six alleles per locus. As expected, the sexually reproducing N. laevigatum had higher genetic diversity than the predominantly asexual N. parile. • This new set of markers is suitable for studying population structure and providing insights into gene flow among populations and for understanding processes of diversification. Compared between the species, they will facilitate an understanding of the influence of contrasting reproductive strategies on population and community structure.
Sobel, E.; Lange, K.
1996-01-01
The introduction of stochastic methods in pedigree analysis has enabled geneticists to tackle computations intractable by standard deterministic methods. Until now these stochastic techniques have worked by running a Markov chain on the set of genetic descent states of a pedigree. Each descent state specifies the paths of gene flow in the pedigree and the founder alleles dropped down each path. The current paper follows up on a suggestion by Elizabeth Thompson that genetic descent graphs offer a more appropriate space for executing a Markov chain. A descent graph specifies the paths of gene flow but not the particular founder alleles traveling down the paths. This paper explores algorithms for implementing Thompson's suggestion for codominant markers in the context of automatic haplotyping, estimating location scores, and computing gene-clustering statistics for robust linkage analysis. Realistic numerical examples demonstrate the feasibility of the algorithms. PMID:8651310
Mukherjee, S.; Saha, A.; Kumar P., Senthil; Chandak, G. R.; Majumder, P. P.; Ray, K.
2006-01-01
Hemophilia B, an X-linked recessive bleeding disorder, is caused by heterogeneous mutations in the factor IX (F9) gene. Hence, carriers of the disease are usually detected by F9 gene linked RFLP analysis. We aimed to test a set of RFLP markers (DdeI, XmnI, MnlI, TaqI & HhaI), used worldwide for carrier detection, to estimate its heterozygosity in different population groups of India, and identify additional single nucleotide polymorphisms (SNPs) if necessary. A total of 8 population groups encompassing different regions of India, consisting of 107 unrelated normal females without any history of hemophilia B in the family and 13 unrelated obligate carriers were recruited in the study. Regions of F9 gene were amplified by PCR from genomic DNA of the donors followed by restriction enzyme digestion and/or sequencing as appropriate. Combined informativeness for the markers varied between 52–86% among normal females belonging to different geographical locations of India. Haplotype analysis revealed that the most prevalent haplotype lacked the restriction sites for all five RFLP markers. Screening regions of F9 gene that harbor 10 SNPs reported in dbSNP yielded only two SNPs, which increased the overall informativeness in each population group and heterozygosity in the obligate carriers for the disease from 38% to 69%. Our data show that heterozygosity of commonly used RFLP markers is remarkably variable across different regions of India. Thus prudent selection of the markers based on specific population groups including usage of additional markers is recommended for efficient carrier detection. PMID:17264403
Adjusting for founder relatedness in a linkage analysis using prior information.
Sheehan, N A; Egeland, T
2008-01-01
In genetic linkage studies, while the pedigrees are generally known, background relatedness between the founding individuals, assumed by definition to be unrelated, can seriously affect the results of the analysis. Likelihood approaches to relationship estimation from genetic marker data can all be expressed in terms of finding the most likely pedigree connecting the individuals of interest. When the true relationship is the main focus, the set of all possible alternative pedigrees can be too large to consider. However, prior information is often available which, when incorporated in a formal and structured way, can restrict this set to a manageable size thus enabling the calculation of a posterior distribution from which inferences can be drawn. Here, the unknown relationships are more of a nuisance factor than of interest in their own right, so the focus is on adjusting the results of the analysis rather than on direct estimation. In this paper, we show how prior information on founder relationships can be exploited in some applications to generate a set of candidate extended pedigrees. We then weight the relevant pedigree-specific likelihoods by their posterior probabilities to adjust the lod score statistics. (c) 2007 S. Karger AG, Basel
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bertholet, Jenny, E-mail: jennbe@rm.dk; Worm, Esben S.; Fledelius, Walther
Purpose: Image guided liver stereotactic body radiation therapy (SBRT) often relies on implanted fiducial markers. The target localization accuracy decreases with increased marker-target distance. This may occur partly because of liver rotations. The aim of this study was to examine time-resolved translations and rotations of liver marker constellations and investigate if time-resolved intrafraction rotational corrections can improve localization accuracy in liver SBRT. Methods and Materials: Twenty-nine patients with 3 implanted markers received SBRT in 3 to 6 fractions. The time-resolved trajectory of each marker was estimated from the projections of 1 to 3 daily cone beam computed tomography scans andmore » used to calculate the translation and rotation of the marker constellation. In all cone beam computed tomography projections, the time-resolved position of each marker was predicted from the position of another surrogate marker by assuming that the marker underwent either (1) the same translation as the surrogate marker; or (2) the same translation as the surrogate marker corrected by the rotation of the marker constellation. The localization accuracy was quantified as the root-mean-square error (RMSE) between the estimated and the actual marker position. For comparison, the RMSE was also calculated when the marker's position was estimated as its mean position for all the projections. Results: The mean translational and rotational range (2nd-98th percentile) was 2.0 mm/3.9° (right-left), 9.2 mm/2.9° (superior-inferior), 4.0 mm/4.0° (anterior-posterior), and 10.5 mm (3-dimensional). Rotational corrections decreased the mean 3-dimensional RMSE from 0.86 mm to 0.54 mm (P<.001) and halved the RMSE increase per millimeter increase in marker distance. Conclusions: Intrafraction rotations during liver SBRT reduce the accuracy of marker-guided target localization. Rotational correction can improve the localization accuracy with a factor of approximately 2 for large marker-target distances.« less
Combining markers with and without the limit of detection
Dong, Ting; Liu, Catherine Chunling; Petricoin, Emanuel F.; Tang, Liansheng Larry
2014-01-01
In this paper, we consider the combination of markers with and without the limit of detection (LOD). LOD is often encountered when measuring proteomic markers. Because of the limited detecting ability of an equipment or instrument, it is difficult to measure markers at a relatively low level. Suppose that after some monotonic transformation, the marker values approximately follow multivariate normal distributions. We propose to estimate distribution parameters while taking the LOD into account, and then combine markers using the results from the linear discriminant analysis. Our simulation results show that the ROC curve parameter estimates generated from the proposed method are much closer to the truth than simply using the linear discriminant analysis to combine markers without considering the LOD. In addition, we propose a procedure to select and combine a subset of markers when many candidate markers are available. The procedure based on the correlation among markers is different from a common understanding that a subset of the most accurate markers should be selected for the combination. The simulation studies show that the accuracy of a combined marker can be largely impacted by the correlation of marker measurements. Our methods are applied to a protein pathway dataset to combine proteomic biomarkers to distinguish cancer patients from non-cancer patients. PMID:24132938
Report on the development of putative functional SSR and SNP markers in passion fruits.
da Costa, Zirlane Portugal; Munhoz, Carla de Freitas; Vieira, Maria Lucia Carneiro
2017-09-06
Passionflowers Passiflora edulis and Passiflora alata are diploid, outcrossing and understudied fruit bearing species. In Brazil, passion fruit cultivation began relatively recently and has earned the country an outstanding position as the world's top producer of passion fruit. The fruit's main economic value lies in the production of juice, an essential exotic ingredient in juice blends. Currently, crop improvement strategies, including those for underexploited tropical species, tend to incorporate molecular genetic approaches. In this study, we examined a set of P. edulis transcripts expressed in response to infection by Xanthomonas axonopodis, (the passion fruit's main bacterial pathogen that attacks the vines), aiming at the development of putative functional markers, i.e. SSRs (simple sequence repeats) and SNPs (single nucleotide polymorphisms). A total of 210 microsatellites were found in 998 sequences, and trinucleotide repeats were found to be the most frequent (31.4%). Of the sequences selected for designing primers, 80.9% could be used to develop SSR markers, and 60.6% SNP markers for P. alata. SNPs were all biallelic and found within 15 gene fragments of P. alata. Overall, gene fragments generated 10,003 bp. SNP frequency was estimated as one SNP every 294 bp. Polymorphism rates revealed by SSR and SNP loci were 29.4 and 53.6%, respectively. Passiflora edulis transcripts were useful for the development of putative functional markers for P. alata, suggesting a certain level of sequence conservation between these cultivated species. The markers developed herein could be used for genetic mapping purposes and also in diversity studies.
Tong, Yubing; Udupa, Jayaram K.; Torigian, Drew A.
2014-01-01
Purpose: The quantification of body fat plays an important role in the study of numerous diseases. It is common current practice to use the fat area at a single abdominal computed tomography (CT) slice as a marker of the body fat content in studying various disease processes. This paper sets out to answer three questions related to this issue which have not been addressed in the literature. At what single anatomic slice location do the areas of subcutaneous adipose tissue (SAT) and visceral adipose tissue (VAT) estimated from the slice correlate maximally with the corresponding fat volume measures? How does one ensure that the slices used for correlation calculation from different subjects are at the same anatomic location? Are there combinations of multiple slices (not necessarily contiguous) whose area sum correlates better with volume than does single slice area with volume? Methods: The authors propose a novel strategy for mapping slice locations to a standardized anatomic space so that same anatomic slice locations are identified in different subjects. The authors then study the volume-to-area correlations and determine where they become maximal. To address the third issue, the authors carry out similar correlation studies by utilizing two and three slices for calculating area sum. Results: Based on 50 abdominal CT data sets, the proposed mapping achieves significantly improved consistency of anatomic localization compared to current practice. Maximum correlations are achieved at different anatomic locations for SAT and VAT which are both different from the L4-L5 junction commonly utilized currently for single slice area estimation as a marker. Conclusions: The maximum area-to-volume correlation achieved is quite high, suggesting that it may be reasonable to estimate body fat by measuring the area of fat from a single anatomic slice at the site of maximum correlation and use this as a marker. The site of maximum correlation is not at L4-L5 as commonly assumed, but is more superiorly located at T12-L1 for SAT and at L3-L4 for VAT. Furthermore, the optimal anatomic locations for SAT and VAT estimation are not the same, contrary to common assumption. The proposed standardized space mapping achieves high consistency of anatomic localization by accurately managing nonlinearities in the relationships among landmarks. Multiple slices achieve greater improvement in correlation for VAT than for SAT. The optimal locations in the case of multiple slices are not contiguous. PMID:24877839
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tong, Yubing; Udupa, Jayaram K., E-mail: jay@mail.med.upenn.edu; Torigian, Drew A.
Purpose: The quantification of body fat plays an important role in the study of numerous diseases. It is common current practice to use the fat area at a single abdominal computed tomography (CT) slice as a marker of the body fat content in studying various disease processes. This paper sets out to answer three questions related to this issue which have not been addressed in the literature. At what single anatomic slice location do the areas of subcutaneous adipose tissue (SAT) and visceral adipose tissue (VAT) estimated from the slice correlate maximally with the corresponding fat volume measures? How doesmore » one ensure that the slices used for correlation calculation from different subjects are at the same anatomic location? Are there combinations of multiple slices (not necessarily contiguous) whose area sum correlates better with volume than does single slice area with volume? Methods: The authors propose a novel strategy for mapping slice locations to a standardized anatomic space so that same anatomic slice locations are identified in different subjects. The authors then study the volume-to-area correlations and determine where they become maximal. To address the third issue, the authors carry out similar correlation studies by utilizing two and three slices for calculating area sum. Results: Based on 50 abdominal CT data sets, the proposed mapping achieves significantly improved consistency of anatomic localization compared to current practice. Maximum correlations are achieved at different anatomic locations for SAT and VAT which are both different from the L4-L5 junction commonly utilized currently for single slice area estimation as a marker. Conclusions: The maximum area-to-volume correlation achieved is quite high, suggesting that it may be reasonable to estimate body fat by measuring the area of fat from a single anatomic slice at the site of maximum correlation and use this as a marker. The site of maximum correlation is not at L4-L5 as commonly assumed, but is more superiorly located at T12-L1 for SAT and at L3-L4 for VAT. Furthermore, the optimal anatomic locations for SAT and VAT estimation are not the same, contrary to common assumption. The proposed standardized space mapping achieves high consistency of anatomic localization by accurately managing nonlinearities in the relationships among landmarks. Multiple slices achieve greater improvement in correlation for VAT than for SAT. The optimal locations in the case of multiple slices are not contiguous.« less
Van Inghelandt, Delphine; Melchinger, Albrecht E.; Lebreton, Claude
2010-01-01
Information about the genetic diversity and population structure in elite breeding material is of fundamental importance for the improvement of crops. The objectives of our study were to (a) examine the population structure and the genetic diversity in elite maize germplasm based on simple sequence repeat (SSR) markers, (b) compare these results with those obtained from single nucleotide polymorphism (SNP) markers, and (c) compare the coancestry coefficient calculated from pedigree records with genetic distance estimates calculated from SSR and SNP markers. Our study was based on 1,537 elite maize inbred lines genotyped with 359 SSR and 8,244 SNP markers. The average number of alleles per locus, of group specific alleles, and the gene diversity (D) were higher for SSRs than for SNPs. Modified Roger’s distance (MRD) estimates and membership probabilities of the STRUCTURE matrices were higher for SSR than for SNP markers but the germplasm organization in four heterotic pools was consistent with STRUCTURE results based on SSRs and SNPs. MRD estimates calculated for the two marker systems were highly correlated (0.87). Our results suggested that the same conclusions regarding the structure and the diversity of heterotic pools could be drawn from both markers types. Furthermore, although our results suggested that the ratio of the number of SSRs and SNPs required to obtain MRD or D estimates with similar precision is not constant across the various precision levels, we propose that between 7 and 11 times more SNPs than SSRs should be used for analyzing population structure and genetic diversity. Electronic supplementary material The online version of this article (doi:10.1007/s00122-009-1256-2) contains supplementary material, which is available to authorized users. PMID:20063144
2013-01-01
Background Microsatellites are widely used for many genetic studies. In contrast to single nucleotide polymorphism (SNP) and genotyping-by-sequencing methods, they are readily typed in samples of low DNA quality/concentration (e.g. museum/non-invasive samples), and enable the quick, cheap identification of species, hybrids, clones and ploidy. Microsatellites also have the highest cross-species utility of all types of markers used for genotyping, but, despite this, when isolated from a single species, only a relatively small proportion will be of utility. Marker development of any type requires skill and time. The availability of sufficient “off-the-shelf” markers that are suitable for genotyping a wide range of species would not only save resources but also uniquely enable new comparisons of diversity among taxa at the same set of loci. No other marker types are capable of enabling this. We therefore developed a set of avian microsatellite markers with enhanced cross-species utility. Results We selected highly-conserved sequences with a high number of repeat units in both of two genetically distant species. Twenty-four primer sets were designed from homologous sequences that possessed at least eight repeat units in both the zebra finch (Taeniopygia guttata) and chicken (Gallus gallus). Each primer sequence was a complete match to zebra finch and, after accounting for degenerate bases, at least 86% similar to chicken. We assessed primer-set utility by genotyping individuals belonging to eight passerine and four non-passerine species. The majority of the new Conserved Avian Microsatellite (CAM) markers amplified in all 12 species tested (on average, 94% in passerines and 95% in non-passerines). This new marker set is of especially high utility in passerines, with a mean 68% of loci polymorphic per species, compared with 42% in non-passerine species. Conclusions When combined with previously described conserved loci, this new set of conserved markers will not only reduce the necessity and expense of microsatellite isolation for a wide range of genetic studies, including avian parentage and population analyses, but will also now enable comparisons of genetic diversity among different species (and populations) at the same set of loci, with no or reduced bias. Finally, the approach used here can be applied to other taxa in which appropriate genome sequences are available. PMID:23497230
Suchan, Tomasz; Espíndola, Anahí; Rutschmann, Sereina; Emerson, Brent C; Gori, Kevin; Dessimoz, Christophe; Arrigo, Nils; Ronikier, Michał; Alvarez, Nadir
2017-09-01
Determining phylogenetic relationships among recently diverged species has long been a challenge in evolutionary biology. Cytoplasmic DNA markers, which have been widely used, notably in the context of molecular barcoding, have not always proved successful in resolving such phylogenies. However, with the advent of next-generation-sequencing technologies and associated techniques of reduced genome representation, phylogenies of closely related species have been resolved at a much higher detail in the last couple of years. Here we examine the potential and limitations of one of such techniques-Restriction-site Associated DNA (RAD) sequencing, a method that produces thousands of (mostly) anonymous nuclear markers, in disentangling the phylogeny of the fly genus Chiastocheta (Diptera: Anthomyiidae). In Europe, this genus encompasses seven species of seed predators, which have been widely studied in the context of their ecological and evolutionary interactions with the plant Trollius europaeus (Ranunculaceae). So far, phylogenetic analyses using mitochondrial markers failed to resolve monophyly of most of the species from this recently diversified genus, suggesting that their taxonomy may need a revision. However, relying on a single, non-recombining marker and ignoring potential incongruences between mitochondrial and nuclear loci may provide an incomplete account of the lineage history. In this study, we applied both classical Sanger sequencing of three mtDNA regions and RAD-sequencing, for reconstructing the phylogeny of the genus. Contrasting with results based on mitochondrial markers, RAD-sequencing analyses retrieved the monophyly of all seven species, in agreement with the morphological species assignment. We found robust nuclear-based species assignment of individual samples, and low levels of estimated contemporary gene flow among them. However, despite recovering species' monophyly, interspecific relationships varied depending on the set of RAD loci considered, producing contradictory topologies. Moreover, coalescence-based phylogenetic analyses revealed low supports for most of the interspecific relationships. Our results indicate that despite the higher performance of RAD-sequencing in terms of species trees resolution compared to cytoplasmic markers, reconstructing inter-specific relationships among recently-diverged lineages may lie beyond the possibilities offered by large sets of RAD-sequencing markers in cases of strong gene tree incongruence. Copyright © 2017 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Forbes, S.A.; Brennan, L.; Richardson, M.
1996-01-01
The gene for X-linked cleft palate (CPX) has previously been mapped in an Icelandic kindred between the unordered proximal markers DXS1002/DXS349/DXS95 and the distal marker DXYS1X, which maps to the proximal end of the X-Y homology region in Xq21.3. Using six sequence-tagged sites (STSs) within the region, a total of 91 yeast artificial chromosome (YAC) clones were isolated and overlapped in a single contig that spans approximately 3.1 Mb between DXS1002 and DXYS1X. The order of microsatellite and STS markers in this was established as DXS1002-DXS1168-DXS349-DXS95-DXS364-DXS1196-DXS472-DXS1217-DXYS1X. A long-range restriction map of this region was created using eight nonchimeric, overlapping YACmore » clones. Analysis of newly positioned polymorphic markers in recombinant individuals from the Icelandic family has enabled us to identify DXS1196 and DXS1217 as the flanking markers for CPX. The maximum physical distance containing the CPX gene has been estimated to be 2.0 Mb, which is spanned by a minimum set of five nonchimeric YAC clones. In addition, YAC end clone and STS analyses have pinpointed the location of the proximal boundary of the X-Y homology region within the map. 40 refs., 2 figs., 2 tabs.« less
AFLP-based genetic mapping of the “bud-flowering” trait in heather (Calluna vulgaris)
2013-01-01
Background Calluna vulgaris is one of the most important landscaping plants produced in Germany. Its enormous economic success is due to the prolonged flower attractiveness of mutants in flower morphology, the so-called bud-bloomers. In this study, we present the first genetic linkage map of C. vulgaris in which we mapped a locus of the economically highly desired trait “flower type”. Results The map was constructed in JoinMap 4.1. using 535 AFLP markers from a single mapping population. A large fraction (40%) of markers showed distorted segregation. To test the effect of segregation distortion on linkage estimation, these markers were sorted regarding their segregation ratio and added in groups to the data set. The plausibility of group formation was evaluated by comparison of the “two-way pseudo-testcross” and the “integrated” mapping approach. Furthermore, regression mapping was compared to the multipoint-likelihood algorithm. The majority of maps constructed by different combinations of these methods consisted of eight linkage groups corresponding to the chromosome number of C. vulgaris. Conclusions All maps confirmed the independent inheritance of the most important horticultural traits “flower type”, “flower colour”, and “leaf colour”. An AFLP marker for the most important breeding target “flower type” was identified. The presented genetic map of C. vulgaris can now serve as a basis for further molecular marker selection and map-based cloning of the candidate gene encoding the unique flower architecture of C. vulgaris bud-bloomers. PMID:23915059
Sun, Xiaochun; Ma, Ping; Mumm, Rita H
2012-01-01
Genomic selection (GS) procedures have proven useful in estimating breeding value and predicting phenotype with genome-wide molecular marker information. However, issues of high dimensionality, multicollinearity, and the inability to deal effectively with epistasis can jeopardize accuracy and predictive ability. We, therefore, propose a new nonparametric method, pRKHS, which combines the features of supervised principal component analysis (SPCA) and reproducing kernel Hilbert spaces (RKHS) regression, with versions for traits with no/low epistasis, pRKHS-NE, to high epistasis, pRKHS-E. Instead of assigning a specific relationship to represent the underlying epistasis, the method maps genotype to phenotype in a nonparametric way, thus requiring fewer genetic assumptions. SPCA decreases the number of markers needed for prediction by filtering out low-signal markers with the optimal marker set determined by cross-validation. Principal components are computed from reduced marker matrix (called supervised principal components, SPC) and included in the smoothing spline ANOVA model as independent variables to fit the data. The new method was evaluated in comparison with current popular methods for practicing GS, specifically RR-BLUP, BayesA, BayesB, as well as a newer method by Crossa et al., RKHS-M, using both simulated and real data. Results demonstrate that pRKHS generally delivers greater predictive ability, particularly when epistasis impacts trait expression. Beyond prediction, the new method also facilitates inferences about the extent to which epistasis influences trait expression.
Sun, Xiaochun; Ma, Ping; Mumm, Rita H.
2012-01-01
Genomic selection (GS) procedures have proven useful in estimating breeding value and predicting phenotype with genome-wide molecular marker information. However, issues of high dimensionality, multicollinearity, and the inability to deal effectively with epistasis can jeopardize accuracy and predictive ability. We, therefore, propose a new nonparametric method, pRKHS, which combines the features of supervised principal component analysis (SPCA) and reproducing kernel Hilbert spaces (RKHS) regression, with versions for traits with no/low epistasis, pRKHS-NE, to high epistasis, pRKHS-E. Instead of assigning a specific relationship to represent the underlying epistasis, the method maps genotype to phenotype in a nonparametric way, thus requiring fewer genetic assumptions. SPCA decreases the number of markers needed for prediction by filtering out low-signal markers with the optimal marker set determined by cross-validation. Principal components are computed from reduced marker matrix (called supervised principal components, SPC) and included in the smoothing spline ANOVA model as independent variables to fit the data. The new method was evaluated in comparison with current popular methods for practicing GS, specifically RR-BLUP, BayesA, BayesB, as well as a newer method by Crossa et al., RKHS-M, using both simulated and real data. Results demonstrate that pRKHS generally delivers greater predictive ability, particularly when epistasis impacts trait expression. Beyond prediction, the new method also facilitates inferences about the extent to which epistasis influences trait expression. PMID:23226325
SU-E-J-229: Magnetic Resonance Imaging of Small Fiducial Markers for Proton Beam Therapy
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hu, Y; James, J; Panda, A
2015-06-15
Purpose: For proton beam therapy, small fiducial markers are preferred for patient alignment due to less interference with the proton beam. Visualizing small fiducial markers can be challenging in MRI. This study intends to investigate MRI imaging protocols for better visualization of small fiducial markers. Methods: Two carbon and two coil-shaped gold markers were placed into a gel phantom. Both carbon markers had a diameter of 1mm and a length of 3mm. Both gold markers had a length of 5mm. One gold marker had a diameter of 0.5mm and the other had a diameter of 0.75mm. T1 VIBE, T2 SPACE,more » TrueFISP and susceptibility weighted (SW) images were acquired. To improve marker contrast, high spatial resolution was used to reduce partial volume effect. Slice thickness was 1.5mm for all four sequences and in-plane resolution was 0.6mm for TrueFISP, 0.7mm for T1 VIBE, and 0.8mm for T2 SPACE and SW. For comparison purpose, a 3D T1 VIBE image set at 3mm slice thickness and 1.2mm in-plane resolution was also acquired. Results: All markers were visible in all high-resolution image sets. In each image set, marker-induced signal void was the smallest (in diameter) for carbon markers, followed by the 0.5mm gold marker and the largest for the 0.75mm gold marker. The SW images had the largest marker-induced signal void. However, those might be confused by susceptibility-gradient-induced signal voids. T1 VIBE had good visualization of markers with nicely defined edges. T2 SPACE had reasonable visualization of markers but edges were slightly blurred. TrueFISP had good visualization of markers only if they were not masked by banding artifacts. As a comparison, all markers were hardly visible in the standard resolution T1 VIBE images. Conclusion: 3D high-resolution T1 VIBE and SW have great potential in providing good visualization of small fiducial markers for proton beam therapy.« less
Velásquez, A V; da Silva, G G; Sousa, D O; Oliveira, C A; Martins, C M M R; Dos Santos, P P M; Balieiro, J C C; Rennó, F P; Fukushima, R S
2018-04-18
Feed intake assessment is a valuable tool for herd management decisions. The use of markers, either internal or external, is currently the most used technique for estimating feed intake in production animals. The experiment used 10 multiparous Holstein cows fed a corn silage-based diet, with 55:45 forage-to-concentrate ratio, the average fecal recovery (FR) of TiO 2 was higher than FR of Cr 2 O 3 , and both FR were more than unity. With internal markers, acetyl bromide lignin and cutin FR were lower than unity, and average FR for indigestible neutral detergent fiber (iNDF) and indigestible acid detergent fiber (iADF) was 1.5. The FR was unaffected by the fecal sampling procedure and appears to be an intrinsic property of each molecule and how it interacts with digesta. Of the 2 external markers, only Cr 2 O 3 produced accurate fecal output (FO) estimates and the same happened to dry matter digestibility (DMD) when iNDF and iADF were used. Estimates for DMD and FO were affected by sampling procedure; 72-h bulk [sub-sample from total feces collection (TFC)] sampling consistently produced accurate results. The grab (sub-samples taken at specific times during the day) sampling procedures were accurate when using either of the indigestible fibers (iNDF or iADF) to estimate DMD. However, grab sampling procedures can only be recommended when concomitant TFC is performed on at least one animal per treatment to determine FR. Under these conditions, Cr 2 O 3 is a suitable marker for estimating FO, and iNDF and iADF are adequate for estimating DMD. Moreover, the Cr 2 O 3 +iADF marker pair produces accurate dry matter intake estimates and deserves further attention in ruminant nutrition studies. The method of dosing the external markers is extremely important and greatly affects and determines results. Whichever the method, it must allow the animals to display normal feeding behavior and not affect performance. The grab sampling procedures can replace TFC (once FR is established), which may open new possibilities for pasture-based or collectively housed animals. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Single nucleotide polymorphism markers for genetic mapping in Drosophila melanogaster
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hoskins, Roger A.; Phan, Alexander C.; Naeemuddin, Mohammed
2001-04-16
For nearly a century, genetic analysis in Drosophila melanogaster has been a powerful tool for analyzing gene function, yet Drosophila lacks the molecular genetic mapping tools that have recently revolutionized human, mouse and plant genetics. Here, we describe the systematic characterization of a dense set of molecular markers in Drosophila using an STS-based physical map of the genome. We identify 474 biallelic markers in standard laboratory strains of Drosophila that the genome. The majority of these markers are single nucleotide polymorphisms (SNPs) and sequences for these variants are provided in an accessible format. The average density of the new markersmore » is 1 marker per 225 kb on the autosomes and 1 marker per 1 Mb on the X chromosome. We include in this survey a set of P-element strains that provide additional utility for high-resolution mapping. We demonstrate one application of the new markers in a simple set of crosses to map a mutation in the hedgehog gene to an interval of <1 Mb. This new map resource significantly increases the efficiency and resolution of recombination mapping and will be of immediate value to the Drosophila research community.« less
NASA Astrophysics Data System (ADS)
Mazurek, Przemysław
2013-09-01
Matchmoving (Match Moving) is the process used for the estimation of camera movements for further integration of acquired video image with computer graphics. The estimation of movements is possible using pattern recognition, 2D and 3D tracking algorithms. The main problem for the workflow is the partial occlusion of markers by the actor, because manual rotoscoping is necessary for fixing of the chroma-keyed footage. In the paper, the partial occlusion problem is solved using the invented, selectively active electronic markers. The sensor network with multiple infrared links detects occlusion state (no-occlusion, partial, full) and switch LED's based markers.
2009-01-01
Background Genomic selection (GS) uses molecular breeding values (MBV) derived from dense markers across the entire genome for selection of young animals. The accuracy of MBV prediction is important for a successful application of GS. Recently, several methods have been proposed to estimate MBV. Initial simulation studies have shown that these methods can accurately predict MBV. In this study we compared the accuracies and possible bias of five different regression methods in an empirical application in dairy cattle. Methods Genotypes of 7,372 SNP and highly accurate EBV of 1,945 dairy bulls were used to predict MBV for protein percentage (PPT) and a profit index (Australian Selection Index, ASI). Marker effects were estimated by least squares regression (FR-LS), Bayesian regression (Bayes-R), random regression best linear unbiased prediction (RR-BLUP), partial least squares regression (PLSR) and nonparametric support vector regression (SVR) in a training set of 1,239 bulls. Accuracy and bias of MBV prediction were calculated from cross-validation of the training set and tested against a test team of 706 young bulls. Results For both traits, FR-LS using a subset of SNP was significantly less accurate than all other methods which used all SNP. Accuracies obtained by Bayes-R, RR-BLUP, PLSR and SVR were very similar for ASI (0.39-0.45) and for PPT (0.55-0.61). Overall, SVR gave the highest accuracy. All methods resulted in biased MBV predictions for ASI, for PPT only RR-BLUP and SVR predictions were unbiased. A significant decrease in accuracy of prediction of ASI was seen in young test cohorts of bulls compared to the accuracy derived from cross-validation of the training set. This reduction was not apparent for PPT. Combining MBV predictions with pedigree based predictions gave 1.05 - 1.34 times higher accuracies compared to predictions based on pedigree alone. Some methods have largely different computational requirements, with PLSR and RR-BLUP requiring the least computing time. Conclusions The four methods which use information from all SNP namely RR-BLUP, Bayes-R, PLSR and SVR generate similar accuracies of MBV prediction for genomic selection, and their use in the selection of immediate future generations in dairy cattle will be comparable. The use of FR-LS in genomic selection is not recommended. PMID:20043835
Novel Filtration Markers for GFR Estimation
Inker, Lesley A.; Coresh, Josef; Levey, Andrew S.; Eckfeldt, John H.
2017-01-01
Creatinine-based glomerular filtration rate estimation (eGFRcr) has been improved and refined since the 1970s through both the Modification of Diet in Renal Disease (MDRD) Study equation in 1999 and the CKD Epidemiology Collaboration (CKD-EPI) equation in 2009, with current clinical practice dependent primarily on eGFR for accurate assessment of GFR. However, researchers and clinicians have recognized limitations of relying on creatinine as the only filtration marker, which can lead to inaccurate GFR estimates in certain populations due to the influence of non-GFR determinants of serum or plasma creatinine. Therefore, recent literature has proposed incorporation of multiple serum or plasma filtration markers into GFR estimation to improve precision and accuracy and decrease the impact of non-GFR determinants for any individual biomarker. To this end, the CKD-EPI combined creatinine-cystatin C equation (eGFRcr-cys) was developed in 2012 and demonstrated superior accuracy to equations relying on creatinine or cystatin C alone (eGFRcr or eGFRcys). Now, the focus has broadened to include additional novel filtration markers to further refine and improve GFR estimation. Beta-2-microglobulin (B2M) and beta-trace-protein (BTP) are two filtration markers with established assays that have been proposed as candidates for improving both GFR estimation and risk prediction. GFR estimating equations based on B2M and BTP have been developed and validated, with the CKD-EPI combined BTP-B2M equation (eGFRBTP-B2M) demonstrating similar performance to eGFR and eGFR. Additionally, several studies have demonstrated that both B2M and BTP are associated with outcomes in CKD patients, including cardiovascular events, ESRD and mortality. This review will primarily focus on these two biomarkers, and will highlight efforts to identify additional candidate biomarkers through metabolomics-based approaches. PMID:29333147
Criteria for selection and application of molecular markers for clinical studies of osteoarthritis.
Otterness, I G; Swindell, A C
2003-03-01
To develop criteria for the selection and application of molecular markers for the study of osteoarthritis (OA). Statistical criteria for marker selection for OA are developed. After studying more than 20 different molecular markers for monitoring OA, procedures for choosing markers for clinical studies have been developed. For a particular study, the process starts with the markers showing 'face-validity' for monitoring OA. They are next required to successfully distinguish OA patients from controls. This necessitates definition of the distribution of marker values in OA patients and controls. So far, they have been consistently log-normal. The difference (Delta) in marker values between OA and controls defines the opportunity for marker improvement. The between-visit standard deviation (S) in patients puts limits on the detection of marker changes. The two variables can be combined to estimate the practicality of a marker using a modified power analysis. The number of patients (N*) required to observe a 50% improvement with an alpha level of P=0.05 and with 80% certainty is estimated as 50(S/Delta)(2). N*, S and Delta should be used to characterize and compare markers. Marker efficiency can be refined by regressing on secondary variables, such as age, sex, BMI, severity, etc. Finally, the use of two or more markers may be required to improve marker prediction of clinical outcome. Correlated markers can be used to reinforce conclusions by essentially adding replicative data. Independent, complementary markers can be used to develop associations with clinical parameters, and perhaps diagnose and monitor disease status, activities that so far have not been possible with single markers.
2014-01-01
Background Although the X chromosome is the second largest bovine chromosome, markers on the X chromosome are not used for genomic prediction in some countries and populations. In this study, we presented a method for computing genomic relationships using X chromosome markers, investigated the accuracy of imputation from a low density (7K) to the 54K SNP (single nucleotide polymorphism) panel, and compared the accuracy of genomic prediction with and without using X chromosome markers. Methods The impact of considering X chromosome markers on prediction accuracy was assessed using data from Nordic Holstein bulls and different sets of SNPs: (a) the 54K SNPs for reference and test animals, (b) SNPs imputed from the 7K to the 54K SNP panel for test animals, (c) SNPs imputed from the 7K to the 54K panel for half of the reference animals, and (d) the 7K SNP panel for all animals. Beagle and Findhap were used for imputation. GBLUP (genomic best linear unbiased prediction) models with or without X chromosome markers and with or without a residual polygenic effect were used to predict genomic breeding values for 15 traits. Results Averaged over the two imputation datasets, correlation coefficients between imputed and true genotypes for autosomal markers, pseudo-autosomal markers, and X-specific markers were 0.971, 0.831 and 0.935 when using Findhap, and 0.983, 0.856 and 0.937 when using Beagle. Estimated reliabilities of genomic predictions based on the imputed datasets using Findhap or Beagle were very close to those using the real 54K data. Genomic prediction using all markers gave slightly higher reliabilities than predictions without X chromosome markers. Based on our data which included only bulls, using a G matrix that accounted for sex-linked relationships did not improve prediction, compared with a G matrix that did not account for sex-linked relationships. A model that included a polygenic effect did not recover the loss of prediction accuracy from exclusion of X chromosome markers. Conclusions The results from this study suggest that markers on the X chromosome contribute to accuracy of genomic predictions and should be used for routine genomic evaluation. PMID:25080199
Effects of sampling close relatives on some elementary population genetics analyses.
Wang, Jinliang
2018-01-01
Many molecular ecology analyses assume the genotyped individuals are sampled at random from a population and thus are representative of the population. Realistically, however, a sample may contain excessive close relatives (ECR) because, for example, localized juveniles are drawn from fecund species. Our knowledge is limited about how ECR affect the routinely conducted elementary genetics analyses, and how ECR are best dealt with to yield unbiased and accurate parameter estimates. This study quantifies the effects of ECR on some popular population genetics analyses of marker data, including the estimation of allele frequencies, F-statistics, expected heterozygosity (H e ), effective and observed numbers of alleles, and the tests of Hardy-Weinberg equilibrium (HWE) and linkage equilibrium (LE). It also investigates several strategies for handling ECR to mitigate their impact and to yield accurate parameter estimates. My analytical work, assisted by simulations, shows that ECR have large and global effects on all of the above marker analyses. The naïve approach of simply ignoring ECR could yield low-precision and often biased parameter estimates, and could cause too many false rejections of HWE and LE. The bold approach, which simply identifies and removes ECR, and the cautious approach, which estimates target parameters (e.g., H e ) by accounting for ECR and using naïve allele frequency estimates, eliminate the bias and the false HWE and LE rejections, but could reduce estimation precision substantially. The likelihood approach, which accounts for ECR in estimating allele frequencies and thus target parameters relying on allele frequencies, usually yields unbiased and the most accurate parameter estimates. Which of the four approaches is the most effective and efficient may depend on the particular marker analysis to be conducted. The results are discussed in the context of using marker data for understanding population properties and marker properties. © 2017 John Wiley & Sons Ltd.
Improving cell mixture deconvolution by identifying optimal DNA methylation libraries (IDOL).
Koestler, Devin C; Jones, Meaghan J; Usset, Joseph; Christensen, Brock C; Butler, Rondi A; Kobor, Michael S; Wiencke, John K; Kelsey, Karl T
2016-03-08
Confounding due to cellular heterogeneity represents one of the foremost challenges currently facing Epigenome-Wide Association Studies (EWAS). Statistical methods leveraging the tissue-specificity of DNA methylation for deconvoluting the cellular mixture of heterogenous biospecimens offer a promising solution, however the performance of such methods depends entirely on the library of methylation markers being used for deconvolution. Here, we introduce a novel algorithm for Identifying Optimal Libraries (IDOL) that dynamically scans a candidate set of cell-specific methylation markers to find libraries that optimize the accuracy of cell fraction estimates obtained from cell mixture deconvolution. Application of IDOL to training set consisting of samples with both whole-blood DNA methylation data (Illumina HumanMethylation450 BeadArray (HM450)) and flow cytometry measurements of cell composition revealed an optimized library comprised of 300 CpG sites. When compared existing libraries, the library identified by IDOL demonstrated significantly better overall discrimination of the entire immune cell landscape (p = 0.038), and resulted in improved discrimination of 14 out of the 15 pairs of leukocyte subtypes. Estimates of cell composition across the samples in the training set using the IDOL library were highly correlated with their respective flow cytometry measurements, with all cell-specific R (2)>0.99 and root mean square errors (RMSEs) ranging from [0.97 % to 1.33 %] across leukocyte subtypes. Independent validation of the optimized IDOL library using two additional HM450 data sets showed similarly strong prediction performance, with all cell-specific R (2)>0.90 and R M S E<4.00 %. In simulation studies, adjustments for cell composition using the IDOL library resulted in uniformly lower false positive rates compared to competing libraries, while also demonstrating an improved capacity to explain epigenome-wide variation in DNA methylation within two large publicly available HM450 data sets. Despite consisting of half as many CpGs compared to existing libraries for whole blood mixture deconvolution, the optimized IDOL library identified herein resulted in outstanding prediction performance across all considered data sets and demonstrated potential to improve the operating characteristics of EWAS involving adjustments for cell distribution. In addition to providing the EWAS community with an optimized library for whole blood mixture deconvolution, our work establishes a systematic and generalizable framework for the assembly of libraries that improve the accuracy of cell mixture deconvolution.
Identifying Markers of Dignity-Conserving Care in Long-Term Care: A Modified Delphi Study
Thompson, Genevieve N.; McArthur, Jennifer; Doupe, Malcolm
2016-01-01
Ensuring that people living in nursing homes (NHs) are afforded with dignity in their daily lives is an essential and humane concern. Promoting dignity-conserving care is fundamentally important. By nature, however, this care is all-encompassing and holistic, and from current knowledge it is challenging to create explicit strategies for measuring dignity-conserving care. In practice the majority of current NH indicators of quality care are derived from information that is routinely collected on NH residents using the RAI-Minimum Data Set (MDS). In this regard, issues that are more tangible to resident dignity such as being treated with respect, compassion, and having opportunities to engage with others are not adequately captured in current NH quality of care indicators. An initial set of markers was created by conducting an integrative literature review of existing markers and indicators of dignity in the NH setting. A modified Delphi process was used to prioritize essential dignity-conserving care markers for use by NH providers, based on factors such as the importance to fostering a culture of dignity, the impact it may have on the residents, and how achievable it is in practice. Through this consensus building technique, we were able to develop a comprehensive set of markers that capture the range and diversity of important dignity-conserving care strategies for use in NHs. The final 10 markers were judged as having high face validity by experts in the field and have explicit implications for enhancing the provision of daily dignified care to NH residents. These markers make an important addition to the traditional quality indicators used in the NH setting and as such, bridge an important gap in addressing the psychosocial and the less easily quantified needs of NH residents. PMID:27304853
Mourad, Amira M I; Sallam, Ahmed; Belamkar, Vikas; Wegulo, Stephen; Bowden, Robert; Jin, Yue; Mahdy, Ezzat; Bakheit, Bahy; El-Wafaa, Atif A; Poland, Jesse; Baenziger, Peter S
2018-01-01
Stem rust (caused by Puccinia graminis f. sp. tritici Erikss. & E. Henn.), is a major disease in wheat ( Triticum aestivium L.). However, in recent years it occurs rarely in Nebraska due to weather and the effective selection and gene pyramiding of resistance genes. To understand the genetic basis of stem rust resistance in Nebraska winter wheat, we applied genome-wide association study (GWAS) on a set of 270 winter wheat genotypes (A-set). Genotyping was carried out using genotyping-by-sequencing and ∼35,000 high-quality SNPs were identified. The tested genotypes were evaluated for their resistance to the common stem rust race in Nebraska (QFCSC) in two replications. Marker-trait association identified 32 SNP markers, which were significantly (Bonferroni corrected P < 0.05) associated with the resistance on chromosome 2D. The chromosomal location of the significant SNPs (chromosome 2D) matched the location of Sr6 gene which was expected in these genotypes based on pedigree information. A highly significant linkage disequilibrium (LD, r 2 ) was found between the significant SNPs and the specific SSR marker for the Sr6 gene ( Xcfd43 ). This suggests the significant SNP markers are tagging Sr6 gene. Out of the 32 significant SNPs, eight SNPs were in six genes that are annotated as being linked to disease resistance in the IWGSC RefSeq v1.0. The 32 significant SNP markers were located in nine haplotype blocks. All the 32 significant SNPs were validated in a set of 60 different genotypes (V-set) using single marker analysis. SNP markers identified in this study can be used in marker-assisted selection, genomic selection, and to develop KASP (Kompetitive Allele Specific PCR) marker for the Sr6 gene. Novel SNPs for Sr6 gene, an important stem rust resistant gene, were identified and validated in this study. These SNPs can be used to improve stem rust resistance in wheat.
Vitezica, Zulma G; Varona, Luis; Legarra, Andres
2013-12-01
Genomic evaluation models can fit additive and dominant SNP effects. Under quantitative genetics theory, additive or "breeding" values of individuals are generated by substitution effects, which involve both "biological" additive and dominant effects of the markers. Dominance deviations include only a portion of the biological dominant effects of the markers. Additive variance includes variation due to the additive and dominant effects of the markers. We describe a matrix of dominant genomic relationships across individuals, D, which is similar to the G matrix used in genomic best linear unbiased prediction. This matrix can be used in a mixed-model context for genomic evaluations or to estimate dominant and additive variances in the population. From the "genotypic" value of individuals, an alternative parameterization defines additive and dominance as the parts attributable to the additive and dominant effect of the markers. This approach underestimates the additive genetic variance and overestimates the dominance variance. Transforming the variances from one model into the other is trivial if the distribution of allelic frequencies is known. We illustrate these results with mouse data (four traits, 1884 mice, and 10,946 markers) and simulated data (2100 individuals and 10,000 markers). Variance components were estimated correctly in the model, considering breeding values and dominance deviations. For the model considering genotypic values, the inclusion of dominant effects biased the estimate of additive variance. Genomic models were more accurate for the estimation of variance components than their pedigree-based counterparts.
USDA-ARS?s Scientific Manuscript database
Selection on important genetic markers can improve estimates of additive and dominance association effects. A composite population of beef cattle was selected for intermediate frequencies of myostatin (GDF8) F94L and µ-calpain (CAPN1) polymorphisms. Important additive associations of the GDF8 locu...
Results for five sets of forensic genetic markers studied in a Greek population sample.
Tomas, C; Skitsa, I; Steinmeier, E; Poulsen, L; Ampati, A; Børsting, C; Morling, N
2015-05-01
A population sample of 223 Greek individuals was typed for five sets of forensic genetic markers with the kits NGM SElect™, SNPforID 49plex, DIPplex®, Argus X-12 and PowerPlex® Y23. No significant deviation from Hardy-Weinberg expectations was observed for any of the studied markers after Holm-Šidák correction. Statistically significant (P<0.05) levels of linkage disequilibrium were observed between markers within two of the studied X-chromosome linkage groups. AMOVA analyses of the five sets of markers did not show population structure when the individuals were grouped according to their geographic origin. The Greek population grouped closely to the other European populations measured by F(ST)(*) distances. The match probability ranged from a value of 1 in 2×10(7) males by using haplotype frequencies of four X-chromosome haplogroups in males to 1 in 1.73×10(21) individuals for 16 autosomal STRs. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Prediction of industrial tomato hybrids from agronomic traits and ISSR molecular markers.
Figueiredo, A S T; Resende, J T V; Faria, M V; Da-Silva, P R; Fagundes, B S; Morales, R G F
2016-05-13
Heterosis is a highly relevant phenomenon in plant breeding. This condition is usually established in hybrids derived from crosses of highly divergent parents. The success of a breeder in obtaining heterosis is directly related to the correct identification of genetically contrasting parents. Currently, the diallel cross is the most commonly used methodology to detect contrasting parents; however, it is a time- and cost-consuming procedure. Therefore, new tools capable of performing this task quickly and accurately are required. Thus, the purpose of this study was to estimate the genetic divergence in industrial tomato lines, based on agronomic traits, and to compare with estimates obtained using inter-simple sequence repeat (ISSR) molecular markers. The genetic divergence among 10 industrial tomato lines, based on nine morphological characters and 12 ISSR primers was analyzed. For data analysis, Pearson and Spearman correlation coefficients were calculated between the genetic dissimilarity measures estimated by Mahalanobis distance and Jaccard's coefficient of genetic dissimilarity from the heterosis estimates, combining ability, and means of important traits of industrial tomato. The ISSR markers efficiently detected contrasting parents for hybrid production in tomato. Parent RVTD-08 was indicated as the most divergent, both by molecular and morphological markers, that positively contributed to increased heterosis and by the specific combining ability in the crosses in which it participated. The genetic dissimilarity estimated by ISSR molecular markers aided the identification of the best hybrids of the experiment in terms of total fruit yield, pulp yield, and soluble solids content.
Software engineering the mixed model for genome-wide association studies on large samples.
Zhang, Zhiwu; Buckler, Edward S; Casstevens, Terry M; Bradbury, Peter J
2009-11-01
Mixed models improve the ability to detect phenotype-genotype associations in the presence of population stratification and multiple levels of relatedness in genome-wide association studies (GWAS), but for large data sets the resource consumption becomes impractical. At the same time, the sample size and number of markers used for GWAS is increasing dramatically, resulting in greater statistical power to detect those associations. The use of mixed models with increasingly large data sets depends on the availability of software for analyzing those models. While multiple software packages implement the mixed model method, no single package provides the best combination of fast computation, ability to handle large samples, flexible modeling and ease of use. Key elements of association analysis with mixed models are reviewed, including modeling phenotype-genotype associations using mixed models, population stratification, kinship and its estimation, variance component estimation, use of best linear unbiased predictors or residuals in place of raw phenotype, improving efficiency and software-user interaction. The available software packages are evaluated, and suggestions made for future software development.
Using postural synergies to animate a low-dimensional hand avatar in haptic simulation.
Mulatto, Sara; Formaglio, Alessandro; Malvezzi, Monica; Prattichizzo, Domenico
2013-01-01
A technique to animate a realistic hand avatar with 20 DoFs based on the biomechanics of the human hand is presented. The animation does not use any sensor glove or advanced tracker with markers. The proposed approach is based on the knowledge of a set of kinematic constraints on the model of the hand, referred to as postural synergies, which allows to represent the hand posture using a number of variables lower than the number of joints of the hand model. This low-dimensional set of parameters is estimated from direct measurement of the motion of thumb and index finger tracked using two haptic devices. A kinematic inversion algorithm has been developed, which takes synergies into account and estimates the kinematic configuration of the whole hand, i.e., also of the fingers whose end tips are not directly tracked by the two haptic devices. The hand skin is deformable and its deformation is computed using a linear vertex blending technique. The proposed synergy-based animation of the hand avatar involves only algebraic computations and is suitable for real-time implementation as required in haptics.
Advances in marker-assisted breeding of sugarcane
USDA-ARS?s Scientific Manuscript database
Despite the challenges posed by sugarcane, geneticists and breeders have actively sought to use DNA marker technology to enhance breeding efforts. Markers have been used to explore taxonomy, estimate genetic diversity, and to develop unique molecular fingerprints. Numerous studies have been undertak...
PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses
Purcell, Shaun ; Neale, Benjamin ; Todd-Brown, Kathe ; Thomas, Lori ; Ferreira, Manuel A. R. ; Bender, David ; Maller, Julian ; Sklar, Pamela ; de Bakker, Paul I. W. ; Daly, Mark J. ; Sham, Pak C.
2007-01-01
Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis. PMID:17701901
Werner, Simone; Krause, Friedemann; Rolny, Vinzent; Strobl, Matthias; Morgenstern, David; Datz, Christian; Chen, Hongda; Brenner, Hermann
2016-04-01
In initial studies that included colorectal cancer patients undergoing diagnostic colonoscopy, we had identified a serum marker combination able to detect colorectal cancer with similar diagnostic performance as fecal immunochemical test (FIT). In this study, we aimed to validate the results in participants of a large colorectal cancer screening study conducted in the average-risk, asymptomatic screening population. We tested serum samples from 1,200 controls, 420 advanced adenoma patients, 4 carcinoma in situ patients, and 36 colorectal cancer patients with a 5-marker blood test [carcinoembryonic antigen (CEA)+anti-p53+osteopontin+seprase+ferritin]. The diagnostic performance of individual markers and marker combinations was assessed and compared with stool test results. AUCs for the detection of colorectal cancer and advanced adenomas with the 5-marker blood test were 0.78 [95% confidence interval (CI), 0.68-0.87] and 0.56 (95% CI, 0.53-0.59), respectively, which now is comparable with guaiac-based fecal occult blood test (gFOBT) but inferior to FIT. With cutoffs yielding specificities of 80%, 90%, and 95%, the sensitivities for the detection of colorectal cancer were 64%, 50%, and 42%, and early-stage cancers were detected as well as late-stage cancers. For osteopontin, seprase, and ferritin, the diagnostic performance in the screening setting was reduced compared with previous studies in diagnostic settings while CEA and anti-p53 showed similar diagnostic performance in both settings. Performance of the 5-marker blood test under screening conditions is inferior to FIT even though it is still comparable with the performance of gFOBT. CEA and anti-p53 could contribute to the development of a multiple marker blood-based test for early detection of colorectal cancer. ©2015 American Association for Cancer Research.
Pancreatic Reference Set Application: Brian Haab-Van Andel (2012) — EDRN Public Portal
New markers are greatly needed for the detection and diagnosis of pancreatic cancer. Patients at high risk for developing pancreatic cancer (for, example because of genetic predisposition or health status) can be screened by endoscopy or a related imaging procedure, but these methods are expensive and burdensome to the patient. Blood-based markers would facilitate regular screening. In addition, patients with known abnormalities of the pancreas (for example, as observed incidentally from an abdominal scan) need to determine whether they have cancer or not. The great majority of patients with pancreatic findings by CT do not have conditions that require treatment, yet nearly all patients undergo invasive and burdensome procedures as a consequence of the CT. Again, a blood-based marker could alleviate this situation and potentially add accuracy to the diagnosis. In preliminary work we showed the potential for highly-accurate discrimination of pancreatic cancer from pancreatitis and healthy control subjects using a panel of protein and glycan markers in the serum. We used an antibody array platform in which we can obtain sensitive, reproducible measurements of protein abundance and glycosylation status in low sample volumes. The detection of the glycosylation status is important for the high accuracy of the test because the glycans attached to the marker proteins are altered in cancer patients. Based on the good performance in these early studies, we now want to validate the performance in rigorously controlled, blinded sample sets. The reference set developed by the EDRN will enable a definitive characterization of our marker performance. In addition, we can make an accurate comparison to other markers that will be applied to the same set and determine whether disparate markers could be used together for added benefit.
Cros, David; Sánchez, Leopoldo; Cochard, Benoit; Samper, Patrick; Denis, Marie; Bouvet, Jean-Marc; Fernández, Jesús
2014-04-01
Explicit pedigree reconstruction by simulated annealing gave reliable estimates of genealogical coancestry in plant species, especially when selfing rate was lower than 0.6, using a realistic number of markers. Genealogical coancestry information is crucial in plant breeding to estimate genetic parameters and breeding values. The approach of Fernández and Toro (Mol Ecol 15:1657-1667, 2006) to estimate genealogical coancestries from molecular data through pedigree reconstruction was limited to species with separate sexes. In this study it was extended to plants, allowing hermaphroditism and monoecy, with possible selfing. Moreover, some improvements were made to take previous knowledge on the population demographic history into account. The new method was validated using simulated and real datasets. Simulations showed that accuracy of estimates was high with 30 microsatellites, with the best results obtained for selfing rates below 0.6. In these conditions, the root mean square error (RMSE) between the true and estimated genealogical coancestry was small (<0.07), although the number of ancestors was overestimated and the selfing rate could be biased. Simulations also showed that linkage disequilibrium between markers and departure from the Hardy-Weinberg equilibrium in the founder population did not affect the efficiency of the method. Real oil palm data confirmed the simulation results, with a high correlation between the true and estimated genealogical coancestry (>0.9) and a low RMSE (<0.08) using 38 markers. The method was applied to the Deli oil palm population for which pedigree data were scarce. The estimated genealogical coancestries were highly correlated (>0.9) with the molecular coancestries using 100 markers. Reconstructed pedigrees were used to estimate effective population sizes. In conclusion, this method gave reliable genealogical coancestry estimates. The strategy was implemented in the software MOLCOANC 3.0.
Integrating Multiple Data Sources for Combinatorial Marker Discovery: A Study in Tumorigenesis.
Bandyopadhyay, Sanghamitra; Mallik, Saurav
2018-01-01
Identification of combinatorial markers from multiple data sources is a challenging task in bioinformatics. Here, we propose a novel computational framework for identifying significant combinatorial markers ( s) using both gene expression and methylation data. The gene expression and methylation data are integrated into a single continuous data as well as a (post-discretized) boolean data based on their intrinsic (i.e., inverse) relationship. A novel combined score of methylation and expression data (viz., ) is introduced which is computed on the integrated continuous data for identifying initial non-redundant set of genes. Thereafter, (maximal) frequent closed homogeneous genesets are identified using a well-known biclustering algorithm applied on the integrated boolean data of the determined non-redundant set of genes. A novel sample-based weighted support ( ) is then proposed that is consecutively calculated on the integrated boolean data of the determined non-redundant set of genes in order to identify the non-redundant significant genesets. The top few resulting genesets are identified as potential s. Since our proposed method generates a smaller number of significant non-redundant genesets than those by other popular methods, the method is much faster than the others. Application of the proposed technique on an expression and a methylation data for Uterine tumor or Prostate Carcinoma produces a set of significant combination of markers. We expect that such a combination of markers will produce lower false positives than individual markers.
USDA-ARS?s Scientific Manuscript database
Genetic marker effects and type of inheritance are estimated with poor precision when minor marker allele frequencies are low. An Angus population was subjected to marker assisted selection for multiple years to equalize CAPN1 haplotypes, CAST, and GHR genetic marker frequencies. The objective was t...
USDA-ARS?s Scientific Manuscript database
Genomic selection (GS) and marker-assisted selection (MAS) rely on marker-trait associations and are both routinely used for breeding purposes. Although similar, these two approaches differ in their applications and how markers are used to estimate breeding values. In this study, GS and MAS were com...
Watkins, W Scott; Xing, Jinchuan; Huff, Chad; Witherspoon, David J; Zhang, Yuhua; Perego, Ugo A; Woodward, Scott R; Jorde, Lynn B
2012-05-20
Populations of the Americas were founded by early migrants from Asia, and some have experienced recent genetic admixture. To better characterize the native and non-native ancestry components in populations from the Americas, we analyzed 815,377 autosomal SNPs, mitochondrial hypervariable segments I and II, and 36 Y-chromosome STRs from 24 Mesoamerican Totonacs and 23 South American Bolivians. We analyzed common genomic regions from native Bolivian and Totonac populations to identify 324 highly predictive Native American ancestry informative markers (AIMs). As few as 40-50 of these AIMs perform nearly as well as large panels of random genome-wide SNPs for predicting and estimating Native American ancestry and admixture levels. These AIMs have greater New World vs. Old World specificity than previous AIMs sets. We identify highly-divergent New World SNPs that coincide with high-frequency haplotypes found at similar frequencies in all populations examined, including the HGDP Pima, Maya, Colombian, Karitiana, and Surui American populations. Some of these regions are potential candidates for positive selection. European admixture in the Bolivian sample is approximately 12%, though individual estimates range from 0-48%. We estimate that the admixture occurred ~360-384 years ago. Little evidence of European or African admixture was found in Totonac individuals. Bolivians with pre-Columbian mtDNA and Y-chromosome haplogroups had 5-30% autosomal European ancestry, demonstrating the limitations of Y-chromosome and mtDNA haplogroups and the need for autosomal ancestry informative markers for assessing ancestry in admixed populations.
Etchemendy, Pablo E; Spiousas, Ignacio; Calcagno, Esteban R; Abregú, Ezequiel; Eguia, Manuel C; Vergara, Ramiro O
2018-06-01
In this study we evaluated whether a method of direct location is an appropriate response method for measuring auditory distance perception of far-field sound sources. We designed an experimental set-up that allows participants to indicate the distance at which they perceive the sound source by moving a visual marker. We termed this method Cross-Modal Direct Location (CMDL) since the response procedure involves the visual modality while the stimulus is presented through the auditory modality. Three experiments were conducted with sound sources located from 1 to 6 m. The first one compared the perceived distances obtained using either the CMDL device or verbal report (VR), which is the response method more frequently used for reporting auditory distance in the far field, and found differences on response compression and bias. In Experiment 2, participants reported visual distance estimates to the visual marker that were found highly accurate. Then, we asked the same group of participants to report VR estimates of auditory distance and found that the spatial visual information, obtained from the previous task, did not influence their reports. Finally, Experiment 3 compared the same responses that Experiment 1 but interleaving the methods, showing a weak, but complex, mutual influence. However, the estimates obtained with each method remained statistically different. Our results show that the auditory distance psychophysical functions obtained with the CMDL method are less susceptible to previously reported underestimation for distances over 2 m.
Kidney Function and Fracture Risk: The Atherosclerosis Risk in Communities (ARIC) Study
Schneider, Andrea L.C.; Ballew, Shoshana; McAdams DeMarco, Mara; Coresh, Josef; Appel, Lawrence J.; Selvin, Elizabeth; Grams, Morgan E.
2015-01-01
Background People with end-stage renal disease are at high risk of bone fracture. Less is known about fracture risk in milder chronic kidney disease (CKD), and whether CKD-associated fracture risk varies by sex or assessment with alternative kidney markers. Study Design Prospective cohort study. Setting & Participants 10,955 participants from the Atherosclerosis Risk in Communities (ARIC) Study followed up from 1996 to 2011. Predictor Kidney function as assessed by creatinine-based estimated glomerular filtration rate (eGFRcr), urine albumin-creatinine ratio (ACR), and alternative filtration markers. Outcomes Fracture-related hospitalizations determined by diagnostic code. Measurements Baseline kidney markers; hospitalizations identified by self-report during annual telephone contact and active surveillance of local hospital discharge lists. Results Mean age of participants was 63 years, 56% were female, and 22% were black. During a median follow-up of 13 years, there were 722 incident fracture-related hospitalizations. Older age, female sex, and white race were associated with higher risk of fracture (p<0.001). The relationship between eGFRcr and fracture risk was non-linear: below 60 ml/min/1.73 m2, lower eGFRcr was associated with higher fracture risk (adjusted HR per 10 ml/min/1.73 m2 lower, 1.24; 95% CI, 1.05–1.47); there was no statistically significant association above 60 ml/min/1.73 m2 in the primary analysis. In contrast, there was a graded association between other markers of kidney function and subsequent fracture, including ACR (HR per doubling, 1.10; 95% CI, 1.06–1.14), cystatin C–based eGFR (HR per 1-SD decrease, 1.15; 95% CI, 1.06–1.25), and 1/β2-microglobulin (HR per 1-SD decrease, 1.26, 95% CI, 1.15–1.37). Limitations No bone mineral density assessment; one-time measure of kidney function. Conclusions Both low eGFR and higher albuminuria were significant risk factors for fracture in this community-based population. The shape of the association in the upper ranges of eGFR varied by the filtration marker used in estimation. PMID:26250781
Genomic selection in plant breeding
USDA-ARS?s Scientific Manuscript database
Genomic selection (GS) is a method to predict the genetic value of selection candidates based on the genomic estimated breeding value (GEBV) predicted from high-density markers positioned throughout the genome. Unlike marker-assisted selection, the GEBV is based on all markers including both minor ...
Chen, Hongda; Werner, Simone; Butt, Julia; Zörnig, Inka; Knebel, Phillip; Michel, Angelika; Eichmüller, Stefan B; Jäger, Dirk; Waterboer, Tim; Pawlita, Michael; Brenner, Hermann
2016-03-29
Novel blood-based screening tests are strongly desirable for early detection of colorectal cancer (CRC). We aimed to identify and evaluate autoantibodies against tumor-associated antigens as biomarkers for early detection of CRC. 380 clinically identified CRC patients and samples of participants with selected findings from a cohort of screening colonoscopy participants in 2005-2013 (N=6826) were included in this analysis. Sixty-four serum autoantibody markers were measured by multiplex bead-based serological assays. A two-step approach with selection of biomarkers in a training set, and validation of findings in a validation set, the latter exclusively including participants from the screening setting, was applied. Anti-MAGEA4 exhibited the highest sensitivity for detecting early stage CRC and advanced adenoma. Multi-marker combinations substantially increased sensitivity at the price of a moderate loss of specificity. Anti-TP53, anti-IMPDH2, anti-MDM2 and anti-MAGEA4 were consistently included in the best-performing 4-, 5-, and 6-marker combinations. This four-marker panel yielded a sensitivity of 26% (95% CI, 13-45%) for early stage CRC at a specificity of 90% (95% CI, 83-94%) in the validation set. Notably, it also detected 20% (95% CI, 13-29%) of advanced adenomas. Taken together, the identified biomarkers could contribute to the development of a useful multi-marker blood-based test for CRC early detection.
Daya, Michelle; van der Merwe, Lize; Galal, Ushma; Möller, Marlo; Salie, Muneeb; Chimusa, Emile R.; Galanter, Joshua M.; van Helden, Paul D.; Henn, Brenna M.; Gignoux, Chris R.; Hoal, Eileen
2013-01-01
Admixture is a well known confounder in genetic association studies. If genome-wide data is not available, as would be the case for candidate gene studies, ancestry informative markers (AIMs) are required in order to adjust for admixture. The predominant population group in the Western Cape, South Africa, is the admixed group known as the South African Coloured (SAC). A small set of AIMs that is optimized to distinguish between the five source populations of this population (African San, African non-San, European, South Asian, and East Asian) will enable researchers to cost-effectively reduce false-positive findings resulting from ignoring admixture in genetic association studies of the population. Using genome-wide data to find SNPs with large allele frequency differences between the source populations of the SAC, as quantified by Rosenberg et. al's -statistic, we developed a panel of AIMs by experimenting with various selection strategies. Subsets of different sizes were evaluated by measuring the correlation between ancestry proportions estimated by each AIM subset with ancestry proportions estimated using genome-wide data. We show that a panel of 96 AIMs can be used to assess ancestry proportions and to adjust for the confounding effect of the complex five-way admixture that occurred in the South African Coloured population. PMID:24376522
Bonnet, Vincent; Richard, Vincent; Camomilla, Valentina; Venture, Gentiane; Cappozzo, Aurelio; Dumas, Raphaël
2017-09-06
To reduce the impact of the soft tissue artefact (STA) on the estimate of skeletal movement using stereophotogrammetric and skin-marker data, multi-body kinematics optimisation (MKO) and extended Kalman filters (EKF) have been proposed. This paper assessed the feasibility and efficiency of these methods when they embed a mathematical model of the STA and simultaneously estimate the ankle, knee and hip joint kinematics and the model parameters. A STA model was used that provides an estimate of the STA affecting the marker-cluster located on a body segment as a function of the kinematics of the adjacent joints. The MKO and the EKF were implemented with and without the STA model. To assess these methods, intra-cortical pin and skin markers located on the thigh, shank, and foot of three subjects and tracked during the stance phase of running were used. Embedding the STA model in MKO and EKF reduced the average RMS of marker tracking from 12.6 to 1.6mm and from 4.3 to 1.9mm, respectively, showing that a STA model trial-specific calibration is feasible. Nevertheless, with the STA model embedded in MKO, the RMS difference between the estimated and the reference joint kinematics determined from the pin markers slightly increased (from 2.0 to 2.1deg) On the contrary, when the STA model was embedded in the EKF, this RMS difference was slightly reduced (from 2.0 to 1.7deg) thus showing a better potentiality of this method to attenuate STA effects and improve the accuracy of joint kinematics estimate. Copyright © 2017 Elsevier Ltd. All rights reserved.
Single Nucleotide Polymorphism Markers for Genetic Mapping in Drosophila melanogaster
Hoskins, Roger A.; Phan, Alexander C.; Naeemuddin, Mohammed; Mapa, Felipa A.; Ruddy, David A.; Ryan, Jessica J.; Young, Lynn M.; Wells, Trent; Kopczynski, Casey; Ellis, Michael C.
2001-01-01
For nearly a century, genetic analysis in Drosophila melanogaster has been a powerful tool for analyzing gene function, yet Drosophila lacks the molecular genetic mapping tools that recently have revolutionized human, mouse, and plant genetics. Here, we describe the systematic characterization of a dense set of molecular markers in Drosophila by using a sequence tagged site-based physical map of the genome. We identify 474 biallelic markers in standard laboratory strains of Drosophila that span the genome. Most of these markers are single nucleotide polymorphisms and sequences for these variants are provided in an accessible format. The average density of the new markers is one per 225 kb on the autosomes and one per megabase on the X chromosome. We include in this survey a set of P-element strains that provide additional use for high-resolution mapping. We show one application of the new markers in a simple set of crosses to map a mutation in the hedgehog gene to an interval of <1 Mb. This new map resource significantly increases the efficiency and resolution of recombination mapping and will be of immediate value to the Drosophila research community. PMID:11381036
Wu, Tsung-Meng; Lin, Ke-Chun; Liau, Wei-Shiang; Chao, Yun-Yang; Yang, Ling-Hung; Chen, Szu-Yun; Lu, Chung-An; Hong, Chwan-Yang
2016-01-01
In the post-genomic era, many useful tools have been developed to accelerate the investigation of gene functions. Fluorescent proteins have been widely used as protein tags for studying the subcellular localization of proteins in plants. Several fluorescent organelle marker lines have been generated in dicot plants; however, useful and reliable fluorescent organelle marker lines are lacking in the monocot model rice. Here, we developed eight different GFP-based organelle markers in transgenic rice and created a set of DsRed-based gateway vectors for combining with the marker lines. Two mitochondrial-localized rice ascorbate peroxidase genes fused to DsRed and successfully co-localized with mitochondrial-targeted marker lines verified the practical use of this system. The co-localization of GFP-fusion marker lines and DsRed-fusion proteins provide a convenient platform for in vivo or in vitro analysis of subcellular localization of rice proteins.
Eckert, Andrew J; Bower, Andrew D; Wegrzyn, Jill L; Pande, Barnaly; Jermstad, Kathleen D; Krutovsky, Konstantin V; St Clair, J Bradley; Neale, David B
2009-08-01
Adaptation to cold is one of the greatest challenges to forest trees. This process is highly synchronized with environmental cues relating to photoperiod and temperature. Here, we use a candidate gene-based approach to search for genetic associations between 384 single-nucleotide polymorphism (SNP) markers from 117 candidate genes and 21 cold-hardiness related traits. A general linear model approach, including population structure estimates as covariates, was implemented for each marker-trait pair. We discovered 30 highly significant genetic associations [false discovery rate (FDR) Q < 0.10] across 12 candidate genes and 10 of the 21 traits. We also detected a set of 7 markers that had elevated levels of differentiation between sampling sites situated across the Cascade crest in northeastern Washington. Marker effects were small (r(2) < 0.05) and within the range of those published previously for forest trees. The derived SNP allele, as measured by a comparison to a recently diverged sister species, typically affected the phenotype in a way consistent with cold hardiness. The majority of markers were characterized as having largely nonadditive modes of gene action, especially underdominance in the case of cold-tolerance related phenotypes. We place these results in the context of trade-offs between the abilities to grow longer and to avoid fall cold damage, as well as putative epigenetic effects. These associations provide insight into the genetic components of complex traits in coastal Douglas fir, as well as highlight the need for landscape genetic approaches to the detection of adaptive genetic diversity.
A framework for automatic creation of gold-standard rigid 3D-2D registration datasets.
Madan, Hennadii; Pernuš, Franjo; Likar, Boštjan; Špiclin, Žiga
2017-02-01
Advanced image-guided medical procedures incorporate 2D intra-interventional information into pre-interventional 3D image and plan of the procedure through 3D/2D image registration (32R). To enter clinical use, and even for publication purposes, novel and existing 32R methods have to be rigorously validated. The performance of a 32R method can be estimated by comparing it to an accurate reference or gold standard method (usually based on fiducial markers) on the same set of images (gold standard dataset). Objective validation and comparison of methods are possible only if evaluation methodology is standardized, and the gold standard dataset is made publicly available. Currently, very few such datasets exist and only one contains images of multiple patients acquired during a procedure. To encourage the creation of gold standard 32R datasets, we propose an automatic framework. The framework is based on rigid registration of fiducial markers. The main novelty is spatial grouping of fiducial markers on the carrier device, which enables automatic marker localization and identification across the 3D and 2D images. The proposed framework was demonstrated on clinical angiograms of 20 patients. Rigid 32R computed by the framework was more accurate than that obtained manually, with the respective target registration error below 0.027 mm compared to 0.040 mm. The framework is applicable for gold standard setup on any rigid anatomy, provided that the acquired images contain spatially grouped fiducial markers. The gold standard datasets and software will be made publicly available.
Marker-Assisted Introgression in Backcross Breeding Programs
Visscher, P. M.; Haley, C. S.; Thompson, R.
1996-01-01
The efficiency of marker-assisted introgression in backcross populations derived from inbred lines was investigated by simulation. Background genotypes were simulated assuming that a genetic model of many genes of small effects in coupling phase explains the observed breed difference and variance in backcross populations. Markers were efficient in introgression backcross programs for simultaneously introgressing an allele and selecting for the desired genomic background. Using a marker spacing of 10-20 cM gave an advantage of one to two backcross generations selection relative to random or phenotypic selection. When the position of the gene to be introgressed is uncertain, for example because its position was estimated from a trait gene mapping experiment, a chromosome segment should be introgressed that is likely to include the allele of interest. Even for relatively precisely mapped quantitative trait loci, flanking markers or marker haplotypes should cover ~10-20 cM around the estimated position of the gene, to ensure that the allele frequency does not decline in later backcross generations. PMID:8978075
Genomic selection in plant breeding.
Newell, Mark A; Jannink, Jean-Luc
2014-01-01
Genomic selection (GS) is a method to predict the genetic value of selection candidates based on the genomic estimated breeding value (GEBV) predicted from high-density markers positioned throughout the genome. Unlike marker-assisted selection, the GEBV is based on all markers including both minor and major marker effects. Thus, the GEBV may capture more of the genetic variation for the particular trait under selection.
Discourse Markers in EFL Setting: Perceptions of Turkish EFL Teachers
ERIC Educational Resources Information Center
Asik, Asuman
2015-01-01
Discourse markers are seen as one of the fundamental units in spoken discourse due to their frequent and multifunctional use by native speakers of English. Discourse markers also have significance in foreign language instruction. In this respect, this study explored the perceptions of Turkish EFL teachers towards the use of discourse markers in…
Interlinkage among cardio-metabolic disease markers in an urban poor setting in Nairobi, Kenya.
Haregu, Tilahun Nigatu; Oti, Samuel; Ngomi, Nicholas; Khayeka-Wandabwa, Christopher; Egondi, Thaddaeus; Kyobutungi, Catherine
2016-01-01
The main cardio-metabolic diseases - mostly cardiovascular diseases such as stroke and ischemic heart disease - share common clinical markers such as raised blood pressure and blood glucose. The pathways of development of many of these conditions are also interlinked. In this regard, a higher level of co-occurrence of the main cardio-metabolic disease markers is expected. Evidence about the patterns of occurrence of cardio-metabolic markers and their interlinkage in the sub-Saharan African setting is inadequate. The goal of the study was to describe the interlinkage among common cardio-metabolic disease markers in an African setting. We used data collected in a cross-sectional study from 5,190 study participants as part of cardiovascular disease risk assessment in the urban slums of Nairobi, Kenya. Five commonly used clinical markers of cardio-metabolic conditions were considered in this analysis. These markers were waist circumference, blood pressure, random blood glucose, total blood cholesterol, and triglyceride levels. Patterns of these markers were described using means, standard deviations, and proportions. The associations between the markers were determined using odds ratios. The weighted prevalence of central obesity, hypertension, hyperglycemia, hypercholesterolemia, and hypertriglyceridemia were 12.3%, 7.0%, 2.5%, 10.3%, and 17.3%, respectively. Women had a higher prevalence of central obesity and hypercholesterolemia as compared to men. Blood glucose was strongly associated with central obesity, blood pressure, and triglyceride levels, whereas the association between blood glucose and total blood cholesterol was not statistically significant. This study shows that most of the common cardio-metabolic markers are interlinked, suggesting a higher probability of comorbidity due to cardio-metabolic conditions and thus the need for integrated approaches.
Herrera, Carlos M
2012-01-01
Methods for estimating quantitative trait heritability in wild populations have been developed in recent years which take advantage of the increased availability of genetic markers to reconstruct pedigrees or estimate relatedness between individuals, but their application to real-world data is not exempt from difficulties. This chapter describes a recent marker-based technique which, by adopting a genomic scan approach and focusing on the relationship between phenotypes and genotypes at the individual level, avoids the problems inherent to marker-based estimators of relatedness. This method allows the quantification of the genetic component of phenotypic variance ("degree of genetic determination" or "heritability in the broad sense") in wild populations and is applicable whenever phenotypic trait values and multilocus data for a large number of genetic markers (e.g., amplified fragment length polymorphisms, AFLPs) are simultaneously available for a sample of individuals from the same population. The method proceeds by first identifying those markers whose variation across individuals is significantly correlated with individual phenotypic differences ("adaptive loci"). The proportion of phenotypic variance in the sample that is statistically accounted for by individual differences in adaptive loci is then estimated by fitting a linear model to the data, with trait value as the dependent variable and scores of adaptive loci as independent ones. The method can be easily extended to accommodate quantitative or qualitative information on biologically relevant features of the environment experienced by each sampled individual, in which case estimates of the environmental and genotype × environment components of phenotypic variance can also be obtained.
Herbeck, Joshua T.; Müller, Viktor; Maust, Brandon S.; Ledergerber, Bruno; Torti, Carlo; Di Giambenedetto, Simona; Gras, Luuk; Günthard, Huldrych F.; Jacobson, Lisa P.; Mullins, James I.; Gottlieb, Geoffrey S.
2013-01-01
Objective The potential for changing HIV-1 virulence has significant implications for the AIDS epidemic, including changing HIV transmission rates, rapidity of disease progression, and timing of ART. Published data to date have provided conflicting results. Design We conducted a meta-analysis of changes in baseline CD4+ T-cell counts and set point plasma viral RNA load over time in order to establish whether summary trends are consistent with changing HIV-1 virulence. Methods We searched PubMed for studies of trends in HIV-1 prognostic markers of disease progression and supplemented findings with publications referenced in epidemiological or virulence studies. We identified 12 studies of trends in baseline CD4+ T-cell counts (21 052 total individuals), and eight studies of trends in set point viral loads (10 785 total individuals), spanning the years 1984–2010. Using random-effects meta-analysis, we estimated summary effect sizes for trends in HIV-1 plasma viral loads and CD4+ T-cell counts. Results Baseline CD4+ T-cell counts showed a summary trend of decreasing cell counts [effect=−4.93 cells/µl per year, 95% confidence interval (CI) −6.53 to −3.3]. Set point viral loads showed a summary trend of increasing plasma viral RNA loads (effect=0.013 log10 copies/ml per year, 95% CI −0.001 to 0.03). The trend rates decelerated in recent years for both prognostic markers. Conclusion Our results are consistent with increased virulence of HIV-1 over the course of the epidemic. Extrapolating over the 30 years since the first description of AIDS, this represents a CD4+ T cells loss of approximately 148 cells/µl and a gain of 0.39 log10 copies/ml of viral RNA measured during early infection. These effect sizes would predict increasing rates of disease progression, and need for ART as well as increasing transmission risk. PMID:22089381
Wolc, Anna; Stricker, Chris; Arango, Jesus; Settar, Petek; Fulton, Janet E; O'Sullivan, Neil P; Preisinger, Rudolf; Habier, David; Fernando, Rohan; Garrick, Dorian J; Lamont, Susan J; Dekkers, Jack C M
2011-01-21
Genomic selection involves breeding value estimation of selection candidates based on high-density SNP genotypes. To quantify the potential benefit of genomic selection, accuracies of estimated breeding values (EBV) obtained with different methods using pedigree or high-density SNP genotypes were evaluated and compared in a commercial layer chicken breeding line. The following traits were analyzed: egg production, egg weight, egg color, shell strength, age at sexual maturity, body weight, albumen height, and yolk weight. Predictions appropriate for early or late selection were compared. A total of 2,708 birds were genotyped for 23,356 segregating SNP, including 1,563 females with records. Phenotypes on relatives without genotypes were incorporated in the analysis (in total 13,049 production records).The data were analyzed with a Reduced Animal Model using a relationship matrix based on pedigree data or on marker genotypes and with a Bayesian method using model averaging. Using a validation set that consisted of individuals from the generation following training, these methods were compared by correlating EBV with phenotypes corrected for fixed effects, selecting the top 30 individuals based on EBV and evaluating their mean phenotype, and by regressing phenotypes on EBV. Using high-density SNP genotypes increased accuracies of EBV up to two-fold for selection at an early age and by up to 88% for selection at a later age. Accuracy increases at an early age can be mostly attributed to improved estimates of parental EBV for shell quality and egg production, while for other egg quality traits it is mostly due to improved estimates of Mendelian sampling effects. A relatively small number of markers was sufficient to explain most of the genetic variation for egg weight and body weight.
Belinchón, Rocío; Ellis, Christopher J.; Yahr, Rebecca
2014-01-01
• Premise of the study: Microsatellite markers were characterized for two epiphytic cyanolichens, Nephroma laevigatum and N. parile (Nephromataceae), and will be used to investigate population structure and estimate gene flow among populations of these two closely related species with contrasting dispersal modes. • Methods and Results: Twelve and 14 microsatellite loci were characterized for N. laevigatum and N. parile, respectively. Allele number in N. laevigatum ranged from three to 13 per locus, while in N. parile there were from two to six alleles per locus. As expected, the sexually reproducing N. laevigatum had higher genetic diversity than the predominantly asexual N. parile. • Conclusions: This new set of markers is suitable for studying population structure and providing insights into gene flow among populations and for understanding processes of diversification. Compared between the species, they will facilitate an understanding of the influence of contrasting reproductive strategies on population and community structure. PMID:25383271
Huang, Minxuan; Matsushita, Kunihiro; Sang, Yingying; Ballew, Shoshana H.; Astor, Brad C.; Coresh, Josef
2014-01-01
Background Decreased kidney function and kidney damage may predate hypertension, but only a few studies have investigated both types of markers simultaneously, and these studies have obtained conflicting results. Study Design Cross-sectional for prevalent and prospective observational study for incident hypertension. Setting & Participants 9,593 participants from the Atherosclerosis Risk in Communities (ARIC) Study, aged 53-75 years during 1996-1998. Predictors Several markers of kidney function (estimated glomerular filtration rate [eGFR] using serum creatinine and/or cystatin C and two novel markers [β-trace protein and β2-microglobulin]) and one marker of kidney damage (urinary albumin-creatinine ratio [ACR]). Every kidney marker was categorized by its quintiles (top quintile as a reference for eGFRs and bottom quintile for the rest). Outcomes Prevalent and incident hypertension. Measurements Prevalence and HRs of hypertension based on modified Poisson regression and Cox proportional hazards models, respectively. Results There were 4,378 participants (45.6%) with prevalent hypertension at baseline and 2,175 incident hypertension cases during a median follow-up of 9.8 years. While all five kidney function markers were significantly associated with prevalent hypertension, prevalent hypertension was most notably associated with higher ACR (adjusted prevalence ratio, 1.60 [95% CI, 1.50-1.71] for the highest vs lowest ACR quintile). Similarly, ACR was consistently associated with incident hypertension in all models tested (adjusted HR, 1.28 [95% CI, 1.10-1.49] for top quintile), while kidney function markers demonstrated significant associations in some, but not all, models. Even mildly increased ACR (9.14-14.0 mg/g) was significantly associated with incident hypertension. Limitations Self-reported use of antihypertensive medication for defining incident hypertension, single assessment of kidney markers, and relatively narrow age range. Conclusions Although all kidney markers were associated with prevalent hypertension, only elevated albuminuria was consistently associated with incident hypertension, suggesting that kidney damage is more closely related to hypertension than moderate reduction in overall kidney function. PMID:25151408
Poulsen, L; Farzad, M Sharafi; Børsting, C; Tomas, C; Pereira, V; Morling, N
2015-07-01
A total of 255 individuals (Persians, Lurs, Kurds and Azeris) from Iran were typed for three sets of forensic genetic markers with the NGM SElect™, DIPplex(®) and Argus X-12 kits. Statistically significant deviations (P≤0.002) from Hardy-Weinberg expectations were observed for the insertion-deletion markers HLD97 and HLD93 after Holm-Šidák correction. Statistically significant (P<0.05) levels of linkage disequilibrium were observed between markers within two of the four studied X-chromosomal linkage groups. AMOVA analyses of the three sets of markers did not show population structure when the individuals were grouped according to their ethnic group. The Iranian population grouped closely to populations living geographically near to Iran based on pairwise FST distances. The matching probabilities ranged from 1 in 3.2×10(7) males by using haplotype frequencies of four X-chromosomal haplogroups to 1 in 3.4×10(21) individuals for the 16 autosomal STRs. Copyright © 2015. Published by Elsevier Ireland Ltd.
Clark, Leigh Anne; Famula, Thomas R; Murphy, Keith E
2004-10-01
To develop a set of microsatellite markers, composed of a minimal number of these markers, suitable for use in forensic genetic investigations in dogs. Blood, tissue, or buccal epithelial cells from 364 dogs of 85 breeds and mixed breeds and 19 animals from related species in the family Canidae. 61 tetranucleotide microsatellite markers were characterized on the basis of number and size of alleles, ease of genotyping, chromosomal location, and ability to be coamplified. The range in allele size, number of alleles, total heterozygosity, and fixation index for each marker were determined by use of genotype data from 383 dogs and related species. Polymorphism information content was calculated for several breeds of dogs. 7 microsatellite markers could be coamplified. These markers were labeled with fluorescent dyes, multiplexed into a single reaction, and optimized for resolution in a commercial genetic analyzer. The multiplex set was used to identify sires for 2 mixed litters. The test was not species specific; genotype information collected for wolves, coyotes, jackals, New Guinea singing dogs, and an African wild dog could not distinguish between these species. This set of 7 microsatellite markers is useful in forensic applications (ie, identification of dogs and determination of parentage) in closely related animals and is applicable to a wide range of species belonging to the family Canidae.
Laidò, Giovanni; Mangini, Giacomo; Taranto, Francesca; Gadaleta, Agata; Blanco, Antonio; Cattivelli, Luigi; Marone, Daniela; Mastrangelo, Anna M.; Papa, Roberto; De Vita, Pasquale
2013-01-01
Levels of genetic diversity and population genetic structure of a collection of 230 accessions of seven tetraploid Triticum turgidum L. subspecies were investigated using six morphological, nine seed storage protein loci, 26 SSRs and 970 DArT markers. The genetic diversity of the morphological traits and seed storage proteins was always lower in the durum wheat compared to the wild and domesticated emmer. Using Bayesian clustering (K = 2), both of the sets of molecular markers distinguished the durum wheat cultivars from the other tetraploid subspecies, and two distinct subgroups were detected within the durum wheat subspecies, which is in agreement with their origin and year of release. The genetic diversity of morphological traits and seed storage proteins was always lower in the improved durum cultivars registered after 1990, than in the intermediate and older ones. This marked effect on diversity was not observed for molecular markers, where there was only a weak reduction. At K >2, the SSR markers showed a greater degree of resolution than for DArT, with their identification of a greater number of groups within each subspecies. Analysis of DArT marker differentiation between the wheat subspecies indicated outlier loci that are potentially linked to genes controlling some important agronomic traits. Among the 211 loci identified under selection, 109 markers were recently mapped, and some of these markers were clustered into specific regions on chromosome arms 2BL, 3BS and 4AL, where several genes/quantitative trait loci (QTLs) are involved in the domestication of tetraploid wheats, such as the tenacious glumes (Tg) and brittle rachis (Br) characteristics. On the basis of these results, it can be assumed that the population structure of the tetraploid wheat collection partially reflects the evolutionary history of Triticum turgidum L. subspecies and the genetic potential of landraces and wild accessions for the detection of unexplored alleles. PMID:23826256
Enzyme markers in inbred rat strains: genetics of new markers and strain profiles.
Adams, M; Baverstock, P R; Watts, C H; Gutman, G A
1984-08-01
Twenty-six inbred strains of the laboratory rat (Rattus norvegicus) were examined for electrophoretic variation at an estimated 97 genetic loci. In addition to previously documented markers, variation was observed for the enzymes aconitase, aldehyde dehydrogenase, and alkaline phosphatase. The genetic basis of these markers (Acon-1, Ahd-2, and Akp-1) was confirmed. Linkage analysis between 35 pairwise comparisons revealed that the markers Fh-1 and Pep-3 are linked. The strain profiles of the 25 inbred strains at 11 electrophoretic markers are given.
Selection enhanced estimates of marker effects on means and variances of beef tenderness
USDA-ARS?s Scientific Manuscript database
Genetic marker associations from surveys of industry cattle populations have low frequencies of rare homozygous animals. Selection for calpain (CAPN1) and calpastatin (CAST) genetic markers was replicated in two cattle populations (Angus and MARC III) at the U.S. Meat Animal Research Center. These...
Chen, Hongda; Werner, Simone; Butt, Julia; Zörnig, Inka; Knebel, Phillip; Michel, Angelika; Eichmüller, Stefan B.; Jäger, Dirk; Waterboer, Tim; Pawlita, Michael; Brenner, Hermann
2016-01-01
Novel blood-based screening tests are strongly desirable for early detection of colorectal cancer (CRC). We aimed to identify and evaluate autoantibodies against tumor-associated antigens as biomarkers for early detection of CRC. 380 clinically identified CRC patients and samples of participants with selected findings from a cohort of screening colonoscopy participants in 2005–2013 (N=6826) were included in this analysis. Sixty-four serum autoantibody markers were measured by multiplex bead-based serological assays. A two-step approach with selection of biomarkers in a training set, and validation of findings in a validation set, the latter exclusively including participants from the screening setting, was applied. Anti-MAGEA4 exhibited the highest sensitivity for detecting early stage CRC and advanced adenoma. Multi-marker combinations substantially increased sensitivity at the price of a moderate loss of specificity. Anti-TP53, anti-IMPDH2, anti-MDM2 and anti-MAGEA4 were consistently included in the best-performing 4-, 5-, and 6-marker combinations. This four-marker panel yielded a sensitivity of 26% (95% CI, 13–45%) for early stage CRC at a specificity of 90% (95% CI, 83–94%) in the validation set. Notably, it also detected 20% (95% CI, 13–29%) of advanced adenomas. Taken together, the identified biomarkers could contribute to the development of a useful multi-marker blood-based test for CRC early detection. PMID:26909861
Optimal Geometrical Set for Automated Marker Placement to Virtualized Real-Time Facial Emotions
Maruthapillai, Vasanthan; Murugappan, Murugappan
2016-01-01
In recent years, real-time face recognition has been a major topic of interest in developing intelligent human-machine interaction systems. Over the past several decades, researchers have proposed different algorithms for facial expression recognition, but there has been little focus on detection in real-time scenarios. The present work proposes a new algorithmic method of automated marker placement used to classify six facial expressions: happiness, sadness, anger, fear, disgust, and surprise. Emotional facial expressions were captured using a webcam, while the proposed algorithm placed a set of eight virtual markers on each subject’s face. Facial feature extraction methods, including marker distance (distance between each marker to the center of the face) and change in marker distance (change in distance between the original and new marker positions), were used to extract three statistical features (mean, variance, and root mean square) from the real-time video sequence. The initial position of each marker was subjected to the optical flow algorithm for marker tracking with each emotional facial expression. Finally, the extracted statistical features were mapped into corresponding emotional facial expressions using two simple non-linear classifiers, K-nearest neighbor and probabilistic neural network. The results indicate that the proposed automated marker placement algorithm effectively placed eight virtual markers on each subject’s face and gave a maximum mean emotion classification rate of 96.94% using the probabilistic neural network. PMID:26859884
Optimal Geometrical Set for Automated Marker Placement to Virtualized Real-Time Facial Emotions.
Maruthapillai, Vasanthan; Murugappan, Murugappan
2016-01-01
In recent years, real-time face recognition has been a major topic of interest in developing intelligent human-machine interaction systems. Over the past several decades, researchers have proposed different algorithms for facial expression recognition, but there has been little focus on detection in real-time scenarios. The present work proposes a new algorithmic method of automated marker placement used to classify six facial expressions: happiness, sadness, anger, fear, disgust, and surprise. Emotional facial expressions were captured using a webcam, while the proposed algorithm placed a set of eight virtual markers on each subject's face. Facial feature extraction methods, including marker distance (distance between each marker to the center of the face) and change in marker distance (change in distance between the original and new marker positions), were used to extract three statistical features (mean, variance, and root mean square) from the real-time video sequence. The initial position of each marker was subjected to the optical flow algorithm for marker tracking with each emotional facial expression. Finally, the extracted statistical features were mapped into corresponding emotional facial expressions using two simple non-linear classifiers, K-nearest neighbor and probabilistic neural network. The results indicate that the proposed automated marker placement algorithm effectively placed eight virtual markers on each subject's face and gave a maximum mean emotion classification rate of 96.94% using the probabilistic neural network.
A global analysis of Y-chromosomal haplotype diversity for 23 STR loci
Purps, Josephine; Siegert, Sabine; Willuweit, Sascha; Nagy, Marion; Alves, Cíntia; Salazar, Renato; Angustia, Sheila M.T.; Santos, Lorna H.; Anslinger, Katja; Bayer, Birgit; Ayub, Qasim; Wei, Wei; Xue, Yali; Tyler-Smith, Chris; Bafalluy, Miriam Baeta; Martínez-Jarreta, Begoña; Egyed, Balazs; Balitzki, Beate; Tschumi, Sibylle; Ballard, David; Court, Denise Syndercombe; Barrantes, Xinia; Bäßler, Gerhard; Wiest, Tina; Berger, Burkhard; Niederstätter, Harald; Parson, Walther; Davis, Carey; Budowle, Bruce; Burri, Helen; Borer, Urs; Koller, Christoph; Carvalho, Elizeu F.; Domingues, Patricia M.; Chamoun, Wafaa Takash; Coble, Michael D.; Hill, Carolyn R.; Corach, Daniel; Caputo, Mariela; D’Amato, Maria E.; Davison, Sean; Decorte, Ronny; Larmuseau, Maarten H.D.; Ottoni, Claudio; Rickards, Olga; Lu, Di; Jiang, Chengtao; Dobosz, Tadeusz; Jonkisz, Anna; Frank, William E.; Furac, Ivana; Gehrig, Christian; Castella, Vincent; Grskovic, Branka; Haas, Cordula; Wobst, Jana; Hadzic, Gavrilo; Drobnic, Katja; Honda, Katsuya; Hou, Yiping; Zhou, Di; Li, Yan; Hu, Shengping; Chen, Shenglan; Immel, Uta-Dorothee; Lessig, Rüdiger; Jakovski, Zlatko; Ilievska, Tanja; Klann, Anja E.; García, Cristina Cano; de Knijff, Peter; Kraaijenbrink, Thirsa; Kondili, Aikaterini; Miniati, Penelope; Vouropoulou, Maria; Kovacevic, Lejla; Marjanovic, Damir; Lindner, Iris; Mansour, Issam; Al-Azem, Mouayyad; Andari, Ansar El; Marino, Miguel; Furfuro, Sandra; Locarno, Laura; Martín, Pablo; Luque, Gracia M.; Alonso, Antonio; Miranda, Luís Souto; Moreira, Helena; Mizuno, Natsuko; Iwashima, Yasuki; Neto, Rodrigo S. Moura; Nogueira, Tatiana L.S.; Silva, Rosane; Nastainczyk-Wulf, Marina; Edelmann, Jeanett; Kohl, Michael; Nie, Shengjie; Wang, Xianping; Cheng, Baowen; Núñez, Carolina; Pancorbo, Marian Martínez de; Olofsson, Jill K.; Morling, Niels; Onofri, Valerio; Tagliabracci, Adriano; Pamjav, Horolma; Volgyi, Antonia; Barany, Gusztav; Pawlowski, Ryszard; Maciejewska, Agnieszka; Pelotti, Susi; Pepinski, Witold; Abreu-Glowacka, Monica; Phillips, Christopher; Cárdenas, Jorge; Rey-Gonzalez, Danel; Salas, Antonio; Brisighelli, Francesca; Capelli, Cristian; Toscanini, Ulises; Piccinini, Andrea; Piglionica, Marilidia; Baldassarra, Stefania L.; Ploski, Rafal; Konarzewska, Magdalena; Jastrzebska, Emila; Robino, Carlo; Sajantila, Antti; Palo, Jukka U.; Guevara, Evelyn; Salvador, Jazelyn; Ungria, Maria Corazon De; Rodriguez, Jae Joseph Russell; Schmidt, Ulrike; Schlauderer, Nicola; Saukko, Pekka; Schneider, Peter M.; Sirker, Miriam; Shin, Kyoung-Jin; Oh, Yu Na; Skitsa, Iulia; Ampati, Alexandra; Smith, Tobi-Gail; Calvit, Lina Solis de; Stenzl, Vlastimil; Capal, Thomas; Tillmar, Andreas; Nilsson, Helena; Turrina, Stefania; De Leo, Domenico; Verzeletti, Andrea; Cortellini, Venusia; Wetton, Jon H.; Gwynne, Gareth M.; Jobling, Mark A.; Whittle, Martin R.; Sumita, Denilce R.; Wolańska-Nowak, Paulina; Yong, Rita Y.Y.; Krawczak, Michael; Nothnagel, Michael; Roewer, Lutz
2014-01-01
In a worldwide collaborative effort, 19,630 Y-chromosomes were sampled from 129 different populations in 51 countries. These chromosomes were typed for 23 short-tandem repeat (STR) loci (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS385ab, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, GATAH4, DYS481, DYS533, DYS549, DYS570, DYS576, and DYS643) and using the PowerPlex Y23 System (PPY23, Promega Corporation, Madison, WI). Locus-specific allelic spectra of these markers were determined and a consistently high level of allelic diversity was observed. A considerable number of null, duplicate and off-ladder alleles were revealed. Standard single-locus and haplotype-based parameters were calculated and compared between subsets of Y-STR markers established for forensic casework. The PPY23 marker set provides substantially stronger discriminatory power than other available kits but at the same time reveals the same general patterns of population structure as other marker sets. A strong correlation was observed between the number of Y-STRs included in a marker set and some of the forensic parameters under study. Interestingly a weak but consistent trend toward smaller genetic distances resulting from larger numbers of markers became apparent. PMID:24854874
Performance of genomic prediction within and across generations in maritime pine.
Bartholomé, Jérôme; Van Heerwaarden, Joost; Isik, Fikret; Boury, Christophe; Vidal, Marjorie; Plomion, Christophe; Bouffier, Laurent
2016-08-11
Genomic selection (GS) is a promising approach for decreasing breeding cycle length in forest trees. Assessment of progeny performance and of the prediction accuracy of GS models over generations is therefore a key issue. A reference population of maritime pine (Pinus pinaster) with an estimated effective inbreeding population size (status number) of 25 was first selected with simulated data. This reference population (n = 818) covered three generations (G0, G1 and G2) and was genotyped with 4436 single-nucleotide polymorphism (SNP) markers. We evaluated the effects on prediction accuracy of both the relatedness between the calibration and validation sets and validation on the basis of progeny performance. Pedigree-based (best linear unbiased prediction, ABLUP) and marker-based (genomic BLUP and Bayesian LASSO) models were used to predict breeding values for three different traits: circumference, height and stem straightness. On average, the ABLUP model outperformed genomic prediction models, with a maximum difference in prediction accuracies of 0.12, depending on the trait and the validation method. A mean difference in prediction accuracy of 0.17 was found between validation methods differing in terms of relatedness. Including the progenitors in the calibration set reduced this difference in prediction accuracy to 0.03. When only genotypes from the G0 and G1 generations were used in the calibration set and genotypes from G2 were used in the validation set (progeny validation), prediction accuracies ranged from 0.70 to 0.85. This study suggests that the training of prediction models on parental populations can predict the genetic merit of the progeny with high accuracy: an encouraging result for the implementation of GS in the maritime pine breeding program.
Physiology of Pseudomonas aeruginosa in biofilms as revealed by transcriptome analysis
2010-01-01
Background Transcriptome analysis was applied to characterize the physiological activities of Pseudomonas aeruginosa grown for three days in drip-flow biofilm reactors. Conventional applications of transcriptional profiling often compare two paired data sets that differ in a single experimentally controlled variable. In contrast this study obtained the transcriptome of a single biofilm state, ranked transcript signals to make the priorities of the population manifest, and compared ranki ngs for a priori identified physiological marker genes between the biofilm and published data sets. Results Biofilms tolerated exposure to antibiotics, harbored steep oxygen concentration gradients, and exhibited stratified and heterogeneous spatial patterns of protein synthetic activity. Transcriptional profiling was performed and the signal intensity of each transcript was ranked to gain insight into the physiological state of the biofilm population. Similar rankings were obtained from data sets published in the GEO database http://www.ncbi.nlm.nih.gov/geo. By comparing the rank of genes selected as markers for particular physiological activities between the biofilm and comparator data sets, it was possible to infer qualitative features of the physiological state of the biofilm bacteria. These biofilms appeared, from their transcriptome, to be glucose nourished, iron replete, oxygen limited, and growing slowly or exhibiting stationary phase character. Genes associated with elaboration of type IV pili were strongly expressed in the biofilm. The biofilm population did not indicate oxidative stress, homoserine lactone mediated quorum sensing, or activation of efflux pumps. Using correlations with transcript ranks, the average specific growth rate of biofilm cells was estimated to be 0.08 h-1. Conclusions Collectively these data underscore the oxygen-limited, slow-growing nature of the biofilm population and are consistent with antimicrobial tolerance due to low metabolic activity. PMID:21083928
Paterson, Kade L; Hinman, Rana S; Metcalf, Ben R; Bennell, Kim L; Wrigley, Tim V
2017-01-01
Understanding how kinematic multi-segment foot modelling influences the utility of Plug-in-Gait calculations of the knee adduction moment (KAM) during shod walking is relevant to knee osteoarthritis (OA). Multi-segment foot markers placed on the skin through windows cut in to the shoe provide a more accurate representation of foot mechanics than the traditional marker set used by Plug-in-Gait, which uses fewer markers, placed on the shoe itself. We aimed to investigate whether Plug-in-Gait calculation of the KAM differed when using a kinematic multi-segment foot model compared to the traditional Plug-in-Gait marker set. Twenty people with medial knee OA underwent gait analysis in two test conditions: i) Plug-in-Gait model with its two standard foot markers placed on the shoes and; ii) Plug-in-Gait with the heel marker virtualised from a modified-Oxford Foot Model where 8 ft markers were placed on the skin through windows cut in shoe uppers. Outcomes were the peak KAM, KAM impulse and other knee kinetic and kinematic variables. There were no differences ( P > 0.05) in any gait variables between conditions. Excellent agreement was found for all outcome variables, with high correlations ( r > 0.88-0.99, P < 0.001), narrow limits of agreement and no proportional bias ( R 2 = 0.03-0.14, P > 0.05). The mean difference and 95% confidence intervals for peak KAM were also within the minimal detectable change range demonstrating equivalence. Plug-in-Gait calculations of the KAM are not altered when using a kinematic multi-segment foot marker model with skin markers placed through windows cut in to the shoe, instead of the traditional marker set placed on top of shoes. Researchers may be confident that applying either foot model does not change the calculation of the KAM using Plug-in-Gait.
NASA Astrophysics Data System (ADS)
Aghaei, Faranak; Ross, Stephen R.; Wang, Yunzhi; Wu, Dee H.; Cornwell, Benjamin O.; Ray, Bappaditya; Zheng, Bin
2017-03-01
Aneurysmal subarachnoid hemorrhage (aSAH) is a form of hemorrhagic stroke that affects middle-aged individuals and associated with significant morbidity and/or mortality especially those presenting with higher clinical and radiologic grades at the time of admission. Previous studies suggested that blood extravasated after aneurysmal rupture was a potentially clinical prognosis factor. But all such studies used qualitative scales to predict prognosis. The purpose of this study is to develop and test a new interactive computer-aided detection (CAD) tool to detect, segment and quantify brain hemorrhage and ventricular cerebrospinal fluid on non-contrasted brain CT images. First, CAD segments brain skull using a multilayer region growing algorithm with adaptively adjusted thresholds. Second, CAD assigns pixels inside the segmented brain region into one of three classes namely, normal brain tissue, blood and fluid. Third, to avoid "black-box" approach and increase accuracy in quantification of these two image markers using CT images with large noise variation in different cases, a graphic User Interface (GUI) was implemented and allows users to visually examine segmentation results. If a user likes to correct any errors (i.e., deleting clinically irrelevant blood or fluid regions, or fill in the holes inside the relevant blood or fluid regions), he/she can manually define the region and select a corresponding correction function. CAD will automatically perform correction and update the computed data. The new CAD tool is now being used in clinical and research settings to estimate various quantitatively radiological parameters/markers to determine radiological severity of aSAH at presentation and correlate the estimations with various homeostatic/metabolic derangements and predict clinical outcome.
USDA-ARS?s Scientific Manuscript database
Selection of the composite MARC III population for markers allowed better estimates of effects and inheritance of markers for targeted carcass quality traits (n=254) and nontargeted traits and an evaluation of SNP specific residual variance models for tenderness. Genotypic effects of CAPN1 haplotyp...
Wells, D J M; Alderson, J A; Dunne, J; Elliott, B C; Donnelly, C J
2017-01-25
To appropriately use inverse kinematic (IK) modelling for the assessment of human motion, a musculoskeletal model must be prepared 1) to match participant segment lengths (scaling) and 2) to align the model׳s virtual markers positions with known, experimentally derived kinematic marker positions (marker registration). The purpose of this study was to investigate whether prescribing joint co-ordinates during the marker registration process (within the modelling framework OpenSim) will improve IK derived elbow kinematics during an overhead sporting task. To test this, the upper limb kinematics of eight cricket bowlers were recorded during two testing sessions, with a different tester each session. The bowling trials were IK modelled twice: once with an upper limb musculoskeletal model prepared with prescribed participant specific co-ordinates during marker registration - MR PC - and once with the same model prepared without prescribed co-ordinates - MR; and by an established direct kinematic (DK) upper limb model. Whilst both skeletal model preparations had strong inter-tester repeatability (MR: Statistical Parametric Mapping (SPM1D)=0% different; MR PC : SPM1D=0% different), when compared with DK model elbow FE waveform estimates, IK estimates using the MR PC model (RMSD=5.2±2.0°, SPM1D=68% different) were in closer agreement than the estimates from the MR model (RMSD=44.5±18.5°, SPM1D=100% different). Results show that prescribing participant specific joint co-ordinates during the marker registration phase of model preparation increases the accuracy and repeatability of IK solutions when modelling overhead sporting tasks in OpenSim. Copyright © 2016 Elsevier Ltd. All rights reserved.
Platt, Roy N; Faircloth, Brant C; Sullivan, Kevin A M; Kieran, Troy J; Glenn, Travis C; Vandewege, Michael W; Lee, Thomas E; Baker, Robert J; Stevens, Richard D; Ray, David A
2018-03-01
The rapid diversification of Myotis bats into more than 100 species is one of the most extensive mammalian radiations available for study. Efforts to understand relationships within Myotis have primarily utilized mitochondrial markers and trees inferred from nuclear markers lacked resolution. Our current understanding of relationships within Myotis is therefore biased towards a set of phylogenetic markers that may not reflect the history of the nuclear genome. To resolve this, we sequenced the full mitochondrial genomes of 37 representative Myotis, primarily from the New World, in conjunction with targeted sequencing of 3648 ultraconserved elements (UCEs). We inferred the phylogeny and explored the effects of concatenation and summary phylogenetic methods, as well as combinations of markers based on informativeness or levels of missing data, on our results. Of the 294 phylogenies generated from the nuclear UCE data, all are significantly different from phylogenies inferred using mitochondrial genomes. Even within the nuclear data, quartet frequencies indicate that around half of all UCE loci conflict with the estimated species tree. Several factors can drive such conflict, including incomplete lineage sorting, introgressive hybridization, or even phylogenetic error. Despite the degree of discordance between nuclear UCE loci and the mitochondrial genome and among UCE loci themselves, the most common nuclear topology is recovered in one quarter of all analyses with strong nodal support. Based on these results, we re-examine the evolutionary history of Myotis to better understand the phenomena driving their unique nuclear, mitochondrial, and biogeographic histories.
Bertrand, Bénédicte; Alburaki, Mohamed; Legout, Hélène; Moulin, Sibyle; Mougel, Florence; Garnery, Lionel
2015-05-01
Honeybee subspecies have been affected by human activities in Europe over the past few decades. One such example is the importation of nonlocal subspecies of bees which has had an adverse impact on the geographical repartition and subsequently on the genetic diversity of the black honeybee Apis mellifera mellifera. To restore the original diversity of this local honeybee subspecies, different conservation centres were set up in Europe. In this study, we established a black honeybee conservation centre Conservatoire de l'Abeille Noire d'Ile de France (CANIF) in the region of Ile-de-France, France. CANIF's honeybee colonies were intensively studied over a 3-year period. This study included a drone congregation area (DCA) located in the conservation centre. MtDNA COI-COII marker was used to evaluate the genetic diversity of CANIF's honeybee populations and the drones found and collected from the DCA. The same marker (mtDNA) was used to estimate the interactions and the haplotype frequency between CANIF's honeybee populations and 10 surrounding honeybee apiaries located outside of the CANIF. Our results indicate that the colonies of the conservation centre and the drones of the DCA show similar stable profiles compared to the surrounding populations with lower level of introgression. The mtDNA marker used on both DCA and colonies of the conservation centre seems to be an efficient approach to monitor and maintain the genetic diversity of the protected honeybee populations. © 2014 John Wiley & Sons Ltd.
Bungartz, Annemarie; Klaus, Marius; Mathew, Boby; Léon, Jens; Naz, Ali Ahmad
2016-03-01
The aim of the present study was to develop a new cost effective PCR based CAPS marker set using advantages of high-throughput SNP genotyping. Initially, SNP survey was made using 20 diverse barley genotypes via 9k iSelect array genotyping that resulted in 6334 polymorphic SNP markers. Principle component analysis using this marker data showed fine differentiation of barley diverse gene pool. Till this end, we developed 200 SNP derived CAPS markers distributed across the genome covering around 991cM with an average marker density of 5.09cM. Further, we genotyped 68 CAPS markers in an F2 population (Cheri×ICB181160) segregating for seed color variation in barley. Genetic mapping of seed color revealed putative linkage of single nuclear gene on chromosome 1H. These findings showed the proof of concept for the development and utility of a newer cost effective genomic tool kit to analyze broader genetic resources of barley worldwide. Copyright © 2016 Elsevier Inc. All rights reserved.
An AFLP genetic linkage map of pacific abalone ( Haliotis discus hannai)
NASA Astrophysics Data System (ADS)
Qi, Li; Yanhong, Xu; Ruihai, Yu; Akihiro, Kijima
2007-07-01
A genetic linkage map of Pacific abalone ( Haliotis discus hannai) was constructed using AFLP markers based on a two-way pseudo-testeross strategy in a full-sib family. With 33 primer combinations, a total of 455 markers (225 from the female parent and 230 from the male parent) segregated in a 1:1 ratio, corresponding to DNA polymorphism: heterozygous in one parent and null in the other. The female framework map consisted of 174 markers distributed in 18 linkage groups, equivalent to the H. discus hannai haploid chromosome number, and spanning a total length of 2031.4 cM, with an average interval of 13.0 cM between adjacent markers. The male framework map consisted of 195 markers mapped on 19 linkage groups, spanning a total length of 2273.4 cM, with an average spacing of 12.9 cM between adjacent markers. The estimated coverage for the framework linkage maps was 81.2% for the female and 82.1% for the male, on the basis of two estimates of genome length. Fifty-two markers (11.4%) remained unlinked. The level of segregation distortion observed in this cross was 20.4%. These linkage maps will serve as a starting point for linkage studies in the Pacific abalone with potential application for marker-assisted selection in breeding programs.
Gianola, Daniel; Fariello, Maria I.; Naya, Hugo; Schön, Chris-Carolin
2016-01-01
Standard genome-wide association studies (GWAS) scan for relationships between each of p molecular markers and a continuously distributed target trait. Typically, a marker-based matrix of genomic similarities among individuals (G) is constructed, to account more properly for the covariance structure in the linear regression model used. We show that the generalized least-squares estimator of the regression of phenotype on one or on m markers is invariant with respect to whether or not the marker(s) tested is(are) used for building G, provided variance components are unaffected by exclusion of such marker(s) from G. The result is arrived at by using a matrix expression such that one can find many inverses of genomic relationship, or of phenotypic covariance matrices, stemming from removing markers tested as fixed, but carrying out a single inversion. When eigenvectors of the genomic relationship matrix are used as regressors with fixed regression coefficients, e.g., to account for population stratification, their removal from G does matter. Removal of eigenvectors from G can have a noticeable effect on estimates of genomic and residual variances, so caution is needed. Concepts were illustrated using genomic data on 599 wheat inbred lines, with grain yield as target trait, and on close to 200 Arabidopsis thaliana accessions. PMID:27520956
USDA-ARS?s Scientific Manuscript database
Large sets of genomic data are becoming available for cucumber (Cucumis sativus), yet there is no tool for whole genome genotyping. Creation of saturated genetic maps depends on development of good markers. The present cucumber genetic maps are based on several hundreds of markers. However they are ...
USDA-ARS?s Scientific Manuscript database
A standardized set of 12 microsatellite markers, previously agreed upon following an ECP/GR workshop in 2006, was used to screen accessions from the UK National Pear Collection at Brogdale and from the US National Pear Germplasm Repository (NCGR), Corvallis. Eight standard varieties were chosen from...
Kejia Pang; Keith Woeste; Charles Michler
2017-01-01
A set of eight microsatellite markers was used to genotype 25 black walnut (Juglans nigra L.) clones within the Purdue University germplasm repository. The identities of 212 ramets were verified using the same eight microsatellite markers. Some trees were mislabeled and corrected as to clone using analysis of microsatellite markers. A genetic...
Frank, James A.; Parsons, Polly E.; Matthay, Michael A.
2009-01-01
For patients with acute lung injury, positive pressure mechanical ventilation is life saving. However, considerable experimental and clinical data have demonstrated that how clinicians set the tidal volume, positive end-expiratory pressure, and plateau airway pressure influences lung injury severity and patient outcomes including mortality. In order to better identify ventilator-associated lung injury (VALI), clinical investigators have sought to measure blood-borne and airspace biological markers of VALI. At the same time, several laboratory-based studies have focused on biological markers of inflammation and organ injury in experimental models in order to clarify the mechanisms of ventilator-induced lung injury (VILI) and VALI. This review summarizes data on biological markers of VALI and VILI from both clinical and experimental studies with an emphasis on markers identified in patients and in the experimental setting. This analysis suggests that measurement of some of these biological markers may be of value in diagnosing VALI and in understanding its pathogenesis. PMID:17167015
Kang, Tianyu; Ding, Wei; Zhang, Luoyan; Ziemek, Daniel; Zarringhalam, Kourosh
2017-12-19
Stratification of patient subpopulations that respond favorably to treatment or experience and adverse reaction is an essential step toward development of new personalized therapies and diagnostics. It is currently feasible to generate omic-scale biological measurements for all patients in a study, providing an opportunity for machine learning models to identify molecular markers for disease diagnosis and progression. However, the high variability of genetic background in human populations hampers the reproducibility of omic-scale markers. In this paper, we develop a biological network-based regularized artificial neural network model for prediction of phenotype from transcriptomic measurements in clinical trials. To improve model sparsity and the overall reproducibility of the model, we incorporate regularization for simultaneous shrinkage of gene sets based on active upstream regulatory mechanisms into the model. We benchmark our method against various regression, support vector machines and artificial neural network models and demonstrate the ability of our method in predicting the clinical outcomes using clinical trial data on acute rejection in kidney transplantation and response to Infliximab in ulcerative colitis. We show that integration of prior biological knowledge into the classification as developed in this paper, significantly improves the robustness and generalizability of predictions to independent datasets. We provide a Java code of our algorithm along with a parsed version of the STRING DB database. In summary, we present a method for prediction of clinical phenotypes using baseline genome-wide expression data that makes use of prior biological knowledge on gene-regulatory interactions in order to increase robustness and reproducibility of omic-scale markers. The integrated group-wise regularization methods increases the interpretability of biological signatures and gives stable performance estimates across independent test sets.
Estimation of mating system parameters in plant populations using marker loci with null alleles.
Ross, H A
1986-06-01
An Expectation-Maximization (EM)-algorithm procedure is presented that extends Cheliak et al. (1983) method of maximum-likelihood estimation of mating system parameters of mixed mating system models. The extension permits the estimation of the rate of self-fertilization (s) and allele frequencies (Pi) at loci in outcrossing pollen, at marker loci having recessive null alleles. The algorithm makes use of maternal and filial genotypic arrays obtained by the electrophoretic analysis of cohorts of progeny. The genotypes of maternal plants must be known. Explicit equations are given for cases when the genotype of the maternal gamete inherited by a seed can (gymnosperms) or cannot (angiosperms) be determined. The procedure can accommodate any number of codominant alleles, but only one recessive null allele at each locus. An example, using actual data from Pinus banksiana, is presented to illustrate the application of this EM algorithm to the estimation of mating system parameters using marker loci having both codominant and recessive alleles.
2012-01-01
Background Populations of the Americas were founded by early migrants from Asia, and some have experienced recent genetic admixture. To better characterize the native and non-native ancestry components in populations from the Americas, we analyzed 815,377 autosomal SNPs, mitochondrial hypervariable segments I and II, and 36 Y-chromosome STRs from 24 Mesoamerican Totonacs and 23 South American Bolivians. Results and Conclusions We analyzed common genomic regions from native Bolivian and Totonac populations to identify 324 highly predictive Native American ancestry informative markers (AIMs). As few as 40–50 of these AIMs perform nearly as well as large panels of random genome-wide SNPs for predicting and estimating Native American ancestry and admixture levels. These AIMs have greater New World vs. Old World specificity than previous AIMs sets. We identify highly-divergent New World SNPs that coincide with high-frequency haplotypes found at similar frequencies in all populations examined, including the HGDP Pima, Maya, Colombian, Karitiana, and Surui American populations. Some of these regions are potential candidates for positive selection. European admixture in the Bolivian sample is approximately 12%, though individual estimates range from 0–48%. We estimate that the admixture occurred ~360–384 years ago. Little evidence of European or African admixture was found in Totonac individuals. Bolivians with pre-Columbian mtDNA and Y-chromosome haplogroups had 5–30% autosomal European ancestry, demonstrating the limitations of Y-chromosome and mtDNA haplogroups and the need for autosomal ancestry informative markers for assessing ancestry in admixed populations. PMID:22606979
Genome-based prediction of test cross performance in two subsequent breeding cycles.
Hofheinz, Nina; Borchardt, Dietrich; Weissleder, Knuth; Frisch, Matthias
2012-12-01
Genome-based prediction of genetic values is expected to overcome shortcomings that limit the application of QTL mapping and marker-assisted selection in plant breeding. Our goal was to study the genome-based prediction of test cross performance with genetic effects that were estimated using genotypes from the preceding breeding cycle. In particular, our objectives were to employ a ridge regression approach that approximates best linear unbiased prediction of genetic effects, compare cross validation with validation using genetic material of the subsequent breeding cycle, and investigate the prospects of genome-based prediction in sugar beet breeding. We focused on the traits sugar content and standard molasses loss (ML) and used a set of 310 sugar beet lines to estimate genetic effects at 384 SNP markers. In cross validation, correlations >0.8 between observed and predicted test cross performance were observed for both traits. However, in validation with 56 lines from the next breeding cycle, a correlation of 0.8 could only be observed for sugar content, for standard ML the correlation reduced to 0.4. We found that ridge regression based on preliminary estimates of the heritability provided a very good approximation of best linear unbiased prediction and was not accompanied with a loss in prediction accuracy. We conclude that prediction accuracy assessed with cross validation within one cycle of a breeding program can not be used as an indicator for the accuracy of predicting lines of the next cycle. Prediction of lines of the next cycle seems promising for traits with high heritabilities.
de Jager, Deon; Swarts, Petrus; Harper, Cindy; Bloomer, Paulette
2017-11-01
The identification of related and unrelated individuals from molecular marker data is often difficult, particularly when no pedigree information is available and the data set is large. High levels of relatedness or inbreeding can influence genotype frequencies and thus genetic marker evaluation, as well as the accurate inference of hidden genetic structure. Identification of related and unrelated individuals is also important in breeding programmes, to inform decisions about breeding pairs and translocations. We present Friends and Family, a Windows executable program with a graphical user interface that identifies unrelated individuals from a pairwise relatedness matrix or table generated in programs such as coancestry and genalex. Friends and Family outputs a list of samples that are all unrelated to each other, based on a user-defined relatedness cut-off value. This unrelated data set can be used in downstream analyses, such as marker evaluation or inference of genetic structure. The results can be compared to that of the full data set to determine the effect related individuals have on the analyses. We demonstrate one of the applications of the program: how the removal of related individuals altered the Hardy-Weinberg equilibrium test outcome for microsatellite markers in an empirical data set. Friends and Family can be obtained from https://github.com/DeondeJager/Friends-and-Family. © 2017 John Wiley & Sons Ltd.
Glazko, V I; Zelenaia, L B; Iasinetskaia, N A
1997-01-01
The investigation of genetic interrelation between a number of Artiodactyla and Perissodactyla species with the use of different types of molecular-genetic markers (proteins, RAPD-PCR) were carried out. The marker-specific features of interspecific relations and their similarities on the groups of markers of both types were revealed. The distinctions between interspecies genetic relations and ones estimated from the phylogeny on the determined group of different types of markers were observed. It was supposed that these discrepancies may be related with common selection factors and involving this marker group in selection in some species.
Probability genotype imputation method and integrated weighted lasso for QTL identification.
Demetrashvili, Nino; Van den Heuvel, Edwin R; Wit, Ernst C
2013-12-30
Many QTL studies have two common features: (1) often there is missing marker information, (2) among many markers involved in the biological process only a few are causal. In statistics, the second issue falls under the headings "sparsity" and "causal inference". The goal of this work is to develop a two-step statistical methodology for QTL mapping for markers with binary genotypes. The first step introduces a novel imputation method for missing genotypes. Outcomes of the proposed imputation method are probabilities which serve as weights to the second step, namely in weighted lasso. The sparse phenotype inference is employed to select a set of predictive markers for the trait of interest. Simulation studies validate the proposed methodology under a wide range of realistic settings. Furthermore, the methodology outperforms alternative imputation and variable selection methods in such studies. The methodology was applied to an Arabidopsis experiment, containing 69 markers for 165 recombinant inbred lines of a F8 generation. The results confirm previously identified regions, however several new markers are also found. On the basis of the inferred ROC behavior these markers show good potential for being real, especially for the germination trait Gmax. Our imputation method shows higher accuracy in terms of sensitivity and specificity compared to alternative imputation method. Also, the proposed weighted lasso outperforms commonly practiced multiple regression as well as the traditional lasso and adaptive lasso with three weighting schemes. This means that under realistic missing data settings this methodology can be used for QTL identification.
Gene genealogies for genetic association mapping, with application to Crohn's disease
Burkett, Kelly M.; Greenwood, Celia M. T.; McNeney, Brad; Graham, Jinko
2013-01-01
A gene genealogy describes relationships among haplotypes sampled from a population. Knowledge of the gene genealogy for a set of haplotypes is useful for estimation of population genetic parameters and it also has potential application in finding disease-predisposing genetic variants. As the true gene genealogy is unknown, Markov chain Monte Carlo (MCMC) approaches have been used to sample genealogies conditional on data at multiple genetic markers. We previously implemented an MCMC algorithm to sample from an approximation to the distribution of the gene genealogy conditional on haplotype data. Our approach samples ancestral trees, recombination and mutation rates at a genomic focal point. In this work, we describe how our sampler can be used to find disease-predisposing genetic variants in samples of cases and controls. We use a tree-based association statistic that quantifies the degree to which case haplotypes are more closely related to each other around the focal point than control haplotypes, without relying on a disease model. As the ancestral tree is a latent variable, so is the tree-based association statistic. We show how the sampler can be used to estimate the posterior distribution of the latent test statistic and corresponding latent p-values, which together comprise a fuzzy p-value. We illustrate the approach on a publicly-available dataset from a study of Crohn's disease that consists of genotypes at multiple SNP markers in a small genomic region. We estimate the posterior distribution of the tree-based association statistic and the recombination rate at multiple focal points in the region. Reassuringly, the posterior mean recombination rates estimated at the different focal points are consistent with previously published estimates. The tree-based association approach finds multiple sub-regions where the case haplotypes are more genetically related than the control haplotypes, and that there may be one or multiple disease-predisposing loci. PMID:24348515
Linkage disequilibrium interval mapping of quantitative trait loci.
Boitard, Simon; Abdallah, Jihad; de Rochambeau, Hubert; Cierco-Ayrolles, Christine; Mangin, Brigitte
2006-03-16
For many years gene mapping studies have been performed through linkage analyses based on pedigree data. Recently, linkage disequilibrium methods based on unrelated individuals have been advocated as powerful tools to refine estimates of gene location. Many strategies have been proposed to deal with simply inherited disease traits. However, locating quantitative trait loci is statistically more challenging and considerable research is needed to provide robust and computationally efficient methods. Under a three-locus Wright-Fisher model, we derived approximate expressions for the expected haplotype frequencies in a population. We considered haplotypes comprising one trait locus and two flanking markers. Using these theoretical expressions, we built a likelihood-maximization method, called HAPim, for estimating the location of a quantitative trait locus. For each postulated position, the method only requires information from the two flanking markers. Over a wide range of simulation scenarios it was found to be more accurate than a two-marker composite likelihood method. It also performed as well as identity by descent methods, whilst being valuable in a wider range of populations. Our method makes efficient use of marker information, and can be valuable for fine mapping purposes. Its performance is increased if multiallelic markers are available. Several improvements can be developed to account for more complex evolution scenarios or provide robust confidence intervals for the location estimates.
Ajzenberg, Daniel; Collinet, Frédéric; Mercier, Aurélien; Vignoles, Philippe; Dardé, Marie-Laure
2010-12-01
We developed an easy-to-use method for genotyping Toxoplasma gondii isolates in a single multiplex PCR assay with 15 microsatellite markers. This method was validated by testing 26 reference isolates that had been characterized with other sets of markers.
Uniparental Markers of Contemporary Italian Population Reveals Details on Its Pre-Roman Heritage
Álvarez-Iglesias, Vanesa; Fondevila, Manuel; Blanco-Verea, Alejandro; Carracedo, Ángel; Pascali, Vincenzo L.; Capelli, Cristian
2012-01-01
Background According to archaeological records and historical documentation, Italy has been a melting point for populations of different geographical and ethnic matrices. Although Italy has been a favorite subject for numerous population genetic studies, genetic patterns have never been analyzed comprehensively, including uniparental and autosomal markers throughout the country. Methods/Principal Findings A total of 583 individuals were sampled from across the Italian Peninsula, from ten distant (if homogeneous by language) ethnic communities — and from two linguistic isolates (Ladins, Grecani Salentini). All samples were first typed for the mitochondrial DNA (mtDNA) control region and selected coding region SNPs (mtSNPs). This data was pooled for analysis with 3,778 mtDNA control-region profiles collected from the literature. Secondly, a set of Y-chromosome SNPs and STRs were also analyzed in 479 individuals together with a panel of autosomal ancestry informative markers (AIMs) from 441 samples. The resulting genetic record reveals clines of genetic frequencies laid according to the latitude slant along continental Italy – probably generated by demographical events dating back to the Neolithic. The Ladins showed distinctive, if more recent structure. The Neolithic contribution was estimated for the Y-chromosome as 14.5% and for mtDNA as 10.5%. Y-chromosome data showed larger differentiation between North, Center and South than mtDNA. AIMs detected a minor sub-Saharan component; this is however higher than for other European non-Mediterranean populations. The same signal of sub-Saharan heritage was also evident in uniparental markers. Conclusions/Significance Italy shows patterns of molecular variation mirroring other European countries, although some heterogeneity exists based on different analysis and molecular markers. From North to South, Italy shows clinal patterns that were most likely modulated during Neolithic times. PMID:23251386
Uniparental markers of contemporary Italian population reveals details on its pre-Roman heritage.
Brisighelli, Francesca; Álvarez-Iglesias, Vanesa; Fondevila, Manuel; Blanco-Verea, Alejandro; Carracedo, Angel; Pascali, Vincenzo L; Capelli, Cristian; Salas, Antonio
2012-01-01
According to archaeological records and historical documentation, Italy has been a melting point for populations of different geographical and ethnic matrices. Although Italy has been a favorite subject for numerous population genetic studies, genetic patterns have never been analyzed comprehensively, including uniparental and autosomal markers throughout the country. A total of 583 individuals were sampled from across the Italian Peninsula, from ten distant (if homogeneous by language) ethnic communities--and from two linguistic isolates (Ladins, Grecani Salentini). All samples were first typed for the mitochondrial DNA (mtDNA) control region and selected coding region SNPs (mtSNPs). This data was pooled for analysis with 3,778 mtDNA control-region profiles collected from the literature. Secondly, a set of Y-chromosome SNPs and STRs were also analyzed in 479 individuals together with a panel of autosomal ancestry informative markers (AIMs) from 441 samples. The resulting genetic record reveals clines of genetic frequencies laid according to the latitude slant along continental Italy--probably generated by demographical events dating back to the Neolithic. The Ladins showed distinctive, if more recent structure. The Neolithic contribution was estimated for the Y-chromosome as 14.5% and for mtDNA as 10.5%. Y-chromosome data showed larger differentiation between North, Center and South than mtDNA. AIMs detected a minor sub-Saharan component; this is however higher than for other European non-Mediterranean populations. The same signal of sub-Saharan heritage was also evident in uniparental markers. Italy shows patterns of molecular variation mirroring other European countries, although some heterogeneity exists based on different analysis and molecular markers. From North to South, Italy shows clinal patterns that were most likely modulated during Neolithic times.
Efficient Breeding by Genomic Mating.
Akdemir, Deniz; Sánchez, Julio I
2016-01-01
Selection in breeding programs can be done by using phenotypes (phenotypic selection), pedigree relationship (breeding value selection) or molecular markers (marker assisted selection or genomic selection). All these methods are based on truncation selection, focusing on the best performance of parents before mating. In this article we proposed an approach to breeding, named genomic mating, which focuses on mating instead of truncation selection. Genomic mating uses information in a similar fashion to genomic selection but includes information on complementation of parents to be mated. Following the efficiency frontier surface, genomic mating uses concepts of estimated breeding values, risk (usefulness) and coefficient of ancestry to optimize mating between parents. We used a genetic algorithm to find solutions to this optimization problem and the results from our simulations comparing genomic selection, phenotypic selection and the mating approach indicate that current approach for breeding complex traits is more favorable than phenotypic and genomic selection. Genomic mating is similar to genomic selection in terms of estimating marker effects, but in genomic mating the genetic information and the estimated marker effects are used to decide which genotypes should be crossed to obtain the next breeding population.
Derbyshire, Brian; Raut, Videshnandan V.
2013-01-01
Historically, wire markers were attached to cemented all-plastic acetabular cups to demarcate the periphery and to measure socket wear. The wire shape was either a semi-circle passing over the pole of the cup, or a circle around the cup equator. More recently, “double-D” shaped markers were introduced with a part-circular aspect passing over the pole and a semi-circular aspect parallel to the equatorial plane. This configuration enabled cup retroversion to be distinguished from anteversion. In this study, the accuracy of radiographic measurement of cup orientation and wear was assessed for cups with “double-D” and circular markers. Each cup was attached to a measurement jig which could vary the anteversion/retroversion and internal/external rotation of the cup. A metal femoral head was fixed within the socket and radiographic images were created for all combinations of cup orientation settings. The images were measured using software with automatic edge detection, and cup orientation and zero-wear accuracies were determined for each setting. The median error for cup version measurements was similar for both types of wire marker (0.2° double-D marker, −0.24° circular marker), but measurements of the circular marker were more repeatable. The median inclination errors were 2.05° (double-D marker) and 0.23° (circular marker). The median overall “zero wear” errors were 0.19 mm (double-D marker) and 0.03 mm (circular marker). Measurements of the circular wire marker were much more repeatable. PMID:23813165
D. V. Shaw; R. W. Allard
1981-01-01
Two methods of estimating the proportion of self-fertilization as opposed to outcrossing in plant populations are described. The first method makes use of marker loci one at a time; the second method makes use of multiple marker loci simultaneously. Comparisons of the estimates of proportions of selfing and outcrossing obtained using the two methods are shown to yield...
An alternative covariance estimator to investigate genetic heterogeneity in populations.
Heslot, Nicolas; Jannink, Jean-Luc
2015-11-26
For genomic prediction and genome-wide association studies (GWAS) using mixed models, covariance between individuals is estimated using molecular markers. Based on the properties of mixed models, using available molecular data for prediction is optimal if this covariance is known. Under this assumption, adding individuals to the analysis should never be detrimental. However, some empirical studies showed that increasing training population size decreased prediction accuracy. Recently, results from theoretical models indicated that even if marker density is high and the genetic architecture of traits is controlled by many loci with small additive effects, the covariance between individuals, which depends on relationships at causal loci, is not always well estimated by the whole-genome kinship. We propose an alternative covariance estimator named K-kernel, to account for potential genetic heterogeneity between populations that is characterized by a lack of genetic correlation, and to limit the information flow between a priori unknown populations in a trait-specific manner. This is similar to a multi-trait model and parameters are estimated by REML and, in extreme cases, it can allow for an independent genetic architecture between populations. As such, K-kernel is useful to study the problem of the design of training populations. K-kernel was compared to other covariance estimators or kernels to examine its fit to the data, cross-validated accuracy and suitability for GWAS on several datasets. It provides a significantly better fit to the data than the genomic best linear unbiased prediction model and, in some cases it performs better than other kernels such as the Gaussian kernel, as shown by an empirical null distribution. In GWAS simulations, alternative kernels control type I errors as well as or better than the classical whole-genome kinship and increase statistical power. No or small gains were observed in cross-validated prediction accuracy. This alternative covariance estimator can be used to gain insight into trait-specific genetic heterogeneity by identifying relevant sub-populations that lack genetic correlation between them. Genetic correlation can be 0 between identified sub-populations by performing automatic selection of relevant sets of individuals to be included in the training population. It may also increase statistical power in GWAS.
Population genetics of autopolyploids under a mixed mating model and the estimation of selfing rate.
Hardy, Olivier J
2016-01-01
Nowadays, the population genetics analysis of autopolyploid species faces many difficulties due to (i) limited development of population genetics tools under polysomic inheritance, (ii) difficulties to assess allelic dosage when genotyping individuals and (iii) a form of inbreeding resulting from the mechanism of 'double reduction'. Consequently, few data analysis computer programs are applicable to autopolyploids. To contribute bridging this gap, this article first derives theoretical expectations for the inbreeding and identity disequilibrium coefficients under polysomic inheritance in a mixed mating model. Moment estimators of these coefficients are proposed when exact genotypes or just markers phenotypes (i.e. allelic dosage unknown) are available. This led to the development of estimators of the selfing rate based on adult genotypes or phenotypes and applicable to any even-ploidy level. Their statistical performances and robustness were assessed by numerical simulations. Contrary to inbreeding-based estimators, the identity disequilibrium-based estimator using phenotypes is robust (absolute bias generally < 0.05), even in the presence of double reduction, null alleles or biparental inbreeding due to isolation by distance. A fairly good precision of the selfing rate estimates (root mean squared error < 0.1) is already achievable using a sample of 30-50 individuals phenotyped at 10 loci bearing 5-10 alleles each, conditions reachable using microsatellite markers. Diallelic markers (e.g. SNP) can also perform satisfactorily in diploids and tetraploids but more polymorphic markers are necessary for higher ploidy levels. The method is implemented in the software SPAGeDi and should contribute to reduce the lack of population genetics tools applicable to autopolyploids. © 2015 John Wiley & Sons Ltd.
Linkage map of the honey bee, Apis mellifera, based on RAPD markers
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hunt, G.J.; Page, R.E. Jr.
A linkage map was constructed for the honey bee based on the segregation of 365 random amplified polymorphic DNA (RAPD) markers in haploid male progeny of a single female bee. The X locus for sex determination and genes for black body color and malate dehydrogenase were mapped to separate linkage groups. RAPD markers were very efficient for mapping, with an average of about 2.8 loci mapped for each 10-nucleotide primer that was used in polymerase chain reactions. The mean interval size between markers on the map was 9.1 cM. The map covered 3110 cM of linked markers on 26 linkagemore » groups. We estimate the total genome size to be {approximately}3450 cM. The size of the map indicated a very high recombination rate for the honey bee. The relationship of physical to genetic distance was estimated at 52 kb/cM, suggesting that map-based cloning of genes will be feasible for this species. 71 refs., 6 figs., 1 tab.« less
The Effect of Teaching Structural Discourse Markers in an EFL Classroom Setting
ERIC Educational Resources Information Center
Alraddadi, Budoor Muslim
2016-01-01
This study aimed to explore the effects of explicit teaching on the acquisition of spoken discourse markers (DMs) on EFL learners' presentation production. It also aimed to measure the impact of two different treatments on the acquisition of a set of DMs. This study is an experimental study and focuses on the overall production of spoken…
The Use of Discourse Markers as an Interactive Feature in Science Lecture Discourse in L2 Setting
ERIC Educational Resources Information Center
Rido, Akhyar
2010-01-01
The objective of this research is to investigate the function of discourse markers as an interpersonal-interactive feature in a science lecture in second language (L2) setting in Malaysia. This research employs qualitative method while the data are gathered through non-participant observation and video recording. From the findings, there are…
Age- and Sex-Specific Causal Effects of Adiposity on Cardiovascular Risk Factors
Fall, Tove; Hägg, Sara; Ploner, Alexander; Mägi, Reedik; Fischer, Krista; Draisma, Harmen H.M.; Sarin, Antti-Pekka; Benyamin, Beben; Ladenvall, Claes; Åkerlund, Mikael; Kals, Mart; Esko, Tõnu; Nelson, Christopher P.; Kaakinen, Marika; Huikari, Ville; Mangino, Massimo; Meirhaeghe, Aline; Kristiansson, Kati; Nuotio, Marja-Liisa; Kobl, Michael; Grallert, Harald; Dehghan, Abbas; Kuningas, Maris; de Vries, Paul S.; de Bruijn, Renée F.A.G.; Willems, Sara M.; Heikkilä, Kauko; Silventoinen, Karri; Pietiläinen, Kirsi H.; Legry, Vanessa; Giedraitis, Vilmantas; Goumidi, Louisa; Syvänen, Ann-Christine; Strauch, Konstantin; Koenig, Wolfgang; Lichtner, Peter; Herder, Christian; Palotie, Aarno; Menni, Cristina; Uitterlinden, André G.; Kuulasmaa, Kari; Havulinna, Aki S.; Moreno, Luis A.; Gonzalez-Gross, Marcela; Evans, Alun; Tregouet, David-Alexandre; Yarnell, John W.G.; Virtamo, Jarmo; Ferrières, Jean; Veronesi, Giovanni; Perola, Markus; Arveiler, Dominique; Brambilla, Paolo; Lind, Lars; Kaprio, Jaakko; Hofman, Albert; Stricker, Bruno H.; van Duijn, Cornelia M.; Ikram, M. Arfan; Franco, Oscar H.; Cottel, Dominique; Dallongeville, Jean; Hall, Alistair S.; Jula, Antti; Tobin, Martin D.; Penninx, Brenda W.; Peters, Annette; Gieger, Christian; Samani, Nilesh J.; Montgomery, Grant W.; Whitfield, John B.; Martin, Nicholas G.; Groop, Leif; Spector, Tim D.; Magnusson, Patrik K.; Amouyel, Philippe; Boomsma, Dorret I.; Nilsson, Peter M.; Järvelin, Marjo-Riitta; Lyssenko, Valeriya; Metspalu, Andres; Strachan, David P.; Salomaa, Veikko; Ripatti, Samuli; Pedersen, Nancy L.; Prokopenko, Inga; McCarthy, Mark I.
2015-01-01
Observational studies have reported different effects of adiposity on cardiovascular risk factors across age and sex. Since cardiovascular risk factors are enriched in obese individuals, it has not been easy to dissect the effects of adiposity from those of other risk factors. We used a Mendelian randomization approach, applying a set of 32 genetic markers to estimate the causal effect of adiposity on blood pressure, glycemic indices, circulating lipid levels, and markers of inflammation and liver disease in up to 67,553 individuals. All analyses were stratified by age (cutoff 55 years of age) and sex. The genetic score was associated with BMI in both nonstratified analysis (P = 2.8 × 10−107) and stratified analyses (all P < 3.3 × 10−30). We found evidence of a causal effect of adiposity on blood pressure, fasting levels of insulin, C-reactive protein, interleukin-6, HDL cholesterol, and triglycerides in a nonstratified analysis and in the <55-year stratum. Further, we found evidence of a smaller causal effect on total cholesterol (P for difference = 0.015) in the ≥55-year stratum than in the <55-year stratum, a finding that could be explained by biology, survival bias, or differential medication. In conclusion, this study extends previous knowledge of the effects of adiposity by providing sex- and age-specific causal estimates on cardiovascular risk factors. PMID:25712996
Ajzenberg, Daniel; Collinet, Frédéric; Mercier, Aurélien; Vignoles, Philippe; Dardé, Marie-Laure
2010-01-01
We developed an easy-to-use method for genotyping Toxoplasma gondii isolates in a single multiplex PCR assay with 15 microsatellite markers. This method was validated by testing 26 reference isolates that had been characterized with other sets of markers. PMID:20881166
Chromosomal assignment of ALFP markers in upland cotton (Gossypium hirsutum L.)
USDA-ARS?s Scientific Manuscript database
In this research, we used two sets of cotton aneuploid (G. hirsutum × G. tomentosum and G. hirsutum × G. barbadense) plants to locate AFLP markers to chromosome using deletion analysis method. Thirty-eight primer combinations were used to generate 608 polymorphic AFLP markers. Ninety-eight AFLP mark...
Development of core SSR markers for Gossypium germplasm characterization
USDA-ARS?s Scientific Manuscript database
A set of 105 portable DNA markers were carefully developed to provide a common basis for systematic characterization of cotton germplasm collections in the U.S. and throughout the world. The 105 PCR-based SSR markers of different origins were evenly distributed on each of the 26 cotton chromosomes ...
NASA Astrophysics Data System (ADS)
Jin, Peng; van Wieringen, Niek; Hulshof, Maarten C. C. M.; Bel, Arjan; Alderliesten, Tanja
2016-03-01
The use of 4D cone-beam computed tomography (CBCT) and fiducial markers for guidance during radiation therapy of mobile tumors is challenging due to the trade-off between image quality, imaging dose, and scanning time. We aimed to investigate the visibility of markers and the feasibility of marker-based 4D registration and manual respiration-induced marker motion quantification for different CBCT acquisition settings. A dynamic thorax phantom and a patient with implanted gold markers were included. For both the phantom and patient, the peak-to-peak amplitude of marker motion in the cranial-caudal direction ranged from 5.3 to 14.0 mm, which did not affect the marker visibility and the associated marker-based registration feasibility. While using a medium field of view (FOV) and the same total imaging dose as is applied for 3D CBCT scanning in our clinic, it was feasible to attain an improved marker visibility by reducing the imaging dose per projection and increasing the number of projection images. For a small FOV with a shorter rotation arc but similar total imaging dose, streak artifacts were reduced due to using a smaller sampling angle. Additionally, the use of a small FOV allowed reducing total imaging dose and scanning time (~2.5 min) without losing the marker visibility. In conclusion, by using 4D CBCT with identical or lower imaging dose and a reduced gantry speed, it is feasible to attain sufficient marker visibility for marker-based 4D setup verification. Moreover, regardless of the settings, manual marker motion quantification can achieve a high accuracy with the error <1.2 mm.
USDA-ARS?s Scientific Manuscript database
Dominant and co-dominant molecular markers are routinely used in plant genetic diversity research. In the present study we assessed the success-rate of three marker-systems for estimating genotypic diversity, clustering varieties into populations, and assigning a single variety into the expected pop...
Mitochondrial DNA Marker EST00083 Is Not Associated with High vs. Average IQ in a German Sample.
ERIC Educational Resources Information Center
Moises, Hans W.; Yang, Liu; Kohnke, Michael; Vetter, Peter; Neppert, Jurgen; Petrill, Stephen A.; Plomin, Robert
1998-01-01
Tested the association of a mitochondrial DNA marker (EST00083) with high IQ in a sample of 47 German adults with high IQ scores and 77 adults with IQs estimated at lower than 110. Results do not support the hypothesis that high IQ is associated with this marker. (SLD)
Chen, Honglin; Wang, Lixia; Liu, Xiaoyan; Hu, Liangliang; Wang, Suhua; Cheng, Xuzhen
2017-07-11
Cowpea [Vigna unguiculata (L.) Walp.] is one of the most important legumes in tropical and semi-arid regions. However, there is relatively little genomic information available for genetic research on and breeding of cowpea. The objectives of this study were to analyse the cowpea transcriptome and develop genic molecular markers for future genetic studies of this genus. Approximately 54 million high-quality cDNA sequence reads were obtained from cowpea based on Illumina paired-end sequencing technology and were de novo assembled to generate 47,899 unigenes with an N50 length of 1534 bp. Sequence similarity analysis revealed 36,289 unigenes (75.8%) with significant similarity to known proteins in the non-redundant (Nr) protein database, 23,471 unigenes (49.0%) with BLAST hits in the Swiss-Prot database, and 20,654 unigenes (43.1%) with high similarity in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Further analysis identified 5560 simple sequence repeats (SSRs) as potential genic molecular markers. Validating a random set of 500 SSR markers yielded 54 polymorphic markers among 32 cowpea accessions. This transcriptomic analysis of cowpea provided a valuable set of genomic data for characterizing genes with important agronomic traits in Vigna unguiculata and a new set of genic SSR markers for further genetic studies and breeding in cowpea and related Vigna species.
A global analysis of Y-chromosomal haplotype diversity for 23 STR loci.
Purps, Josephine; Siegert, Sabine; Willuweit, Sascha; Nagy, Marion; Alves, Cíntia; Salazar, Renato; Angustia, Sheila M T; Santos, Lorna H; Anslinger, Katja; Bayer, Birgit; Ayub, Qasim; Wei, Wei; Xue, Yali; Tyler-Smith, Chris; Bafalluy, Miriam Baeta; Martínez-Jarreta, Begoña; Egyed, Balazs; Balitzki, Beate; Tschumi, Sibylle; Ballard, David; Court, Denise Syndercombe; Barrantes, Xinia; Bäßler, Gerhard; Wiest, Tina; Berger, Burkhard; Niederstätter, Harald; Parson, Walther; Davis, Carey; Budowle, Bruce; Burri, Helen; Borer, Urs; Koller, Christoph; Carvalho, Elizeu F; Domingues, Patricia M; Chamoun, Wafaa Takash; Coble, Michael D; Hill, Carolyn R; Corach, Daniel; Caputo, Mariela; D'Amato, Maria E; Davison, Sean; Decorte, Ronny; Larmuseau, Maarten H D; Ottoni, Claudio; Rickards, Olga; Lu, Di; Jiang, Chengtao; Dobosz, Tadeusz; Jonkisz, Anna; Frank, William E; Furac, Ivana; Gehrig, Christian; Castella, Vincent; Grskovic, Branka; Haas, Cordula; Wobst, Jana; Hadzic, Gavrilo; Drobnic, Katja; Honda, Katsuya; Hou, Yiping; Zhou, Di; Li, Yan; Hu, Shengping; Chen, Shenglan; Immel, Uta-Dorothee; Lessig, Rüdiger; Jakovski, Zlatko; Ilievska, Tanja; Klann, Anja E; García, Cristina Cano; de Knijff, Peter; Kraaijenbrink, Thirsa; Kondili, Aikaterini; Miniati, Penelope; Vouropoulou, Maria; Kovacevic, Lejla; Marjanovic, Damir; Lindner, Iris; Mansour, Issam; Al-Azem, Mouayyad; Andari, Ansar El; Marino, Miguel; Furfuro, Sandra; Locarno, Laura; Martín, Pablo; Luque, Gracia M; Alonso, Antonio; Miranda, Luís Souto; Moreira, Helena; Mizuno, Natsuko; Iwashima, Yasuki; Neto, Rodrigo S Moura; Nogueira, Tatiana L S; Silva, Rosane; Nastainczyk-Wulf, Marina; Edelmann, Jeanett; Kohl, Michael; Nie, Shengjie; Wang, Xianping; Cheng, Baowen; Núñez, Carolina; Pancorbo, Marian Martínez de; Olofsson, Jill K; Morling, Niels; Onofri, Valerio; Tagliabracci, Adriano; Pamjav, Horolma; Volgyi, Antonia; Barany, Gusztav; Pawlowski, Ryszard; Maciejewska, Agnieszka; Pelotti, Susi; Pepinski, Witold; Abreu-Glowacka, Monica; Phillips, Christopher; Cárdenas, Jorge; Rey-Gonzalez, Danel; Salas, Antonio; Brisighelli, Francesca; Capelli, Cristian; Toscanini, Ulises; Piccinini, Andrea; Piglionica, Marilidia; Baldassarra, Stefania L; Ploski, Rafal; Konarzewska, Magdalena; Jastrzebska, Emila; Robino, Carlo; Sajantila, Antti; Palo, Jukka U; Guevara, Evelyn; Salvador, Jazelyn; Ungria, Maria Corazon De; Rodriguez, Jae Joseph Russell; Schmidt, Ulrike; Schlauderer, Nicola; Saukko, Pekka; Schneider, Peter M; Sirker, Miriam; Shin, Kyoung-Jin; Oh, Yu Na; Skitsa, Iulia; Ampati, Alexandra; Smith, Tobi-Gail; Calvit, Lina Solis de; Stenzl, Vlastimil; Capal, Thomas; Tillmar, Andreas; Nilsson, Helena; Turrina, Stefania; De Leo, Domenico; Verzeletti, Andrea; Cortellini, Venusia; Wetton, Jon H; Gwynne, Gareth M; Jobling, Mark A; Whittle, Martin R; Sumita, Denilce R; Wolańska-Nowak, Paulina; Yong, Rita Y Y; Krawczak, Michael; Nothnagel, Michael; Roewer, Lutz
2014-09-01
In a worldwide collaborative effort, 19,630 Y-chromosomes were sampled from 129 different populations in 51 countries. These chromosomes were typed for 23 short-tandem repeat (STR) loci (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS385ab, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, GATAH4, DYS481, DYS533, DYS549, DYS570, DYS576, and DYS643) and using the PowerPlex Y23 System (PPY23, Promega Corporation, Madison, WI). Locus-specific allelic spectra of these markers were determined and a consistently high level of allelic diversity was observed. A considerable number of null, duplicate and off-ladder alleles were revealed. Standard single-locus and haplotype-based parameters were calculated and compared between subsets of Y-STR markers established for forensic casework. The PPY23 marker set provides substantially stronger discriminatory power than other available kits but at the same time reveals the same general patterns of population structure as other marker sets. A strong correlation was observed between the number of Y-STRs included in a marker set and some of the forensic parameters under study. Interestingly a weak but consistent trend toward smaller genetic distances resulting from larger numbers of markers became apparent. Copyright © 2014 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Olsen, Morten Tange; Bérubé, Martine; Robbins, Jooke; Palsbøll, Per J
2012-09-06
Telomeres, the protective cap of chromosomes, have emerged as powerful markers of biological age and life history in model and non-model species. The qPCR method for telomere length estimation is one of the most common methods for telomere length estimation, but has received recent critique for being too error-prone and yielding unreliable results. This critique coincides with an increasing awareness of the potentials and limitations of the qPCR technique in general and the proposal of a general set of guidelines (MIQE) for standardization of experimental, analytical, and reporting steps of qPCR. In order to evaluate the utility of the qPCR method for telomere length estimation in non-model species, we carried out four different qPCR assays directed at humpback whale telomeres, and subsequently performed a rigorous quality control to evaluate the performance of each assay. Performance differed substantially among assays and only one assay was found useful for telomere length estimation in humpback whales. The most notable factors causing these inter-assay differences were primer design and choice of using singleplex or multiplex assays. Inferred amplification efficiencies differed by up to 40% depending on assay and quantification method, however this variation only affected telomere length estimates in the worst performing assays. Our results suggest that seemingly well performing qPCR assays may contain biases that will only be detected by extensive quality control. Moreover, we show that the qPCR method for telomere length estimation can be highly precise and accurate, and thus suitable for telomere measurement in non-model species, if effort is devoted to optimization at all experimental and analytical steps. We conclude by highlighting a set of quality controls which may serve for further standardization of the qPCR method for telomere length estimation, and discuss some of the factors that may cause variation in qPCR experiments.
2012-01-01
Background Telomeres, the protective cap of chromosomes, have emerged as powerful markers of biological age and life history in model and non-model species. The qPCR method for telomere length estimation is one of the most common methods for telomere length estimation, but has received recent critique for being too error-prone and yielding unreliable results. This critique coincides with an increasing awareness of the potentials and limitations of the qPCR technique in general and the proposal of a general set of guidelines (MIQE) for standardization of experimental, analytical, and reporting steps of qPCR. In order to evaluate the utility of the qPCR method for telomere length estimation in non-model species, we carried out four different qPCR assays directed at humpback whale telomeres, and subsequently performed a rigorous quality control to evaluate the performance of each assay. Results Performance differed substantially among assays and only one assay was found useful for telomere length estimation in humpback whales. The most notable factors causing these inter-assay differences were primer design and choice of using singleplex or multiplex assays. Inferred amplification efficiencies differed by up to 40% depending on assay and quantification method, however this variation only affected telomere length estimates in the worst performing assays. Conclusion Our results suggest that seemingly well performing qPCR assays may contain biases that will only be detected by extensive quality control. Moreover, we show that the qPCR method for telomere length estimation can be highly precise and accurate, and thus suitable for telomere measurement in non-model species, if effort is devoted to optimization at all experimental and analytical steps. We conclude by highlighting a set of quality controls which may serve for further standardization of the qPCR method for telomere length estimation, and discuss some of the factors that may cause variation in qPCR experiments. PMID:22954451
Bishop, Chris; Paul, Gunther; Thewlis, Dominic
2013-04-01
Kinematic models are commonly used to quantify foot and ankle kinematics, yet no marker sets or models have been proven reliable or accurate when wearing shoes. Further, the minimal detectable difference of a developed model is often not reported. We present a kinematic model that is reliable, accurate and sensitive to describe the kinematics of the foot-shoe complex and lower leg during walking gait. In order to achieve this, a new marker set was established, consisting of 25 markers applied on the shoe and skin surface, which informed a four segment kinematic model of the foot-shoe complex and lower leg. Three independent experiments were conducted to determine the reliability, accuracy and minimal detectable difference of the marker set and model. Inter-rater reliability of marker placement on the shoe was proven to be good to excellent (ICC=0.75-0.98) indicating that markers could be applied reliably between raters. Intra-rater reliability was better for the experienced rater (ICC=0.68-0.99) than the inexperienced rater (ICC=0.38-0.97). The accuracy of marker placement along each axis was <6.7 mm for all markers studied. Minimal detectable difference (MDD90) thresholds were defined for each joint; tibiocalcaneal joint--MDD90=2.17-9.36°, tarsometatarsal joint--MDD90=1.03-9.29° and the metatarsophalangeal joint--MDD90=1.75-9.12°. These thresholds proposed are specific for the description of shod motion, and can be used in future research designed at comparing between different footwear. Copyright © 2012 Elsevier B.V. All rights reserved.
Tjon-Kon-Fat, R I; Tajik, P; Zafarmand, M H; Bensdorp, A J; Bossuyt, P M M; Oosterhuis, G J E; van Golde, R; Repping, S; Lambers, M D A; Slappendel, E; Perquin, D; Pelinck, M J; Gianotten, J; Maas, J W M; Eijkemans, M J C; van der Veen, F; Mol, B W; van Wely, M
2017-05-01
Are there treatment selection markers that could aid in identifying couples, with unexplained or mild male subfertility, who would have better chances of a healthy child with IVF with single embryo transfer (IVF-SET) than with IUI with ovarian stimulation (IUI-OS)? We did not find any treatment selection markers that were associated with better chances of a healthy child with IVF-SET instead of IUI-OS in couples with unexplained or mild male subfertility. A recent trial, comparing IVF-SET to IUI-OS, found no evidence of a difference between live birth rates and multiple pregnancy rates. It was suggested that IUI-OS should remain the first-line treatment instead of IVF-SET in couples with unexplained or mild male subfertility and female age between 18 and 38 years. The question remains whether there are some couples that may have higher pregnancy chances if treated with IVF-SET instead of IUI. We performed our analyses on data from the INeS trial, where couples with unexplained or mild male subfertility and an unfavourable prognosis for natural conception were randomly allocated to IVF-SET, IVF in a modified natural cycle or IUI-OS. In view of the aim of this study, we only used data of the comparison between IVF-SET (201 couples) and IUI-OS (207 couples). We pre-defined the following baseline characteristics as potential treatment selection markers: female age, ethnicity, smoking status, type of subfertility (primary/secondary), duration of subfertility, BMI, pre-wash total motile count and Hunault prediction score. For each potential treatment selection marker, we explored the association with the chances of a healthy child after IVF-SET and IUI-OS and tested if there was an interaction with treatment. Given the exploratory nature of our analysis, we used a P-value of 0.1. None of the markers were associated with higher chances of a healthy child from IVF-SET compared to IUI-OS (P-value for interaction >0.10). Since this is the first large study that looked at potential treatment selection markers for IVF-SET compared to IUI-OS, we had no data on which to base a power calculation. The sample size was limited, making it difficult to detect any smaller associations. We could not identify couples with unexplained or mild male subfertility who would have had higher chances of a healthy child from immediate IVF-SET than from IUI-OS. As in the original trial IUI-OS had similar effectiveness and was less costly compared to IVF-SET, IUI-OS should remain the preferred first-line treatment in these couples. The study was supported by a grant from the Netherlands Organization for Health Research and Development, and a grant from the Netherlands' association of health care insurers. There are no conflicts of interest. The trial was registered at the Dutch trial registry (NTR939). © The Author 2017. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Biswas, Manosh Kumar; Liu, Yuxuan; Li, Chunyu; Sheng, Ou; Mayer, Christoph; Yi, Ganjun
2015-01-01
The development of organized, informative, robust, user-friendly, and freely accessible molecular markers is imperative to the Musa marker assisted breeding program. Although several hundred SSR markers have already been developed, the number of informative, robust, and freely accessible Musa markers remains inadequate for some breeding applications. In view of this issue, we surveyed SSRs in four different data sets, developed large-scale non-redundant highly informative therapeutic SSR markers, and classified them according to their attributes, as well as analyzed their cross-taxon transferability and utility for the genetic study of Musa and its relatives. A high SSR frequency (177 per Mbp) was found in the Musa genome. AT-rich dinucleotide repeats are predominant, and trinucleotide repeats are the most abundant in transcribed regions. A significant number of Musa SSRs are associated with pre-miRNAs, and 83% of these SSRs are promising candidates for the development of therapeutic SSR markers. Overall, 74% of the SSR markers were polymorphic, and 94% were transferable to at least one Musa spp. Two hundred forty-three markers generated a total of 1047 alleles, with 2-8 alleles each and an average of 4.38 alleles per locus. The PIC values ranged from 0.31 to 0.89 and averaged 0.71. We report the largest set of non-redundant, polymorphic, new SSR markers to be developed in Musa. These additional markers could be a valuable resource for marker-assisted breeding, genetic diversity and genomic studies of Musa and related species. PMID:26121637
Campos, G S; Reimann, F A; Cardoso, L L; Ferreira, C E R; Junqueira, V S; Schmidt, P I; Braccini Neto, J; Yokoo, M J I; Sollero, B P; Boligon, A A; Cardoso, F F
2018-05-07
The objective of the present study was to evaluate the accuracy and bias of direct and blended genomic predictions using different methods and cross-validation techniques for growth traits (weight and weight gains) and visual scores (conformation, precocity, muscling and size) obtained at weaning and at yearling in Hereford and Braford breeds. Phenotypic data contained 126,290 animals belonging to the Delta G Connection genetic improvement program, and a set of 3,545 animals genotyped with the 50K chip and 131 sires with the 777K. After quality control, 41,045 markers remained for all animals. An animal model was used to estimate (co)variances components and to predict breeding values, which were later used to calculate the deregressed estimated breeding values (DEBV). Animals with genotype and phenotype for the traits studied were divided into four or five groups by random and k-means clustering cross-validation strategies. The values of accuracy of the direct genomic values (DGV) were moderate to high magnitude for at weaning and at yearling traits, ranging from 0.19 to 0.45 for the k-means and 0.23 to 0.78 for random clustering among all traits. The greatest gain in relation to the pedigree BLUP (PBLUP) was 9.5% with the BayesB method with both the k-means and the random clustering. Blended genomic value accuracies ranged from 0.19 to 0.56 for k-means and from 0.21 to 0.82 for random clustering. The analyzes using the historical pedigree and phenotypes contributed additional information to calculate the GEBV and in general, the largest gains were for the single-step (ssGBLUP) method in bivariate analyses with a mean increase of 43.00% among all traits measured at weaning and of 46.27% for those evaluated at yearling. The accuracy values for the marker effects estimation methods were lower for k-means clustering, indicating that the training set relationship to the selection candidates is a major factor affecting accuracy of genomic predictions. The gains in accuracy obtained with genomic blending methods, mainly ssGBLUP in bivariate analyses, indicate that genomic predictions should be used as a tool to improve genetic gains in relation to the traditional PBLUP selection.
The effectiveness and cost-effectiveness of a rural employer-based wellness program.
Saleh, Shadi S; Alameddine, Mohamad S; Hill, Dan; Darney-Beuhler, Jessica; Morgan, Ann
2010-01-01
The cost-effectiveness of employer-based wellness programs has been previously investigated with favorable financial and nonfinancial outcomes being detected. However, these investigations have mainly focused on large employers in urban settings. Very few studies examined wellness programs offered in rural settings. This paper aims to explore the effectiveness and cost-effectiveness of a rural employer-based wellness program. Six rural employers were categorized into 3 groups: a control group and 2 intervention groups with varying degrees of wellness activities. Participants were asked to complete an annual health risk assessment (HRA) that addressed 16 wellness areas. At the conclusion of 4 years, HRA and effectiveness data were utilized to examine program effectiveness and combined with program costs to estimate cost-effectiveness. The "Coaching and Referral" group-the highest in intensity of participant engagement-exhibited superior improvement in several wellness areas and in percentage of employees with good health indicators compared to the control and the Trail Marker, lower-intensity intervention groups. However, the Trail Markers had more favorable cost-effectiveness ratios. Rural worksite wellness programs have shown great potential in their effectiveness and cost-effectiveness. Such programs need not be too aggressive, tedious, and costly to generate a favorable return for employers and funders. However, employers should be encouraged to experiment with different levels of wellness program intensities until a more favorable outcome can be realized.
Lech, Karolina; Liu, Fan; Ackermann, Katrin; Revell, Victoria L; Lao, Oscar; Skene, Debra J; Kayser, Manfred
2016-03-01
Determining the time a biological trace was left at a scene of crime reflects a crucial aspect of forensic investigations as - if possible - it would permit testing the sample donor's alibi directly from the trace evidence, helping to link (or not) the DNA-identified sample donor with the crime event. However, reliable and robust methodology is lacking thus far. In this study, we assessed the suitability of mRNA for the purpose of estimating blood deposition time, and its added value relative to melatonin and cortisol, two circadian hormones we previously introduced for this purpose. By analysing 21 candidate mRNA markers in blood samples from 12 individuals collected around the clock at 2h intervals for 36h under real-life, controlled conditions, we identified 11 mRNAs with statistically significant expression rhythms. We then used these 11 significantly rhythmic mRNA markers, with and without melatonin and cortisol also analysed in these samples, to establish statistical models for predicting day/night time categories. We found that although in general mRNA-based estimation of time categories was less accurate than hormone-based estimation, the use of three mRNA markers HSPA1B, MKNK2 and PER3 together with melatonin and cortisol generally enhanced the time prediction accuracy relative to the use of the two hormones alone. Our data best support a model that by using these five molecular biomarkers estimates three time categories, i.e. night/early morning, morning/noon, and afternoon/evening with prediction accuracies expressed as AUC values of 0.88, 0.88, and 0.95, respectively. For the first time, we demonstrate the value of mRNA for blood deposition timing and introduce a statistical model for estimating day/night time categories based on molecular biomarkers, which shall be further validated with additional samples in the future. Moreover, our work provides new leads for molecular approaches on time of death estimation using the significantly rhythmic mRNA markers established here. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Using plant wax markers to estimate the diet composition of grazing Holstein dairy cows.
Heublein, C; Südekum, K-H; Gill, F L; Dohme-Meier, F; Schori, F
2017-02-01
The objective of this study was to test whether diet selection of dairy cows under grazing conditions could be estimated using plant wax markers. Furthermore, differences between 2 cow strains and the effect of concentrate supplementation on plant species selection were investigated. The experiment was a study with a crossover design performed on an organic farm with 12 Swiss Holstein cows and 12 New Zealand Holstein cows. Both experimental periods consisted of a 21-d adaptation and a 7-d measurement period. All cows grazed full time in a rotational stocking system and received either no concentrate or 6 kg/d of a commercial cereal-grain mix. Representative herbage samples of each grazed paddock were taken and botanical composition of subsamples was manually determined. The average proportions of the plant species were 27.8% Lolium perenne, 6.1% Dactylis glomerata, 10.4% Trifolium repens, and 9.0% Taraxacum officinale. Other grass species were merged as "other grass" (38.2%) and other forb species as "other forbs" (8.5%). n-Alkanes, long-chain fatty acids, and long-chain alcohols (LCOH) were analyzed in the samples of plant species, concentrate, and feces from each cow. A linear discriminant analysis indicated that diet components were differentiated best with LCOH (96%) and worst with the combination of all marker groups together (12%). For each marker, the fecal marker recovery (FR) relative to dosed ytterbium was determined in 2 ways. Estimation of diet composition was performed with the software "EatWhat," and results were compared with botanical composition with the Aitchison distance. The results indicate that the diet composition of grazing dairy cows can be estimated using plant wax markers. Additionally, the calculation of FR led to mostly reliable results, yet this approach needs further validation. The most accurate estimation was achieved with the marker combination of n-alkanes and LCOH with a correction for FR. Less accurate estimations were achieved with long-chain fatty acids alone or in combination with n-alkanes. No difference relating to diet selection between the 2 cow strains was recorded, but supplemented cows apparently ingested higher proportions of T. repens than nonsupplemented cows. Awareness that supplementation influences selection behavior of grazing dairy cows may lead to adaptations in botanical composition of the pasture according to the demand of the animals. The Authors. Published by the Federation of Animal Science Societies and Elsevier Inc. on behalf of the American Dairy Science Association®. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/3.0/).
Chan, Kwun Chuen Gary; Wang, Mei-Cheng
2017-01-01
Recurrent event processes with marker measurements are mostly and largely studied with forward time models starting from an initial event. Interestingly, the processes could exhibit important terminal behavior during a time period before occurrence of the failure event. A natural and direct way to study recurrent events prior to a failure event is to align the processes using the failure event as the time origin and to examine the terminal behavior by a backward time model. This paper studies regression models for backward recurrent marker processes by counting time backward from the failure event. A three-level semiparametric regression model is proposed for jointly modeling the time to a failure event, the backward recurrent event process, and the marker observed at the time of each backward recurrent event. The first level is a proportional hazards model for the failure time, the second level is a proportional rate model for the recurrent events occurring before the failure event, and the third level is a proportional mean model for the marker given the occurrence of a recurrent event backward in time. By jointly modeling the three components, estimating equations can be constructed for marked counting processes to estimate the target parameters in the three-level regression models. Large sample properties of the proposed estimators are studied and established. The proposed models and methods are illustrated by a community-based AIDS clinical trial to examine the terminal behavior of frequencies and severities of opportunistic infections among HIV infected individuals in the last six months of life.
Immunohistochemical estimation of cell cycle phase in laryngeal neoplasia
Chatrath, P; Scott, I S; Morris, L S; Davies, R J; Bird, K; Vowler, S L; Coleman, N
2006-01-01
We previously developed an immunohistochemical method for estimating cell cycle state and phase in tissue samples, including biopsies that are too small for flow cytometry. We have used our technique to examine whether primary abnormalities of the cell cycle exist in laryngeal neoplasia. Antibodies against the markers of cell cycle entry, minichromosome maintenance protein-2 (Mcm-2) and Ki67, and putative markers of cell cycle phase, cyclin D1 (G1-phase), cyclin A (S-phase), cyclin B1 (G2-phase) and phosphohistone H3 (Mitosis) were applied to paraffin-embedded sections of normal larynx (n=8), laryngeal dysplasia (n=10) and laryngeal squamous cell carcinoma (n=10). Cells expressing each marker were determined as a percentage of total cells, termed the labelling index (LI), and as a percentage of Mcm-2-positive cells, termed the labelling fraction (LF). The frequency of coexpression of each putative phase marker was investigated by confocal microscopy. There was a correlation between Mcm-2 and Ki67 LIs (ρ=0.93) but Mcm-2 LIs were consistently higher. All cells expressing a phase marker coexpressed Mcm-2, whereas Ki67 was not expressed in a proportion of these cells. The putative phase markers showed little coexpression. Labelling index values increased on progression from normal larynx through laryngeal dysplasia to squamous cell carcinoma for Mcm-2 (P=0.001), Ki67 (P=0.0002), cyclin D1 (P=0.015), cyclin A (P=0.0001) and cyclin B1 (P=0.0004). There was no evidence of an increase in the LF for any phase marker. Immunohistochemistry can be used to estimate cell cycle state and phase in laryngeal biopsies. Our data argues against primary cell cycle phase abnormalities in laryngeal neoplasia. PMID:16832409
Gianola, Daniel; Fariello, Maria I; Naya, Hugo; Schön, Chris-Carolin
2016-10-13
Standard genome-wide association studies (GWAS) scan for relationships between each of p molecular markers and a continuously distributed target trait. Typically, a marker-based matrix of genomic similarities among individuals ( G: ) is constructed, to account more properly for the covariance structure in the linear regression model used. We show that the generalized least-squares estimator of the regression of phenotype on one or on m markers is invariant with respect to whether or not the marker(s) tested is(are) used for building G,: provided variance components are unaffected by exclusion of such marker(s) from G: The result is arrived at by using a matrix expression such that one can find many inverses of genomic relationship, or of phenotypic covariance matrices, stemming from removing markers tested as fixed, but carrying out a single inversion. When eigenvectors of the genomic relationship matrix are used as regressors with fixed regression coefficients, e.g., to account for population stratification, their removal from G: does matter. Removal of eigenvectors from G: can have a noticeable effect on estimates of genomic and residual variances, so caution is needed. Concepts were illustrated using genomic data on 599 wheat inbred lines, with grain yield as target trait, and on close to 200 Arabidopsis thaliana accessions. Copyright © 2016 Gianola et al.
Huang, Jie; Li, Yu-Zhi; Du, Lian-Ming; Yang, Bo; Shen, Fu-Jun; Zhang, He-Min; Zhang, Zhi-He; Zhang, Xiu-Yue; Yue, Bi-Song
2015-02-07
The giant panda (Ailuropoda melanoleuca) is a critically endangered species endemic to China. Microsatellites have been preferred as the most popular molecular markers and proven effective in estimating population size, paternity test, genetic diversity for the critically endangered species. The availability of the giant panda complete genome sequences provided the opportunity to carry out genome-wide scans for all types of microsatellites markers, which now opens the way for the analysis and development of microsatellites in giant panda. By screening the whole genome sequence of giant panda in silico mining, we identified microsatellites in the genome of giant panda and analyzed their frequency and distribution in different genomic regions. Based on our search criteria, a repertoire of 855,058 SSRs was detected, with mono-nucleotides being the most abundant. SSRs were found in all genomic regions and were more abundant in non-coding regions than coding regions. A total of 160 primer pairs were designed to screen for polymorphic microsatellites using the selected tetranucleotide microsatellite sequences. The 51 novel polymorphic tetranucleotide microsatellite loci were discovered based on genotyping blood DNA from 22 captive giant pandas in this study. Finally, a total of 15 markers, which showed good polymorphism, stability, and repetition in faecal samples, were used to establish the novel microsatellite marker system for giant panda. Meanwhile, a genotyping database for Chengdu captive giant pandas (n = 57) were set up using this standardized system. What's more, a universal individual identification method was established and the genetic diversity were analysed in this study as the applications of this marker system. The microsatellite abundance and diversity were characterized in giant panda genomes. A total of 154,677 tetranucleotide microsatellites were identified and 15 of them were discovered as the polymorphic and stable loci. The individual identification method and the genetic diversity analysis method in this study provided adequate material for the future study of giant panda.
Development of a set of SNP markers present in expressed genes of the apple.
Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S
2008-11-01
Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yan, Guanghua, E-mail: yan@ufl.edu; Li, Jonathan; Huang, Yin
Purpose: To propose a simple model to explain the origin of ghost markers in marker-based optical tracking systems (OTS) and to develop retrospective strategies to detect and eliminate ghost markers. Methods: In marker-based OTS, ghost markers are virtual markers created due to the cross-talk between the two camera sensors, which can lead to system execution failure or inaccuracy in patient tracking. As a result, the users have to limit the number of markers and avoid certain marker configurations to reduce the chances of ghost markers. In this work, the authors propose retrospective strategies to detect and eliminate ghost markers. Themore » two camera sensors were treated as mathematical points in space. The authors identified the coplanar within limit (CWL) condition as the necessary condition for ghost marker occurrence. A simple ghost marker detection method was proposed based on the model. Ghost marker elimination was achieved through pattern matching: a ghost marker-free reference set was matched with the optical marker set observed by the OTS; unmatched optical markers were eliminated as either ghost markers or misplaced markers. The pattern matching problem was formulated as a constraint satisfaction problem (using pairwise distances as constraints) and solved with an iterative backtracking algorithm. Wildcard markers were introduced to address missing or misplaced markers. An experiment was designed to measure the sensor positions and the limit for the CWL condition. The ghost marker detection and elimination algorithms were verified with samples collected from a five-marker jig and a nine-marker anthropomorphic phantom, rotated with the treatment couch from −60° to +60°. The accuracy of the pattern matching algorithm was further validated with marker patterns from 40 patients who underwent stereotactic body radiotherapy (SBRT). For this purpose, a synthetic optical marker pattern was created for each patient by introducing ghost markers, marker position uncertainties, and marker displacement. Results: The sensor positions and the limit for the CWL condition were measured with excellent reproducibility (standard deviation ≤ 0.39 mm). The ghost marker detection algorithm had perfect detection accuracy for both the jig (1544 samples) and the anthropomorphic phantom (2045 samples). Pattern matching was successful for all samples from both phantoms as well as the 40 patient marker patterns. Conclusions: The authors proposed a simple model to explain the origin of ghost markers and identified the CWL condition as the necessary condition for ghost marker occurrence. The retrospective ghost marker detection and elimination algorithms guarantee complete ghost marker elimination while providing the users with maximum flexibility in selecting the number of markers and their configuration to meet their clinic needs.« less
Algorithms for selecting informative marker panels for population assignment.
Rosenberg, Noah A
2005-11-01
Given a set of potential source populations, genotypes of an individual of unknown origin at a collection of markers can be used to predict the correct source population of the individual. For improved efficiency, informative markers can be chosen from a larger set of markers to maximize the accuracy of this prediction. However, selecting the loci that are individually most informative does not necessarily produce the optimal panel. Here, using genotypes from eight species--carp, cat, chicken, dog, fly, grayling, human, and maize--this univariate accumulation procedure is compared to new multivariate "greedy" and "maximin" algorithms for choosing marker panels. The procedures generally suggest similar panels, although the greedy method often recommends inclusion of loci that are not chosen by the other algorithms. In seven of the eight species, when applied to five or more markers, all methods achieve at least 94% assignment accuracy on simulated individuals, with one species--dog--producing this level of accuracy with only three markers, and the eighth species--human--requiring approximately 13-16 markers. The new algorithms produce substantial improvements over use of randomly selected markers; where differences among the methods are noticeable, the greedy algorithm leads to slightly higher probabilities of correct assignment. Although none of the approaches necessarily chooses the panel with optimal performance, the algorithms all likely select panels with performance near enough to the maximum that they all are suitable for practical use.
Jin, Peng; van Wieringen, Niek; Hulshof, Maarten C C M; Bel, Arjan; Alderliesten, Tanja
2018-04-01
Use of four-dimensional cone-beam CT (4D-CBCT) and fiducial markers for image guidance during radiation therapy (RT) of mobile tumors is challenging due to the trade-off among image quality, imaging dose, and scanning time. This study aimed to investigate different 4D-CBCT acquisition settings for good visibility of fiducial markers in 4D-CBCT. Using these 4D-CBCTs, the feasibility of marker-based 4D registration for RT setup verification and manual respiration-induced motion quantification was investigated. For this, we applied a dynamic phantom with three different breathing motion amplitudes and included two patients with implanted markers. Irrespective of the motion amplitude, for a medium field of view (FOV), marker visibility was improved by reducing the imaging dose per projection and increasing the number of projection images; however, the scanning time was 4 to 8 min. For a small FOV, the total imaging dose and the scanning time were reduced (62.5% of the dose using a medium FOV, 2.5 min) without losing marker visibility. However, the body contour could be missing for a small FOV, which is not preferred in RT. The marker-based 4D setup verification was feasible for both the phantom and patient data. Moreover, manual marker motion quantification can achieve a high accuracy with a mean error of [Formula: see text].
Linking the potato genome to the conserved ortholog set (COS) markers
2013-01-01
Background Conserved ortholog set (COS) markers are an important functional genomics resource that has greatly improved orthology detection in Asterid species. A comprehensive list of these markers is available at Sol Genomics Network (http://solgenomics.net/) and many of these have been placed on the genetic maps of a number of solanaceous species. Results We amplified over 300 COS markers from eight potato accessions involving two diploid landraces of Solanum tuberosum Andigenum group (formerly classified as S. goniocalyx, S. phureja), and a dihaploid clone derived from a modern tetraploid cultivar of S. tuberosum and the wild species S. berthaultii, S. chomatophilum, and S. paucissectum. By BLASTn (Basic Local Alignment Search Tool of the NCBI, National Center for Biotechnology Information) algorithm we mapped the DNA sequences of these markers into the potato genome sequence. Additionally, we mapped a subset of these markers genetically in potato and present a comparison between the physical and genetic locations of these markers in potato and in comparison with the genetic location in tomato. We found that most of the COS markers are single-copy in the reference genome of potato and that the genetic location in tomato and physical location in potato sequence are mostly in agreement. However, we did find some COS markers that are present in multiple copies and those that map in unexpected locations. Sequence comparisons between species show that some of these markers may be paralogs. Conclusions The sequence-based physical map becomes helpful in identification of markers for traits of interest thereby reducing the number of markers to be tested for applications like marker assisted selection, diversity, and phylogenetic studies. PMID:23758607
Discourse Markers in Italian as L2 in Face to Face vs. Computer Mediated Settings
ERIC Educational Resources Information Center
De Marco, Anna; Leone, Paola
2013-01-01
This pilot study aims to highlight a) differences in pragmatic function and distribution of discourse markers (DMs) in computer mediated and face to face (FtF) settings and b) any correlation of DM uses and language competence. The data have been collected by video-recording and analysing three speakers of Italian L2 (language level competence:…
Detection and Prevention of Arrhythmias During Space Flight
NASA Technical Reports Server (NTRS)
Pillai, Dilip; Rosenbaum, David; Liszka, Kathy; York, David; Mackin, Michael; Lichter, Michael
2004-01-01
Objectives of this research include:determine if orthogonal lead sets can; determine if orthogonal lead sets can correct artifactual ECG changes caused by correct artifactual ECG changes caused by microgravity- induced alterations in cardiac position; determine if markers of susceptibility to SCD (TWA and QT restitution) can be reliably measured during space flight; determine the effects of continuous microgravity on markers of susceptibility to SCD.
Cohen, Alan A; Milot, Emmanuel; Li, Qing; Legault, Véronique; Fried, Linda P; Ferrucci, Luigi
2014-09-01
Measuring physiological dysregulation during aging could be a key tool both to understand underlying aging mechanisms and to predict clinical outcomes in patients. However, most existing indices are either circular or hard to interpret biologically. Recently, we showed that statistical distance of 14 common blood biomarkers (a measure of how strange an individual's biomarker profile is) was associated with age and mortality in the WHAS II data set, validating its use as a measure of physiological dysregulation. Here, we extend the analyses to other data sets (WHAS I and InCHIANTI) to assess the stability of the measure across populations. We found that the statistical criteria used to determine the original 14 biomarkers produced diverging results across populations; in other words, had we started with a different data set, we would have chosen a different set of markers. Nonetheless, the same 14 markers (or the subset of 12 available for InCHIANTI) produced highly similar predictions of age and mortality. We include analyses of all combinatorial subsets of the markers and show that results do not depend much on biomarker choice or data set, but that more markers produce a stronger signal. We conclude that statistical distance as a measure of physiological dysregulation is stable across populations in Europe and North America. Copyright © 2014 Elsevier Inc. All rights reserved.
Toward an affordable and user-friendly visual motion capture system.
Bonnet, V; Sylla, N; Cherubini, A; Gonzáles, A; Azevedo Coste, C; Fraisse, P; Venture, G
2014-01-01
The present study aims at designing and evaluating a low-cost, simple and portable system for arm joint angle estimation during grasping-like motions. The system is based on a single RGB-D camera and three customized markers. The automatically detected and tracked marker positions were used as inputs to an offline inverse kinematic process based on bio-mechanical constraints to reduce noise effect and handle marker occlusion. The method was validated on 4 subjects with different motions. The joint angles were estimated both with the proposed low-cost system and, a stereophotogrammetric system. Comparative analysis shows good accuracy with high correlation coefficient (r= 0.92) and low average RMS error (3.8 deg).
Joint genomic evaluation of French dairy cattle breeds using multiple-trait models.
Karoui, Sofiene; Carabaño, María Jesús; Díaz, Clara; Legarra, Andrés
2012-12-07
Using a multi-breed reference population might be a way of increasing the accuracy of genomic breeding values in small breeds. Models involving mixed-breed data do not take into account the fact that marker effects may differ among breeds. This study was aimed at investigating the impact on accuracy of increasing the number of genotyped candidates in the training set by using a multi-breed reference population, in contrast to single-breed genomic evaluations. Three traits (milk production, fat content and female fertility) were analyzed by genomic mixed linear models and Bayesian methodology. Three breeds of French dairy cattle were used: Holstein, Montbéliarde and Normande with 2976, 950 and 970 bulls in the training population, respectively and 964, 222 and 248 bulls in the validation population, respectively. All animals were genotyped with the Illumina Bovine SNP50 array. Accuracy of genomic breeding values was evaluated under three scenarios for the correlation of genomic breeding values between breeds (r(g)): uncorrelated (1), r(g) = 0; estimated r(g) (2); high, r(g) = 0.95 (3). Accuracy and bias of predictions obtained in the validation population with the multi-breed training set were assessed by the coefficient of determination (R(2)) and by the regression coefficient of daughter yield deviations of validation bulls on their predicted genomic breeding values, respectively. The genetic variation captured by the markers for each trait was similar to that estimated for routine pedigree-based genetic evaluation. Posterior means for rg ranged from -0.01 for fertility between Montbéliarde and Normande to 0.79 for milk yield between Montbéliarde and Holstein. Differences in R(2) between the three scenarios were notable only for fat content in the Montbéliarde breed: from 0.27 in scenario (1) to 0.33 in scenarios (2) and (3). Accuracies for fertility were lower than for other traits. Using a multi-breed reference population resulted in small or no increases in accuracy. Only the breed with a small data set and large genetic correlation with the breed with a large data set showed increased accuracy for the traits with moderate (milk) to high (fat content) heritability. No benefit was observed for fertility, a lowly heritable trait.
Joint genomic evaluation of French dairy cattle breeds using multiple-trait models
2012-01-01
Background Using a multi-breed reference population might be a way of increasing the accuracy of genomic breeding values in small breeds. Models involving mixed-breed data do not take into account the fact that marker effects may differ among breeds. This study was aimed at investigating the impact on accuracy of increasing the number of genotyped candidates in the training set by using a multi-breed reference population, in contrast to single-breed genomic evaluations. Methods Three traits (milk production, fat content and female fertility) were analyzed by genomic mixed linear models and Bayesian methodology. Three breeds of French dairy cattle were used: Holstein, Montbéliarde and Normande with 2976, 950 and 970 bulls in the training population, respectively and 964, 222 and 248 bulls in the validation population, respectively. All animals were genotyped with the Illumina Bovine SNP50 array. Accuracy of genomic breeding values was evaluated under three scenarios for the correlation of genomic breeding values between breeds (rg): uncorrelated (1), rg = 0; estimated rg (2); high, rg = 0.95 (3). Accuracy and bias of predictions obtained in the validation population with the multi-breed training set were assessed by the coefficient of determination (R2) and by the regression coefficient of daughter yield deviations of validation bulls on their predicted genomic breeding values, respectively. Results The genetic variation captured by the markers for each trait was similar to that estimated for routine pedigree-based genetic evaluation. Posterior means for rg ranged from −0.01 for fertility between Montbéliarde and Normande to 0.79 for milk yield between Montbéliarde and Holstein. Differences in R2 between the three scenarios were notable only for fat content in the Montbéliarde breed: from 0.27 in scenario (1) to 0.33 in scenarios (2) and (3). Accuracies for fertility were lower than for other traits. Conclusions Using a multi-breed reference population resulted in small or no increases in accuracy. Only the breed with a small data set and large genetic correlation with the breed with a large data set showed increased accuracy for the traits with moderate (milk) to high (fat content) heritability. No benefit was observed for fertility, a lowly heritable trait. PMID:23216664
St-Louis, Etienne; Deckelbaum, Dan Leon; Baird, Robert; Razek, Tarek
2017-06-01
Although a plethora of pediatric injury severity scoring systems is available, many of them present important challenges and limitations in the low resource setting. Our aim is to generate consensus among a group of experts regarding the optimal parameters, outcomes, and methods of estimating injury severity for pediatric trauma patients in low resource settings. A systematic review of the literature was conducted to identify and compare existing injury scores used in pediatric patients. Qualitative data was extracted from the systematic review, including scoring parameters, settings and outcomes. In order to establish consensus regarding which of these elements are most adapted to pediatric patients in low-resource settings, they were subjected to a modified Delphi survey for external validation. The Delphi process is a structured communication technique that relies on a panel of experts to develop a systematic, interactive consensus method. We invited a group of 38 experts, including adult and pediatric surgeons, emergency physicians and anesthesiologists trauma team leaders from a level 1 trauma center in Montreal, Canada, and a pediatric referral trauma hospital in Santiago, Chile to participate in two successive rounds of our survey. Consensus was reached regarding various features of an ideal pediatric trauma score. Specifically, our experts agreed pediatric trauma scoring tool should differ from its adult counterpart, that it can be derived from point of care data available at first assessment, that blood pressure is an important variable to include in a predictive model for pediatric trauma outcomes, that blood pressure is a late but specific marker of shock in pediatric patients, that pulse rate is a more sensitive marker of hemodynamic instability than blood pressure, that an assessment of airway status should be included as a predictive variable for pediatric trauma outcomes, that the AVPU classification of neurologic status is simple and reliable in the acute setting, and more so than GCS at all ages. Therefore, we conclude that an opportunity exists to develop a new pediatric trauma score, combining the above consensus-generating ideas, that would be best adapted for use in low-resource settings. Copyright © 2017 Elsevier Ltd. All rights reserved.
Graffelman, Jan; Sánchez, Milagros; Cook, Samantha; Moreno, Victor
2013-01-01
In genetic association studies, tests for Hardy-Weinberg proportions are often employed as a quality control checking procedure. Missing genotypes are typically discarded prior to testing. In this paper we show that inference for Hardy-Weinberg proportions can be biased when missing values are discarded. We propose to use multiple imputation of missing values in order to improve inference for Hardy-Weinberg proportions. For imputation we employ a multinomial logit model that uses information from allele intensities and/or neighbouring markers. Analysis of an empirical data set of single nucleotide polymorphisms possibly related to colon cancer reveals that missing genotypes are not missing completely at random. Deviation from Hardy-Weinberg proportions is mostly due to a lack of heterozygotes. Inbreeding coefficients estimated by multiple imputation of the missings are typically lowered with respect to inbreeding coefficients estimated by discarding the missings. Accounting for missings by multiple imputation qualitatively changed the results of 10 to 17% of the statistical tests performed. Estimates of inbreeding coefficients obtained by multiple imputation showed high correlation with estimates obtained by single imputation using an external reference panel. Our conclusion is that imputation of missing data leads to improved statistical inference for Hardy-Weinberg proportions.
New DArT markers for oat provide enhanced map coverage and global germplasm characterization
USDA-ARS?s Scientific Manuscript database
Genomic discovery in oat and its application to oat improvement have been hindered by a lack of common markers on different genetic maps, and by the difficulty of conducting whole-genome analysis using high throughput markers. In this study we developed, characterized, and applied a large set oat g...
Amplified fragment length polymorphism (AFLP) markers can be developed more quickly and at a lower cost than microsatellite and single nucleotide polymorphism markers, which makes them ideal markers for large-scale studies of understudied taxa — such as species at risk. However,...
Brown, Ian J.; Dyer, Alan R.; Chan, Queenie; Cogswell, Mary E.; Ueshima, Hirotsugu; Stamler, Jeremiah; Elliott, Paul
2013-01-01
High intakes of dietary sodium are associated with elevated blood pressure levels and an increased risk of cardiovascular disease. National and international guidelines recommend reduced sodium intake in the general population, which necessitates population-wide surveillance. We assessed the utility of casual (spot) urine specimens in estimating 24-hour urinary sodium excretion as a marker of sodium intake in the International Cooperative Study on Salt, Other Factors, and Blood Pressure. There were 5,693 participants recruited in 1984–1987 at the ages of 20–59 years from 29 North American and European samples. Participants were randomly assigned to test or validation data sets. Equations derived from casual urinary sodium concentration and other variables in the test data were applied to the validation data set. Correlations between observed and estimated 24-hour sodium excretion were 0.50 for individual men and 0.51 for individual women; the values were 0.79 and 0.71, respectively, for population samples. Bias in mean values (observed minus estimated) was small; for men and women, the values were −1.6 mmol per 24 hours and 2.3 mmol per 24 hours, respectively, at the individual level and −1.8 mmol per 24 hours and 2.2 mmol per 24 hours, respectively, at the population level. Proportions of individuals with urinary 24-hour sodium excretion above the recommended levels were slightly overestimated by the models. Casual urine specimens may be a useful, low-burden, low-cost alternative to 24-hour urine collections for estimation of population sodium intakes; ongoing calibration with study-specific 24-hour urinary collections is recommended to increase validity. PMID:23673246
Hottes, Travis S.; Skowronski, Danuta M.; Hiebert, Brett; Janjua, Naveed Z.; Roos, Leslie L.; Van Caeseele, Paul; Law, Barbara J.; De Serres, Gaston
2011-01-01
Background Administrative databases provide efficient methods to estimate influenza vaccine effectiveness (IVE) against severe outcomes in the elderly but are prone to intractable bias. This study returns to one of the linked population databases by which IVE against hospitalization and death in the elderly was first assessed. We explore IVE across six more recent influenza seasons, including periods before, during, and after peak activity to identify potential markers for bias. Methods and Findings Acute respiratory hospitalization and all-cause mortality were compared between immunized/non-immunized community-dwelling seniors ≥65years through administrative databases in Manitoba, Canada between 2000-01 and 2005-06. IVE was compared during pre-season/influenza/post-season periods through logistic regression with multivariable adjustment (age/sex/income/residence/prior influenza or pneumococcal immunization/medical visits/comorbidity), stratification based on prior influenza immunization history, and propensity scores. Analysis during pre-season periods assessed baseline differences between immunized and unimmunized groups. The study population included ∼140,000 seniors, of whom 50–60% were immunized annually. Adjustment for key covariates and use of propensity scores consistently increased IVE. Estimates were paradoxically higher pre-season and for all-cause mortality vs. acute respiratory hospitalization. Stratified analysis showed that those twice consecutively and currently immunized were always at significantly lower hospitalization/mortality risk with odds ratios (OR) of 0.60 [95%CI0.48–0.75] and 0.58 [0.53–0.64] pre-season and 0.77 [0.69–0.86] and 0.71 [0.66–0.77] during influenza circulation, relative to the consistently unimmunized. Conversely, those forgoing immunization when twice previously immunized were always at significantly higher hospitalization/mortality risk with OR of 1.41 [1.14–1.73] and 2.45 [2.21–2.72] pre-season and 1.21 [1.03–1.43] and 1.78 [1.61–1.96] during influenza circulation. Conclusions The most pronounced IVE estimates were paradoxically observed pre-season, indicating bias tending to over-estimate vaccine protection. Change in immunization habit from that of the prior two years may be a marker for this bias in administrative data sets; however, no analytic technique explored could adjust for its influence. Improved methods to achieve valid interpretation of protection in the elderly are needed. PMID:21818350
Research notes : raised and recessed pavement markers.
DOT National Transportation Integrated Search
1995-06-01
In March, we surveyed all region traffic engineers, ODOT districts and several other state department of transportation. The survey data collected included the estimated length of service for different types of markers, the most common mode of failur...
Surface smoothness: cartilage biomarkers for knee OA beyond the radiologist
NASA Astrophysics Data System (ADS)
Tummala, Sudhakar; Dam, Erik B.
2010-03-01
Fully automatic imaging biomarkers may allow quantification of patho-physiological processes that a radiologist would not be able to assess reliably. This can introduce new insight but is problematic to validate due to lack of meaningful ground truth expert measurements. Rather than quantification accuracy, such novel markers must therefore be validated against clinically meaningful end-goals such as the ability to allow correct diagnosis. We present a method for automatic cartilage surface smoothness quantification in the knee joint. The quantification is based on a curvature flow method used on tibial and femoral cartilage compartments resulting from an automatic segmentation scheme. These smoothness estimates are validated for their ability to diagnose osteoarthritis and compared to smoothness estimates based on manual expert segmentations and to conventional cartilage volume quantification. We demonstrate that the fully automatic markers eliminate the time required for radiologist annotations, and in addition provide a diagnostic marker superior to the evaluated semi-manual markers.
New microsatellite loci for Prosopis alba and P. chilensis (Fabaceae)1
Bessega, Cecilia F.; Pometti, Carolina L.; Miller, Joe T.; Watts, Richard; Saidman, Beatriz O.; Vilardi, Juan C.
2013-01-01
• Premise of the study: As only six useful microsatellite loci that exhibit broad cross-amplification are so far available for Prosopis species, it is necessary to develop a larger number of codominant markers for population genetic studies. Simple sequence repeat (SSR) markers obtained for Prosopis species from a 454 pyrosequencing run were optimized and characterized for studies in P. alba and P. chilensis. • Methods and Results: Twelve markers that were successfully amplified showed polymorphism in P. alba and P. chilensis. The number of alleles per locus ranged between two and seven and heterozygosity estimates ranged from 0.2 to 0.8. Most of these loci cross-amplify in P. ruscifolia, P. flexuosa, P. kuntzei, P. glandulosa, and P. pallida. • Conclusions: These loci will enable genetic diversity studies of P. alba and P. chilensis and contribute to fine-scale population structure, indirect estimation of relatedness among individuals, and marker-assisted selection. PMID:25202541
Code of Federal Regulations, 2010 CFR
2010-01-01
... FACILITIES NON-FEDERAL NAVIGATION FACILITIES VHF Marker Beacons § 171.201 Scope. (a) This subpart sets forth minimum requirements for the approval and operation of non-Federal VHF marker beacon facilities that are...
Bouvet, J-M; Makouanzi, G; Cros, D; Vigneron, Ph
2016-01-01
Hybrids are broadly used in plant breeding and accurate estimation of variance components is crucial for optimizing genetic gain. Genome-wide information may be used to explore models designed to assess the extent of additive and non-additive variance and test their prediction accuracy for the genomic selection. Ten linear mixed models, involving pedigree- and marker-based relationship matrices among parents, were developed to estimate additive (A), dominance (D) and epistatic (AA, AD and DD) effects. Five complementary models, involving the gametic phase to estimate marker-based relationships among hybrid progenies, were developed to assess the same effects. The models were compared using tree height and 3303 single-nucleotide polymorphism markers from 1130 cloned individuals obtained via controlled crosses of 13 Eucalyptus urophylla females with 9 Eucalyptus grandis males. Akaike information criterion (AIC), variance ratios, asymptotic correlation matrices of estimates, goodness-of-fit, prediction accuracy and mean square error (MSE) were used for the comparisons. The variance components and variance ratios differed according to the model. Models with a parent marker-based relationship matrix performed better than those that were pedigree-based, that is, an absence of singularities, lower AIC, higher goodness-of-fit and accuracy and smaller MSE. However, AD and DD variances were estimated with high s.es. Using the same criteria, progeny gametic phase-based models performed better in fitting the observations and predicting genetic values. However, DD variance could not be separated from the dominance variance and null estimates were obtained for AA and AD effects. This study highlighted the advantages of progeny models using genome-wide information. PMID:26328760
Duchêne, Sebastián; Archer, Frederick I.; Vilstrup, Julia; Caballero, Susana; Morin, Phillip A.
2011-01-01
The availability of mitochondrial genome sequences is growing as a result of recent technological advances in molecular biology. In phylogenetic analyses, the complete mitogenome is increasingly becoming the marker of choice, usually providing better phylogenetic resolution and precision relative to traditional markers such as cytochrome b (CYTB) and the control region (CR). In some cases, the differences in phylogenetic estimates between mitogenomic and single-gene markers have yielded incongruent conclusions. By comparing phylogenetic estimates made from different genes, we identified the most informative mitochondrial regions and evaluated the minimum amount of data necessary to reproduce the same results as the mitogenome. We compared results among individual genes and the mitogenome for recently published complete mitogenome datasets of selected delphinids (Delphinidae) and killer whales (genus Orcinus). Using Bayesian phylogenetic methods, we investigated differences in estimation of topologies, divergence dates, and clock-like behavior among genes for both datasets. Although the most informative regions were not the same for each taxonomic group (COX1, CYTB, ND3 and ATP6 for Orcinus, and ND1, COX1 and ND4 for Delphinidae), in both cases they were equivalent to less than a quarter of the complete mitogenome. This suggests that gene information content can vary among groups, but can be adequately represented by a portion of the complete sequence. Although our results indicate that complete mitogenomes provide the highest phylogenetic resolution and most precise date estimates, a minimum amount of data can be selected using our approach when the complete sequence is unavailable. Studies based on single genes can benefit from the addition of a few more mitochondrial markers, producing topologies and date estimates similar to those obtained using the entire mitogenome. PMID:22073275
Field heritability of a plant adaptation to fire in heterogeneous landscapes.
Castellanos, M C; González-Martínez, S C; Pausas, J G
2015-11-01
The strong association observed between fire regimes and variation in plant adaptations to fire suggests a rapid response to fire as an agent of selection. It also suggests that fire-related traits are heritable, a precondition for evolutionary change. One example is serotiny, the accumulation of seeds in unopened fruits or cones until the next fire, an important strategy for plant population persistence in fire-prone ecosystems. Here, we evaluate the potential of this trait to respond to natural selection in its natural setting. For this, we use a SNP marker approach to estimate genetic variance and heritability of serotiny directly in the field for two Mediterranean pine species. Study populations were large and heterogeneous in climatic conditions and fire regime. We first estimated the realized relatedness among trees from genotypes, and then partitioned the phenotypic variance in serotiny using Bayesian animal models that incorporated environmental predictors. As expected, field heritability was smaller (around 0.10 for both species) than previous estimates under common garden conditions (0.20). An estimate on a subset of stands with more homogeneous environmental conditions was not different from that in the complete set of stands, suggesting that our models correctly captured the environmental variation at the spatial scale of the study. Our results highlight the importance of measuring quantitative genetic parameters in natural populations, where environmental heterogeneity is a critical aspect. The heritability of serotiny, although not high, combined with high phenotypic variance within populations, confirms the potential of this fire-related trait for evolutionary change in the wild. © 2015 John Wiley & Sons Ltd.
Copeptin Levels Remain Unchanged during the Menstrual Cycle
Blum, Claudine A.; Mirza, Uzma; Christ-Crain, Mirjam; Mueller, Beat; Schindler, Christian; Puder, Jardena J.
2014-01-01
Background Copeptin, a surrogate marker for arginin vasopressin production, is evaluated as an osmo-dependent stress and inflammatory biomarker in different diseases. We investigated copeptin during the menstrual cycle and its relationship to sex hormones, markers of subclinical inflammation and estimates of body fluid. Methods In 15 healthy women with regular menstrual cycles, blood was drawn on fifteen defined days of their menstrual cycle and was assayed for copeptin, progesterone, estradiol, luteinizing hormone, high-sensitive C-reactive protein, tumor necrosis factor-alpha and procalcitonin. Symptoms of fluid retention were assessed on each visit, and bio impedance analysis was measured thrice to estimate body fluid changes. Mixed linear model analysis was performed to assess the changes of copeptin across the menstrual cycle and the relationship of sex hormones, markers of subclinical inflammation and estimates of body fluid with copeptin. Results Copeptin levels did not significantly change during the menstrual cycle (p = 0.16). Throughout the menstrual cycle, changes in estradiol (p = 0.002) and in the physical premenstrual symptom score (p = 0.01) were positively related to copeptin, but changes in other sex hormones, in markers of subclinical inflammation or in bio impedance analysis-estimated body fluid were not (all p = ns). Conclusion Although changes in estradiol and the physical premenstrual symptom score appear to be related to copeptin changes, copeptin does not significantly change during the menstrual cycle. PMID:24866705
Vocal exercise may attenuate acute vocal fold inflammation
Abbott, Katherine Verdolini; Li, Nicole Y.K.; Branski, Ryan C.; Rosen, Clark A.; Grillo, Elizabeth; Steinhauer, Kimberly; Hebda, Patricia A.
2012-01-01
Objectives/Hypotheses The objective was to assess the utility of selected “resonant voice” exercises for the reduction of acute vocal fold inflammation. The hypothesis was that relatively large-amplitude, low-impact exercises associated with resonant voice would reduce inflammation more than spontaneous speech and possibly more than voice rest. Study Design The study design was prospective, randomized, double-blind. Methods Nine vocally healthy adults underwent a 1-hr vocal loading procedure, followed by randomization to (a) a spontaneous speech condition, (b) a vocal rest condition, or (c) a resonant voice exercise condition. Treatments were monitored in clinic for 4 hr, and continued extra-clinically until the next morning. At baseline, immediately following loading, after the 4-hr in-clinic treatment, and 24 hr post baseline, secretions were suctioned from the vocal folds bilaterally and submitted to enzyme-linked immunosorbent assay (ELISA) to estimate concentrations of key markers of tissue injury and inflammation: IL-1β, IL-6, IL-8, TNF-α, MMP-8, and IL-10. Results Complete data sets were obtained for 3 markers -- IL-1β, IL-6, and MMP-8 -- for one subject in each treatment condition. For those markers, results were poorest at 24-hr follow-up in the spontaneous speech condition, sharply improved in the voice rest condition, and best in the resonant voice condition. Average results for all markers, for all responsive subjects with normal baseline mediator concentrations, revealed an almost identical pattern. Conclusions Some forms of tissue mobilization may be useful to attenuate acute vocal fold inflammation. PMID:23177745
Komínková, Eva; Dreiseitl, Antonín; Malečková, Eva; Doležel, Jaroslav
2016-01-01
Population surveys of Blumeria graminis f. sp. hordei (Bgh), a causal agent of more than 50% of barley fungal infections in the Czech Republic, have been traditionally based on virulence tests, at times supplemented with non-specific Restriction fragment length polymorphism or Random amplified polymorphic DNA markers. A genomic sequence of Bgh, which has become available recently, enables identification of potential markers suitable for population genetics studies. Two major strategies relying on transposable elements and microsatellites were employed in this work to develop a set of Repeat junction markers, Single sequence repeat and Single nucleotide polymorphism markers. A resolution power of the new panel of markers comprising 33 polymorphisms was demonstrated by a phylogenetic analysis of 158 Bgh isolates. A core set of 97 Czech isolates was compared to a set 50 Australian isolates on the background of 11 diverse isolates collected throughout the world. 73.2% of Czech isolates were found to be genetically unique. An extreme diversity of this collection was in strong contrast with the uniformity of the Australian one. This work paves the way for studies of population structure and dynamics based on genetic variability among different Bgh isolates originating from geographically limited regions. PMID:27875588
Kosoy, Roman; Nassir, Rami; Tian, Chao; White, Phoebe A; Butler, Lesley M.; Silva, Gabriel; Kittles, Rick; Alarcon-Riquelme, Marta E.; Gregersen, Peter K.; Belmont, John W.; De La Vega, Francisco M.; Seldin, Michael F.
2011-01-01
To provide a resource for assessing continental ancestry in a wide variety of genetic studies we identified, validated and characterized a set of 128 ancestry informative markers (AIMs). The markers were chosen for informativeness, genome-wide distribution, and genotype reproducibility on two platforms (TaqMan® assays and Illumina arrays). We analyzed genotyping data from 825 subjects with diverse ancestry, including European, East Asian, Amerindian, African, South Asian, Mexican, and Puerto Rican. A comprehensive set of 128 AIMs and subsets as small as 24 AIMs are shown to be useful tools for ascertaining the origin of subjects from particular continents, and to correct for population stratification in admixed population sample sets. Our findings provide general guidelines for the application of specific AIM subsets as a resource for wide application. We conclude that investigators can use TaqMan assays for the selected AIMs as a simple and cost efficient tool to control for differences in continental ancestry when conducting association studies in ethnically diverse populations. PMID:18683858
A Genetic Linkage Map of the Male Goat Genome
Vaiman, D.; Schibler, L.; Bourgeois, F.; Oustry, A.; Amigues, Y.; Cribiu, E. P.
1996-01-01
This paper presents a first genetic linkage map of the goat genome. Primers derived from the flanking sequences of 612 bovine, ovine and goat microsatellite markers were gathered and tested for amplification with goat DNA under standardized PCR conditions. This screen made it possible to choose a set of 55 polymorphic markers that can be used in the three species and to define a panel of 223 microsatellites suitable for the goat. Twelve half-sib paternal goat families were then used to build a linkage map of the goat genome. The linkage analysis made it possible to construct a meiotic map covering 2300 cM, i.e., >80% of the total estimated length of the goat genome. Moreover, eight cosmids containing microsatellites were mapped by fluorescence in situ hybridization in goat and sheep. Together with 11 microsatellite-containing cosmids previously mapped in cattle (and supposing conservation of the banding pattern between this species and the goat) and data from the sheep map, these results made the orientation of 15 linkage groups possible. Furthermore, 12 coding sequences were mapped either genetically or physically, providing useful data for comparative mapping. PMID:8878693
Troise, Antonio Dario; Wiltafsky, Markus; Fogliano, Vincenzo; Vitaglione, Paola
2018-05-01
The quantification of protein bound Maillard reaction products (MRPs) is still a challenge in food chemistry. Protein hydrolysis is the bottleneck step: it is time consuming and the protein degradation is not always complete. In this study, the quantitation of free amino acids and Amadori products (APs) was compared to the percentage of blocked lysine by using chemometric tools. Eighty thermally treated soybean samples were analyzed by mass spectrometry to measure the concentration of free amino acids, free APs and the protein-bound markers of the Maillard reaction (furosine, Nε-(carboxymethyl)-l-lysine, Nε-(carboxyethyl)-l-lysine, total lysine). Results demonstrated that Discriminant Analysis (DA) and Correlated Component Regression (CCR) correctly estimated the percent of blocked lysine in a validation and prediction set. These findings indicate that the measure of free markers reflects the extent of protein damage in soybean samples and it suggests the possibility to obtain rapid information on the quality of the industrial processes. Copyright © 2017 Elsevier Ltd. All rights reserved.
EVALUATING RISK-PREDICTION MODELS USING DATA FROM ELECTRONIC HEALTH RECORDS.
Wang, L E; Shaw, Pamela A; Mathelier, Hansie M; Kimmel, Stephen E; French, Benjamin
2016-03-01
The availability of data from electronic health records facilitates the development and evaluation of risk-prediction models, but estimation of prediction accuracy could be limited by outcome misclassification, which can arise if events are not captured. We evaluate the robustness of prediction accuracy summaries, obtained from receiver operating characteristic curves and risk-reclassification methods, if events are not captured (i.e., "false negatives"). We derive estimators for sensitivity and specificity if misclassification is independent of marker values. In simulation studies, we quantify the potential for bias in prediction accuracy summaries if misclassification depends on marker values. We compare the accuracy of alternative prognostic models for 30-day all-cause hospital readmission among 4548 patients discharged from the University of Pennsylvania Health System with a primary diagnosis of heart failure. Simulation studies indicate that if misclassification depends on marker values, then the estimated accuracy improvement is also biased, but the direction of the bias depends on the direction of the association between markers and the probability of misclassification. In our application, 29% of the 1143 readmitted patients were readmitted to a hospital elsewhere in Pennsylvania, which reduced prediction accuracy. Outcome misclassification can result in erroneous conclusions regarding the accuracy of risk-prediction models.
NASA Technical Reports Server (NTRS)
Tarabalka, Y.; Tilton, J. C.; Benediktsson, J. A.; Chanussot, J.
2012-01-01
The Hierarchical SEGmentation (HSEG) algorithm, which combines region object finding with region object clustering, has given good performances for multi- and hyperspectral image analysis. This technique produces at its output a hierarchical set of image segmentations. The automated selection of a single segmentation level is often necessary. We propose and investigate the use of automatically selected markers for this purpose. In this paper, a novel Marker-based HSEG (M-HSEG) method for spectral-spatial classification of hyperspectral images is proposed. Two classification-based approaches for automatic marker selection are adapted and compared for this purpose. Then, a novel constrained marker-based HSEG algorithm is applied, resulting in a spectral-spatial classification map. Three different implementations of the M-HSEG method are proposed and their performances in terms of classification accuracies are compared. The experimental results, presented for three hyperspectral airborne images, demonstrate that the proposed approach yields accurate segmentation and classification maps, and thus is attractive for remote sensing image analysis.
Batista, Philip D; Janes, Jasmine K; Boone, Celia K; Murray, Brent W; Sperling, Felix A H
2016-09-01
Assessments of population genetic structure and demographic history have traditionally been based on neutral markers while explicitly excluding adaptive markers. In this study, we compared the utility of putatively adaptive and neutral single-nucleotide polymorphisms (SNPs) for inferring mountain pine beetle population structure across its geographic range. Both adaptive and neutral SNPs, and their combination, allowed range-wide structure to be distinguished and delimited a population that has recently undergone range expansion across northern British Columbia and Alberta. Using an equal number of both adaptive and neutral SNPs revealed that adaptive SNPs resulted in a stronger correlation between sampled populations and inferred clustering. Our results suggest that adaptive SNPs should not be excluded prior to analysis from neutral SNPs as a combination of both marker sets resulted in better resolution of genetic differentiation between populations than either marker set alone. These results demonstrate the utility of adaptive loci for resolving population genetic structure in a nonmodel organism.
King, Julie; Thomas, Ann; James, Caron; King, Ian; Armstead, Ian
2013-07-03
Ryegrasses and fescues (genera, Lolium and Festuca) are species of forage and turf grasses which are used widely in agricultural and amenity situations. They are classified within the sub-family Pooideae and so are closely related to Brachypodium distachyon, wheat, barley, rye and oats. Recently, a DArT array has been developed which can be used in generating marker and mapping information for ryegrasses and fescues. This represents a potential common marker set for ryegrass and fescue researchers which can be linked through to comparative genomic information for the grasses. A F2 perennial ryegrass genetic map was developed consisting of 7 linkage groups defined by 1316 markers and deriving a total map length of 683 cM. The marker set included 866 DArT and 315 gene sequence-based markers. Comparison with previous DArT mapping studies in perennial and Italian ryegrass (L. multiflorum) identified 87 and 105 DArT markers in common, respectively, of which 94% and 87% mapped to homoeologous linkage groups. A similar comparison with meadow fescue (F. pratensis) identified only 28 DArT markers in common, of which c. 50% mapped to non-homoelogous linkage groups. In L. perenne, the genetic distance spanned by the DArT markers encompassed the majority of the regions that could be described in terms of comparative genomic relationships with rice, Brachypodium distachyon, and Sorghum bicolor. DArT markers are likely to be a useful common marker resource for ryegrasses and fescues, though the success in aligning different populations through the mapping of common markers will be influenced by degrees of population interrelatedness. The detailed mapping of DArT and gene-based markers in this study potentially allows comparative relationships to be derived in future mapping populations characterised using solely DArT markers.
Forest, J-C; Massé, J; Bujold, E; Rousseau, F; Charland, M; Thériault, S; Lafond, J; Giguère, Y
2012-07-01
The advent of early preventive measures, such as low-dose aspirin targeting women at high risk of preeclampsia (PE), emphasizes the need for better detection. Despite the emergence of promising biochemical markers linked to the pathophysiological processes, systematic reviews have shown that, until now, no single tests fulfill the criteria set by WHO for biomarkers to screen for a disease. However, recent literature reveals that by combining various clinical, biophysical and biochemical markers into multivariate algorithms, one can envisage to estimate the risk of PE with a performance that would reach clinical utility and cost-effectiveness, but this remains to be demonstrated in various environments and health care settings. To investigate, in a prospective study, the clinical utility of candidate biomarkers and clinical data to detect, early in pregnancy, women at risk to develop PE and to propose a multivariate prediction algorithm combining clinical parameters to biochemical markers. 7929 pregnant women prospectively recruited at the first prenatal visit, provided blood samples, clinical and sociodemographic information. 214 pregnant women developed hypertensive disorders of pregnancy (HDP) of which 88 had PE (1.2%), including 44 with severe PE (0.6%). A nested case-control study was performed including for each case of HDP two normal pregnancies matched for maternal age, gestational age at recruitment, ethnicity, parity, and smoking status. Based on the literature we selected the most promising markers in a multivariate logistic regression model: mean arterial pressure (MAP), BMI, placental growth factor (PlGF), soluble Flt-1, inhibin A and PAPP-A. Biomarker results measured between 10-18 weeks gestation were expressed as multiples of the median. Medians were determined for each gestational week. When combined with MAP at the time of blood sampling and BMI at the beginning of pregnancy, the four biochemical markers discriminate normal pregnancies from those with HDP. At a 5% false positive rate, 37% of the affected pregnancies would have been detected. However, considering the prevalence of HDP in our population, the positive predictive value would have been only 15%. If all the predicted positive women would have been proposed a preventive intervention, only one out 6.7 women could have potentially benefited. In the case of severe PE, performance was not improved, sensitivity was the same, but the positive predictive value decreased to 3% (lower prevalence of severe PE). In our low-risk Caucasian population, neither individual candidate markers nor multivariate risk algorithm using an a priori combination of selected markers reached a performance justifying implementation. This also emphasizes the necessity to take into consideration characteristics of the population and environment influencing prevalence before promoting wide implementation of such screening strategies. In a perspective of personalized medicine, it appears more than ever mandatory to tailor recommendations for HDP screening according not only to individual but also to population characteristics. Copyright © 2012. Published by Elsevier B.V.
Lee, S; Kim, C S; Shin, Y G; Kim, J H; Kim, Y S; Jheong, W H
2016-03-01
The Peach rosette mosaic virus (PRMV) is a plant pathogen of the genus Nepovirus, and has been designated as a controlled quarantine virus in Korea. In this study, a specific reverse transcription (RT)-PCR marker set, nested PCR marker set, and modified-plasmid positive control were developed to promptly and accurately diagnose PRMV at plant-quarantine sites. The final selected PRMV-specific RT-PCR marker was PRMV-N10/C70 (967 bp), and the nested PCR product of 419 bp was finally amplified. The modified-plasmid positive control, in which the SalI restriction-enzyme region (GTCGAC) was inserted, verified PRMV contamination in a comparison with the control, enabling a more accurate diagnosis. It is expected that the developed method will continuously contribute to the plant-quarantine process in Korea.
BRSCW Reference Set Application: Joe Buechler - Biosite Inc (2009) — EDRN Public Portal
Over 40 marker assays are available to run on the samples. These include markers such as Osteopontin, Mesothelin, Periostin, Endoglin, intestinal Fatty Acid Binding Protein, and FAS-Ligand, some of which have been previously described in the literature. Other proprietary markers are derived from internal discovery efforts and from collaborator programs.
Estimation of treatment effects in all-comers randomized clinical trials with a predictive marker.
Choai, Yuki; Matsui, Shigeyuki
2015-03-01
Recent advances in genomics and biotechnologies have accelerated the development of molecularly targeted treatments and accompanying markers to predict treatment responsiveness. However, it is common at the initiation of a definitive phase III clinical trial that there is no compelling biological basis or early trial data for a candidate marker regarding its capability in predicting treatment effects. In this case, it is reasonable to include all patients as eligible for randomization, but to plan for prospective subgroup analysis based on the marker. One analysis plan in such all-comers designs is the so-called fallback approach that first tests for overall treatment efficacy and then proceeds to testing in a biomarker-positive subgroup if the first test is not significant. In this approach, owing to the adaptive nature of the analysis and a correlation between the two tests, a bias will arise in estimating the treatment effect in the biomarker-positive subgroup after a non-significant first overall test. In this article, we formulate the bias function and show a difficulty in obtaining unbiased estimators for a whole range of an associated parameter. To address this issue, we propose bias-corrected estimation methods, including those based on an approximation of the bias function under a bounded range of the parameter using polynomials. We also provide an interval estimation method based on a bivariate doubly truncated normal distribution. Simulation experiments demonstrated a success in bias reduction. Application to a phase III trial for lung cancer is provided. © 2014, The International Biometric Society.
Bonicelli, Andrea; Xhemali, Bledar; Kranioti, Elena F.
2017-01-01
Age estimation remains one of the most challenging tasks in forensic practice when establishing a biological profile of unknown skeletonised remains. Morphological methods based on developmental markers of bones can provide accurate age estimates at a young age, but become highly unreliable for ages over 35 when all developmental markers disappear. This study explores the changes in the biomechanical properties of bone tissue and matrix, which continue to change with age even after skeletal maturity, and their potential value for age estimation. As a proof of concept we investigated the relationship of 28 variables at the macroscopic and microscopic level in rib autopsy samples from 24 individuals. Stepwise regression analysis produced a number of equations one of which with seven variables showed an R2 = 0.949; a mean residual error of 2.13 yrs ±0.4 (SD) and a maximum residual error value of 2.88 yrs. For forensic purposes, by using only bench top machines in tests which can be carried out within 36 hrs, a set of just 3 variables produced an equation with an R2 = 0.902 a mean residual error of 3.38 yrs ±2.6 (SD) and a maximum observed residual error 9.26yrs. This method outstrips all existing age-at-death methods based on ribs, thus providing a novel lab based accurate tool in the forensic investigation of human remains. The present application is optimised for fresh (uncompromised by taphonomic conditions) remains, but the potential of the principle and method is vast once the trends of the biomechanical variables are established for other environmental conditions and circumstances. PMID:28520764
Namroud, Marie-Claire; Beaulieu, Jean; Juge, Nicolas; Laroche, Jérôme; Bousquet, Jean
2008-01-01
Conifers are characterized by a large genome size and a rapid decay of linkage disequilibrium, most often within gene limits. Genome scans based on noncoding markers are less likely to detect molecular adaptation linked to genes in these species. In this study, we assessed the effectiveness of a genome-wide single nucleotide polymorphism (SNP) scan focused on expressed genes in detecting local adaptation in a conifer species. Samples were collected from six natural populations of white spruce (Picea glauca) moderately differentiated for several quantitative characters. A total of 534 SNPs representing 345 expressed genes were analysed. Genes potentially under natural selection were identified by estimating the differentiation in SNP frequencies among populations (FST) and identifying outliers, and by estimating local differentiation using a Bayesian approach. Both average expected heterozygosity and population differentiation estimates (HE = 0.270 and FST = 0.006) were comparable to those obtained with other genetic markers. Of all genes, 5.5% were identified as outliers with FST at the 95% confidence level, while 14% were identified as candidates for local adaptation with the Bayesian method. There was some overlap between the two gene sets. More than half of the candidate genes for local adaptation were specific to the warmest population, about 20% to the most arid population, and 15% to the coldest and most humid higher altitude population. These adaptive trends were consistent with the genes’ putative functions and the divergence in quantitative traits noted among the populations. The results suggest that an approach separating the locus and population effects is useful to identify genes potentially under selection. These candidates are worth exploring in more details at the physiological and ecological levels. PMID:18662225
Beaulieu, J; Doerksen, T; Clément, S; MacKay, J; Bousquet, J
2014-01-01
Genomic selection (GS) is of interest in breeding because of its potential for predicting the genetic value of individuals and increasing genetic gains per unit of time. To date, very few studies have reported empirical results of GS potential in the context of large population sizes and long breeding cycles such as for boreal trees. In this study, we assessed the effectiveness of marker-aided selection in an undomesticated white spruce (Picea glauca (Moench) Voss) population of large effective size using a GS approach. A discovery population of 1694 trees representative of 214 open-pollinated families from 43 natural populations was phenotyped for 12 wood and growth traits and genotyped for 6385 single-nucleotide polymorphisms (SNPs) mined in 2660 gene sequences. GS models were built to predict estimated breeding values using all the available SNPs or SNP subsets of the largest absolute effects, and they were validated using various cross-validation schemes. The accuracy of genomic estimated breeding values (GEBVs) varied from 0.327 to 0.435 when the training and the validation data sets shared half-sibs that were on average 90% of the accuracies achieved through traditionally estimated breeding values. The trend was also the same for validation across sites. As expected, the accuracy of GEBVs obtained after cross-validation with individuals of unknown relatedness was lower with about half of the accuracy achieved when half-sibs were present. We showed that with the marker densities used in the current study, predictions with low to moderate accuracy could be obtained within a large undomesticated population of related individuals, potentially resulting in larger gains per unit of time with GS than with the traditional approach. PMID:24781808
Nguyen, Kim-Huong; Jimenez-Soto, Eliana; Dayal, Prarthna; Hodge, Andrew
2013-06-27
The Millennium Development Goals prompted renewed international efforts to reduce under-five mortality and measure national progress. However, scant evidence exists about the distribution of child mortality at low sub-national levels, which in diverse and decentralized countries like India are required to inform policy-making. This study estimates changes in child mortality across a range of markers of inequalities in Orissa and Madhya Pradesh, two of India's largest, poorest, and most disadvantaged states. Estimates of under-five and neonatal mortality rates were computed using seven datasets from three available sources--sample registration system, summary birth histories in surveys, and complete birth histories. Inequalities were gauged by comparison of mortality rates within four sub-state populations defined by the following characteristics: rural-urban location, ethnicity, wealth, and district. Trend estimates suggest that progress has been made in mortality rates at the state levels. However, reduction rates have been modest, particularly for neonatal mortality. Different mortality rates are observed across all the equity markers, although there is a pattern of convergence between rural and urban areas, largely due to inadequate progress in urban settings. Inter-district disparities and differences between socioeconomic groups are also evident. Although child mortality rates continue to decline at the national level, our evidence shows that considerable disparities persist. While progress in reducing under-five and neonatal mortality rates in urban areas appears to be levelling off, policies targeting rural populations and scheduled caste and tribe groups appear to have achieved some success in reducing mortality differentials. The results of this study thus add weight to recent government initiatives targeting these groups. Equitable progress, particularly for neonatal mortality, requires continuing efforts to strengthen health systems and overcome barriers to identify and reach vulnerable groups.
Ancestry Analysis in the 11-M Madrid Bomb Attack Investigation
Phillips, Christopher; Prieto, Lourdes; Fondevila, Manuel; Salas, Antonio; Gómez-Tato, Antonio; Álvarez-Dios, José; Alonso, Antonio; Blanco-Verea, Alejandro; Brión, María; Montesino, Marta; Carracedo, Ángel; Lareu, María Victoria
2009-01-01
The 11-M Madrid commuter train bombings of 2004 constituted the second biggest terrorist attack to occur in Europe after Lockerbie, while the subsequent investigation became the most complex and wide-ranging forensic case in Spain. Standard short tandem repeat (STR) profiling of 600 exhibits left certain key incriminatory samples unmatched to any of the apprehended suspects. A judicial order to perform analyses of unmatched samples to differentiate European and North African ancestry became a critical part of the investigation and was instigated to help refine the search for further suspects. Although mitochondrial DNA (mtDNA) and Y-chromosome markers routinely demonstrate informative geographic differentiation, the populations compared in this analysis were known to show a proportion of shared mtDNA and Y haplotypes as a result of recent gene-flow across the western Mediterranean, while any two loci can be unrepresentative of the ancestry of an individual as a whole. We based our principal analysis on a validated 34plex autosomal ancestry-informative-marker single nucleotide polymorphism (AIM-SNP) assay to make an assignment of ancestry for DNA from seven unmatched case samples including a handprint from a bag containing undetonated explosives together with personal items recovered from various locations in Madrid associated with the suspects. To assess marker informativeness before genotyping, we predicted the probable classification success for the 34plex assay with standard error estimators for a naïve Bayesian classifier using Moroccan and Spanish training sets (each n = 48). Once misclassification error was found to be sufficiently low, genotyping yielded seven near-complete profiles (33 of 34 AIM-SNPs) that in four cases gave probabilities providing a clear assignment of ancestry. One of the suspects predicted to be North African by AIM-SNP analysis of DNA from a toothbrush was identified late in the investigation as Algerian in origin. The results achieved illustrate the benefit of adding specialized marker sets to provide enhanced scope and power to an already highly effective system of DNA analysis for forensic identification. PMID:19668368
Nater, Alexander; Arora, Natasha; Greminger, Maja P; van Schaik, Carel P; Singleton, Ian; Wich, Serge A; Fredriksson, Gabriella; Perwitasari-Farajallah, Dyah; Pamungkas, Joko; Krützen, Michael
2013-01-01
A multitude of factors influence how natural populations are genetically structured, including dispersal barriers, inhomogeneous habitats, and social organization. Such population subdivision is of special concern in endangered species, as it may lead to reduced adaptive potential and inbreeding in local subpopulations, thus increasing the risk of future extinctions. With only 6600 animals left in the wild, Sumatran orangutans (Pongo abelii) are among the most endangered, but also most enigmatic, great ape species. In order to infer the fine-scale population structure and connectivity of Sumatran orangutans, we analyzed the most comprehensive set of samples to date, including mitochondrial hyper-variable region I haplotypes for 123 individuals and genotypes of 27 autosomal microsatellite markers for 109 individuals. For both mitochondrial and autosomal markers, we found a pronounced population structure, caused by major rivers, mountain ridges, and the Toba caldera. We found that genetic diversity and corresponding long-term effective population size estimates vary strongly among sampling regions for mitochondrial DNA, but show remarkable similarity for autosomal markers, hinting at male-driven long-distance gene flow. In support of this, we identified several individuals that were most likely sired by males originating from other genetic clusters. Our results highlight the effect of natural barriers in shaping the genetic structure of great ape populations, but also point toward important dispersal corridors on northern Sumatra that allow for genetic exchange.
Caffeic acid attenuates lipopolysaccharide-induced sickness behaviour and neuroinflammation in mice.
Basu Mallik, Sanchari; Mudgal, Jayesh; Nampoothiri, Madhavan; Hall, Susan; Dukie, Shailendra Anoopkumar-; Grant, Gary; Rao, C Mallikarjuna; Arora, Devinder
2016-10-06
Accumulating data links inflammation, oxidative stress and immune system in the pathophysiology of major depressive disorders. Sickness behaviour is a set of behavioural changes that develop during infection, eventually leading to decrease in mobility and depressed behaviour. Lipopolysaccharide (LPS) induces a depression-like state in animals that mimics sickness behaviour. Caffeic acid, a naturally occurring polyphenol, possesses antioxidant and anti-inflammatory properties. The present study was designed to explore the potential of caffeic acid against LPS-induced sickness behaviour in mice. Caffeic acid (30mg/kg) and imipramine (15mg/kg) were administered orally one hour prior to LPS (1.5mg/kg) challenge. Behavioural assessment was carried out between 1 and 2h and blood samples were collected at 3h post-LPS injection. Additionally, cytokines (brain and serum) and brain oxidative stress markers were estimated. LPS increased the systemic and brain cytokine levels, altered the anti-oxidant defence and produced key signs of sickness behaviour in animals. Caffeic acid treatment significantly reduced the LPS-induced changes, including reduced expression of inflammatory markers in serum and whole brain. Caffeic acid also exerted an anti-oxidant effect, which was evident from the decreased levels of oxidative stress markers in whole brain. Our data suggests that caffeic acid can prevent the neuroinflammation-induced acute and probably the long term neurodegenerative changes. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Segmentation of prostate biopsy needles in transrectal ultrasound images
NASA Astrophysics Data System (ADS)
Krefting, Dagmar; Haupt, Barbara; Tolxdorff, Thomas; Kempkensteffen, Carsten; Miller, Kurt
2007-03-01
Prostate cancer is the most common cancer in men. Tissue extraction at different locations (biopsy) is the gold-standard for diagnosis of prostate cancer. These biopsies are commonly guided by transrectal ultrasound imaging (TRUS). Exact location of the extracted tissue within the gland is desired for more specific diagnosis and provides better therapy planning. While the orientation and the position of the needle within clinical TRUS image are limited, the appearing length and visibility of the needle varies strongly. Marker lines are present and tissue inhomogeneities and deflection artefacts may appear. Simple intensity, gradient oder edge-detecting based segmentation methods fail. Therefore a multivariate statistical classificator is implemented. The independent feature model is built by supervised learning using a set of manually segmented needles. The feature space is spanned by common binary object features as size and eccentricity as well as imaging-system dependent features like distance and orientation relative to the marker line. The object extraction is done by multi-step binarization of the region of interest. The ROI is automatically determined at the beginning of the segmentation and marker lines are removed from the images. The segmentation itself is realized by scale-invariant classification using maximum likelihood estimation and Mahalanobis distance as discriminator. The technique presented here could be successfully applied in 94% of 1835 TRUS images from 30 tissue extractions. It provides a robust method for biopsy needle localization in clinical prostate biopsy TRUS images.
Jiang, Li; Wei, Yi-Liang; Zhao, Lei; Li, Na; Liu, Tao; Liu, Hai-Bo; Ren, Li-Jie; Li, Jiu-Ling; Hao, Hui-Fang; Li, Qing; Li, Cai-Xia
2018-07-01
Over the last decade, several panels of ancestry-informative markers have been proposed for the analysis of population genetic structure. The differentiation efficiency depends on the discriminatory ability of the included markers and the reference population coverage. We previously developed a small set of 27 autosomal single nucleotide polymorphisms (SNPs) for analyzing African, European, and East Asian ancestries. In the current study, we gathered a high-coverage reference database of 110 populations (10,350 individuals) from across the globe. The discrimination power of the panel was re-evaluated using four continental ancestry groups (as well as Indigenous Americans). We observed that all the 27 SNPs demonstrated stratified population specificity leading to a striking ancestral discrimination. Five markers (rs728404, rs7170869, rs2470102, rs1448485, and rs4789193) showed differences (δ > 0.3) in the frequency profiles between East Asian and Indigenous American populations. Ancestry components of all involved populations were accurately accessed compared with those from previous genome-wide analyses, thereafter achieved broadly population separation. Thus, our ancestral inference panel of a small number of highly informative SNPs in combination with a large-scale reference database provides a high-resolution in estimating ancestry compositions and distinguishing individual origins. We propose extensive usage in biomedical studies and forensics. Copyright © 2018 Elsevier B.V. All rights reserved.
MAGMA: Generalized Gene-Set Analysis of GWAS Data
de Leeuw, Christiaan A.; Mooij, Joris M.; Heskes, Tom; Posthuma, Danielle
2015-01-01
By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn’s Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn’s Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn’s Disease data was found to be considerably faster as well. PMID:25885710
MAGMA: generalized gene-set analysis of GWAS data.
de Leeuw, Christiaan A; Mooij, Joris M; Heskes, Tom; Posthuma, Danielle
2015-04-01
By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.
Ferrarini, Luca; Frisoni, Giovanni B; Pievani, Michela; Reiber, Johan H C; Ganzola, Rossana; Milles, Julien
2009-01-01
In this study, we investigated the use of hippocampal shape-based markers for automatic detection of Alzheimer's disease (AD) and mild cognitive impairment converters (MCI-c). Three-dimensional T1-weighted magnetic resonance images of 50 AD subjects, 50 age-matched controls, 15 MCI-c, and 15 MCI-non-converters (MCI-nc) were taken. Manual delineations of both hippocampi were obtained from normalized images. Fully automatic shape modeling was used to generate comparable meshes for both structures. Repeated permutation tests, run over a randomly sub-sampled training set (25 controls and 25 ADs), highlighted shape-based markers, mostly located in the CA1 sector, which consistently discriminated ADs and controls. Support vector machines (SVMs) were trained, using markers from either one or both hippocampi, to automatically classify control and AD subjects. Leave-1-out cross-validations over the remaining 25 ADs and 25 controls resulted in an optimal accuracy of 90% (sensitivity 92%), for markers in the left hippocampus. The same morphological markers were used to train SVMs for MCI-c versus MCI-nc classification: markers in the right hippocampus reached an accuracy (and sensitivity) of 80%. Due to the pattern recognition framework, our results statistically represent the expected performances of clinical set-ups, and compare favorably to analyses based on hippocampal volumes.
Harmon, Monica; Lane, Thomas; Staton, Margaret; Coggeshall, Mark V; Best, Teodora; Chen, Chien-Chih; Liang, Haiying; Zembower, Nicole; Drautz-Moses, Daniela I; Hwee, Yap Zhei; Schuster, Stephan C; Schlarbaum, Scott E; Carlson, John E; Gailing, Oliver
2017-08-08
Sugar maple (Acer saccharum Marsh.) is a hardwood tree species native to northeastern North America and economically valued for its wood and sap. Yet, few molecular genetic resources have been developed for this species to date. Microsatellite markers have been a useful tool in population genetics, e.g., to monitor genetic variation and to analyze gene flow patterns. The objective of this study is to develop a reference transcriptome and microsatellite markers in sugar maple. A set of 117,861 putative unique transcripts were assembled using 29.2 Gb of RNA sequencing data derived from different tissues and stress treatments. From this set of sequences a total of 1068 microsatellite motifs were identified. Out of 58 genic microsatellite markers tested on a population of 47 sugar maple trees in upper Michigan, 22 amplified well, of which 16 were polymorphic and 6 were monomorphic. Values for expected heterozygosity varied from 0.224 to 0.726 for individual loci. Of the 16 polymorphic markers, 15 exhibited transferability to other Acer L. species. Genic microsatellite markers can be applied to analyze genetic variation in potentially adaptive genes relative to genomic reference markers as a basis for the management of sugar maple genetic resources in the face of climate change.
Diagnostic and Prognostic Utility of the Synaptic Marker Neurogranin in Alzheimer Disease
Tarawneh, Rawan; D’Angelo, Gina; Crimmins, Dan; Herries, Elizabeth; Griest, Terry; Fagan, Anne M.; Zipfel, Gregory J.; Ladenson, Jack H.; Morris, John C.; Holtzman, David M.
2016-01-01
IMPORTANCE Synaptic loss is an early pathologic substrate of Alzheimer disease (AD). Neurogranin is a postsynaptic neuronal protein that has demonstrated utility as a cerebrospinal fluid (CSF) marker of synaptic loss in AD. OBJECTIVE To investigate the diagnostic and prognostic utility of CSF neurogranin levels in a large, well-characterized cohort of individuals with symptomatic AD and cognitively normal controls. DESIGN, SETTING, AND PARTICIPANTS A cross-sectional and longitudinal observational study of cognitive decline in patients with symptomatic AD and cognitively normal controls was performed. Participants were individuals with a clinical diagnosis of early symptomatic AD and cognitively normal controls who were enrolled in longitudinal studies of aging and dementia at the Charles F. and Joanne Knight Alzheimer Disease Research Center, Washington University School of Medicine, from January 21, 2000, through March 21, 2011. Data analysis was performed from November 1, 2013, to March 31, 2015. MAIN OUTCOMES AND MEASURES Correlations between baseline CSF biomarker levels and future cognitive decline in patients with symptomatic AD and cognitively normal controls overtime. RESULTS A total of 302 individuals (mean [SE] age, 73.1 [0.4] years) were included in this study (95 patients [52 women and 43 men] with AD and 207 controls [125 women and 82 men]). The CSF neurogranin levels differentiated patients with early symptomatic AD from controls with comparable diagnostic utility (mean [SE] area under the receiver operating characteristic curve, 0.71 [0.03]; 95% CI, 0.64–0.77) to the other CSF biomarkers. The CSF neurogranin levels correlated with brain atrophy (normalized whole-brain volumes: adjusted r = −0.38, P = .02; hippocampal volumes: adjusted r = −0.36, P = .03; entorhinal volumes: adjusted r = −0.46, P = .006; and parahippocampal volumes: adjusted r = −0.47, P = .005, n = 38) in AD and with amyloid load (r = 0.39, P = .02, n = 36) in preclinical AD. The CSF neurogranin levels predicted future cognitive impairment (adjusted hazard ratio, 1.89; 95% CI, 1.29–2.78; P = .001 as a continuous measure, and adjusted hazard ratio, 2.78; 95% CI, 1.13–5.99; P = .02 as a categorical measure using the 85th percentile cutoff value) in controls and rates of cognitive decline (Clinical Dementia Rating sum of boxes score: β estimate, 0.29; P = .001; global composite scores: β estimate, −0.11; P = .001; episodic memory scores: β estimate, −0.18; P < .001; and semantic memory scores: β estimate, −0.06; P = .04, n = 57) in patients with symptomatic AD over time, similarly to the CSF proteins VILIP-1, tau, and p-tau181. CONCLUSIONS AND RELEVANCE The CSF levels of the synaptic marker neurogranin offer diagnostic and prognostic utility for early symptomatic AD that is comparable to other CSF markers of AD. Importantly, CSF neurogranin complements the collective ability of these markers to predict future cognitive decline in cognitively normal individuals and, therefore, will be a useful addition to the current panel of AD biomarkers. PMID:27018940
Khatkar, Mehar S; Nicholas, Frank W; Collins, Andrew R; Zenger, Kyall R; Cavanagh, Julie A L; Barris, Wes; Schnabel, Robert D; Taylor, Jeremy F; Raadsma, Herman W
2008-04-24
The extent of linkage disequilibrium (LD) within a population determines the number of markers that will be required for successful association mapping and marker-assisted selection. Most studies on LD in cattle reported to date are based on microsatellite markers or small numbers of single nucleotide polymorphisms (SNPs) covering one or only a few chromosomes. This is the first comprehensive study on the extent of LD in cattle by analyzing data on 1,546 Holstein-Friesian bulls genotyped for 15,036 SNP markers covering all regions of all autosomes. Furthermore, most studies in cattle have used relatively small sample sizes and, consequently, may have had biased estimates of measures commonly used to describe LD. We examine minimum sample sizes required to estimate LD without bias and loss in accuracy. Finally, relatively little information is available on comparative LD structures including other mammalian species such as human and mouse, and we compare LD structure in cattle with public-domain data from both human and mouse. We computed three LD estimates, D', Dvol and r2, for 1,566,890 syntenic SNP pairs and a sample of 365,400 non-syntenic pairs. Mean D' is 0.189 among syntenic SNPs, and 0.105 among non-syntenic SNPs; mean r2 is 0.024 among syntenic SNPs and 0.0032 among non-syntenic SNPs. All three measures of LD for syntenic pairs decline with distance; the decline is much steeper for r2 than for D' and Dvol. The value of D' and Dvol are quite similar. Significant LD in cattle extends to 40 kb (when estimated as r2) and 8.2 Mb (when estimated as D'). The mean values for LD at large physical distances are close to those for non-syntenic SNPs. Minor allelic frequency threshold affects the distribution and extent of LD. For unbiased and accurate estimates of LD across marker intervals spanning < 1 kb to > 50 Mb, minimum sample sizes of 400 (for D') and 75 (for r2) are required. The bias due to small samples sizes increases with inter-marker interval. LD in cattle is much less extensive than in a mouse population created from crossing inbred lines, and more extensive than in humans. For association mapping in Holstein-Friesian cattle, for a given design, at least one SNP is required for each 40 kb, giving a total requirement of at least 75,000 SNPs for a low power whole-genome scan (median r2 > 0.19) and up to 300,000 markers at 10 kb intervals for a high power genome scan (median r2 > 0.62). For estimation of LD by D' and Dvol with sufficient precision, a sample size of at least 400 is required, whereas for r2 a minimum sample of 75 is adequate.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, J; Nguyen, D; O’Brien, R
Purpose: Kilovoltage intrafraction monitoring (KIM) scheme has been successfully used to simultaneously monitor 3D tumor motion during radiotherapy. Recently, an iterative closest point (ICP) algorithm was implemented in KIM to also measure rotations about three axes, enabling real-time tracking of tumor motion in six degrees-of-freedom (DoF). This study aims to evaluate the accuracy of the six DoF motion estimates of KIM by comparing it with the corresponding motion (i) measured by the Calypso; and (ii) derived from kV/MV triangulation. Methods: (i) Various motions (static and dynamic) were applied to a CIRS phantom with three embedded electromagnetic transponders (Calypso Medical) usingmore » a 5D motion platform (HexaMotion) and a rotating treatment couch while both KIM and Calypso were used to concurrently track the phantom motion in six DoF. (ii) KIM was also used to retrospectively estimate six DoF motion from continuous sets of kV projections of a prostate, implanted with three gold fiducial markers (2 patients with 80 fractions in total), acquired during the treatment. Corresponding motion was obtained from kV/MV triangulation using a closed form least squares method based on three markers’ positions. Only the frames where all three markers were present were used in the analysis. The mean differences between the corresponding motion estimates were calculated for each DoF. Results: Experimental results showed that the mean of absolute differences in six DoF phantom motion measured by Calypso and KIM were within 1.1° and 0.7 mm. kV/MV triangulation derived six DoF prostate tumor better agreed with KIM estimated motion with the mean (s.d.) difference of up to 0.2° (1.36°) and 0.2 (0.25) mm for rotation and translation, respectively. Conclusion: These results suggest that KIM can provide an accurate six DoF intrafraction tumor during radiotherapy.« less
Yin, Xianyong; Wineinger, Nathan E.; Wang, Kai; Yue, Weihua; Norgren, Nina; Wang, Ling; Yao, Weiyi; Jiang, Xiaoyun; Wu, Bo; Cui, Yong; Shen, Changbing; Cheng, Hui; Zhou, Fusheng; Chen, Gang; Zuo, Xianbo; Zheng, Xiaodong; Fan, Xing; Wang, Hongyan; Wang, Lifang; Lee, Jimmy; Lam, Max; Tai, E. Shyong; Zhang, Zheng; Huang, Qiong; Sun, Liangdan; Xu, Jinhua; Yang, Sen; Wilhelmsen, Kirk C.; Liu, Jianjun; Schork, Nicholas J.; Zhang, Xuejun
2016-01-01
Background Previous studies have shown that individuals with schizophrenia have a greater risk for psoriasis than a typical person. This suggests that there might be a shared genetic etiology between the 2 conditions. We aimed to characterize the potential shared genetic susceptibility between schizophrenia and psoriasis using genome-wide marker genotype data. Methods We obtained genetic data on individuals with psoriasis, schizophrenia and control individuals. We applied a marker-based coheritability estimation procedure, polygenic score analysis, a gene set enrichment test and a least absolute shrinkage and selection operator regression model to estimate the potential shared genetic etiology between the 2 diseases. We validated the results in independent schizophrenia and psoriasis cohorts from Singapore. Results We included 1139 individuals with psoriasis, 744 with schizophrenia and 1678 controls in our analysis, and we validated the results in independent cohorts, including 441 individuals with psoriasis (and 2420 controls) and 1630 with schizophrenia (and 1860 controls). We estimated that a large fraction of schizophrenia and psoriasis risk could be attributed to common variants (h2SNP = 29% ± 5.0%, p = 2.00 × 10−8), with a coheritability estimate between the traits of 21%. We identified 5 variants within the human leukocyte antigen (HLA) gene region, which were most likely to be associated with both diseases and collectively conferred a significant risk effect (odds ratio of highest risk quartile = 6.03, p < 2.00 × 10−16). We discovered that variants contributing most to the shared heritable component between psoriasis and schizophrenia were enriched in antigen processing and cell endoplasmic reticulum. Limitations Our sample size was relatively small. The findings of 5 HLA gene variants were complicated by the complex structure in the HLA region. Conclusion We found evidence for a shared genetic etiology between schizophrenia and psoriasis. The mechanism for this shared genetic basis likely involves immune and calcium signalling pathways. PMID:27091718
Joint modelling of longitudinal CEA tumour marker progression and survival data on breast cancer
NASA Astrophysics Data System (ADS)
Borges, Ana; Sousa, Inês; Castro, Luis
2017-06-01
This work proposes the use of Biostatistics methods to study breast cancer in patients of Braga's Hospital Senology Unit, located in Portugal. The primary motivation is to contribute to the understanding of the progression of breast cancer, within the Portuguese population, using a more complex statistical model assumptions than the traditional analysis that take into account a possible existence of a serial correlation structure within a same subject observations. We aim to infer which risk factors aect the survival of Braga's Hospital patients, diagnosed with breast tumour. Whilst analysing risk factors that aect a tumour markers used on the surveillance of disease progression the Carcinoembryonic antigen (CEA). As survival and longitudinal processes may be associated, it is important to model these two processes together. Hence, a joint modelling of these two processes to infer on the association of these was conducted. A data set of 540 patients, along with 50 variables, was collected from medical records of the Hospital. A joint model approach was used to analyse these data. Two dierent joint models were applied to the same data set, with dierent parameterizations which give dierent interpretations to model parameters. These were used by convenience as the ones implemented in R software. Results from the two models were compared. Results from joint models, showed that the longitudinal CEA values were signicantly associated with the survival probability of these patients. A comparison between parameter estimates obtained in this analysis and previous independent survival[4] and longitudinal analysis[5][6], lead us to conclude that independent analysis brings up bias parameter estimates. Hence, an assumption of association between the two processes in a joint model of breast cancer data is necessary. Results indicate that the longitudinal progression of CEA is signicantly associated with the probability of survival of these patients. Hence, an assumption of association between the two processes in a joint model of breast cancer data is necessary.
A novel fully automatic scheme for fiducial marker-based alignment in electron tomography.
Han, Renmin; Wang, Liansan; Liu, Zhiyong; Sun, Fei; Zhang, Fa
2015-12-01
Although the topic of fiducial marker-based alignment in electron tomography (ET) has been widely discussed for decades, alignment without human intervention remains a difficult problem. Specifically, the emergence of subtomogram averaging has increased the demand for batch processing during tomographic reconstruction; fully automatic fiducial marker-based alignment is the main technique in this process. However, the lack of an accurate method for detecting and tracking fiducial markers precludes fully automatic alignment. In this paper, we present a novel, fully automatic alignment scheme for ET. Our scheme has two main contributions: First, we present a series of algorithms to ensure a high recognition rate and precise localization during the detection of fiducial markers. Our proposed solution reduces fiducial marker detection to a sampling and classification problem and further introduces an algorithm to solve the parameter dependence of marker diameter and marker number. Second, we propose a novel algorithm to solve the tracking of fiducial markers by reducing the tracking problem to an incomplete point set registration problem. Because a global optimization of a point set registration occurs, the result of our tracking is independent of the initial image position in the tilt series, allowing for the robust tracking of fiducial markers without pre-alignment. The experimental results indicate that our method can achieve an accurate tracking, almost identical to the current best one in IMOD with half automatic scheme. Furthermore, our scheme is fully automatic, depends on fewer parameters (only requires a gross value of the marker diameter) and does not require any manual interaction, providing the possibility of automatic batch processing of electron tomographic reconstruction. Copyright © 2015 Elsevier Inc. All rights reserved.
Liver Full Reference Set Application :Timothy Block - Drexel Univ (2010) — EDRN Public Portal
The goal of this application is to determine if the levels of serum GP73 and fucosylated kininogen/acute phase proteins can be used to detect hepatocellular carcinoma (HCC) in the background of liver cirrhosis. The use of the validation set would allow us to directly compare GP73 and fucosylated markers against AFP, AFP-L3 and DCP as well as test them in combination with these markers
Liver Rapid Reference Set Application: Timothy Block - Drexel Univ (2008) — EDRN Public Portal
The goal of this application is to determine if the levels of serum GP73 and fucosylated kininogen/acute phase proteins can be used to detect hepatocellular carcinoma (HCC) in the background of liver cirrhosis. The use of the validation set would allow us to directly compare GP73 and fucosylated markers against AFP, AFP-L3 and DCP as well as test them in combination with these markers
CoCoa: a software tool for estimating the coefficient of coancestry from multilocus genotype data.
Maenhout, Steven; De Baets, Bernard; Haesaert, Geert
2009-10-15
Phenotypic data collected in breeding programs and marker-trait association studies are often analyzed by means of linear mixed models. In these models, the covariance between the genetic background effects of all genotypes under study is modeled by means of pairwise coefficients of coancestry. Several marker-based coancestry estimation procedures allow to estimate this covariance matrix, but generally introduce a certain amount of bias when the examined genotypes are part of a breeding program. CoCoa implements the most commonly used marker-based coancestry estimation procedures and as such, allows to select the best fitting covariance structure for the phenotypic data at hand. This better model fit translates into an increased power and improved type I error control in association studies and an improved accuracy in phenotypic prediction studies. The presented software package also provides an implementation of the new Weighted Alikeness in State (WAIS) estimator for use in hybrid breeding programs. Besides several matrix manipulation tools, CoCoa implements two different bending heuristics, in case the inverse of an ill-conditioned coancestry matrix estimate is needed. The software package CoCoa is freely available at http://webs.hogent.be/cocoa. Source code, manual, binaries for 32 and 64-bit Linux systems and an installer for Microsoft Windows are provided. The core components of CoCoa are written in C++, while the graphical user interface is written in Java.
N'Diaye, Amidou; Haile, Jemanesh K; Fowler, D Brian; Ammar, Karim; Pozniak, Curtis J
2017-01-01
Advances in sequencing and genotyping methods have enable cost-effective production of high throughput single nucleotide polymorphism (SNP) markers, making them the choice for linkage mapping. As a result, many laboratories have developed high-throughput SNP assays and built high-density genetic maps. However, the number of markers may, by orders of magnitude, exceed the resolution of recombination for a given population size so that only a minority of markers can accurately be ordered. Another issue attached to the so-called 'large p, small n' problem is that high-density genetic maps inevitably result in many markers clustering at the same position (co-segregating markers). While there are a number of related papers, none have addressed the impact of co-segregating markers on genetic maps. In the present study, we investigated the effects of co-segregating markers on high-density genetic map length and marker order using empirical data from two populations of wheat, Mohawk × Cocorit (durum wheat) and Norstar × Cappelle Desprez (bread wheat). The maps of both populations consisted of 85% co-segregating markers. Our study clearly showed that excess of co-segregating markers can lead to map expansion, but has little effect on markers order. To estimate the inflation factor (IF), we generated a total of 24,473 linkage maps (8,203 maps for Mohawk × Cocorit and 16,270 maps for Norstar × Cappelle Desprez). Using seven machine learning algorithms, we were able to predict with an accuracy of 0.7 the map expansion due to the proportion of co-segregating markers. For example in Mohawk × Cocorit, with 10 and 80% co-segregating markers the length of the map inflated by 4.5 and 16.6%, respectively. Similarly, the map of Norstar × Cappelle Desprez expanded by 3.8 and 11.7% with 10 and 80% co-segregating markers. With the increasing number of markers on SNP-chips, the proportion of co-segregating markers in high-density maps will continue to increase making map expansion unavoidable. Therefore, we suggest developers improve linkage mapping algorithms for efficient analysis of high-throughput data. This study outlines a practical strategy to estimate the IF due to the proportion of co-segregating markers and outlines a method to scale the length of the map accordingly.
N’Diaye, Amidou; Haile, Jemanesh K.; Fowler, D. Brian; Ammar, Karim; Pozniak, Curtis J.
2017-01-01
Advances in sequencing and genotyping methods have enable cost-effective production of high throughput single nucleotide polymorphism (SNP) markers, making them the choice for linkage mapping. As a result, many laboratories have developed high-throughput SNP assays and built high-density genetic maps. However, the number of markers may, by orders of magnitude, exceed the resolution of recombination for a given population size so that only a minority of markers can accurately be ordered. Another issue attached to the so-called ‘large p, small n’ problem is that high-density genetic maps inevitably result in many markers clustering at the same position (co-segregating markers). While there are a number of related papers, none have addressed the impact of co-segregating markers on genetic maps. In the present study, we investigated the effects of co-segregating markers on high-density genetic map length and marker order using empirical data from two populations of wheat, Mohawk × Cocorit (durum wheat) and Norstar × Cappelle Desprez (bread wheat). The maps of both populations consisted of 85% co-segregating markers. Our study clearly showed that excess of co-segregating markers can lead to map expansion, but has little effect on markers order. To estimate the inflation factor (IF), we generated a total of 24,473 linkage maps (8,203 maps for Mohawk × Cocorit and 16,270 maps for Norstar × Cappelle Desprez). Using seven machine learning algorithms, we were able to predict with an accuracy of 0.7 the map expansion due to the proportion of co-segregating markers. For example in Mohawk × Cocorit, with 10 and 80% co-segregating markers the length of the map inflated by 4.5 and 16.6%, respectively. Similarly, the map of Norstar × Cappelle Desprez expanded by 3.8 and 11.7% with 10 and 80% co-segregating markers. With the increasing number of markers on SNP-chips, the proportion of co-segregating markers in high-density maps will continue to increase making map expansion unavoidable. Therefore, we suggest developers improve linkage mapping algorithms for efficient analysis of high-throughput data. This study outlines a practical strategy to estimate the IF due to the proportion of co-segregating markers and outlines a method to scale the length of the map accordingly. PMID:28878789
2011-01-01
Background Pigeonpea [Cajanus cajan (L.) Millsp.] is an important legume crop of rainfed agriculture. Despite of concerted research efforts directed to pigeonpea improvement, stagnated productivity of pigeonpea during last several decades may be accounted to prevalence of various biotic and abiotic constraints and the situation is exacerbated by availability of inadequate genomic resources to undertake any molecular breeding programme for accelerated crop improvement. With the objective of enhancing genomic resources for pigeonpea, this study reports for the first time, large scale development of SSR markers from BAC-end sequences and their subsequent use for genetic mapping and hybridity testing in pigeonpea. Results A set of 88,860 BAC (bacterial artificial chromosome)-end sequences (BESs) were generated after constructing two BAC libraries by using HindIII (34,560 clones) and BamHI (34,560 clones) restriction enzymes. Clustering based on sequence identity of BESs yielded a set of >52K non-redundant sequences, comprising 35 Mbp or >4% of the pigeonpea genome. These sequences were analyzed to develop annotation lists and subdivide the BESs into genome fractions (e.g., genes, retroelements, transpons and non-annotated sequences). Parallel analysis of BESs for microsatellites or simple sequence repeats (SSRs) identified 18,149 SSRs, from which a set of 6,212 SSRs were selected for further analysis. A total of 3,072 novel SSR primer pairs were synthesized and tested for length polymorphism on a set of 22 parental genotypes of 13 mapping populations segregating for traits of interest. In total, we identified 842 polymorphic SSR markers that will have utility in pigeonpea improvement. Based on these markers, the first SSR-based genetic map comprising of 239 loci was developed for this previously uncharacterized genome. Utility of developed SSR markers was also demonstrated by identifying a set of 42 markers each for two hybrids (ICPH 2671 and ICPH 2438) for genetic purity assessment in commercial hybrid breeding programme. Conclusion In summary, while BAC libraries and BESs should be useful for genomics studies, BES-SSR markers, and the genetic map should be very useful for linking the genetic map with a future physical map as well as for molecular breeding in pigeonpea. PMID:21447154
Dantan, Etienne; Foucher, Yohann; Lorent, Marine; Giral, Magali; Tessier, Philippe
2018-06-01
Defining thresholds of prognostic markers is essential for stratified medicine. Such thresholds are mostly estimated from purely statistical measures regardless of patient preferences potentially leading to unacceptable medical decisions. Quality-Adjusted Life-Years are a widely used preferences-based measure of health outcomes. We develop a time-dependent Quality-Adjusted Life-Years-based expected utility function for censored data that should be maximized to estimate an optimal threshold. We performed a simulation study to compare estimated thresholds when using the proposed expected utility approach and purely statistical estimators. Two applications illustrate the usefulness of the proposed methodology which was implemented in the R package ROCt ( www.divat.fr ). First, by reanalysing data of a randomized clinical trial comparing the efficacy of prednisone vs. placebo in patients with chronic liver cirrhosis, we demonstrate the utility of treating patients with a prothrombin level higher than 89%. Second, we reanalyze the data of an observational cohort of kidney transplant recipients: we conclude to the uselessness of the Kidney Transplant Failure Score to adapt the frequency of clinical visits. Applying such a patient-centered methodology may improve future transfer of novel prognostic scoring systems or markers in clinical practice.
Byun, Yeun-Sub; Jeong, Rag-Gyo; Kang, Seok-Won
2015-11-13
The real-time recognition of absolute (or relative) position and orientation on a network of roads is a core technology for fully automated or driving-assisted vehicles. This paper presents an empirical investigation of the design, implementation, and evaluation of a self-positioning system based on a magnetic marker reference sensing method for an autonomous vehicle. Specifically, the estimation accuracy of the magnetic sensing ruler (MSR) in the up-to-date estimation of the actual position was successfully enhanced by compensating for time delays in signal processing when detecting the vertical magnetic field (VMF) in an array of signals. In this study, the signal processing scheme was developed to minimize the effects of the distortion of measured signals when estimating the relative positional information based on magnetic signals obtained using the MSR. In other words, the center point in a 2D magnetic field contour plot corresponding to the actual position of magnetic markers was estimated by tracking the errors between pre-defined reference models and measured magnetic signals. The algorithm proposed in this study was validated by experimental measurements using a test vehicle on a pilot network of roads. From the results, the positioning error was found to be less than 0.04 m on average in an operational test.
Byun, Yeun-Sub; Jeong, Rag-Gyo; Kang, Seok-Won
2015-01-01
The real-time recognition of absolute (or relative) position and orientation on a network of roads is a core technology for fully automated or driving-assisted vehicles. This paper presents an empirical investigation of the design, implementation, and evaluation of a self-positioning system based on a magnetic marker reference sensing method for an autonomous vehicle. Specifically, the estimation accuracy of the magnetic sensing ruler (MSR) in the up-to-date estimation of the actual position was successfully enhanced by compensating for time delays in signal processing when detecting the vertical magnetic field (VMF) in an array of signals. In this study, the signal processing scheme was developed to minimize the effects of the distortion of measured signals when estimating the relative positional information based on magnetic signals obtained using the MSR. In other words, the center point in a 2D magnetic field contour plot corresponding to the actual position of magnetic markers was estimated by tracking the errors between pre-defined reference models and measured magnetic signals. The algorithm proposed in this study was validated by experimental measurements using a test vehicle on a pilot network of roads. From the results, the positioning error was found to be less than 0.04 m on average in an operational test. PMID:26580622
Estimation of genetic diversity using SSR markers in sunflower
USDA-ARS?s Scientific Manuscript database
Sunflower is a major oilseed crop in central Asia, but little is known of the molecular diversity among collections of sunflower from Pakistan region. This paper described inherent genetic relationships among sunflower collections using Simple Sequence Repeat molecular markers. Results should help...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-04-08
... Customer Satisfaction Surveys (Headstone/Marker) Activity: Comment Request AGENCY: National Cemetery... estimates relating to customer satisfaction surveys involving the National Cemetery Administration (NCA.... Title: Generic Clearance for NCA, and IG Customer Satisfaction Surveys. OMB Control Number: 2900-0571...
ERIC Educational Resources Information Center
Barbaro, Josephine; Dissanayake, Cheryl
2013-01-01
The Social Attention and Communication Study involved the successful implementation of developmental surveillance of the early markers of autism spectrum disorders in a community-based setting. The objective in the current study was to determine the most discriminating and predictive markers of autism spectrum disorders used in the Social…
SPORE/EDRN/PRE-PLCO Ovarian Phase II Validation Study — EDRN Public Portal
Create a new set of phase II specimens (160 cases with pre-operative bloods representing major histologic types and including 80 early-staged and 80 late-staged cases, 160 controls with benign disease, 480 general population controls, and a small set of serial Samples collected either at least 3 months apart, but not more than 6 months apart OR between 10 months apart and no more than 14 months apart in 40 healthy controls) will be used to evaluate markers identified in preliminary work. The top 5-10 markers, plus an expanded panel of Luminex markers, will comprise a “working consensus panel” for subsequent analysis in PLCO specimens.
A 48 SNP set for grapevine cultivar identification
2011-01-01
Background Rapid and consistent genotyping is an important requirement for cultivar identification in many crop species. Among them grapevine cultivars have been the subject of multiple studies given the large number of synonyms and homonyms generated during many centuries of vegetative multiplication and exchange. Simple sequence repeat (SSR) markers have been preferred until now because of their high level of polymorphism, their codominant nature and their high profile repeatability. However, the rapid application of partial or complete genome sequencing approaches is identifying thousands of single nucleotide polymorphisms (SNP) that can be very useful for such purposes. Although SNP markers are bi-allelic, and therefore not as polymorphic as microsatellites, the high number of loci that can be multiplexed and the possibilities of automation as well as their highly repeatable results under any analytical procedure make them the future markers of choice for any type of genetic identification. Results We analyzed over 300 SNP in the genome of grapevine using a re-sequencing strategy in a selection of 11 genotypes. Among the identified polymorphisms, we selected 48 SNP spread across all grapevine chromosomes with allele frequencies balanced enough as to provide sufficient information content for genetic identification in grapevine allowing for good genotyping success rate. Marker stability was tested in repeated analyses of a selected group of cultivars obtained worldwide to demonstrate their usefulness in genetic identification. Conclusions We have selected a set of 48 stable SNP markers with a high discrimination power and a uniform genome distribution (2-3 markers/chromosome), which is proposed as a standard set for grapevine (Vitis vinifera L.) genotyping. Any previous problems derived from microsatellite allele confusion between labs or the need to run reference cultivars to identify allele sizes disappear using this type of marker. Furthermore, because SNP markers are bi-allelic, allele identification and genotype naming are extremely simple and genotypes obtained with different equipments and by different laboratories are always fully comparable. PMID:22060012
Altomare, Cristina; Guglielmann, Raffaella; Riboldi, Marco; Bellazzi, Riccardo; Baroni, Guido
2015-02-01
In high precision photon radiotherapy and in hadrontherapy, it is crucial to minimize the occurrence of geometrical deviations with respect to the treatment plan in each treatment session. To this end, point-based infrared (IR) optical tracking for patient set-up quality assessment is performed. Such tracking depends on external fiducial points placement. The main purpose of our work is to propose a new algorithm based on simulated annealing and augmented Lagrangian pattern search (SAPS), which is able to take into account prior knowledge, such as spatial constraints, during the optimization process. The SAPS algorithm was tested on data related to head and neck and pelvic cancer patients, and that were fitted with external surface markers for IR optical tracking applied for patient set-up preliminary correction. The integrated algorithm was tested considering optimality measures obtained with Computed Tomography (CT) images (i.e. the ratio between the so-called target registration error and fiducial registration error, TRE/FRE) and assessing the marker spatial distribution. Comparison has been performed with randomly selected marker configuration and with the GETS algorithm (Genetic Evolutionary Taboo Search), also taking into account the presence of organs at risk. The results obtained with SAPS highlight improvements with respect to the other approaches: (i) TRE/FRE ratio decreases; (ii) marker distribution satisfies both marker visibility and spatial constraints. We have also investigated how the TRE/FRE ratio is influenced by the number of markers, obtaining significant TRE/FRE reduction with respect to the random configurations, when a high number of markers is used. The SAPS algorithm is a valuable strategy for fiducial configuration optimization in IR optical tracking applied for patient set-up error detection and correction in radiation therapy, showing that taking into account prior knowledge is valuable in this optimization process. Further work will be focused on the computational optimization of the SAPS algorithm toward fast point-of-care applications. Copyright © 2014 Elsevier Inc. All rights reserved.
Panasevich, Sviatlana; Leander, Karin; Ljungman, Petter; Bellander, Tom; de Faire, Ulf; Pershagen, Göran; Nyberg, Fredrik
2013-01-01
Objectives Air pollution exposure induces cardiovascular effects, possibly via systemic inflammation and coagulation misbalance. Genetic variation may determine individual susceptibility. Our aim was to investigate effect modification by inflammation (Interleukin6 (IL6), tumour necrosis factor-α (TNF-α)) and coagulation (fibrinogen Bβ, plasminogen activator inhibitor-1 (PAI-1)) gene variants on the effect of long-term or short-term air pollution exposure on both blood marker levels and non-fatal myocardial infarction (MI) risk. Design Population-based case–control study with a nested case-crossover study. Gene-environment interactions for short-term and long-term air pollution on blood marker levels were studied in population controls, for long-term exposure on MI risk using case–control design, and for short-term exposure on MI onset using case-crossover design. Setting The Stockholm Heart Epidemiology Programme (SHEEP) conducted in 1992–1994 in Stockholm, Sweden. Spatial modelling was used to assess long-term (up to 30 years retrospectively) air pollution exposure to traffic-NO2 and heating-SO2 emissions at home addresses. Urban background NO2, SO2, PM10 and O3 measurements were used to estimate short-term (up to 5 days) air pollution exposure. Participants 1192 MI cases and 1506 population controls aged 45–70 years. Outcomes The levels of blood markers of inflammation (IL-6, TNF-α) and coagulation (fibrinogen, PAI-1) and MI risk. Results We observed gene–environment interaction for several IL6 and TNF SNPs in relation to inflammation blood marker levels. One-year traffic-NO2 exposure was associated with higher IL-6 levels with each additional IL6-174C allele, and 1-year heating-SO2 exposure with higher levels of TNF-α in TNF-308AA homozygotes versus −308G carriers. Short-term air pollution exposure also interacted with IL6 and TNF in relation to marker levels. The risk of MI followed the effect on blood markers in each genotype group. Conclusions Genetic variants in IL6 and TNF may modify effects of long-term and short-term air pollution exposure on inflammatory marker levels and MI risk. PMID:24056475
Oral sampling methods are associated with differences in immune marker concentrations.
Fakhry, Carole; Qeadan, Fares; Gilman, Robert H; Yori, Pablo; Kosek, Margaret; Patterson, Nicole; Eisele, David W; Gourin, Christine G; Chitguppi, Chandala; Marks, Morgan; Gravitt, Patti
2018-06-01
To determine whether the concentration and distribution of immune markers in paired oral samples were similar. Clinical research. Cross-sectional study. Paired saliva and oral secretions (OS) samples were collected. The concentration of immune markers was estimated using Luminex multiplex assay (Thermo Fisher Scientific, Waltham, MA). For each sample, the concentration of respective immune markers was normalized to total protein present and log-transformed. Median concentrations of immune markers were compared between both types of samples. Intermarker correlation in each sampling method and across sampling methods was evaluated. There were 90 study participants. Concentrations of immune markers in saliva samples were significantly different from concentrations in OS samples. Oral secretions samples showed higher concentrations of immunoregulatory markers, whereas the saliva samples contained proinflammatory markers in higher concentration. The immune marker profile in saliva samples is distinct from the immune marker profile in paired OS samples. 2b. Laryngoscope, 128:E214-E221, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
2010-01-01
Background Genetic markers and linkage mapping are basic prerequisites for marker-assisted selection and map-based cloning. In the case of the key grassland species Lolium spp., numerous mapping populations have been developed and characterised for various traits. Although some genetic linkage maps of these populations have been aligned with each other using publicly available DNA markers, the number of common markers among genetic maps is still low, limiting the ability to compare candidate gene and QTL locations across germplasm. Results A set of 204 expressed sequence tag (EST)-derived simple sequence repeat (SSR) markers has been assigned to map positions using eight different ryegrass mapping populations. Marker properties of a subset of 64 EST-SSRs were assessed in six to eight individuals of each mapping population and revealed 83% of the markers to be polymorphic in at least one population and an average number of alleles of 4.88. EST-SSR markers polymorphic in multiple populations served as anchor markers and allowed the construction of the first comprehensive consensus map for ryegrass. The integrated map was complemented with 97 SSRs from previously published linkage maps and finally contained 284 EST-derived and genomic SSR markers. The total map length was 742 centiMorgan (cM), ranging for individual chromosomes from 70 cM of linkage group (LG) 6 to 171 cM of LG 2. Conclusions The consensus linkage map for ryegrass based on eight mapping populations and constructed using a large set of publicly available Lolium EST-SSRs mapped for the first time together with previously mapped SSR markers will allow for consolidating existing mapping and QTL information in ryegrass. Map and markers presented here will prove to be an asset in the development for both molecular breeding of ryegrass as well as comparative genetics and genomics within grass species. PMID:20712870
Kling, Daniel; Egeland, Thore; Mostad, Petter
2012-01-01
In a number of applications there is a need to determine the most likely pedigree for a group of persons based on genetic markers. Adequate models are needed to reach this goal. The markers used to perform the statistical calculations can be linked and there may also be linkage disequilibrium (LD) in the population. The purpose of this paper is to present a graphical Bayesian Network framework to deal with such data. Potential LD is normally ignored and it is important to verify that the resulting calculations are not biased. Even if linkage does not influence results for regular paternity cases, it may have substantial impact on likelihood ratios involving other, more extended pedigrees. Models for LD influence likelihoods for all pedigrees to some degree and an initial estimate of the impact of ignoring LD and/or linkage is desirable, going beyond mere rules of thumb based on marker distance. Furthermore, we show how one can readily include a mutation model in the Bayesian Network; extending other programs or formulas to include such models may require considerable amounts of work and will in many case not be practical. As an example, we consider the two STR markers vWa and D12S391. We estimate probabilities for population haplotypes to account for LD using a method based on data from trios, while an estimate for the degree of linkage is taken from the literature. The results show that accounting for haplotype frequencies is unnecessary in most cases for this specific pair of markers. When doing calculations on regular paternity cases, the markers can be considered statistically independent. In more complex cases of disputed relatedness, for instance cases involving siblings or so-called deficient cases, or when small differences in the LR matter, independence should not be assumed. (The networks are freely available at http://arken.umb.no/~dakl/BayesianNetworks.) PMID:22984448
Kling, Daniel; Tillmar, Andreas; Egeland, Thore; Mostad, Petter
2015-09-01
Several applications necessitate an unbiased determination of relatedness, be it in linkage or association studies or in a forensic setting. An appropriate model to compute the joint probability of some genetic data for a set of persons given some hypothesis about the pedigree structure is then required. The increasing number of markers available through high-density SNP microarray typing and NGS technologies intensifies the demand, where using a large number of markers may lead to biased results due to strong dependencies between closely located loci, both within pedigrees (linkage) and in the population (allelic association or linkage disequilibrium (LD)). We present a new general model, based on a Markov chain for inheritance patterns and another Markov chain for founder allele patterns, the latter allowing us to account for LD. We also demonstrate a specific implementation for X chromosomal markers that allows for computation of likelihoods based on hypotheses of alleged relationships and genetic marker data. The algorithm can simultaneously account for linkage, LD, and mutations. We demonstrate its feasibility using simulated examples. The algorithm is implemented in the software FamLinkX, providing a user-friendly GUI for Windows systems (FamLinkX, as well as further usage instructions, is freely available at www.famlink.se ). Our software provides the necessary means to solve cases where no previous implementation exists. In addition, the software has the possibility to perform simulations in order to further study the impact of linkage and LD on computed likelihoods for an arbitrary set of markers.
Integrative Analysis of Cancer Diagnosis Studies with Composite Penalization
Liu, Jin; Huang, Jian; Ma, Shuangge
2013-01-01
Summary In cancer diagnosis studies, high-throughput gene profiling has been extensively conducted, searching for genes whose expressions may serve as markers. Data generated from such studies have the “large d, small n” feature, with the number of genes profiled much larger than the sample size. Penalization has been extensively adopted for simultaneous estimation and marker selection. Because of small sample sizes, markers identified from the analysis of single datasets can be unsatisfactory. A cost-effective remedy is to conduct integrative analysis of multiple heterogeneous datasets. In this article, we investigate composite penalization methods for estimation and marker selection in integrative analysis. The proposed methods use the minimax concave penalty (MCP) as the outer penalty. Under the homogeneity model, the ridge penalty is adopted as the inner penalty. Under the heterogeneity model, the Lasso penalty and MCP are adopted as the inner penalty. Effective computational algorithms based on coordinate descent are developed. Numerical studies, including simulation and analysis of practical cancer datasets, show satisfactory performance of the proposed methods. PMID:24578589
De Vos, Stephanie; Bossier, Peter; Van Stappen, Gilbert; Vercauteren, Ilse; Sorgeloos, Patrick; Vuylsteke, Marnik
2013-01-01
We report on the construction of sex-specific linkage maps, the identification of sex-linked markers and the genome size estimation for the brine shrimp Artemia franciscana. Overall, from the analysis of 433 AFLP markers segregating in a 112 full-sib family we identified 21 male and 22 female linkage groups (2n = 42), covering 1,041 and 1,313 cM respectively. Fifteen putatively homologous linkage groups, including the sex linkage groups, were identified between the female and male linkage map. Eight sex-linked AFLP marker alleles were inherited from the female parent, supporting the hypothesis of a WZ–ZZ sex-determining system. The haploid Artemia genome size was estimated to 0.93 Gb by flow cytometry. The produced Artemia linkage maps provide the basis for further fine mapping and exploring of the sex-determining region and are a possible marker resource for mapping genomic loci underlying phenotypic differences among Artemia species. PMID:23469207
King, Timothy L.; Johnson, Robin L.
2011-01-01
We document the isolation and characterization of 19 tetra-nucleotide microsatellite DNA markers in northern snakehead (Channa argus) fish that recently colonized Meadow Lake, New York City, New York. These markers displayed moderate levels of allelic diversity (averaging 6.8 alleles/locus) and heterozygosity (averaging 74.2%). Demographic analyses suggested that the Meadow Lake collection has not achieved mutation-drift equilibrium. These results were consistent with instances of deviations from Hardy–Weinberg equilibrium and the presence of some linkage disequilibrium. A comparison of individual pair-wise distances suggested the presence of multiple differentiated groups of related individuals. Results of all analyses are consistent with a pattern of multiple, recent introductions. The microsatellite markers developed for C. argus yielded sufficient genetic diversity to potentially: (1) delineate kinship; (2) elucidate fine-scale population structure; (3) define management (eradication) units; (4) estimate dispersal rates; (5) estimate population sizes; and (6) provide unique demographic perspectives of control or eradication effectiveness.
Kumar, L S; Sawant, A S; Gupta, V S; Ranjekar, P K
2001-10-01
Genetic variation between 28 Indian populations of the rice pest, Scirpophaga incertulas was evaluated using inter-simple sequence repeats (ISSR)-PCR assay. Nine SSR primers gave rise to 79 amplification products of which 67 were polymorphic. A dendrogram constructed from this data indicates that there is no geographical bias to the clustering and that gene flow between populations appears to be relatively unrestricted, substantiating our earlier conclusion based on the RAPD (random amplified polymorphic DNA) data. The dendrograms obtained using each of these marker systems were poorly correlated with each other as determined by Mantel's test for matrix correlation. Estimates of expected heterozygosity and marker index for each of these marker systems suggests that both these marker systems are equally efficient in determining polymorphisms. Matrix correlation analyses suggest that reliable estimates of genetic variation among the S. incertulas pest populations can be obtained by using RAPDs alone or in combination with ISSRs, but ISSRs alone cannot be used for this purpose.
3D kinematic measurement of human movement using low cost fish-eye cameras
NASA Astrophysics Data System (ADS)
Islam, Atiqul; Asikuzzaman, Md.; Garratt, Matthew A.; Pickering, Mark R.
2017-02-01
3D motion capture is difficult when the capturing is performed in an outdoor environment without controlled surroundings. In this paper, we propose a new approach of using two ordinary cameras arranged in a special stereoscopic configuration and passive markers on a subject's body to reconstruct the motion of the subject. Firstly for each frame of the video, an adaptive thresholding algorithm is applied for extracting the markers on the subject's body. Once the markers are extracted, an algorithm for matching corresponding markers in each frame is applied. Zhang's planar calibration method is used to calibrate the two cameras. As the cameras use the fisheye lens, they cannot be well estimated using a pinhole camera model which makes it difficult to estimate the depth information. In this work, to restore the 3D coordinates we use a unique calibration method for fisheye lenses. The accuracy of the 3D coordinate reconstruction is evaluated by comparing with results from a commercially available Vicon motion capture system.
1995 feels so close yet so far: the effect of event markers on subjective feelings of elapsed time.
Zauberman, Gal; Levav, Jonathan; Diehl, Kristin; Bhargave, Rajesh
2010-01-01
Why does an event feel more or less distant than another event that occurred around the same time? Prior research suggests that characteristics of an event itself can affect the estimated date of its occurrence. Our work differs in that we focused on how characteristics of the time interval following an event affect people's feelings of elapsed time (i.e., their feelings of how distant an event seems). We argue that a time interval that is punctuated by a greater number of accessible intervening events related to the target event (event markers) will make the target event feel more distant, but that unrelated intervening events will not have this effect. In three studies, we found support for the systematic effect of event markers. The effect of markers was independent of other characteristics of the event, such as its memorability, emotionality, importance, and estimated date, a result suggesting that this effect is distinct from established dating biases.
Ander, Bradley P.; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R.; Yang, Xiaowei
2013-01-01
The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with ‘large p, small n’ problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed. PMID:23844055
Peng, Bin; Zhu, Dianwen; Ander, Bradley P; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R; Yang, Xiaowei
2013-01-01
The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with 'large p, small n' problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed.
Predicting genotypes environmental range from genome-environment associations.
Manel, Stéphanie; Andrello, Marco; Henry, Karine; Verdelet, Daphné; Darracq, Aude; Guerin, Pierre-Edouard; Desprez, Bruno; Devaux, Pierre
2018-05-17
Genome-environment association methods aim to detect genetic markers associated with environmental variables. The detected associations are usually analysed separately to identify the genomic regions involved in local adaptation. However, a recent study suggests that single-locus associations can be combined and used in a predictive way to estimate environmental variables for new individuals on the basis of their genotypes. Here, we introduce an original approach to predict the environmental range (values and upper and lower limits) of species genotypes from the genetic markers significantly associated with those environmental variables in an independent set of individuals. We illustrate this approach to predict aridity in a database constituted of 950 individuals of wild beets and 299 individuals of cultivated beets genotyped at 14,409 random Single Nucleotide Polymorphisms (SNPs). We detected 66 alleles associated with aridity and used them to calculate the fraction (I) of aridity-associated alleles in each individual. The fraction I correctly predicted the values of aridity in an independent validation set of wild individuals and was then used to predict aridity in the 299 cultivated individuals. Wild individuals had higher median values and a wider range of values of aridity than the cultivated individuals, suggesting that wild individuals have higher ability to resist to stress-aridity conditions and could be used to improve the resistance of cultivated varieties to aridity. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Flores, Danielle; Miller, Amy L.; Showman, Angelique; Tobita, Caitlyn; Shimoda, Lori M.N.; Sung, Carl; Stokes, Alexander J.; Tomberlin, Jeffrey K.; Carter, David O.; Turner, Helen
2016-01-01
Entomological protocols for aging blow fly (Diptera: Calliphoridae) larvae to estimate the time of colonization (TOC) are commonly used to assist in death investigations. While the methodologies for analysing fly larvae differ, most rely on light microscopy, genetic analysis or, more rarely, electron microscopy. This pilot study sought to improve resolution of larval stage in the forensically-important blow fly Chrysomya rufifacies using high-content fluorescence microscopy and biochemical measures of developmental marker proteins. We established fixation and mounting protocols, defined a set of measurable morphometric criteria and captured developmental transitions of 2nd instar to 3rd instar using both fluorescence microscopy and anti-ecdysone receptor Western blot analysis. The data show that these instars can be distinguished on the basis of robust, non-bleaching, autofluorescence of larval posterior spiracles. High content imaging techniques using confocal microscopy, combined with morphometric and biochemical techniques, may therefore aid forensic entomologists in estimating TOC. PMID:27706817
Guo, Bing; Greenwood, Paul L; Cafe, Linda M; Zhou, Guanghong; Zhang, Wangang; Dalrymple, Brian P
2015-03-13
This study aimed to identify markers for muscle growth rate and the different cellular contributors to cattle muscle and to link the muscle growth rate markers to specific cell types. The expression of two groups of genes in the longissimus muscle (LM) of 48 Brahman steers of similar age, significantly enriched for "cell cycle" and "ECM (extracellular matrix) organization" Gene Ontology (GO) terms was correlated with average daily gain/kg liveweight (ADG/kg) of the animals. However, expression of the same genes was only partly related to growth rate across a time course of postnatal LM development in two cattle genotypes, Piedmontese x Hereford (high muscling) and Wagyu x Hereford (high marbling). The deposition of intramuscular fat (IMF) altered the relationship between the expression of these genes and growth rate. K-means clustering across the development time course with a large set of genes (5,596) with similar expression profiles to the ECM genes was undertaken. The locations in the clusters of published markers of different cell types in muscle were identified and used to link clusters of genes to the cell type most likely to be expressing them. Overall correspondence between published cell type expression of markers and predicted major cell types of expression in cattle LM was high. However, some exceptions were identified: expression of SOX8 previously attributed to muscle satellite cells was correlated with angiogenesis. Analysis of the clusters and cell types suggested that the "cell cycle" and "ECM" signals were from the fibro/adipogenic lineage. Significant contributions to these signals from the muscle satellite cells, angiogenic cells and adipocytes themselves were not as strongly supported. Based on the clusters and cell type markers, sets of five genes predicted to be representative of fibro/adipogenic precursors (FAPs) and endothelial cells, and/or ECM remodelling and angiogenesis were identified. Gene sets and gene markers for the analysis of many of the major processes/cell populations contributing to muscle composition and growth have been proposed, enabling a consistent interpretation of gene expression datasets from cattle LM. The same gene sets are likely to be applicable in other cattle muscles and in other species.
Boomerang: A method for recursive reclassification.
Devlin, Sean M; Ostrovnaya, Irina; Gönen, Mithat
2016-09-01
While there are many validated prognostic classifiers used in practice, often their accuracy is modest and heterogeneity in clinical outcomes exists in one or more risk subgroups. Newly available markers, such as genomic mutations, may be used to improve the accuracy of an existing classifier by reclassifying patients from a heterogenous group into a higher or lower risk category. The statistical tools typically applied to develop the initial classifiers are not easily adapted toward this reclassification goal. In this article, we develop a new method designed to refine an existing prognostic classifier by incorporating new markers. The two-stage algorithm called Boomerang first searches for modifications of the existing classifier that increase the overall predictive accuracy and then merges to a prespecified number of risk groups. Resampling techniques are proposed to assess the improvement in predictive accuracy when an independent validation data set is not available. The performance of the algorithm is assessed under various simulation scenarios where the marker frequency, degree of censoring, and total sample size are varied. The results suggest that the method selects few false positive markers and is able to improve the predictive accuracy of the classifier in many settings. Lastly, the method is illustrated on an acute myeloid leukemia data set where a new refined classifier incorporates four new mutations into the existing three category classifier and is validated on an independent data set. © 2016, The International Biometric Society.
Żuk, Magdalena; Pezowicz, Celina
2015-01-01
Objective. The purpose of the present work was to assess the validity of a six-degrees-of-freedom gait analysis model based on the ISB recommendation on definitions of joint coordinate systems (ISB 6DOF) through a quantitative comparison with the Helen Hays model (HH) and repeatability assessment. Methods. Four healthy subjects were analysed with both marker sets: an HH marker set and four marker clusters in ISB 6DOF. A navigated pointer was used to indicate the anatomical landmark position in the cluster reference system according to the ISB recommendation. Three gait cycles were selected from the data collected simultaneously for the two marker sets. Results. Two protocols showed good intertrial repeatability, which apart from pelvic rotation did not exceed 2°. The greatest differences between protocols were observed in the transverse plane as well as for knee angles. Knee internal/external rotation revealed the lowest subject-to-subject and interprotocol repeatability and inconsistent patterns for both protocols. Knee range of movement in transverse plane was overestimated for the HH set (the mean is 34°), which could indicate the cross-talk effect. Conclusions. The ISB 6DOF anatomically based protocol enabled full 3D kinematic description of joints according to the current standard with clinically acceptable intertrial repeatability and minimal equipment requirements.
Yang, Q; Khoury, M J; Atkinson, M; Sun, F; Cheng, R; Flanders, W D
1999-01-01
We used a case-control design to scan the genome for any associations between genetic markers and disease susceptibility loci using the first two replicates of the Mycenaean population from the GAW11 (Problem 2) data. Using a case-control approach, we constructed a series of 2-by-3 tables for each allele of every marker on all six chromosomes. Odds ratios (ORs) and 95% confidence intervals (95% CI) were estimated for all alleles of every marker. We selected the one allele for which the estimated OR had the minimum p-value to plot in the graph. Among these selected ORs, we calculated 95% CI for those that had a p-value < or = adjusted alpha level. Significantly high ORs were taken to indicate an association between a marker locus and a suspected disease-susceptibility gene. For the Mycenaean population, the case-control design identified allele number 1 of marker 24 on chromosome 1 to be associated with a disease susceptibility gene, OR = 2.10 (95% CI 1.66-2.62). Our approach failed to show any other significant association between case-control status and genetic markers. Stratified analysis on the environmental risk factor (E1) provided no further evidence of significant association other than allele 1 of marker 24 on chromosome 1. These data indicate the absence of linkage disequilibrium for markers flanking loci A, B, and C. Finally, we examined the effect of gene x environment (G x E) interaction for the identified allele. Our results provided no evidence of G x E interaction, but suggested that the environmental exposure alone was a risk factor for the disease.
Localized strain measurements of the intervertebral disc annulus during biaxial tensile testing.
Karakolis, Thomas; Callaghan, Jack P
2015-01-01
Both inter-lamellar and intra-lamellar failures of the annulus have been described as potential modes of disc herniation. Attempts to characterize initial lamellar failure of the annulus have involved tensile testing of small tissue samples. The purpose of this study was to evaluate a method of measuring local surface strains through image analysis of a tensile test conducted on an isolated sample of annular tissue in order to enhance future studies of intervertebral disc failure. An annulus tissue sample was biaxial strained to 10%. High-resolution images captured the tissue surface throughout testing. Three test conditions were evaluated: submerged, non-submerged and marker. Surface strains were calculated for the two non-marker conditions based on motion of virtual tracking points. Tracking algorithm parameters (grid resolution and template size) were varied to determine the effect on estimated strains. Accuracy of point tracking was assessed through a comparison of the non-marker conditions to a condition involving markers placed on tissue surface. Grid resolution had a larger effect on local strain than template size. Average local strain error ranged from 3% to 9.25% and 0.1% to 2.0%, for the non-submerged and submerged conditions, respectively. Local strain estimation has a relatively high potential for error. Submerging the tissue provided superior strain estimates.
GFR Estimation: From Physiology to Public Health
Levey, Andrew S.; Inker, Lesley A.; Coresh, Josef
2014-01-01
Estimating glomerular filtration rate (GFR) is essential for clinical practice, research, and public health. Appropriate interpretation of estimated GFR (eGFR) requires understanding the principles of physiology, laboratory medicine, epidemiology and biostatistics used in the development and validation of GFR estimating equations. Equations developed in diverse populations are less biased at higher GFR than equations developed in CKD populations and are more appropriate for general use. Equations that include multiple endogenous filtration markers are more precise than equations including a single filtration marker. The Chronic Kidney Disease Epidemiology Collaboration (CKD-EPI) equations are the most accurate GFR estimating equations that have been evaluated in large, diverse populations and are applicable for general clinical use. The 2009 CKD-EPI creatinine equation is more accurate in estimating GFR and prognosis than the 2006 Modification of Diet in Renal Disease (MDRD) Study equation and provides lower estimates of prevalence of decreased eGFR. It is useful as a “first” test for decreased eGFR and should replace the MDRD Study equation for routine reporting of serum creatinine–based eGFR by clinical laboratories. The 2012 CKD-EPI cystatin C equation is as accurate as the 2009 CKD-EPI creatinine equation in estimating eGFR, does not require specification of race, and may be more accurate in patients with decreased muscle mass. The 2012 CKD-EPI creatinine–cystatin C equation is more accurate than the 2009 CKD-EPI creatinine and 2012 CKD-EPI cystatin C equations and is useful as a confirmatory test for decreased eGFR as determined by an equation based on serum creatinine. Further improvement in GFR estimating equations will require development in more broadly representative populations, including diverse racial and ethnic groups, use of multiple filtration markers, and evaluation using statistical techniques to compare eGFR to “true GFR”. PMID:24485147
Kosoy, Roman; Nassir, Rami; Tian, Chao; White, Phoebe A; Butler, Lesley M; Silva, Gabriel; Kittles, Rick; Alarcon-Riquelme, Marta E; Gregersen, Peter K; Belmont, John W; De La Vega, Francisco M; Seldin, Michael F
2009-01-01
To provide a resource for assessing continental ancestry in a wide variety of genetic studies, we identified, validated, and characterized a set of 128 ancestry informative markers (AIMs). The markers were chosen for informativeness, genome-wide distribution, and genotype reproducibility on two platforms (TaqMan assays and Illumina arrays). We analyzed genotyping data from 825 subjects with diverse ancestry, including European, East Asian, Amerindian, African, South Asian, Mexican, and Puerto Rican. A comprehensive set of 128 AIMs and subsets as small as 24 AIMs are shown to be useful tools for ascertaining the origin of subjects from particular continents, and to correct for population stratification in admixed population sample sets. Our findings provide general guidelines for the application of specific AIM subsets as a resource for wide application. We conclude that investigators can use TaqMan assays for the selected AIMs as a simple and cost efficient tool to control for differences in continental ancestry when conducting association studies in ethnically diverse populations. Copyright 2008 Wiley-Liss, Inc.
Omar, Hesham R; Charnigo, Richard; Guglin, Maya
2017-04-01
Congestion is the main contributor to heart failure (HF) morbidity and mortality. We assessed the combined role of congestion and decreased forward flow in predicting morbidity and mortality in acute systolic HF. The Evaluation Study of Congestive Heart Failure and Pulmonary Artery Catheterization Effectiveness trial data set was used to determine if the ratio of simultaneously measured systolic blood pressure (SBP)/right atrial pressure (RAP) on admission predicted HF rehospitalization and 6-month mortality. One hundred ninety-five patients (mean age 56.5 years, 75% men) who received pulmonary artery catheterization were studied. The RAP, SBP, and SBP/RAP had an area under the curve (AUC) of 0.593 (p = 0.0205), 0.585 (p = 0.0359), and 0.621 (p = 0.0026), respectively, in predicting HF rehospitalization. The SBP/RAP was a superior marker of HF rehospitalization compared with RAP alone (difference in AUC 0.0289, p = 0.0385). The optimal criterion of SBP/RAP <11 provided the highest combined sensitivity (77.1%) and specificity (50.9%) in predicting HF rehospitalization. The SBP/RAP had an AUC 0.622, p = 0.0108, and a cut-off value of SBP/RAP <8 had a sensitivity of 61.9% and specificity 64.1% in predicting mortality. Multivariate analysis showed that an SBP/RAP <11 independently predicted rehospitalization for HF (estimated odds ratio 3.318, 95% confidence interval 1.692 to 6.506, p = 0.0005) and an SBP/RAP <8 independently predicted mortality (estimated hazard ratio 2.025, 95% confidence interval 1.069 to 3.833, p = 0.030). In conclusion, SBP/RAP ratio is a marker that identifies a spectrum of complications after hospitalization of patients with decompensated systolic HF, starting with increased incidence of HF rehospitalization at SBP/RAP <11 to increased mortality with SBP/RAP <8. Copyright © 2017 Elsevier Inc. All rights reserved.
‘Alzheimer’s Progression Score’: Development of a Biomarker Summary Outcome for AD Prevention Trials
Leoutsakos, J.-M.; Gross, A.L.; Jones, R.N.; Albert, M.S.; Breitner, J.C.S.
2017-01-01
BACKGROUND Alzheimer’s disease (AD) prevention research requires methods for measurement of disease progression not yet revealed by symptoms. Preferably, such measurement should encompass multiple disease markers. OBJECTIVES Evaluate an item response theory (IRT) model-based latent variable Alzheimer Progression Score (APS) that uses multi-modal disease markers to estimate pre-clinical disease progression. DESIGN Estimate APS scores in the BIOCARD observational study, and in the parallel PREVENT-AD Cohort and its sister INTREPAD placebo-controlled prevention trial. Use BIOCARD data to evaluate whether baseline and early APS trajectory predict later progression to MCI/dementia. Similarly, use longitudinal PREVENT-AD data to assess test measurement invariance over time. Further, assess portability of the PREVENT-AD IRT model to baseline INTREPAD data, and explore model changes when CSF markers are added or withdrawn. SETTING BIOCARD was established in 1995 and participants were followed up to 20 years in Baltimore, USA. The PREVENT-AD and INTREPAD trial cohorts were established between 2011–2015 in Montreal, Canada, using nearly identical entry criteria to enroll high-risk cognitively normal persons aged 60+ then followed for several years. PARTICIPANTS 349 cognitively normal, primarily middle-aged participants in BIOCARD, 125 high-risk participants aged 60+ in PREVENT-AD, and 217 similar subjects in INTREPAD. 106 INTREPAD participants donated up to four serial CSF samples. MEASUREMENTS Global cognitive assessment and multiple structural, functional, and diffusion MRI metrics, sensori-neural tests, and CSF concentrations of tau, Aβ42 and their ratio. RESULTS Both baseline values and early slope of APS scores in BIOCARD predicted later progression to MCI or AD. Presence of CSF variables strongly improved such prediction. A similarly derived APS in PREVENT-AD showed measurement invariance over time and portability to the parallel INTREPAD sample. CONCLUSIONS An IRT-based APS can summarize multimodal information to provide a longitudinal measure of pre-clinical AD progression, and holds promise as an outcome for AD prevention trials. PMID:29034223
Palha, Teresinha; Ribeiro-Rodrigues, Elzemar; Ribeiro-dos-Santos, Andrea; Santos, Sidney
2012-05-01
Fourteen Y-STR loci (DYS458, DYS439, Y-GATA H4, DYS576, DYS447, DYS460, DYS456, YGATA A10, DYS437, DYS449, DYS570, DYS635 or Y-GATA C4, DYS448 and DYS438) were analysed in 873 males from eight northern Brazil populations: Belém (N=400), Santarém (N=69), Manaus (N=75), Macapá (N=65), Palmas (N=30), Rio Branco (N=32), Porto Velho (N=135) and Boa Vista (N=67). A total of 871 different haplotypes were identified, of which 869 were unique. The panel's estimated total haplotype diversity (HD) is 0.9988, and its discrimination capacity (DC) is 0.9980. The lowest estimates of genetic diversity correspond to markers Y-GATA H4 (0.550) and DYS460 (0.581), and the greatest (above 0.700) to markers DYS458, DYS576, DYS447, YS449, DYS570 and DYS635. The genetic parameters obtained were higher for the 14-Y-STR panel than that for the minimum haplotype set (HD=0.9969; DC=0.76) and the parameters were similar to those obtained with the panel of 17 YSTR of YHRD (HD=0.9987; DC=0. 9870). The analysis of molecular variance (AMOVA) indicated that most of the genetic variance is found within populations and a smaller, but significant part, is found among populations (R(ST)=0.027, p value=0.009). The data when compared with those from African, Amerindian and European populations have shown no significant genetic distance between northern Brazil populations and Europeans, but there is a significant genetic distance when compared to Africans and Amerindians. The discrimination capacity of the markers shows a high potential for forensic analysis. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Glomerular and Tubular Damage Markers in Individuals with Progressive Albuminuria
Nauta, Ferdau L.; Scheven, Lieneke; Meijer, Esther; van Oeveren, Wim; de Jong, Paul E.; Bakker, Stephan J.L.
2013-01-01
Summary Background and objectives Albuminuria is associated with risk for renal and cardiovascular disease. It is difficult to predict which persons will progress in albuminuria. This study investigated whether assessment of urinary markers associated with damage to different parts of the nephron may help identify individuals that will progress in albuminuria. Design, setting, participants, & measurements Individuals were selected from a prospective community-based cohort study with serial follow-up and defined as “progressors” if they belonged to the quintile of participants with the most rapid annual increase in albuminuria, and reached an albuminuria ≥150 mg/d during follow-up. Patients with known renal disease or macroalbuminuria at baseline were excluded. Each progressor was matched to two control participants, based on baseline albuminuria, age, and sex. Furthermore, damage markers were measured in a separate set of healthy individuals. Results After a median follow-up of 8.6 years, 183 of 8394 participants met the criteria for progressive albuminuria. Baseline clinical characteristics were comparable between progressors and matched controls (n=366). Both had higher baseline albuminuria than the overall population. Urinary excretion of the glomerular damage marker IgG was significantly higher in progressors, whereas urinary excretion of proximal tubular damage markers and inflammatory markers was lower in these individuals compared with controls. Healthy individuals (n=109) had the lowest values for all urinary damage markers measured. Conclusions These data suggest that albuminuria associated with markers of glomerular damage is more likely to progress, whereas albuminuria associated with markers of tubulointerstitial damage is more likely to remain stable. PMID:23539232
Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers
Jiang, Yong; Schmidt, Renate H.; Reif, Jochen C.
2018-01-01
Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. PMID:29549092
Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers.
Jiang, Yong; Schmidt, Renate H; Reif, Jochen C
2018-05-04
Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. Copyright © 2018 Jiang et al.
Prenatal exclusion of Norrie disease with flanking DNA markers.
Gal, A; Uhlhaas, S; Glaser, D; Grimm, T
1988-10-01
Three polymorphic DNA markers linked to the locus of Norrie disease were used for indirect genotype analysis in a ten-wk-old fetus at risk for the disease. When haplotypes of the family members and the estimated recombination frequency between Norrie gene and each of the DNA marker loci DXS7, DXS84, and DXS146 were taken into account, the risk that the fetus had inherited the mutation was about 1%.
Mohanty, Sambit K; Smith, Steven C; Chang, Elena; Luthringer, Daniel J; Gown, Allen M; Aron, Manju; Amin, Mahul B
2014-08-01
New immunohistochemical (IHC) markers of urothelial carcinoma (UCa) and prostatic adenocarcinoma (PCa) have emerged in recent years, yet comparative studies to establish markers remain lacking. We aimed to identify an effective but parsimonious approach for poorly differentiated bladder neck lesions, to establish a best practice panel approach in a setting simulating prospective use. We tested the performance of a panel of IHC markers on whole sections of a consecutive cohort of transurethral resection specimens of poorly differentiated, challenging bladder neck resections (n=36). In the setting of poorly differentiated bladder neck carcinomas, biomarker sensitivities for UCa were as follows: GATA3, 100%; S100P, 88%; p63, 75%; and cytokeratin (CK) 5/6, 56%; specificities of each were 100%. CK7 and CK20 showed sensitivities of 75% and 63%, though these were only 85% and 80% specific. For PCa markers, NKX3.1, p501S, prostate-specific membrane antigen, and androgen receptor (AR) each showed 100% sensitivity, outperforming ERG (35%) and prostate-specific antigen (PSA; 25%). All the prostate histogenesis markers were 100% specific, except for AR, which was positive in 13% of the UCa cases. Novel IHC markers show improved diagnostic performance that enables positive and negative support for identifying histogenesis with the use of as few as two markers for this critical therapeutic distinction. PSA underperforms newer markers. Copyright© by the American Society for Clinical Pathology.
Iterative h-minima-based marker-controlled watershed for cell nucleus segmentation.
Koyuncu, Can Fahrettin; Akhan, Ece; Ersahin, Tulin; Cetin-Atalay, Rengul; Gunduz-Demir, Cigdem
2016-04-01
Automated microscopy imaging systems facilitate high-throughput screening in molecular cellular biology research. The first step of these systems is cell nucleus segmentation, which has a great impact on the success of the overall system. The marker-controlled watershed is a technique commonly used by the previous studies for nucleus segmentation. These studies define their markers finding regional minima on the intensity/gradient and/or distance transform maps. They typically use the h-minima transform beforehand to suppress noise on these maps. The selection of the h value is critical; unnecessarily small values do not sufficiently suppress the noise, resulting in false and oversegmented markers, and unnecessarily large ones suppress too many pixels, causing missing and undersegmented markers. Because cell nuclei show different characteristics within an image, the same h value may not work to define correct markers for all the nuclei. To address this issue, in this work, we propose a new watershed algorithm that iteratively identifies its markers, considering a set of different h values. In each iteration, the proposed algorithm defines a set of candidates using a particular h value and selects the markers from those candidates provided that they fulfill the size requirement. Working with widefield fluorescence microscopy images, our experiments reveal that the use of multiple h values in our iterative algorithm leads to better segmentation results, compared to its counterparts. © 2016 International Society for Advancement of Cytometry. © 2016 International Society for Advancement of Cytometry.
Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life.
Parks, Donovan H; Rinke, Christian; Chuvochina, Maria; Chaumeil, Pierre-Alain; Woodcroft, Ben J; Evans, Paul N; Hugenholtz, Philip; Tyson, Gene W
2017-11-01
Challenges in cultivating microorganisms have limited the phylogenetic diversity of currently available microbial genomes. This is being addressed by advances in sequencing throughput and computational techniques that allow for the cultivation-independent recovery of genomes from metagenomes. Here, we report the reconstruction of 7,903 bacterial and archaeal genomes from >1,500 public metagenomes. All genomes are estimated to be ≥50% complete and nearly half are ≥90% complete with ≤5% contamination. These genomes increase the phylogenetic diversity of bacterial and archaeal genome trees by >30% and provide the first representatives of 17 bacterial and three archaeal candidate phyla. We also recovered 245 genomes from the Patescibacteria superphylum (also known as the Candidate Phyla Radiation) and find that the relative diversity of this group varies substantially with different protein marker sets. The scale and quality of this data set demonstrate that recovering genomes from metagenomes provides an expedient path forward to exploring microbial dark matter.
Comparison of three techniques for estimating the forage intake of lactating dairy cows on pasture.
Macoon, B; Sollenberger, L E; Moore, J E; Staples, C R; Fike, J H; Portier, K M
2003-09-01
Quantifying DMI is necessary for estimation of nutrient consumption by ruminants, but it is inherently difficult on grazed pastures and even more so when supplements are fed. Our objectives were to compare three methods of estimating forage DMI (inference from animal performance, evaluation from fecal output using a pulse-dose marker, and estimation from herbage disappearance methods) and to identify the most useful approach or combination of approaches for estimating pasture intake by lactating dairy cows. During three continuous 28-d periods in the winter season, Holstein cows (Bos taurus; n = 32) grazed a cool-season grass or a cool-season grass-clover mixture at two stocking rates (SR; 5 vs. 2.5 cows/ha) and were fed two rates of concentrate supplementation (CS; 1 kg of concentrate [as-fed] per 2.5 or 3.5 kg of milk produced). Animal response data used in computations for the animal performance method were obtained from the latter 14 d of each period. For the pulse-dose marker method, chromium-mordanted fiber was used. Pasture sampling to determine herbage disappearance was done weekly throughout the study. Forage DMI estimated by the animal performance method was different among periods (P < 0.001; 6.5, 6.4, and 9.6 kg/d for Periods 1, 2, and 3, respectively), between SR (P < 0.001; 8.7 [low SR] vs. 6.3 kg/d [high SR]) and between CS (P < 0.01; 8.4 [low CS] vs. 6.6 kg/d [high CS]). The period and SR effect seemed to be related to forage mass. The pulse-dose marker method generally provided greater estimates of forage DMI (as much as 11.0 kg/d more than the animal performance method) and was not correlated with the other methods. Estimates of forage DMI by the herbage disappearance method were correlated with the animal performance method. The difference between estimates from these two methods, ranging from -4.7 to 5.4 kg/d, were much lower than their difference from pulse-dose marker estimates. The results of this study suggest that, when appropriate for the research objectives, the animal performance or herbage disappearance methods may be useful and less costly alternatives to using the pulse-dose method.
Smooth time-dependent receiver operating characteristic curve estimators.
Martínez-Camblor, Pablo; Pardo-Fernández, Juan Carlos
2018-03-01
The receiver operating characteristic curve is a popular graphical method often used to study the diagnostic capacity of continuous (bio)markers. When the considered outcome is a time-dependent variable, two main extensions have been proposed: the cumulative/dynamic receiver operating characteristic curve and the incident/dynamic receiver operating characteristic curve. In both cases, the main problem for developing appropriate estimators is the estimation of the joint distribution of the variables time-to-event and marker. As usual, different approximations lead to different estimators. In this article, the authors explore the use of a bivariate kernel density estimator which accounts for censored observations in the sample and produces smooth estimators of the time-dependent receiver operating characteristic curves. The performance of the resulting cumulative/dynamic and incident/dynamic receiver operating characteristic curves is studied by means of Monte Carlo simulations. Additionally, the influence of the choice of the required smoothing parameters is explored. Finally, two real-applications are considered. An R package is also provided as a complement to this article.
Kumar, Satish; Molloy, Claire; Muñoz, Patricio; Daetwyler, Hans; Chagné, David; Volz, Richard
2015-01-01
The nonadditive genetic effects may have an important contribution to total genetic variation of phenotypes, so estimates of both the additive and nonadditive effects are desirable for breeding and selection purposes. Our main objectives were to: estimate additive, dominance and epistatic variances of apple (Malus × domestica Borkh.) phenotypes using relationship matrices constructed from genome-wide dense single nucleotide polymorphism (SNP) markers; and compare the accuracy of genomic predictions using genomic best linear unbiased prediction models with or without including nonadditive genetic effects. A set of 247 clonally replicated individuals was assessed for six fruit quality traits at two sites, and also genotyped using an Illumina 8K SNP array. Across several fruit quality traits, the additive, dominance, and epistatic effects contributed about 30%, 16%, and 19%, respectively, to the total phenotypic variance. Models ignoring nonadditive components yielded upwardly biased estimates of additive variance (heritability) for all traits in this study. The accuracy of genomic predicted genetic values (GEGV) varied from about 0.15 to 0.35 for various traits, and these were almost identical for models with or without including nonadditive effects. However, models including nonadditive genetic effects further reduced the bias of GEGV. Between-site genotypic correlations were high (>0.85) for all traits, and genotype-site interaction accounted for <10% of the phenotypic variability. The accuracy of prediction, when the validation set was present only at one site, was generally similar for both sites, and varied from about 0.50 to 0.85. The prediction accuracies were strongly influenced by trait heritability, and genetic relatedness between the training and validation families. PMID:26497141
Plasma Amyloid-β Levels, Cerebral Small Vessel Disease, and Cognition: The Rotterdam Study.
Hilal, Saima; Akoudad, Saloua; van Duijn, Cornelia M; Niessen, Wiro J; Verbeek, Marcel M; Vanderstichele, Hugo; Stoops, Erik; Ikram, M Arfan; Vernooij, Meike W
2017-01-01
Plasma amyloid-β (Aβ) levels are increasingly studied as a potential, accessible marker of cognitive impairment and dementia. The most common plasma Aβ isoforms, i.e., Aβ1-40 and Aβ1-42 have been linked with risk of Alzheimer's disease. However, it remains under-explored whether plasma Aβ levels including novel Aβ1-38 relate to vascular brain disease and cognition in a preclinical-phase of dementiaObjective:To examine the association of plasma Aβ levels (i.e., Aβ1-38, Aβ1-40, and Aβ1-42) with markers of cerebral small vessel disease (SVD) and cognition in a large population-based setting. We analyzed plasma Aβ1 levels in 1201 subjects from two independent cohorts of the Rotterdam Study. Markers of SVD [lacunes, white matter hyperintensity (WMH) volume] were assessed on brain MRI (1.5T). Cognition was assessed by a detailed neuropsychological battery. In each cohort, the association of Aβ levels with SVD and cognition was performed using regression models. Estimates were then pooled across cohorts using inverse variance meta-analysis with fixed effects. Higher levels of plasma Aβ1-38, Aβ1-40, Aβ1-42, and Aβ1-40/ Aβ1-42 ratio were associated with increasing lacunar and microbleeds counts. Moreover, higher levels of Aβ1-40 and Aβ1-40/ Aβ1-42 were significantly associated with larger WMH volumes. With regard to cognition, a higher level of Aβ1-38 Aβ1-40 and Aβ1-40/ Aβ1-42 was related to worse performance on cognitive test specifically in memory domain. Higher plasma levels of Aβ levels are associated with subclinical markers of vascular disease and poorer memory. Plasma Aβ levels thus mark the presence of vascular brain pathology.
Stoeckel, D.M.; Stelzer, E.A.; Stogner, R.W.; Mau, D.P.
2011-01-01
Protocols for microbial source tracking of fecal contamination generally are able to identify when a source of contamination is present, but thus far have been unable to evaluate what portion of fecal-indicator bacteria (FIB) came from various sources. A mathematical approach to estimate relative amounts of FIB, such as Escherichia coli, from various sources based on the concentration and distribution of microbial source tracking markers in feces was developed. The approach was tested using dilute fecal suspensions, then applied as part of an analytical suite to a contaminated headwater stream in the Rocky Mountains (Upper Fountain Creek, Colorado). In one single-source fecal suspension, a source that was not present could not be excluded because of incomplete marker specificity; however, human and ruminant sources were detected whenever they were present. In the mixed-feces suspension (pet and human), the minority contributor (human) was detected at a concentration low enough to preclude human contamination as the dominant source of E. coli to the sample. Without the semi-quantitative approach described, simple detects of human-associated marker in stream samples would have provided inaccurate evidence that human contamination was a major source of E. coli to the stream. In samples from Upper Fountain Creek the pattern of E. coli, general and host-associated microbial source tracking markers, nutrients, and wastewater-associated chemical detections-augmented with local observations and land-use patterns-indicated that, contrary to expectations, birds rather than humans or ruminants were the predominant source of fecal contamination to Upper Fountain Creek. This new approach to E. coli allocation, validated by a controlled study and tested by application in a relatively simple setting, represents a widely applicable step forward in the field of microbial source tracking of fecal contamination. ?? 2011 Elsevier Ltd.
Spencer, Kevin; Cuckle, Howard S
2002-10-01
To assess the within person biological variability of first trimester maternal serum biochemical markers of trisomy 21 across the 10-14 week gestational period. To evaluate whether repeat sampling and testing of free beta-hCG and PAPP-A during this period would result in an improved detection rate. Women presenting at the first trimester OSCAR clinic have blood collected prior to ultrasound dating and nuchal translucency measurement. All samples are analysed for free beta-hCG and PAPP-A before an accurate estimate of gestation is available. In 10% of cases the gestation is prior to the minimum time for NT measurement (11 weeks) and these women are rebooked for a repeat visit to the clinic at the appropriate time. Our fetal database was interrogated to obtain cases in which two maternal blood samples had been collected and analysed in the 10-14 week period. Using data from the marker correlations and statistical modelling, the impact of repeat testing on detection rate for trisomy 21 at a fixed 5% false positive rate, was assessed. 261 pairs of data were available for analysis collected over a 3 year period. The correlation between free beta-hCG in sample 1 and sample 2 was 0.890 and that for PAPP-A was 0.827. The average within person biological variation for free beta-hCG was 21% and 32% for PAPP-A. The increase in detection rate when using both sets of marker data was 3.5% when using serum biochemistry and maternal age, and 1.3% when using nuchal translucency, serum biochemistry and maternal age. Repeat sampling and testing of maternal serum biochemical markers is unlikely to substantially improve first trimester screening performance. Copyright 2002 John Wiley & Sons, Ltd.
Gu, Junfei; Yin, Xinyou; Zhang, Chengwei; Wang, Huaqi; Struik, Paul C.
2014-01-01
Background and Aims Genetic markers can be used in combination with ecophysiological crop models to predict the performance of genotypes. Crop models can estimate the contribution of individual markers to crop performance in given environments. The objectives of this study were to explore the use of crop models to design markers and virtual ideotypes for improving yields of rice (Oryza sativa) under drought stress. Methods Using the model GECROS, crop yield was dissected into seven easily measured parameters. Loci for these parameters were identified for a rice population of 94 introgression lines (ILs) derived from two parents differing in drought tolerance. Marker-based values of ILs for each of these parameters were estimated from additive allele effects of the loci, and were fed to the model in order to simulate yields of the ILs grown under well-watered and drought conditions and in order to design virtual ideotypes for those conditions. Key Results To account for genotypic yield differences, it was necessary to parameterize the model for differences in an additional trait ‘total crop nitrogen uptake’ (Nmax) among the ILs. Genetic variation in Nmax had the most significant effect on yield; five other parameters also significantly influenced yield, but seed weight and leaf photosynthesis did not. Using the marker-based parameter values, GECROS also simulated yield variation among 251 recombinant inbred lines of the same parents. The model-based dissection approach detected more markers than the analysis using only yield per se. Model-based sensitivity analysis ranked all markers for their importance in determining yield differences among the ILs. Virtual ideotypes based on markers identified by modelling had 10–36 % more yield than those based on markers for yield per se. Conclusions This study outlines a genotype-to-phenotype approach that exploits the potential value of marker-based crop modelling in developing new plant types with high yields. The approach can provide more markers for selection programmes for specific environments whilst also allowing for prioritization. Crop modelling is thus a powerful tool for marker design for improved rice yields and for ideotyping under contrasting conditions. PMID:24984712
Wheat Landrace Genome Diversity
Wingen, Luzie U.; West, Claire; Leverington-Waite, Michelle; Collier, Sarah; Orford, Simon; Goram, Richard; Yang, Cai-Yun; King, Julie; Allen, Alexandra M.; Burridge, Amanda; Edwards, Keith J.; Griffiths, Simon
2017-01-01
Understanding the genomic complexity of bread wheat (Triticum aestivum L.) is a cornerstone in the quest to unravel the processes of domestication and the following adaptation of domesticated wheat to a wide variety of environments across the globe. Additionally, it is of importance for future improvement of the crop, particularly in the light of climate change. Focusing on the adaptation after domestication, a nested association mapping (NAM) panel of 60 segregating biparental populations was developed, mainly involving landrace accessions from the core set of the Watkins hexaploid wheat collection optimized for genetic diversity. A modern spring elite variety, “Paragon,” was used as common reference parent. Genetic maps were constructed following identical rules to make them comparable. In total, 1611 linkage groups were identified, based on recombination from an estimated 126,300 crossover events over the whole NAM panel. A consensus map, named landrace consensus map (LRC), was constructed and contained 2498 genetic loci. These newly developed genetics tools were used to investigate the rules underlying genome fluidity or rigidity, e.g., by comparing marker distances and marker orders. In general, marker order was highly correlated, which provides support for strong synteny between bread wheat accessions. However, many exceptional cases of incongruent linkage groups and increased marker distances were also found. Segregation distortion was detected for many markers, sometimes as hot spots present in different populations. Furthermore, evidence for translocations in at least 36 of the maps was found. These translocations fell, in general, into many different translocation classes, but a few translocation classes were found in several accessions, the most frequent one being the well-known T5B:7B translocation. Loci involved in recombination rate, which is an interesting trait for plant breeding, were identified by QTL analyses using the crossover counts as a trait. In total, 114 significant QTL were detected, nearly half of them with increasing effect from the nonreference parents. PMID:28213475
A gene-based SNP resource and linkage map for the copepod Tigriopus californicus
2011-01-01
Background As yet, few genomic resources have been developed in crustaceans. This lack is particularly evident in Copepoda, given the extraordinary numerical abundance, and taxonomic and ecological diversity of this group. Tigriopus californicus is ideally suited to serve as a genetic model copepod and has been the subject of extensive work in environmental stress and reproductive isolation. Accordingly, we set out to develop a broadly-useful panel of genetic markers and to construct a linkage map dense enough for quantitative trait locus detection in an interval mapping framework for T. californicus--a first for copepods. Results One hundred and ninety Single Nucleotide Polymorphisms (SNPs) were used to genotype our mapping population of 250 F2 larvae. We were able to construct a linkage map with an average intermarker distance of 1.8 cM, and a maximum intermarker distance of 10.3 cM. All markers were assembled into linkage groups, and the 12 linkage groups corresponded to the 12 known chromosomes of T. californicus. We estimate a total genome size of 401.0 cM, and a total coverage of 73.7%. Seventy five percent of the mapped markers were detected in 9 additional populations of T. californicus. Of available model arthropod genomes, we were able to show more colocalized pairs of homologues between T. californicus and the honeybee Apis mellifera, than expected by chance, suggesting preserved macrosynteny between Hymenoptera and Copepoda. Conclusions Our study provides an abundance of linked markers spanning all chromosomes. Many of these markers are also found in multiple populations of T. californicus, and in two other species in the genus. The genomic resource we have developed will enable mapping throughout the geographical range of this species and in closely related species. This linkage map will facilitate genome sequencing, mapping and assembly in an ecologically and taxonomically interesting group for which genomic resources are currently under development. PMID:22103327
Karger, S; Krause, K; Gutknecht, M; Schierle, K; Graf, D; Steinert, F; Dralle, H; Führer, D
2012-01-01
Background: Previously, we reported a six-marker gene set, which allowed a molecular discrimination of benign and malignant thyroid tumours. Now, we evaluated these markers in fine-needle aspiration biopsies (FNAB) in a prospective, independent series of thyroid tumours with proven histological outcome. Methods: Quantitative RT–PCR was performed (ADM3, HGD1, LGALS3, PLAB, TFF3, TG) in the needle wash-out of 156 FNAB of follicular adenoma (FA), adenomatous nodules, follicular and papillary thyroid cancers (TC) and normal thyroid tissues (NT). Results: Significant expression differences were found for TFF3, HGD1, ADM3 and LGALS3 in FNAB of TC compared with benign thyroid nodules and NT. Using two-marker gene sets, a specific FNAB distinction of benign and malignant tumours was achieved with negative predictive values (NPV) up to 0.78 and positive predictive values (PPV) up to 0.84. Two FNAB marker gene combinations (ADM3/TFF3; ADM3/ACTB) allowed the distinction of FA and malignant follicular neoplasia with NPV up to 0.94 and PPV up to 0.86. Conclusion: We demonstrate that molecular FNAB diagnosis of benign and malignant thyroid tumours including follicular neoplasia is possible with recently identified marker gene combinations. We propose multi-centre FNAB studies on these markers to bring this promising diagnostic tool closer to clinical practice. PMID:22223087
Individual participant data meta-analyses should not ignore clustering
Abo-Zaid, Ghada; Guo, Boliang; Deeks, Jonathan J.; Debray, Thomas P.A.; Steyerberg, Ewout W.; Moons, Karel G.M.; Riley, Richard David
2013-01-01
Objectives Individual participant data (IPD) meta-analyses often analyze their IPD as if coming from a single study. We compare this approach with analyses that rather account for clustering of patients within studies. Study Design and Setting Comparison of effect estimates from logistic regression models in real and simulated examples. Results The estimated prognostic effect of age in patients with traumatic brain injury is similar, regardless of whether clustering is accounted for. However, a family history of thrombophilia is found to be a diagnostic marker of deep vein thrombosis [odds ratio, 1.30; 95% confidence interval (CI): 1.00, 1.70; P = 0.05] when clustering is accounted for but not when it is ignored (odds ratio, 1.06; 95% CI: 0.83, 1.37; P = 0.64). Similarly, the treatment effect of nicotine gum on smoking cessation is severely attenuated when clustering is ignored (odds ratio, 1.40; 95% CI: 1.02, 1.92) rather than accounted for (odds ratio, 1.80; 95% CI: 1.29, 2.52). Simulations show models accounting for clustering perform consistently well, but downwardly biased effect estimates and low coverage can occur when ignoring clustering. Conclusion Researchers must routinely account for clustering in IPD meta-analyses; otherwise, misleading effect estimates and conclusions may arise. PMID:23651765
Identifying Potential Regions of Copy Number Variation for Bipolar Disorder
Chen, Yi-Hsuan; Lu, Ru-Band; Hung, Hung; Kuo, Po-Hsiu
2014-01-01
Bipolar disorder is a complex psychiatric disorder with high heritability, but its genetic determinants are still largely unknown. Copy number variation (CNV) is one of the sources to explain part of the heritability. However, it is a challenge to estimate discrete values of the copy numbers using continuous signals calling from a set of markers, and to simultaneously perform association testing between CNVs and phenotypic outcomes. The goal of the present study is to perform a series of data filtering and analysis procedures using a DNA pooling strategy to identify potential CNV regions that are related to bipolar disorder. A total of 200 normal controls and 200 clinically diagnosed bipolar patients were recruited in this study, and were randomly divided into eight control and eight case pools. Genome-wide genotyping was employed using Illumina Human Omni1-Quad array with approximately one million markers for CNV calling. We aimed at setting a series of criteria to filter out the signal noise of marker data and to reduce the chance of false-positive findings for CNV regions. We first defined CNV regions for each pool. Potential CNV regions were reported based on the different patterns of CNV status between cases and controls. Genes that were mapped into the potential CNV regions were examined with association testing, Gene Ontology enrichment analysis, and checked with existing literature for their associations with bipolar disorder. We reported several CNV regions that are related to bipolar disorder. Two CNV regions on chromosome 11 and 22 showed significant signal differences between cases and controls (p < 0.05). Another five CNV regions on chromosome 6, 9, and 19 were overlapped with results in previous CNV studies. Experimental validation of two CNV regions lent some support to our reported findings. Further experimental and replication studies could be designed for these selected regions. PMID:27605030
Estrella, Michelle M.; Astor, Brad C.; Köttgen, Anna; Selvin, Elizabeth; Coresh, Josef; Parekh, Rulan S.
2010-01-01
Background. Anaemia worsens as kidney function declines. Both conditions are associated with increased mortality. Serum cystatin C is purportedly a more sensitive marker of kidney disease and a better predictor of mortality than serum creatinine. However, studies suggest that extrarenal factors also influence cystatin C levels. Methods. We determined whether estimates of glomerular filtration rate [estimated glomerular filtration rate (eGFR)] based on serum cystatin C alone or in combination with serum creatinine were superior to those based on serum creatinine in recognizing impaired kidney function in the setting of anaemia in a sub-sample of the Third National Health and Nutrition Examination Survey of the USA consisting of 6734 participants, 20 years or older. Results. The prevalence of moderate to severe kidney disease (eGFR 15–59 mL/min/1.73 m2) among anaemic persons was 15–16% when based on serum creatinine alone (eGFRSCR) or combined with cystatin C (eGFRSCR + CYSC); this estimate increased to nearly 25% when kidney function was estimated by cystatin C (eGFRCYSC). The adjusted odds ratios of kidney disease in anaemic versus non-anaemic persons were slightly higher with eGFRCYSC than eGFRSCR and eGFRSCR + CYSC in younger adults [odds ratio (OR) = 5.22, 95% confidence interval (CI): 2.23, 12.17], women (OR = 5.34, 95% CI: 2.36, 12.06) and those with elevated C-reactive protein (CRP) (OR = 7.36, 95% CI: 1.98–27.36). Conclusions. Impaired kidney function was common in individuals with anaemia. Among anaemic individuals, the prevalence estimate for kidney disease was notably higher when kidney function was estimated by cystatin C alone compared with the estimations by serum creatinine alone or in combination with serum cystatin C. eGFRCYSC may be particularly helpful in identifying kidney disease in the setting of anaemia among younger persons, women and those with elevated CRP. Regardless of which renal biomarker is used, our study suggests that an evaluation for underlying kidney disease should be considered in the standard workup of anaemia. PMID:20176612
2012-01-01
Background Quantitative trait loci (QTL) detection on a huge amount of phenotypes, like eQTL detection on transcriptomic data, can be dramatically impaired by the statistical properties of interval mapping methods. One of these major outcomes is the high number of QTL detected at marker locations. The present study aims at identifying and specifying the sources of this bias, in particular in the case of analysis of data issued from outbred populations. Analytical developments were carried out in a backcross situation in order to specify the bias and to propose an algorithm to control it. The outbred population context was studied through simulated data sets in a wide range of situations. The likelihood ratio test was firstly analyzed under the "one QTL" hypothesis in a backcross population. Designs of sib families were then simulated and analyzed using the QTL Map software. On the basis of the theoretical results in backcross, parameters such as the population size, the density of the genetic map, the QTL effect and the true location of the QTL, were taken into account under the "no QTL" and the "one QTL" hypotheses. A combination of two non parametric tests - the Kolmogorov-Smirnov test and the Mann-Whitney-Wilcoxon test - was used in order to identify the parameters that affected the bias and to specify how much they influenced the estimation of QTL location. Results A theoretical expression of the bias of the estimated QTL location was obtained for a backcross type population. We demonstrated a common source of bias under the "no QTL" and the "one QTL" hypotheses and qualified the possible influence of several parameters. Simulation studies confirmed that the bias exists in outbred populations under both the hypotheses of "no QTL" and "one QTL" on a linkage group. The QTL location was systematically closer to marker locations than expected, particularly in the case of low QTL effect, small population size or low density of markers, i.e. designs with low power. Practical recommendations for experimental designs for QTL detection in outbred populations are given on the basis of this bias quantification. Furthermore, an original algorithm is proposed to adjust the location of a QTL, obtained with interval mapping, which co located with a marker. Conclusions Therefore, one should be attentive when one QTL is mapped at the location of one marker, especially under low power conditions. PMID:22520935
Manni, Mosè; Gomulski, Ludvik M; Aketarawong, Nidchaya; Tait, Gabriella; Scolari, Francesca; Somboon, Pradya; Guglielmino, Carmela R; Malacrida, Anna R; Gasperi, Giuliano
2015-03-28
The dramatic worldwide expansion of Aedes albopictus (the Asian tiger mosquito) and its vector competence for numerous arboviruses represent a growing threat to public health security. Molecular markers are crucially needed for tracking the rapid spread of this mosquito and to obtain a deeper knowledge of population structure. This is a fundamental requirement for the development of strict monitoring protocols and for the improvement of sustainable control measures. Wild population samples from putative source areas and from newly colonised regions were analysed for variability at the ribosomal DNA internal transcribed spacer 2 (ITS2). Moreover, a new set of 23 microsatellite markers (SSR) was developed. Sixteen of these SSRs were tested in an ancestral (Thailand) and two adventive Italian populations. Seventy-six ITS2 sequences representing 52 unique haplotypes were identified, and AMOVA indicated that most of their variation occurred within individuals (74.36%), while only about 8% was detected among populations. Spatial analyses of molecular variance revealed that haplotype genetic similarity was not related to the geographic proximity of populations and the haplotype phylogeny clearly indicated that highly related sequences were distributed across populations from different geographical regions. The SSR markers displayed a high level of polymorphism both in the ancestral and in adventive populations, and F ST estimates suggested the absence of great differentiation. The ancestral nature of the Thai population was corroborated by its higher level of variability. The two types of genetic markers here implemented revealed the distribution of genetic diversity within and between populations and provide clues on the dispersion dynamics of this species. It appears that the diffusion of this mosquito does not conform to a progressive expansion from the native Asian source area, but to a relatively recent and chaotic propagule distribution mediated by human activities. Under this scenario, multiple introductions and admixture events probably play an important role in maintaining the genetic diversity and in avoiding bottleneck effects. The polymorphic SSR markers here implemented will provide an important tool for reconstructing the routes of invasion followed by this mosquito.
Kim, Hyun-Joong; Ryu, Ji-Oh; Song, Ji-Yeon; Kim, Hae-Yeong
2017-07-01
In the detection of Shigella species using molecular biological methods, previously known genetic markers for Shigella species were not sufficient to discriminate between Shigella species and diarrheagenic Escherichia coli. The purposes of this study were to screen for genetic markers of the Shigella genus and four Shigella species through comparative genomics and develop a multiplex polymerase chain reaction (PCR) for the detection of shigellae and Shigella species. A total of seven genomic DNA sequences from Shigella species were subjected to comparative genomics for the screening of genetic markers of shigellae and each Shigella species. The primer sets were designed from the screened genetic markers and evaluated using PCR with genomic DNAs from Shigella and other bacterial strains in Enterobacteriaceae. A novel Shigella quintuplex PCR, designed for the detection of Shigella genus, S. dysenteriae, S. boydii, S. flexneri, and S. sonnei, was developed from the evaluated primer sets, and its performance was demonstrated with specifically amplified results from each Shigella species. This Shigella multiplex PCR is the first to be reported with novel genetic markers developed through comparative genomics and may be a useful tool for the accurate detection of the Shigella genus and species from closely related bacteria in clinical microbiology and food safety.
Yu, Jihnhee; Yang, Luge; Vexler, Albert; Hutson, Alan D
2016-06-15
The receiver operating characteristic (ROC) curve is a popular technique with applications, for example, investigating an accuracy of a biomarker to delineate between disease and non-disease groups. A common measure of accuracy of a given diagnostic marker is the area under the ROC curve (AUC). In contrast with the AUC, the partial area under the ROC curve (pAUC) looks into the area with certain specificities (i.e., true negative rate) only, and it can be often clinically more relevant than examining the entire ROC curve. The pAUC is commonly estimated based on a U-statistic with the plug-in sample quantile, making the estimator a non-traditional U-statistic. In this article, we propose an accurate and easy method to obtain the variance of the nonparametric pAUC estimator. The proposed method is easy to implement for both one biomarker test and the comparison of two correlated biomarkers because it simply adapts the existing variance estimator of U-statistics. In this article, we show accuracy and other advantages of the proposed variance estimation method by broadly comparing it with previously existing methods. Further, we develop an empirical likelihood inference method based on the proposed variance estimator through a simple implementation. In an application, we demonstrate that, depending on the inferences by either the AUC or pAUC, we can make a different decision on a prognostic ability of a same set of biomarkers. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Guidelines for the use of raised pavement markers
DOT National Transportation Integrated Search
1998-10-01
The Manual on Uniform Traffic Control Devices (MUTCD, 1988) provides a general outline for how Raised Pavement Markers (RPMs) should be used, but more specific information is required in order to produce a set of guidelines that are usable by a field...
Iterative framework radiation hybrid mapping
USDA-ARS?s Scientific Manuscript database
Building comprehensive radiation hybrid maps for large sets of markers is a computationally expensive process, since the basic mapping problem is equivalent to the traveling salesman problem. The mapping problem is also susceptible to noise, and as a result, it is often beneficial to remove markers ...
Li, Yong; Zhang, Weirui
2015-10-01
Microsatellite markers of Jasminum sambac (Oleaceae) were isolated to investigate wild germplasm resources and provide markers for breeding. Illumina sequencing was used to isolate microsatellite markers from the transcriptome of J. sambac. A total of 1322 microsatellites were identified from 49,772 assembled unigenes. One hundred primer pairs were randomly selected to verify primer amplification efficiency. Out of these tested primer pairs, 31 were successfully amplified: 18 primer pairs yielded a single allele, seven exhibited fixed heterozygosity with two alleles, and only six displayed polymorphisms. This study obtained the first set of microsatellite markers for J. sambac, which will be helpful for the assessment of wild germplasm resources and the development of molecular marker-assisted breeding.
Kourtchev, Ivan; Hellebust, Stig; Bell, Jennifer M; O'Connor, Ian P; Healy, Robert M; Allanic, Arnaud; Healy, David; Wenger, John C; Sodeau, John R
2011-05-01
PM(2.5) samples collected at Cork Harbour, Ireland during summer, autumn, late autumn and winter, 2008-2009 were analyzed for polar organic compounds that are useful markers for aerosol source characterization. The determined compounds include tracers for biomass burning primary particles, fungal spores, markers for secondary organic aerosol (SOA) from isoprene, α-/β-pinene, and d-limonene. Seasonal and temporal variations and other characteristic features of the detected tracers are discussed in terms of aerosol sources and processes. The biogenic species were detected only during the summer period where the contributions of isoprene SOA and fungal spores to the PM(2.5) organic carbon (OC) were estimated to be 1.6% and 1% respectively. The biomass burning markers, and in particular levoglucosan, were present in all samples and attributed to the combustion of cellulose-containing fuels including wood, peat, bituminous and smokeless coal. The contribution of domestic solid fuel (DSF) burning to the measured OC mass concentration was estimated at 10.8, 50, 66.4 and 74.9% for summer, autumn, late autumn and winter periods, respectively, based on factors derived from a series of burning experiments on locally available fuels. Application of an alternative approach, namely principal component analysis-multiple linear regression (PCA-MLR), to the measured concentrations of the polar organic marker compounds used in conjunction with real-time air quality data provided similar trends and estimates for DSF combustion during all seasons except summer. This study clearly demonstrates that, despite the ban on the sale of bituminous coal in Cork and other large urban areas in Ireland, DSF combustion is still the major source of OC during autumn and winter periods and also makes a significant contribution to PM(2.5) levels. The developed marker approach for estimating the contribution of DSF combustion to ambient OC concentrations can, in principle, also be applied to other locations. Copyright © 2011 Elsevier B.V. All rights reserved.
Joint confidence region estimation for area under ROC curve and Youden index.
Yin, Jingjing; Tian, Lili
2014-03-15
In the field of diagnostic studies, the area under the ROC curve (AUC) serves as an overall measure of a biomarker/diagnostic test's accuracy. Youden index, defined as the overall correct classification rate minus one at the optimal cut-off point, is another popular index. For continuous biomarkers of binary disease status, although researchers mainly evaluate the diagnostic accuracy using AUC, for the purpose of making diagnosis, Youden index provides an important and direct measure of the diagnostic accuracy at the optimal threshold and hence should be taken into consideration in addition to AUC. Furthermore, AUC and Youden index are generally correlated. In this paper, we initiate the idea of evaluating diagnostic accuracy based on AUC and Youden index simultaneously. As the first step toward this direction, this paper only focuses on the confidence region estimation of AUC and Youden index for a single marker. We present both parametric and non-parametric approaches for estimating joint confidence region of AUC and Youden index. We carry out extensive simulation study to evaluate the performance of the proposed methods. In the end, we apply the proposed methods to a real data set. Copyright © 2013 John Wiley & Sons, Ltd.
Cañas-Álvarez, J J; Mouresan, E F; Varona, L; Díaz, C; Molina, A; Baro, J A; Altarriba, J; Carabaño, M J; Casellas, J; Piedrafita, J
2016-07-01
Linkage disequilibrium (LD) and persistence of phase are fundamental approaches for exploring the genetic basis of economically important traits in cattle, including the identification of QTL for genomic selection and the estimation of effective population size () to determine the size of the training populations. In this study, we have used the Illumina BovineHD chip in 168 trios of 7 Spanish beef cattle breeds to obtain an overview of the magnitude of LD and the persistence of LD phase through the physical distance between markers. Also, we estimated the time of divergence based on the persistence of the LD phase and calculated past from LD estimates using different alternatives to define the recombination rate. Estimates of average (as a measure of LD) for adjacent markers were close to 0.52 in the 7 breeds and decreased with the distance between markers, although in long distances, some LD still remained (0.07 and 0.05 for markers 200 kb and 1 Mb apart, respectively). A panel with a lower boundary of 38,000 SNP would be necessary to launch a successful within-breed genomic selection program. Persistence of phase, measured as the pairwise correlations between estimates of in 2 breeds at short distances (10 kb), was in the 0.89 to 0.94 range and decreased from 0.33 to 0.52 to a range of 0.01 to 0.08 when marker distance increased from 200 kb to 1 Mb, respectively. The magnitude of the persistence of phase between the Spanish beef breeds was similar to those found in dairy breeds. For across-breed genomic selection, the size of the SNP panels must be in the range of 50,000 to 83,000 SNP. Estimates of past showed values ranging from 26 to 31 for 1 generation ago in all breeds. The divergence among breeds occurred between 129 and 207 generations ago. The results of this study are relevant for the future implementation of within- and across-breed genomic selection programs in the Spanish beef cattle populations. Our results suggest that a reduced subset of the SNP panel would be enough to achieve an adequate precision of the genomic predictions.
Accuracy of genomic predictions in Gyr (Bos indicus) dairy cattle.
Boison, S A; Utsunomiya, A T H; Santos, D J A; Neves, H H R; Carvalheiro, R; Mészáros, G; Utsunomiya, Y T; do Carmo, A S; Verneque, R S; Machado, M A; Panetto, J C C; Garcia, J F; Sölkner, J; da Silva, M V G B
2017-07-01
Genomic selection may accelerate genetic progress in breeding programs of indicine breeds when compared with traditional selection methods. We present results of genomic predictions in Gyr (Bos indicus) dairy cattle of Brazil for milk yield (MY), fat yield (FY), protein yield (PY), and age at first calving using information from bulls and cows. Four different single nucleotide polymorphism (SNP) chips were studied. Additionally, the effect of the use of imputed data on genomic prediction accuracy was studied. A total of 474 bulls and 1,688 cows were genotyped with the Illumina BovineHD (HD; San Diego, CA) and BovineSNP50 (50K) chip, respectively. Genotypes of cows were imputed to HD using FImpute v2.2. After quality check of data, 496,606 markers remained. The HD markers present on the GeneSeek SGGP-20Ki (15,727; Lincoln, NE), 50K (22,152), and GeneSeek GGP-75Ki (65,018) were subset and used to assess the effect of lower SNP density on accuracy of prediction. Deregressed breeding values were used as pseudophenotypes for model training. Data were split into reference and validation to mimic a forward prediction scheme. The reference population consisted of animals whose birth year was ≤2004 and consisted of either only bulls (TR1) or a combination of bulls and dams (TR2), whereas the validation set consisted of younger bulls (born after 2004). Genomic BLUP was used to estimate genomic breeding values (GEBV) and reliability of GEBV (R 2 PEV ) was based on the prediction error variance approach. Reliability of GEBV ranged from ∼0.46 (FY and PY) to 0.56 (MY) with TR1 and from 0.51 (PY) to 0.65 (MY) with TR2. When averaged across all traits, R 2 PEV were substantially higher (R 2 PEV of TR1 = 0.50 and TR2 = 0.57) compared with reliabilities of parent averages (0.35) computed from pedigree data and based on diagonals of the coefficient matrix (prediction error variance approach). Reliability was similar for all the 4 marker panels using either TR1 or TR2, except that imputed HD cow data set led to an inflation of reliability. Reliability of GEBV could be increased by enlarging the limited bull reference population with cow information. A reduced panel of ∼15K markers resulted in reliabilities similar to using HD markers. Reliability of GEBV could be increased by enlarging the limited bull reference population with cow information. Copyright © 2017 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Clarke, Charlotte H; Yip, Christine; Badgwell, Donna; Fung, Eric T; Coombes, Kevin R; Zhang, Zhen; Lu, Karen H; Bast, Robert C
2011-09-01
The low prevalence of ovarian cancer demands both high sensitivity (>75%) and specificity (99.6%) to achieve a positive predictive value of 10% for successful early detection. Utilizing a two stage strategy where serum marker(s) prompt the performance of transvaginal sonography (TVS) in a limited number (2%) of women could reduce the requisite specificity for serum markers to 98%. We have attempted to improve sensitivity by combining CA125 with proteomic markers. Sera from 41 patients with early stage (I/II) and 51 with late stage (III/IV) epithelial ovarian cancer, 40 with benign disease and 99 healthy individuals, were analyzed to measure 7 proteins [Apolipoprotein A1 (Apo-A1), truncated transthyretin (TT), transferrin, hepcidin, ß-2-microglobulin (ß2M), Connective Tissue Activating Protein III (CTAPIII), and Inter-alpha-trypsin inhibitor heavy chain 4 (ITIH4)]. Statistical models were fit by logistic regression, followed by optimization of factors retained in the models determined by optimizing the Akaike Information Criterion. A validation set included 136 stage I ovarian cancers, 140 benign pelvic masses and 174 healthy controls. In a training set analysis, the 3 most effective biomarkers (Apo-A1, TT and CTAPIII) exhibited 54% sensitivity at 98% specificity, CA125 alone produced 68% sensitivity and the combination increased sensitivity to 88%. In a validation set, the marker panel plus CA125 produced a sensitivity of 84% at 98% specificity (P=0.015, McNemar's test). Combining a panel of proteomic markers with CA125 could provide a first step in a sequential two-stage strategy with TVS for early detection of ovarian cancer. Copyright © 2011. Published by Elsevier Inc.
Clarke, Charlotte H.; Yip, Christine; Badgwell, Donna; Fung, Eric T.; Coombes, Kevin R.; Zhang, Zhen; Lu, Karen H.; Bast, Robert C.
2011-01-01
Objective The low prevalence of ovarian cancer demands both high sensitivity (>75%) and specificity (99.6%) to achieve a positive predictive value of 10% for successful early detection. Utilizing a two stage strategy where serum marker(s) prompt the performance of transvaginal sonography (TVS) in a limited number (2%) of women could reduce the requisite specificity for serum markers to 98%. We have attempted to improve sensitivity by combining CA125 with proteomic markers. Methods Sera from 41 patients with early stage (I/II) and 51 with late stage (III/IV) epithelial ovarian cancer, 40 with benign disease and 99 healthy individuals, were analyzed to measure 7 proteins [Apolipoprotein A1 (Apo-A1), truncated transthyretin (TT), transferrin, hepcidin, ß-2-microglobulin (ß2M), Connective Tissue Activating Protein III (CTAPIII), and Inter-alpha-trypsin inhibitor heavy chain 4 (ITIH4)]. Statistical models were fit by logistic regression, followed by optimization of factors retained in the models determined by optimizing the Akaike Information Criterion. A validation set included 136 stage I ovarian cancers, 140 benign pelvic masses and 174 healthy controls. Results In a training set analysis, the 3 most effective biomarkers (Apo-A1, TT and CTAPIII) exhibited 54% sensitivity at 98% specificity, CA125 alone produced 68% sensitivity and the combination increased sensitivity to 88%. In a validation set, the marker panel plus CA125 produced a sensitivity of 84% at 98% specificity (P= 0.015, McNemar's test). Conclusion Combining a panel of proteomic markers with CA125 could provide a first step in a sequential two-stage strategy with TVS for early detection of ovarian cancer. PMID:21708402
Abd Al Samid, May; Chaudhary, Belal; Khaled, Yazan S; Ammori, Basil J; Elkord, Eyad
2016-03-22
Regulatory T cells (Tregs) comprise numerous heterogeneous subsets with distinct phenotypic and functional features. Identifying Treg markers is critical to investigate the role and clinical impact of various Treg subsets in pathological settings, and also for developing more effective immunotherapies. We have recently shown that non-activated FoxP3-Helios+ and activated FoxP3+/-Helios+ CD4+ T cells express GARP/LAP immunosuppressive markers in healthy donors. In this study we report similar observations in the peripheral blood of patients with pancreatic cancer (PC) and liver metastases from colorectal cancer (LICRC). Comparing levels of different Treg subpopulations in cancer patients and controls, we report that in PC patients, and unlike LICRC patients, there was no increase in Treg levels as defined by FoxP3 and Helios. However, defining Tregs based on GARP/LAP expression showed that FoxP3-LAP+ Tregs in non-activated and activated settings, and FoxP3+Helios+GARP+LAP+ activated Tregs were significantly increased in both groups of patients, compared with controls. This work implies that a combination of Treg-specific markers could be used to more accurately determine expanded Treg subsets and to understand their contribution in cancer settings. Additionally, GARP-/+LAP+ CD4+ T cells made IL-10, and not IFN-γ, and levels of IL-10-secreting CD4+ T cells were elevated in LICRC patients, especially with higher tumor staging. Taken together, our results indicate that investigations of Treg levels in different cancers should consider diverse Treg-related markers such as GARP, LAP, Helios, and others and not only FoxP3 as a sole Treg-specific marker.
Validation of a novel biomarker panel for the detection of ovarian cancer
Leung, Felix; Bernardini, Marcus Q.; Brown, Marshall D.; Zheng, Yingye; Molina, Rafael; Bast, Robert C.; Davis, Gerard; Serra, Stefano; Diamandis, Eleftherios P.; Kulasingam, Vathany
2016-01-01
Background Ovarian cancer (OvCa) is the most lethal gynecological malignancy. Our integrated -omics approach to OvCa biomarker discovery has identified kallikrein 6 (KLK6) and folate-receptor 1 (FOLR1) as promising candidates but these markers require further validation. Methods KLK6, FOLR1 CA125 and HE4 were investigated in three independent serum cohorts with a total of 20 healthy controls, 150 benign controls and 216 OvCa patients. The serum biomarker levels were determined by ELISA or automated immunoassay. Results All biomarkers demonstrated elevations in the sera of OvCa patients compared to controls (p<0.01). Overall, CA125 and HE4 displayed the strongest ability (AUC 0.80 and 0.82, respectively) to identify OvCa patients and the addition of HE4 to CA125 improved the sensitivity from 36% to 67% at a set specificity of 95%. As well, the combination of HE4 and FOLR1 was a strong predictor of OvCa diagnosis, displaying comparable sensitivity (65%) to the best performing CA125-based models (67%) at a set specificity of 95%. Conclusions The markers identified through our integrated –omics approach performed similarly to the clinically-approved markers CA125 and HE4. Furthermore, HE4 represents a powerful diagnostic marker for OvCa and should be used more routinely in a clinical setting. Impact The implications of our study are two-fold: (1) we have demonstrated the strengths of HE4 alone and in combination with CA125, lending credence to increasing its usage in the clinic; and (2) we have demonstrated the clinical utility of our integrated –omics approach to identifying novel serum markers with comparable performance to clinical markers. PMID:27448593
Makki-Rmida, Faten; Kammoun, Arwa; Mahfoudh, Nadia; Ayadi, Adnene; Gibriel, Abdullah Ahmed; Mallek, Bakhta; Maalej, Leila; Hammami, Zouheir; Maatoug, Samir; Makni, Hafedh; Masmoudi, Saber
2015-12-01
Y chromosome STRs (Y-STRs) are being used frequently in forensic laboratories. Previous studies of Y-STR polymorphisms in different groups of the Tunisian population identified low levels of diversity and discrimination capacity (DC) using various commercial marker sets. This definitely limits the use of such systems for Y-STRs genotyping in Tunisia. In our investigation on South Tunisia, 200 unrelated males were typed for the 12 conventional Y-STRs included in the PowerPlex® Y System. Additional set of nine noncore Y-STRs including DYS446, DYS456, DYS458, DYS388, DYS444, DYS445, DYS449, DYS710, and DYS464 markers were genotyped and evaluated for their potential in improving DC. Allele frequency, gene diversity, haplotype diversity (HD), and DC calculation revealed that DYS464 was the most diverse marker followed by DYS710 and DYS449 markers. The standard panel of 12 Y-STRs (DC = 80.5%) and the nine markers were combined to obtain DC of 99%. Among the 198 different haplotypes observed, 196 haplotypes were unique (HD = 99.999). Out of the nine noncore set, six Y-STRs (DYS458, DYS456, DYS449, DYS710, DYS444, and DYS464) had the greatest impact on enhancing DC. Our data provided putative Y-STRs combination to be used for genetic and forensic applications. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Kumata, Hidetoshi; Mori, Mika; Takahashi, Sho; Takamiya, Shohei; Tsuzuki, Mikio; Uchida, Tatsuya; Fujiwara, Kitao
2011-12-01
To propose new molecular markers for tire-wear emissions, four dihydroresin acids, that is, 8-isopimaren-18-oic acid (I), 8-pimaren-18-oic acid (II), 13β(H)-abieten-18-oic acid (III), and 13α(H)-abiet-8-en-18-oic acid (IV), were identified and investigated for source specificities, distributions, and environmental stabilities. The absence of I-IV in natural sources and the linear correlations between dihydroresin acids with different skeletons in tires and in environmental samples demonstrated that I-IV are specific markers for synthetic rubbers. The ratio of III + IV to the sum of III + IV plus abietic acid showed the resin acids distribution between different environmental compartments receiving contributions from traffic and natural sources. The physicochemical properties and results of photolysis experiments suggested that I-IV can set lower limits for tire-wear contributions to environmental loads of particulate matter (PM) and polycyclic aromatic hydrocarbons with molecular weight ≥202. By comparing III + IV concentrations or (III+IV)/pyrene or (III+IV)/benzo[a]pyrene ratios in tires and those in environmental matrices, the contributions of tire-wear emissions to PM, pyrene, and benzo[a]pyrene were estimated to be 0.68 ± 0.54%, 6.9 ± 4.8%, and 0.37 ± 0.18% in roadside PM and 0.83 ± 0.21%, 0.88 ± 0.52%, and 0.08 ± 0.06% in rooftop PM.
LinkImpute: Fast and Accurate Genotype Imputation for Nonmodel Organisms
Money, Daniel; Gardner, Kyle; Migicovsky, Zoë; Schwaninger, Heidi; Zhong, Gan-Yuan; Myles, Sean
2015-01-01
Obtaining genome-wide genotype data from a set of individuals is the first step in many genomic studies, including genome-wide association and genomic selection. All genotyping methods suffer from some level of missing data, and genotype imputation can be used to fill in the missing data and improve the power of downstream analyses. Model organisms like human and cattle benefit from high-quality reference genomes and panels of reference genotypes that aid in imputation accuracy. In nonmodel organisms, however, genetic and physical maps often are either of poor quality or are completely absent, and there are no panels of reference genotypes available. There is therefore a need for imputation methods designed specifically for nonmodel organisms in which genomic resources are poorly developed and marker order is unreliable or unknown. Here we introduce LinkImpute, a software package based on a k-nearest neighbor genotype imputation method, LD-kNNi, which is designed for unordered markers. No physical or genetic maps are required, and it is designed to work on unphased genotype data from heterozygous species. It exploits the fact that markers useful for imputation often are not physically close to the missing genotype but rather distributed throughout the genome. Using genotyping-by-sequencing data from diverse and heterozygous accessions of apples, grapes, and maize, we compare LD-kNNi with several genotype imputation methods and show that LD-kNNi is fast, comparable in accuracy to the best-existing methods, and exhibits the least bias in allele frequency estimates. PMID:26377960
Novel efficient genome-wide SNP panels for the conservation of the highly endangered Iberian lynx.
Kleinman-Ruiz, Daniel; Martínez-Cruz, Begoña; Soriano, Laura; Lucena-Perez, Maria; Cruz, Fernando; Villanueva, Beatriz; Fernández, Jesús; Godoy, José A
2017-07-21
The Iberian lynx (Lynx pardinus) has been acknowledged as the most endangered felid species in the world. An intense contraction and fragmentation during the twentieth century left less than 100 individuals split in two isolated and genetically eroded populations by 2002. Genetic monitoring and management so far have been based on 36 STRs, but their limited variability and the more complex situation of current populations demand more efficient molecular markers. The recent characterization of the Iberian lynx genome identified more than 1.6 million SNPs, of which 1536 were selected and genotyped in an extended Iberian lynx sample. We validated 1492 SNPs and analysed their heterozygosity, Hardy-Weinberg equilibrium, and linkage disequilibrium. We then selected a panel of 343 minimally linked autosomal SNPs from which we extracted subsets optimized for four different typical tasks in conservation applications: individual identification, parentage assignment, relatedness estimation, and admixture classification, and compared their power to currently used STR panels. We ascribed 21 SNPs to chromosome X based on their segregation patterns, and identified one additional marker that showed significant differentiation between sexes. For all applications considered, panels of autosomal SNPs showed higher power than the currently used STR set with only a very modest increase in the number of markers. These novel panels of highly informative genome-wide SNPs provide more powerful, efficient, and flexible tools for the genetic management and non-invasive monitoring of Iberian lynx populations. This example highlights an important outcome of whole-genome studies in genetically threatened species.
Undermethylated DNA as a source of microsatellites from a conifer genome.
Zhou, Y; Bui, T; Auckland, L D; Williams, C G
2002-02-01
Developing microsatellites from the large, highly duplicated conifer genome requires special tools. To improve the efficiency of developing Pinus taeda L. microsatellites, undermethylated (UM) DNA fragments were used to construct a microsatellite-enriched copy library. A methylation-sensitive restriction enzyme, McrBC, was used to enrich for UM DNA before library construction. Digested DNA fragments larger than 9 kb were then excised and digested with RsaI and used to construct nine dinucleotide and trinucleotide libraries. A total of 1016 microsatellite-positive clones were detected among 11 904 clones and 620 of these were unique. Of 245 primer sets that produced a PCR product, 113 could be developed as UM microsatellite markers and 70 were polymorphic. Inheritance and marker informativeness were tested for a random sample of 36 polymorphic markers using a three-generation outbred pedigree. Thirty-one microsatellites (86%) had single-locus inheritance despite the highly duplicated nature of the P. taeda genome. Nineteen UM microsatellites had highly informative intercross mating type configurations. Allele number and frequency were estimated for eleven UM microsatellites using a population survey. Allele numbers for these UM microsatellites ranged from 3 to 12 with an average of 5.7 alleles/locus. Frequencies for the 63 alleles were mostly in the low-common range; only 14 of the 63 were in the rare allele (q < 0.05) class. Enriching for UM DNA was an efficient method for developing polymorphic microsatellites from a large plant genome.
West, Howard
2017-09-01
The current standard of care for molecular marker testing in patients with advanced non-small cell lung cancer (NSCLC) has been evolving over several years and is a product of the quality of the evidence supporting a targeted therapy for a specific molecular marker, the pre-test probability of that marker in the population, and the magnitude of benefit seen with that treatment. Among the markers that have one or more matched targeted therapies, only a few are in the subset for which they should be considered as most clearly worthy of prioritizing to detect in the first line setting in order to have them supplant other first line alternatives, and in only a subset of patients, as defined currently by NSCLC histology. Specifically, this currently includes testing for an activating epidermal growth factor receptor ( EGFR ) mutation or an anaplastic lymphoma kinase ( ALK ) or ROS1 rearrangement. This article reviews the history and data supporting the prioritization of these markers in patients with non-squamous NSCLC, a histologically selected population in whom the probability of these markers combined with the anticipated efficacy of targeted therapies against them is high enough to favor these treatments in the first line setting. In reviewing the evidence supporting this very limited core subset of most valuable molecular markers to detect in the initial workup of such patients, we can also see the criteria by which other actionable markers need to reach in order to be widely recognized as reliably valuable enough to warrant prioritization to detect in the initial workup of advanced NSCLC as well.
Liu, Guozheng; Zhao, Yusheng; Gowda, Manje; Longin, C. Friedrich H.; Reif, Jochen C.; Mette, Michael F.
2016-01-01
Bread-making quality traits are central targets for wheat breeding. The objectives of our study were to (1) examine the presence of major effect QTLs for quality traits in a Central European elite wheat population, (2) explore the optimal strategy for predicting the hybrid performance for wheat quality traits, and (3) investigate the effects of marker density and the composition and size of the training population on the accuracy of prediction of hybrid performance. In total 135 inbred lines of Central European bread wheat (Triticum aestivum L.) and 1,604 hybrids derived from them were evaluated for seven quality traits in up to six environments. The 135 parental lines were genotyped using a 90k single-nucleotide polymorphism array. Genome-wide association mapping initially suggested presence of several quantitative trait loci (QTLs), but cross-validation rather indicated the absence of major effect QTLs for all quality traits except of 1000-kernel weight. Genomic selection substantially outperformed marker-assisted selection in predicting hybrid performance. A resampling study revealed that increasing the effective population size in the estimation set of hybrids is relevant to boost the accuracy of prediction for an unrelated test population. PMID:27383841
Alzheimer Disease Biomarkers as Outcome Measures for Clinical Trials in MCI.
Caroli, Anna; Prestia, Annapaola; Wade, Sara; Chen, Kewei; Ayutyanont, Napatkamon; Landau, Susan M; Madison, Cindee M; Haense, Cathleen; Herholz, Karl; Reiman, Eric M; Jagust, William J; Frisoni, Giovanni B
2015-01-01
The aim of this study was to compare the performance and power of the best-established diagnostic biological markers as outcome measures for clinical trials in patients with mild cognitive impairment (MCI). Magnetic resonance imaging, F-18 fluorodeoxyglucose positron emission tomography markers, and Alzheimer's Disease Assessment Scale-cognitive subscale were compared in terms of effect size and statistical power over different follow-up periods in 2 MCI groups, selected from Alzheimer's Disease Neuroimaging Initiative data set based on cerebrospinal fluid (abnormal cerebrospinal fluid Aβ1-42 concentration-ABETA+) or magnetic resonance imaging evidence of Alzheimer disease (positivity to hippocampal atrophy-HIPPO+). Biomarkers progression was modeled through mixed effect models. Scaled slope was chosen as measure of effect size. Biomarkers power was estimated using simulation algorithms. Seventy-four ABETA+ and 51 HIPPO+ MCI patients were included in the study. Imaging biomarkers of neurodegeneration, especially MR measurements, showed highest performance. For all biomarkers and both MCI groups, power increased with increasing follow-up time, irrespective of biomarker assessment frequency. These findings provide information about biomarker enrichment and outcome measurements that could be employed to reduce MCI patient samples and treatment duration in future clinical trials.
Song, H; Li, L; Ma, P; Zhang, S; Su, G; Lund, M S; Zhang, Q; Ding, X
2018-06-01
This study investigated the efficiency of genomic prediction with adding the markers identified by genome-wide association study (GWAS) using a data set of imputed high-density (HD) markers from 54K markers in Chinese Holsteins. Among 3,056 Chinese Holsteins with imputed HD data, 2,401 individuals born before October 1, 2009, were used for GWAS and a reference population for genomic prediction, and the 220 younger cows were used as a validation population. In total, 1,403, 1,536, and 1,383 significant single nucleotide polymorphisms (SNP; false discovery rate at 0.05) associated with conformation final score, mammary system, and feet and legs were identified, respectively. About 2 to 3% genetic variance of 3 traits was explained by these significant SNP. Only a very small proportion of significant SNP identified by GWAS was included in the 54K marker panel. Three new marker sets (54K+) were herein produced by adding significant SNP obtained by linear mixed model for each trait into the 54K marker panel. Genomic breeding values were predicted using a Bayesian variable selection (BVS) model. The accuracies of genomic breeding value by BVS based on the 54K+ data were 2.0 to 5.2% higher than those based on the 54K data. The imputed HD markers yielded 1.4% higher accuracy on average (BVS) than the 54K data. Both the 54K+ and HD data generated lower bias of genomic prediction, and the 54K+ data yielded the lowest bias in all situations. Our results show that the imputed HD data were not very useful for improving the accuracy of genomic prediction and that adding the significant markers derived from the imputed HD marker panel could improve the accuracy of genomic prediction and decrease the bias of genomic prediction. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
14 CFR 171.207 - Performance requirements.
Code of Federal Regulations, 2014 CFR
2014-01-01
... 14 Aeronautics and Space 3 2014-01-01 2014-01-01 false Performance requirements. 171.207 Section...) NAVIGATIONAL FACILITIES NON-FEDERAL NAVIGATION FACILITIES VHF Marker Beacons § 171.207 Performance requirements. (a) VHF Marker Beacons must meet the performance requirements set forth in the “International...
14 CFR 171.207 - Performance requirements.
Code of Federal Regulations, 2011 CFR
2011-01-01
... 14 Aeronautics and Space 3 2011-01-01 2011-01-01 false Performance requirements. 171.207 Section...) NAVIGATIONAL FACILITIES NON-FEDERAL NAVIGATION FACILITIES VHF Marker Beacons § 171.207 Performance requirements. (a) VHF Marker Beacons must meet the performance requirements set forth in the “International...
14 CFR 171.207 - Performance requirements.
Code of Federal Regulations, 2013 CFR
2013-01-01
... 14 Aeronautics and Space 3 2013-01-01 2013-01-01 false Performance requirements. 171.207 Section...) NAVIGATIONAL FACILITIES NON-FEDERAL NAVIGATION FACILITIES VHF Marker Beacons § 171.207 Performance requirements. (a) VHF Marker Beacons must meet the performance requirements set forth in the “International...
Accurate and ergonomic method of registration for image-guided neurosurgery
NASA Astrophysics Data System (ADS)
Henderson, Jaimie M.; Bucholz, Richard D.
1994-05-01
There has been considerable interest in the development of frameless stereotaxy based upon scalp mounted fiducials. In practice we have experienced difficulty in relating markers to the image data sets in our series of 25 frameless cases, as well as inaccuracy due to scalp movement and the size of the markers. We have developed an alternative system for accurately and conveniently achieving surgical registration for image-guided neurosurgery based on alignment and matching of patient forehead contours. The system consists of a laser contour digitizer which is used in the operating room to acquire forehead contours, editing software for extracting contours from patient image data sets, and a contour-match algorithm for aligning the two contours and performing data set registration. The contour digitizer is tracked by a camera array which relates its position with respect to light emitting diodes placed on the head clamp. Once registered, surgical instrument can be tracked throughout the procedure. Contours can be extracted from either CT or MRI image datasets. The system has proven to be robust in the laboratory setting. Overall error of registration is 1 - 2 millimeters in routine use. Image to patient registration can therefore be achieved quite easily and accurately, without the need for fixation of external markers to the skull, or manually finding markers on the scalp and image datasets. The system is unobtrusive and imposes little additional effort on the neurosurgeon, broadening the appeal of image-guided surgery.
Potential Role of Neuroimaging Markers for Early Diagnosis of Dementia in Primary Care.
Teipel, Stefan; Kilimann, Ingo; Thyrian, Jochen R; Kloppel, Stefan; Hoffmann, Wolfgang
2018-01-01
The use of imaging markers for the diagnosis of predementia and early dementia stages of Alzheimer's disease (AD) has widely been explored in research settings and specialized care. The use of these markers in primary care has yet to be established. Summarize current evidence for the usefulness of imaging markers for AD in primary compared to specialized care settings. Selective overview of the literature, and pilot data on the use of MRI-based hippocampus and basal forebrain volumetry for the discrimination of AD dementia and mild cognitive impairment (MCI) cases from healthy controls in 58 cases from a primary care cohort and 58 matched cases from a memory clinic's sample. Molecular imaging marker of amyloid pathology, and volumetric markers of regional and whole brain atrophy support the diagnosis of AD dementia and MCI due to AD, and contribute to confidence in the differential diagnosis of AD and non-AD related dementias in specialized care. Limited evidence from the literature and our primary care cohort suggests that the diagnostic accuracy of volumetric imaging markers may be similar in the dementia stage of AD, but may be inferior for cases with MCI in primary compared with specialized care. Evidence is still widely lacking on the use of imaging markers for early and differential diagnosis of AD dementia, and detection of prodromal AD in primary care. Further progress to fill this gap will depend on the availability of international multimodal data from well-defined primary care cohorts. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Neocleous, A C; Syngelaki, A; Nicolaides, K H; Schizas, C N
2018-04-01
To estimate the risk of fetal trisomy 21 (T21) and other chromosomal abnormalities (OCA) at 11-13 weeks' gestation using computational intelligence classification methods. As a first step, a training dataset consisting of 72 054 euploid pregnancies, 295 cases of T21 and 305 cases of OCA was used to train an artificial neural network. Then, a two-stage approach was used for stratification of risk and diagnosis of cases of aneuploidy in the blind set. In Stage 1, using four markers, pregnancies in the blind set were classified into no risk and risk. No-risk pregnancies were not examined further, whereas the risk pregnancies were forwarded to Stage 2 for further examination. In Stage 2, using seven markers, pregnancies were classified into three types of risk, namely no risk, moderate risk and high risk. Of 36 328 unknown to the system pregnancies (blind set), 17 512 euploid, two T21 and 18 OCA were classified as no risk in Stage 1. The remaining 18 796 cases were forwarded to Stage 2, of which 7895 euploid, two T21 and two OCA cases were classified as no risk, 10 464 euploid, 83 T21 and 61 OCA as moderate risk and 187 euploid, 50 T21 and 52 OCA as high risk. The sensitivity and the specificity for T21 in Stage 2 were 97.1% and 99.5%, respectively, and the false-positive rate from Stage 1 to Stage 2 was reduced from 51.4% to ∼1%, assuming that the cell-free DNA test could identify all euploid and aneuploid cases. We propose a method for early diagnosis of chromosomal abnormalities that ensures that most T21 cases are classified as high risk at any stage. At the same time, the number of euploid cases subjected to invasive or cell-free DNA examinations was minimized through a routine procedure offered in two stages. Our method is minimally invasive and of relatively low cost, highly effective at T21 identification and it performs better than do other existing statistical methods. Copyright © 2017 ISUOG. Published by John Wiley & Sons Ltd. Copyright © 2017 ISUOG. Published by John Wiley & Sons Ltd.
Daugherty, Bethany L; Schap, TusaRebecca E; Ettienne-Gittens, Reynolette; Zhu, Fengqing M; Bosch, Marc; Delp, Edward J; Ebert, David S; Kerr, Deborah A; Boushey, Carol J
2012-04-13
The development of a mobile telephone food record has the potential to ameliorate much of the burden associated with current methods of dietary assessment. When using the mobile telephone food record, respondents capture an image of their foods and beverages before and after eating. Methods of image analysis and volume estimation allow for automatic identification and volume estimation of foods. To obtain a suitable image, all foods and beverages and a fiducial marker must be included in the image. To evaluate a defined set of skills among adolescents and adults when using the mobile telephone food record to capture images and to compare the perceptions and preferences between adults and adolescents regarding their use of the mobile telephone food record. We recruited 135 volunteers (78 adolescents, 57 adults) to use the mobile telephone food record for one or two meals under controlled conditions. Volunteers received instruction for using the mobile telephone food record prior to their first meal, captured images of foods and beverages before and after eating, and participated in a feedback session. We used chi-square for comparisons of the set of skills, preferences, and perceptions between the adults and adolescents, and McNemar test for comparisons within the adolescents and adults. Adults were more likely than adolescents to include all foods and beverages in the before and after images, but both age groups had difficulty including the entire fiducial marker. Compared with adolescents, significantly more adults had to capture more than one image before (38% vs 58%, P = .03) and after (25% vs 50%, P = .008) meal session 1 to obtain a suitable image. Despite being less efficient when using the mobile telephone food record, adults were more likely than adolescents to perceive remembering to capture images as easy (P < .001). A majority of both age groups were able to follow the defined set of skills; however, adults were less efficient when using the mobile telephone food record. Additional interactive training will likely be necessary for all users to provide extra practice in capturing images before entering a free-living situation. These results will inform age-specific development of the mobile telephone food record that may translate to a more accurate method of dietary assessment.
Influence of the number of elongated fiducial markers on the localization accuracy of the prostate
NASA Astrophysics Data System (ADS)
de Boer, Johan; de Bois, Josien; van Herk, Marcel; Sonke, Jan-Jakob
2012-10-01
Implanting fiducial markers for localization purposes has become an accepted practice in radiotherapy for prostate cancer. While many correction strategies correct for translations only, advanced correction protocols also require knowledge of the rotation of the prostate. For this purpose, typically, three or more markers are implanted. Elongated fiducial markers provide more information about their orientation than traditional round or cylindrical markers. Potentially, fewer markers are required. In this study, we evaluate the effect of the number of elongated markers on the localization accuracy of the prostate. To quantify the localization error, we developed a model that estimates, at arbitrary locations in the prostate, the registration error caused by translational and rotational uncertainties of the marker registration. Every combination of one, two and three markers was analysed for a group of 24 patients. The average registration errors at the prostate surface were 0.3-0.8 mm and 0.4-1 mm for registrations on, respectively, three markers and two markers located on different sides of the prostate. Substantial registration errors (2.0-2.2 mm) occurred at the prostate surface contralateral to the markers when two markers were implanted on the same side of the prostate or only one marker was used. In conclusion, there is no benefit in using three elongated markers: two markers accurately localize the prostate if they are implanted at some distance from each other.
Short interspersed CAN SINE elements as prognostic markers in canine mammary neoplasia.
Gelaleti, Gabriela B; Granzotto, Adriana; Leonel, Camila; Jardim, Bruna V; Moschetta, Marina G; Carareto, Claudia M A; Zuccari, Debora Ap P C
2014-01-01
The genome of mammals is characterized by a large number of non-LTR retrotransposons, and among them, the CAN SINEs are characteristics of the canine species. Small amounts of DNA freely circulate in normal blood serum and high amounts are found in human patients with cancer, characterizing it as a candidate tumor-biomarker. The aim of this study was to estimate, through its absolute expression, the number of copies of CAN SINE sequences present in free circulating DNA of female dogs with mammary cancer, in order to correlate with the clinical and pathological characteristics and the follow-up period. The copy number of CAN SINE sequences was estimated by qPCR in 28 female dogs with mammary neoplasia. The univariate analysis showed an increased number of copies in female dogs with mammary tumor in female dogs >10 years old (p=0.02) and tumor time >18 months (p<0.05). The Kaplan-Meier test demonstrated a negative correlation between an increased number of copies and survival time (p=0.03). High amounts of CAN SINE fragments can be good markers for the detection of tumor DNA in blood and may characterize it as a marker of poor prognosis, being related to female dogs with shorter survival times. This estimate can be used as a prognostic marker in non-invasive breast cancer research and is useful in predicting tumor progression and patient monitoring.
Zeestraten, Eva Anna; Benjamin, Philip; Lambert, Christian; Lawrence, Andrew John; Williams, Owen Alan; Morris, Robin Guy; Barrick, Thomas Richard; Markus, Hugh Stephen
2016-01-01
Cerebral small vessel disease (SVD) is the major cause of vascular cognitive impairment, resulting in significant disability and reduced quality of life. Cognitive tests have been shown to be insensitive to change in longitudinal studies and, therefore, sensitive surrogate markers are needed to monitor disease progression and assess treatment effects in clinical trials. Diffusion tensor imaging (DTI) is thought to offer great potential in this regard. Sensitivity of the various parameters that can be derived from DTI is however unknown. We aimed to evaluate the differential sensitivity of DTI markers to detect SVD progression, and to estimate sample sizes required to assess therapeutic interventions aimed at halting decline based on DTI data. We investigated 99 patients with symptomatic SVD, defined as clinical lacunar syndrome with MRI confirmation of a corresponding infarct as well as confluent white matter hyperintensities over a 3 year follow-up period. We evaluated change in DTI histogram parameters using linear mixed effect models and calculated sample size estimates. Over a three-year follow-up period we observed a decline in fractional anisotropy and increase in diffusivity in white matter tissue and most parameters changed significantly. Mean diffusivity peak height was the most sensitive marker for SVD progression as it had the smallest sample size estimate. This suggests disease progression can be monitored sensitively using DTI histogram analysis and confirms DTI's potential as surrogate marker for SVD.
Simultaneous Estimation of Withaferin A and Z-Guggulsterone in Marketed Formulation by RP-HPLC.
Agrawal, Poonam; Vegda, Rashmi; Laddha, Kirti
2015-07-01
A simple, rapid, precise and accurate high-performance liquid chromatography (HPLC) method was developed for simultaneous estimation of withaferin A and Z-guggulsterone in a polyherbal formulation containing Withania somnifera and Commiphora wightii. The chromatographic separation was achieved on a Purosphere RP-18 column (particle size 5 µm) with a mobile phase consisting of Solvent A (acetonitrile) and Solvent B (water) with the following gradients: 0-7 min, 50% A in B; 7-9 min, 50-80% A in B; 9-20 min, 80% A in B at a flow rate of 1 mL/min and detection at 235 nm. The marker compounds were well separated on the chromatogram within 20 min. The results obtained indicate accuracy and reliability of the developed simultaneous HPLC method for the quantification of withaferin A and Z-guggulsterone. The proposed method was found to be reproducible, specific, precise and accurate for simultaneous estimation of these marker compounds in a combined dosage form. The HPLC method was appropriate and the two markers are well resolved, enabling efficient quantitative analysis of withaferin A and Z-guggulsterone. The method can be successively used for quantitative analysis of these two marker constituents in combination of marketed polyherbal formulation. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Liao, Wan-Jin; Hu, Yi; Zhu, Bi-Ru; Zhao, Xia-Qing; Zeng, Yan-Fei; Zhang, Da-Yong
2009-01-01
Background and Aims Reduction in female fitness in large clones can occur as a result of increased geitonogamous self-fertilization and its influence through inbreeding depression. This possibility was investigated in the self-compatible, bee-pollinated perennial herb Aconitum kusnezoffii which varies in clone size. Methods Field investigations were conducted on pollinator behaviour, flowering phenology and variation in seed set. The effects of self-pollination following controlled self- and cross-pollination were also examined. Selfing rates of differently sized clones were assessed using allozyme markers. Key Results High rates of geitonogamous pollination were associated with large display size. Female fitness at the ramet level decreased with clone size. Fruit and seed set under cross-pollination were significantly higher than those under self-pollination. The pre-dispersal inbreeding depression was estimated as 0·502 based on the difference in seed set per flower between self- and cross-pollinated flowers. Selfing rates of differently sized clones did not differ. Conclusions It is concluded that in A. kusnezoffii the negative effects of self-pollination causing reduced female fertility with clone size arise primarily from a strong early-acting inbreeding depression leading to the abortion of selfed embryos prior to seed maturation. PMID:19767308
Marker Registration Technique for Handwritten Text Marker in Augmented Reality Applications
NASA Astrophysics Data System (ADS)
Thanaborvornwiwat, N.; Patanukhom, K.
2018-04-01
Marker registration is a fundamental process to estimate camera poses in marker-based Augmented Reality (AR) systems. We developed AR system that creates correspondence virtual objects on handwritten text markers. This paper presents a new method for registration that is robust for low-content text markers, variation of camera poses, and variation of handwritten styles. The proposed method uses Maximally Stable Extremal Regions (MSER) and polygon simplification for a feature point extraction. The experiment shows that we need to extract only five feature points per image which can provide the best registration results. An exhaustive search is used to find the best matching pattern of the feature points in two images. We also compared performance of the proposed method to some existing registration methods and found that the proposed method can provide better accuracy and time efficiency.
Microbial source tracking markers at three inland recreational lakes in Ohio, 2011
Francy, Donna S.; Stelzer, Erin A.
2012-01-01
During the 2011 recreational season, samples were collected for E. coli and microbial source tracking (MST) marker concentrations to begin to understand potential sources of fecal contamination at three inland recreational lakes in Ohio - Buckeye, Atwood, and Tappan Lakes. The results from 32 regular samples, 4 field blanks, and 7 field replicates collected at 5 sites are presented in this report. At the three lakes, the ruminant-associated marker was found most often (57-73 percent of samples) but at estimated quantities, followed by the dog-associated marker (30-43 percent of samples). The human-associated marker was found in 14 and 50 percent of samples from Atwood and Tappan Lakes, respectively, but was not found in any samples from the two Buckeye Lake sites. The gull-associated marker was detected in only two samples, both from Tappan Lake.
Tao, Yebin; Sánchez, Brisa N; Mukherjee, Bhramar
2015-03-30
Many existing cohort studies designed to investigate health effects of environmental exposures also collect data on genetic markers. The Early Life Exposures in Mexico to Environmental Toxicants project, for instance, has been genotyping single nucleotide polymorphisms on candidate genes involved in mental and nutrient metabolism and also in potentially shared metabolic pathways with the environmental exposures. Given the longitudinal nature of these cohort studies, rich exposure and outcome data are available to address novel questions regarding gene-environment interaction (G × E). Latent variable (LV) models have been effectively used for dimension reduction, helping with multiple testing and multicollinearity issues in the presence of correlated multivariate exposures and outcomes. In this paper, we first propose a modeling strategy, based on LV models, to examine the association between repeated outcome measures (e.g., child weight) and a set of correlated exposure biomarkers (e.g., prenatal lead exposure). We then construct novel tests for G × E effects within the LV framework to examine effect modification of outcome-exposure association by genetic factors (e.g., the hemochromatosis gene). We consider two scenarios: one allowing dependence of the LV models on genes and the other assuming independence between the LV models and genes. We combine the two sets of estimates by shrinkage estimation to trade off bias and efficiency in a data-adaptive way. Using simulations, we evaluate the properties of the shrinkage estimates, and in particular, we demonstrate the need for this data-adaptive shrinkage given repeated outcome measures, exposure measures possibly repeated and time-varying gene-environment association. Copyright © 2014 John Wiley & Sons, Ltd.
NASA Astrophysics Data System (ADS)
Stone, Elizabeth A.; Schauer, James J.; Pradhan, Bidya Banmali; Dangol, Pradeep Man; Habib, Gazala; Venkataraman, Chandra; Ramanathan, V.
2010-03-01
This study focuses on improving source apportionment of carbonaceous aerosol in South Asia and consists of three parts: (1) development of novel molecular marker-based profiles for real-world biofuel combustion, (2) application of these profiles to a year-long data set, and (3) evaluation of profiles by an in-depth sensitivity analysis. Emissions profiles for biomass fuels were developed through source testing of a residential stove commonly used in South Asia. Wood fuels were combusted at high and low rates, which corresponded to source profiles high in organic carbon (OC) or high in elemental carbon (EC), respectively. Crop wastes common to the region, including rice straw, mustard stalk, jute stalk, soybean stalk, and animal residue burnings, were also characterized. Biofuel profiles were used in a source apportionment study of OC and EC in Godavari, Nepal. This site is located in the foothills of the Himalayas and was selected for its well-mixed and regionally impacted air masses. At Godavari, daily samples of fine particulate matter (PM2.5) were collected throughout the year of 2006, and the annual trends in particulate mass, OC, and EC followed the occurrence of a regional haze in South Asia. Maximum concentrations occurred during the dry winter season and minimum concentrations occurred during the summer monsoon season. Specific organic compounds unique to aerosol sources, molecular markers, were measured in monthly composite samples. These markers implicated motor vehicles, coal combustion, biomass burning, cow dung burning, vegetative detritus, and secondary organic aerosol as sources of carbonaceous aerosol. A molecular marker-based chemical mass balance (CMB) model provided a quantitative assessment of primary source contributions to carbonaceous aerosol. The new profiles were compared to widely used biomass burning profiles from the literature in a sensitivity analysis. This analysis indicated a high degree of stability in estimates of source contributions to OC when different biomass profiles were used. The majority of OC was unapportioned to primary sources and was estimated to be of secondary origin, while biomass combustion was the next-largest source of OC. The CMB apportionment of EC to primary sources was unstable due to the diversity of biomass burning conditions in the region. The model results suggested that biomass burning and fossil fuel were important contributors to EC, but could not reconcile their relative contributions.
A genetic map and germplasm diversity estimation of Mangifera indica (mango) with SNPs
USDA-ARS?s Scientific Manuscript database
Mango (Mangifera indica) is often referred to as the “King of Fruits”. As the first steps in developing a mango genomics project, we genotyped 582 individuals comprising six mapping populations with 1054 SNP markers. The resulting consensus map had 20 linkage groups defined by 726 SNP markers with...
USDA-ARS?s Scientific Manuscript database
The development of resources for genomic studies in Mangifera indica (mango) will allow marker-assisted selection and identification of genetically diverse germplasm, greatly aiding mango breeding programs. We report here a first step in developing such resources, our identification of thousands una...
Financial feasibility of marker-aided selection in Douglas-fir.
G.R. Johnson; N.C. Wheeler; S.H. Strauss
2000-01-01
The land area required for a marker-aided selection (MAS) program to break-even (i.e., have equal costs and benefits) was estimated using computer simulation for coastal Douglas-fir (Pseudotsuga menziesii (Mirb.) Franco) in the Pacific Northwestern United States. We compared the selection efficiency obtained when using an index that included the...
Genetic architechture and biological basis for feed efficiency in dairy cattle
USDA-ARS?s Scientific Manuscript database
The genetic architecture of residual feed intake (RFI) and related traits was evaluated using a dataset of 2,894 cows. A Bayesian analysis estimated that markers accounted for 14% of the variance in RFI, and that RFI had considerable genetic variation. Effects of marker windows were small, but QTL p...
A genetically anchored physical framework for Theobroma cacao cv. Matina 1-6
2011-01-01
Background The fermented dried seeds of Theobroma cacao (cacao tree) are the main ingredient in chocolate. World cocoa production was estimated to be 3 million tons in 2010 with an annual estimated average growth rate of 2.2%. The cacao bean production industry is currently under threat from a rise in fungal diseases including black pod, frosty pod, and witches' broom. In order to address these issues, genome-sequencing efforts have been initiated recently to facilitate identification of genetic markers and genes that could be utilized to accelerate the release of robust T. cacao cultivars. However, problems inherent with assembly and resolution of distal regions of complex eukaryotic genomes, such as gaps, chimeric joins, and unresolvable repeat-induced compressions, have been unavoidably encountered with the sequencing strategies selected. Results Here, we describe the construction of a BAC-based integrated genetic-physical map of the T. cacao cultivar Matina 1-6 which is designed to augment and enhance these sequencing efforts. Three BAC libraries, each comprised of 10× coverage, were constructed and fingerprinted. 230 genetic markers from a high-resolution genetic recombination map and 96 Arabidopsis-derived conserved ortholog set (COS) II markers were anchored using pooled overgo hybridization. A dense tile path consisting of 29,383 BACs was selected and end-sequenced. The physical map consists of 154 contigs and 4,268 singletons. Forty-nine contigs are genetically anchored and ordered to chromosomes for a total span of 307.2 Mbp. The unanchored contigs (105) span 67.4 Mbp and therefore the estimated genome size of T. cacao is 374.6 Mbp. A comparative analysis with A. thaliana, V. vinifera, and P. trichocarpa suggests that comparisons of the genome assemblies of these distantly related species could provide insights into genome structure, evolutionary history, conservation of functional sites, and improvements in physical map assembly. A comparison between the two T. cacao cultivars Matina 1-6 and Criollo indicates a high degree of collinearity in their genomes, yet rearrangements were also observed. Conclusions The results presented in this study are a stand-alone resource for functional exploitation and enhancement of Theobroma cacao but are also expected to complement and augment ongoing genome-sequencing efforts. This resource will serve as a template for refinement of the T. cacao genome through gap-filling, targeted re-sequencing, and resolution of repetitive DNA arrays. PMID:21846342
A genetically anchored physical framework for Theobroma cacao cv. Matina 1-6.
Saski, Christopher A; Feltus, Frank A; Staton, Margaret E; Blackmon, Barbara P; Ficklin, Stephen P; Kuhn, David N; Schnell, Raymond J; Shapiro, Howard; Motamayor, Juan Carlos
2011-08-16
The fermented dried seeds of Theobroma cacao (cacao tree) are the main ingredient in chocolate. World cocoa production was estimated to be 3 million tons in 2010 with an annual estimated average growth rate of 2.2%. The cacao bean production industry is currently under threat from a rise in fungal diseases including black pod, frosty pod, and witches' broom. In order to address these issues, genome-sequencing efforts have been initiated recently to facilitate identification of genetic markers and genes that could be utilized to accelerate the release of robust T. cacao cultivars. However, problems inherent with assembly and resolution of distal regions of complex eukaryotic genomes, such as gaps, chimeric joins, and unresolvable repeat-induced compressions, have been unavoidably encountered with the sequencing strategies selected. Here, we describe the construction of a BAC-based integrated genetic-physical map of the T. cacao cultivar Matina 1-6 which is designed to augment and enhance these sequencing efforts. Three BAC libraries, each comprised of 10× coverage, were constructed and fingerprinted. 230 genetic markers from a high-resolution genetic recombination map and 96 Arabidopsis-derived conserved ortholog set (COS) II markers were anchored using pooled overgo hybridization. A dense tile path consisting of 29,383 BACs was selected and end-sequenced. The physical map consists of 154 contigs and 4,268 singletons. Forty-nine contigs are genetically anchored and ordered to chromosomes for a total span of 307.2 Mbp. The unanchored contigs (105) span 67.4 Mbp and therefore the estimated genome size of T. cacao is 374.6 Mbp. A comparative analysis with A. thaliana, V. vinifera, and P. trichocarpa suggests that comparisons of the genome assemblies of these distantly related species could provide insights into genome structure, evolutionary history, conservation of functional sites, and improvements in physical map assembly. A comparison between the two T. cacao cultivars Matina 1-6 and Criollo indicates a high degree of collinearity in their genomes, yet rearrangements were also observed. The results presented in this study are a stand-alone resource for functional exploitation and enhancement of Theobroma cacao but are also expected to complement and augment ongoing genome-sequencing efforts. This resource will serve as a template for refinement of the T. cacao genome through gap-filling, targeted re-sequencing, and resolution of repetitive DNA arrays.
Mester, David; Ronin, Yefim; Schnable, Patrick; Aluru, Srinivas; Korol, Abraham
2015-01-01
Our aim was to develop a fast and accurate algorithm for constructing consensus genetic maps for chip-based SNP genotyping data with a high proportion of shared markers between mapping populations. Chip-based genotyping of SNP markers allows producing high-density genetic maps with a relatively standardized set of marker loci for different mapping populations. The availability of a standard high-throughput mapping platform simplifies consensus analysis by ignoring unique markers at the stage of consensus mapping thereby reducing mathematical complicity of the problem and in turn analyzing bigger size mapping data using global optimization criteria instead of local ones. Our three-phase analytical scheme includes automatic selection of ~100-300 of the most informative (resolvable by recombination) markers per linkage group, building a stable skeletal marker order for each data set and its verification using jackknife re-sampling, and consensus mapping analysis based on global optimization criterion. A novel Evolution Strategy optimization algorithm with a global optimization criterion presented in this paper is able to generate high quality, ultra-dense consensus maps, with many thousands of markers per genome. This algorithm utilizes "potentially good orders" in the initial solution and in the new mutation procedures that generate trial solutions, enabling to obtain a consensus order in reasonable time. The developed algorithm, tested on a wide range of simulated data and real world data (Arabidopsis), outperformed two tested state-of-the-art algorithms by mapping accuracy and computation time. PMID:25867943
Kariminia, Amina; Holtan, Shernan G.; Ivison, Sabine; Rozmus, Jacob; Hebert, Marie-Josée; Martin, Paul J.; Lee, Stephanie J.; Wolff, Daniel; Subrt, Peter; Abdossamadi, Sayeh; Sung, Susanna; Storek, Jan; Levings, Megan; Aljurf, Mahmoud; Arora, Mukta; Cutler, Corey; Gallagher, Geneviève; Kuruvilla, John; Lipton, Jeff; Nevill, Thomas J.; Newell, Laura F.; Panzarella, Tony; Pidala, Joseph; Popradi, Gizelle; Szwajcer, David; Tay, Jason; Toze, Cynthia L.; Walker, Irwin; Couban, Stephen; Storer, Barry E.
2016-01-01
Chronic graft-versus-host disease (cGVHD) remains one of the most significant long-term complications after allogeneic blood and marrow transplantation. Diagnostic biomarkers for cGVHD are needed for early diagnosis and may guide identification of prognostic markers. No cGVHD biomarker has yet been validated for use in clinical practice. We evaluated both previously known markers and performed discovery-based analysis for cGVHD biomarkers in a 2 independent test sets (total of 36 cases ≤1 month from diagnosis and 31 time-matched controls with no cGVHD). On the basis of these results, 11 markers were selected and evaluated in 2 independent replication cohorts (total of 134 cGVHD cases and 154 controls). cGVHD cases and controls were evaluated for several clinical covariates, and their impact on biomarkers was identified by univariate analysis. The 2 replications sets were relatively disparate in the biomarkers they replicated. Only sBAFF and, most consistently, CXCL10 were identified as significant in both replication sets. Other markers identified as significant in only 1 replication set included intercellular adhesion molecule 1 (ICAM-1), anti-LG3, aminopeptidase N, CXCL9, endothelin-1, and gelsolin. Multivariate analysis found that all covariates evaluated affected interpretation of the biomarkers. CXCL10 had an increased significance in combination with anti-LG3 and CXCL9, or inversely with CXCR3+CD56bright natural killer (NK) cells. There was significant heterogeneity of cGVHD biomarkers in a large comprehensive evaluation of cGVHD biomarkers impacted by several covariates. Only CXCL10 strongly correlated in both replication sets. Future analyses for plasma cGVHD biomarkers will need to be performed on very large patient groups with consideration of multiple covariates. PMID:27020088
Analysis of facial motion patterns during speech using a matrix factorization algorithm
Lucero, Jorge C.; Munhall, Kevin G.
2008-01-01
This paper presents an analysis of facial motion during speech to identify linearly independent kinematic regions. The data consists of three-dimensional displacement records of a set of markers located on a subject’s face while producing speech. A QR factorization with column pivoting algorithm selects a subset of markers with independent motion patterns. The subset is used as a basis to fit the motion of the other facial markers, which determines facial regions of influence of each of the linearly independent markers. Those regions constitute kinematic “eigenregions” whose combined motion produces the total motion of the face. Facial animations may be generated by driving the independent markers with collected displacement records. PMID:19062866
Saatchi, Mahdi; McClure, Mathew C; McKay, Stephanie D; Rolf, Megan M; Kim, JaeWoo; Decker, Jared E; Taxis, Tasia M; Chapple, Richard H; Ramey, Holly R; Northcutt, Sally L; Bauck, Stewart; Woodward, Brent; Dekkers, Jack C M; Fernando, Rohan L; Schnabel, Robert D; Garrick, Dorian J; Taylor, Jeremy F
2011-11-28
Genomic selection is a recently developed technology that is beginning to revolutionize animal breeding. The objective of this study was to estimate marker effects to derive prediction equations for direct genomic values for 16 routinely recorded traits of American Angus beef cattle and quantify corresponding accuracies of prediction. Deregressed estimated breeding values were used as observations in a weighted analysis to derive direct genomic values for 3570 sires genotyped using the Illumina BovineSNP50 BeadChip. These bulls were clustered into five groups using K-means clustering on pedigree estimates of additive genetic relationships between animals, with the aim of increasing within-group and decreasing between-group relationships. All five combinations of four groups were used for model training, with cross-validation performed in the group not used in training. Bivariate animal models were used for each trait to estimate the genetic correlation between deregressed estimated breeding values and direct genomic values. Accuracies of direct genomic values ranged from 0.22 to 0.69 for the studied traits, with an average of 0.44. Predictions were more accurate when animals within the validation group were more closely related to animals in the training set. When training and validation sets were formed by random allocation, the accuracies of direct genomic values ranged from 0.38 to 0.85, with an average of 0.65, reflecting the greater relationship between animals in training and validation. The accuracies of direct genomic values obtained from training on older animals and validating in younger animals were intermediate to the accuracies obtained from K-means clustering and random clustering for most traits. The genetic correlation between deregressed estimated breeding values and direct genomic values ranged from 0.15 to 0.80 for the traits studied. These results suggest that genomic estimates of genetic merit can be produced in beef cattle at a young age but the recurrent inclusion of genotyped sires in retraining analyses will be necessary to routinely produce for the industry the direct genomic values with the highest accuracy.
2011-01-01
Background Genomic selection is a recently developed technology that is beginning to revolutionize animal breeding. The objective of this study was to estimate marker effects to derive prediction equations for direct genomic values for 16 routinely recorded traits of American Angus beef cattle and quantify corresponding accuracies of prediction. Methods Deregressed estimated breeding values were used as observations in a weighted analysis to derive direct genomic values for 3570 sires genotyped using the Illumina BovineSNP50 BeadChip. These bulls were clustered into five groups using K-means clustering on pedigree estimates of additive genetic relationships between animals, with the aim of increasing within-group and decreasing between-group relationships. All five combinations of four groups were used for model training, with cross-validation performed in the group not used in training. Bivariate animal models were used for each trait to estimate the genetic correlation between deregressed estimated breeding values and direct genomic values. Results Accuracies of direct genomic values ranged from 0.22 to 0.69 for the studied traits, with an average of 0.44. Predictions were more accurate when animals within the validation group were more closely related to animals in the training set. When training and validation sets were formed by random allocation, the accuracies of direct genomic values ranged from 0.38 to 0.85, with an average of 0.65, reflecting the greater relationship between animals in training and validation. The accuracies of direct genomic values obtained from training on older animals and validating in younger animals were intermediate to the accuracies obtained from K-means clustering and random clustering for most traits. The genetic correlation between deregressed estimated breeding values and direct genomic values ranged from 0.15 to 0.80 for the traits studied. Conclusions These results suggest that genomic estimates of genetic merit can be produced in beef cattle at a young age but the recurrent inclusion of genotyped sires in retraining analyses will be necessary to routinely produce for the industry the direct genomic values with the highest accuracy. PMID:22122853
DOE Office of Scientific and Technical Information (OSTI.GOV)
Weber, B.H.F.; Vogt, G.; Stoehr, H.
1994-12-01
Best vitelliform macular dystrophy (VMD2) has previously been linked to several microsatellite markers from chromosome 11. Subsequently, additional genetic studies have refined the Best disease region to a 3.7-cM interval flanked by markers at D11S903 and PYGM. To further narrow the interval containing the Best disease gene and to obtain an estimate of the physical size of the minimal candidate region, we used a combination of high-resolution PCR hybrid mapping and analysis of recombinant Best disease chromosomes. We identified six markers from within the D11S903-PYGM interval that show no recombination with the defective gene in three multigeneration Best disease pedigrees.more » Our hybrid panel localizes these markers on either side of the centromere on chromosome 11. The closest markers flanking the disease gene are at D11S986 in band p12-11.22 on the short arm and at D11S480 in band q13.2-13.3 on the proximal long arm. This study demonstrates that the physical size of the Best disease region is exceedingly larger than previously estimated from the genetic data, because of the proximity of the defective gene to the centromere of chromosome 11.« less