population-based sample methods: Topics by Science.gov

Sample records for population-based sample methods

A two-stage cluster sampling method using gridded population data, a GIS, and Google Earth(TM) imagery in a population-based mortality survey in Iraq.

PubMed

Galway, Lp; Bell, Nathaniel; Sae, Al Shatari; Hagopian, Amy; Burnham, Gilbert; Flaxman, Abraham; Weiss, Wiliam M; Rajaratnam, Julie; Takaro, Tim K

2012-04-27

Mortality estimates can measure and monitor the impacts of conflict on a population, guide humanitarian efforts, and help to better understand the public health impacts of conflict. Vital statistics registration and surveillance systems are rarely functional in conflict settings, posing a challenge of estimating mortality using retrospective population-based surveys. We present a two-stage cluster sampling method for application in population-based mortality surveys. The sampling method utilizes gridded population data and a geographic information system (GIS) to select clusters in the first sampling stage and Google Earth TM imagery and sampling grids to select households in the second sampling stage. The sampling method is implemented in a household mortality study in Iraq in 2011. Factors affecting feasibility and methodological quality are described. Sampling is a challenge in retrospective population-based mortality studies and alternatives that improve on the conventional approaches are needed. The sampling strategy presented here was designed to generate a representative sample of the Iraqi population while reducing the potential for bias and considering the context specific challenges of the study setting. This sampling strategy, or variations on it, are adaptable and should be considered and tested in other conflict settings.
A two-stage cluster sampling method using gridded population data, a GIS, and Google EarthTM imagery in a population-based mortality survey in Iraq

PubMed Central

2012-01-01

Background Mortality estimates can measure and monitor the impacts of conflict on a population, guide humanitarian efforts, and help to better understand the public health impacts of conflict. Vital statistics registration and surveillance systems are rarely functional in conflict settings, posing a challenge of estimating mortality using retrospective population-based surveys. Results We present a two-stage cluster sampling method for application in population-based mortality surveys. The sampling method utilizes gridded population data and a geographic information system (GIS) to select clusters in the first sampling stage and Google Earth TM imagery and sampling grids to select households in the second sampling stage. The sampling method is implemented in a household mortality study in Iraq in 2011. Factors affecting feasibility and methodological quality are described. Conclusion Sampling is a challenge in retrospective population-based mortality studies and alternatives that improve on the conventional approaches are needed. The sampling strategy presented here was designed to generate a representative sample of the Iraqi population while reducing the potential for bias and considering the context specific challenges of the study setting. This sampling strategy, or variations on it, are adaptable and should be considered and tested in other conflict settings. PMID:22540266
A random spatial sampling method in a rural developing nation

Treesearch

Michelle C. Kondo; Kent D.W. Bream; Frances K. Barg; Charles C. Branas

2014-01-01

Nonrandom sampling of populations in developing nations has limitations and can inaccurately estimate health phenomena, especially among hard-to-reach populations such as rural residents. However, random sampling of rural populations in developing nations can be challenged by incomplete enumeration of the base population. We describe a stratified random sampling method...
Nonprobability and probability-based sampling strategies in sexual science.

PubMed

Catania, Joseph A; Dolcini, M Margaret; Orellana, Roberto; Narayanan, Vasudah

2015-01-01

With few exceptions, much of sexual science builds upon data from opportunistic nonprobability samples of limited generalizability. Although probability-based studies are considered the gold standard in terms of generalizability, they are costly to apply to many of the hard-to-reach populations of interest to sexologists. The present article discusses recent conclusions by sampling experts that have relevance to sexual science that advocates for nonprobability methods. In this regard, we provide an overview of Internet sampling as a useful, cost-efficient, nonprobability sampling method of value to sex researchers conducting modeling work or clinical trials. We also argue that probability-based sampling methods may be more readily applied in sex research with hard-to-reach populations than is typically thought. In this context, we provide three case studies that utilize qualitative and quantitative techniques directed at reducing limitations in applying probability-based sampling to hard-to-reach populations: indigenous Peruvians, African American youth, and urban men who have sex with men (MSM). Recommendations are made with regard to presampling studies, adaptive and disproportionate sampling methods, and strategies that may be utilized in evaluating nonprobability and probability-based sampling methods.
Evaluation of respondent-driven sampling.

PubMed

McCreesh, Nicky; Frost, Simon D W; Seeley, Janet; Katongole, Joseph; Tarsh, Matilda N; Ndunguse, Richard; Jichi, Fatima; Lunel, Natasha L; Maher, Dermot; Johnston, Lisa G; Sonnenberg, Pam; Copas, Andrew J; Hayes, Richard J; White, Richard G

2012-01-01

Respondent-driven sampling is a novel variant of link-tracing sampling for estimating the characteristics of hard-to-reach groups, such as HIV prevalence in sex workers. Despite its use by leading health organizations, the performance of this method in realistic situations is still largely unknown. We evaluated respondent-driven sampling by comparing estimates from a respondent-driven sampling survey with total population data. Total population data on age, tribe, religion, socioeconomic status, sexual activity, and HIV status were available on a population of 2402 male household heads from an open cohort in rural Uganda. A respondent-driven sampling (RDS) survey was carried out in this population, using current methods of sampling (RDS sample) and statistical inference (RDS estimates). Analyses were carried out for the full RDS sample and then repeated for the first 250 recruits (small sample). We recruited 927 household heads. Full and small RDS samples were largely representative of the total population, but both samples underrepresented men who were younger, of higher socioeconomic status, and with unknown sexual activity and HIV status. Respondent-driven sampling statistical inference methods failed to reduce these biases. Only 31%-37% (depending on method and sample size) of RDS estimates were closer to the true population proportions than the RDS sample proportions. Only 50%-74% of respondent-driven sampling bootstrap 95% confidence intervals included the population proportion. Respondent-driven sampling produced a generally representative sample of this well-connected nonhidden population. However, current respondent-driven sampling inference methods failed to reduce bias when it occurred. Whether the data required to remove bias and measure precision can be collected in a respondent-driven sampling survey is unresolved. Respondent-driven sampling should be regarded as a (potentially superior) form of convenience sampling method, and caution is required when interpreting findings based on the sampling method.
Estimating population size with correlated sampling unit estimates

Treesearch

David C. Bowden; Gary C. White; Alan B. Franklin; Joseph L. Ganey

2003-01-01

Finite population sampling theory is useful in estimating total population size (abundance) from abundance estimates of each sampled unit (quadrat). We develop estimators that allow correlated quadrat abundance estimates, even for quadrats in different sampling strata. Correlated quadrat abundance estimates based on markârecapture or distance sampling methods occur...
[Respondent-Driven Sampling: a new sampling method to study visible and hidden populations].

PubMed

Mantecón, Alejandro; Juan, Montse; Calafat, Amador; Becoña, Elisardo; Román, Encarna

2008-01-01

The paper introduces a variant of chain-referral sampling: respondent-driven sampling (RDS). This sampling method shows that methods based on network analysis can be combined with the statistical validity of standard probability sampling methods. In this sense, RDS appears to be a mathematical improvement of snowball sampling oriented to the study of hidden populations. However, we try to prove its validity with populations that are not within a sampling frame but can nonetheless be contacted without difficulty. The basics of RDS are explained through our research on young people (aged 14 to 25) who go clubbing, consume alcohol and other drugs, and have sex. Fieldwork was carried out between May and July 2007 in three Spanish regions: Baleares, Galicia and Comunidad Valenciana. The presentation of the study shows the utility of this type of sampling when the population is accessible but there is a difficulty deriving from the lack of a sampling frame. However, the sample obtained is not a random representative one in statistical terms of the target population. It must be acknowledged that the final sample is representative of a 'pseudo-population' that approximates to the target population but is not identical to it.
Development of a novel cell sorting method that samples population diversity in flow cytometry.

PubMed

Osborne, Geoffrey W; Andersen, Stacey B; Battye, Francis L

2015-11-01

Flow cytometry based electrostatic cell sorting is an important tool in the separation of cell populations. Existing instruments can sort single cells into multi-well collection plates, and keep track of cell of origin and sorted well location. However currently single sorted cell results reflect the population distribution and fail to capture the population diversity. Software was designed that implements a novel sorting approach, "Slice and Dice Sorting," that links a graphical representation of a multi-well plate to logic that ensures that single cells are sampled and sorted from all areas defined by the sort region/s. Therefore the diversity of the total population is captured, and the more frequently occurring or rarer cell types are all sampled. The sorting approach was tested computationally, and using functional cell based assays. Computationally we demonstrate that conventional single cell sorting can sample as little as 50% of the population diversity dependant on the population distribution, and that Slice and Dice sorting samples much more of the variety present within a cell population. We then show by sorting single cells into wells using the Slice and Dice sorting method that there are cells sorted using this method that would be either rarely sorted, or not sorted at all using conventional single cell sorting approaches. The present study demonstrates a novel single cell sorting method that samples much more of the population diversity than current methods. It has implications in clonal selection, stem cell sorting, single cell sequencing and any areas where population heterogeneity is of importance. © 2015 International Society for Advancement of Cytometry.
Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru.

PubMed

Anderson, Weston; Guikema, Seth; Zaitchik, Ben; Pan, William

2014-01-01

Obtaining accurate small area estimates of population is essential for policy and health planning but is often difficult in countries with limited data. In lieu of available population data, small area estimate models draw information from previous time periods or from similar areas. This study focuses on model-based methods for estimating population when no direct samples are available in the area of interest. To explore the efficacy of tree-based models for estimating population density, we compare six different model structures including Random Forest and Bayesian Additive Regression Trees. Results demonstrate that without information from prior time periods, non-parametric tree-based models produced more accurate predictions than did conventional regression methods. Improving estimates of population density in non-sampled areas is important for regions with incomplete census data and has implications for economic, health and development policies.
Methods for Estimating Population Density in Data-Limited Areas: Evaluating Regression and Tree-Based Models in Peru

PubMed Central

Anderson, Weston; Guikema, Seth; Zaitchik, Ben; Pan, William

2014-01-01

Obtaining accurate small area estimates of population is essential for policy and health planning but is often difficult in countries with limited data. In lieu of available population data, small area estimate models draw information from previous time periods or from similar areas. This study focuses on model-based methods for estimating population when no direct samples are available in the area of interest. To explore the efficacy of tree-based models for estimating population density, we compare six different model structures including Random Forest and Bayesian Additive Regression Trees. Results demonstrate that without information from prior time periods, non-parametric tree-based models produced more accurate predictions than did conventional regression methods. Improving estimates of population density in non-sampled areas is important for regions with incomplete census data and has implications for economic, health and development policies. PMID:24992657
Diagnostic test accuracy and prevalence inferences based on joint and sequential testing with finite population sampling.

PubMed

Su, Chun-Lung; Gardner, Ian A; Johnson, Wesley O

2004-07-30

The two-test two-population model, originally formulated by Hui and Walter, for estimation of test accuracy and prevalence estimation assumes conditionally independent tests, constant accuracy across populations and binomial sampling. The binomial assumption is incorrect if all individuals in a population e.g. child-care centre, village in Africa, or a cattle herd are sampled or if the sample size is large relative to population size. In this paper, we develop statistical methods for evaluating diagnostic test accuracy and prevalence estimation based on finite sample data in the absence of a gold standard. Moreover, two tests are often applied simultaneously for the purpose of obtaining a 'joint' testing strategy that has either higher overall sensitivity or specificity than either of the two tests considered singly. Sequential versions of such strategies are often applied in order to reduce the cost of testing. We thus discuss joint (simultaneous and sequential) testing strategies and inference for them. Using the developed methods, we analyse two real and one simulated data sets, and we compare 'hypergeometric' and 'binomial-based' inferences. Our findings indicate that the posterior standard deviations for prevalence (but not sensitivity and specificity) based on finite population sampling tend to be smaller than their counterparts for infinite population sampling. Finally, we make recommendations about how small the sample size should be relative to the population size to warrant use of the binomial model for prevalence estimation. Copyright 2004 John Wiley & Sons, Ltd.
Reliability of confidence intervals calculated by bootstrap and classical methods using the FIA 1-ha plot design

Treesearch

H. T. Schreuder; M. S. Williams

2000-01-01

In simulation sampling from forest populations using sample sizes of 20, 40, and 60 plots respectively, confidence intervals based on the bootstrap (accelerated, percentile, and t-distribution based) were calculated and compared with those based on the classical t confidence intervals for mapped populations and subdomains within those populations. A 68.1 ha mapped...
Methodology Series Module 5: Sampling Strategies.

PubMed

Setia, Maninder Singh

2016-01-01

Once the research question and the research design have been finalised, it is important to select the appropriate sample for the study. The method by which the researcher selects the sample is the ' Sampling Method'. There are essentially two types of sampling methods: 1) probability sampling - based on chance events (such as random numbers, flipping a coin etc.); and 2) non-probability sampling - based on researcher's choice, population that accessible & available. Some of the non-probability sampling methods are: purposive sampling, convenience sampling, or quota sampling. Random sampling method (such as simple random sample or stratified random sample) is a form of probability sampling. It is important to understand the different sampling methods used in clinical studies and mention this method clearly in the manuscript. The researcher should not misrepresent the sampling method in the manuscript (such as using the term ' random sample' when the researcher has used convenience sample). The sampling method will depend on the research question. For instance, the researcher may want to understand an issue in greater detail for one particular population rather than worry about the ' generalizability' of these results. In such a scenario, the researcher may want to use ' purposive sampling' for the study.
Evaluation of Respondent-Driven Sampling

PubMed Central

McCreesh, Nicky; Frost, Simon; Seeley, Janet; Katongole, Joseph; Tarsh, Matilda Ndagire; Ndunguse, Richard; Jichi, Fatima; Lunel, Natasha L; Maher, Dermot; Johnston, Lisa G; Sonnenberg, Pam; Copas, Andrew J; Hayes, Richard J; White, Richard G

2012-01-01

Background Respondent-driven sampling is a novel variant of link-tracing sampling for estimating the characteristics of hard-to-reach groups, such as HIV prevalence in sex-workers. Despite its use by leading health organizations, the performance of this method in realistic situations is still largely unknown. We evaluated respondent-driven sampling by comparing estimates from a respondent-driven sampling survey with total-population data. Methods Total-population data on age, tribe, religion, socioeconomic status, sexual activity and HIV status were available on a population of 2402 male household-heads from an open cohort in rural Uganda. A respondent-driven sampling (RDS) survey was carried out in this population, employing current methods of sampling (RDS sample) and statistical inference (RDS estimates). Analyses were carried out for the full RDS sample and then repeated for the first 250 recruits (small sample). Results We recruited 927 household-heads. Full and small RDS samples were largely representative of the total population, but both samples under-represented men who were younger, of higher socioeconomic status, and with unknown sexual activity and HIV status. Respondent-driven-sampling statistical-inference methods failed to reduce these biases. Only 31%-37% (depending on method and sample size) of RDS estimates were closer to the true population proportions than the RDS sample proportions. Only 50%-74% of respondent-driven-sampling bootstrap 95% confidence intervals included the population proportion. Conclusions Respondent-driven sampling produced a generally representative sample of this well-connected non-hidden population. However, current respondent-driven-sampling inference methods failed to reduce bias when it occurred. Whether the data required to remove bias and measure precision can be collected in a respondent-driven sampling survey is unresolved. Respondent-driven sampling should be regarded as a (potentially superior) form of convenience-sampling method, and caution is required when interpreting findings based on the sampling method. PMID:22157309
Confidence intervals for the population mean tailored to small sample sizes, with applications to survey sampling.

PubMed

Rosenblum, Michael A; Laan, Mark J van der

2009-01-07

The validity of standard confidence intervals constructed in survey sampling is based on the central limit theorem. For small sample sizes, the central limit theorem may give a poor approximation, resulting in confidence intervals that are misleading. We discuss this issue and propose methods for constructing confidence intervals for the population mean tailored to small sample sizes. We present a simple approach for constructing confidence intervals for the population mean based on tail bounds for the sample mean that are correct for all sample sizes. Bernstein's inequality provides one such tail bound. The resulting confidence intervals have guaranteed coverage probability under much weaker assumptions than are required for standard methods. A drawback of this approach, as we show, is that these confidence intervals are often quite wide. In response to this, we present a method for constructing much narrower confidence intervals, which are better suited for practical applications, and that are still more robust than confidence intervals based on standard methods, when dealing with small sample sizes. We show how to extend our approaches to much more general estimation problems than estimating the sample mean. We describe how these methods can be used to obtain more reliable confidence intervals in survey sampling. As a concrete example, we construct confidence intervals using our methods for the number of violent deaths between March 2003 and July 2006 in Iraq, based on data from the study "Mortality after the 2003 invasion of Iraq: A cross sectional cluster sample survey," by Burnham et al. (2006).
Methodology Series Module 5: Sampling Strategies

PubMed Central

Setia, Maninder Singh

2016-01-01

Once the research question and the research design have been finalised, it is important to select the appropriate sample for the study. The method by which the researcher selects the sample is the ‘ Sampling Method’. There are essentially two types of sampling methods: 1) probability sampling – based on chance events (such as random numbers, flipping a coin etc.); and 2) non-probability sampling – based on researcher's choice, population that accessible & available. Some of the non-probability sampling methods are: purposive sampling, convenience sampling, or quota sampling. Random sampling method (such as simple random sample or stratified random sample) is a form of probability sampling. It is important to understand the different sampling methods used in clinical studies and mention this method clearly in the manuscript. The researcher should not misrepresent the sampling method in the manuscript (such as using the term ‘ random sample’ when the researcher has used convenience sample). The sampling method will depend on the research question. For instance, the researcher may want to understand an issue in greater detail for one particular population rather than worry about the ‘ generalizability’ of these results. In such a scenario, the researcher may want to use ‘ purposive sampling’ for the study. PMID:27688438
Confidence intervals for population allele frequencies: the general case of sampling from a finite diploid population of any size.

PubMed

Fung, Tak; Keenan, Kevin

2014-01-01

The estimation of population allele frequencies using sample data forms a central component of studies in population genetics. These estimates can be used to test hypotheses on the evolutionary processes governing changes in genetic variation among populations. However, existing studies frequently do not account for sampling uncertainty in these estimates, thus compromising their utility. Incorporation of this uncertainty has been hindered by the lack of a method for constructing confidence intervals containing the population allele frequencies, for the general case of sampling from a finite diploid population of any size. In this study, we address this important knowledge gap by presenting a rigorous mathematical method to construct such confidence intervals. For a range of scenarios, the method is used to demonstrate that for a particular allele, in order to obtain accurate estimates within 0.05 of the population allele frequency with high probability (> or = 95%), a sample size of > 30 is often required. This analysis is augmented by an application of the method to empirical sample allele frequency data for two populations of the checkerspot butterfly (Melitaea cinxia L.), occupying meadows in Finland. For each population, the method is used to derive > or = 98.3% confidence intervals for the population frequencies of three alleles. These intervals are then used to construct two joint > or = 95% confidence regions, one for the set of three frequencies for each population. These regions are then used to derive a > or = 95%% confidence interval for Jost's D, a measure of genetic differentiation between the two populations. Overall, the results demonstrate the practical utility of the method with respect to informing sampling design and accounting for sampling uncertainty in studies of population genetics, important for scientific hypothesis-testing and also for risk-based natural resource management.
Hard-to-reach populations of men who have sex with men and sex workers: a systematic review on sampling methods.

PubMed

Barros, Ana B; Dias, Sonia F; Martins, Maria Rosario O

2015-10-30

In public health, hard-to-reach populations are often recruited by non-probabilistic sampling methods that produce biased results. In order to overcome this, several sampling methods have been improved and developed in the last years. The aim of this systematic review was to identify all current methods used to survey most-at-risk populations of men who have sex with men and sex workers. The review also aimed to assess if there were any relations between the study populations and the sampling methods used to recruit them. Lastly, we wanted to assess if the number of publications originated in middle and low human development (MLHD) countries had been increasing in the last years. A systematic review was conducted using electronic databases and a total of 268 published studies were included in the analysis. In this review, 11 recruitment methods were identified. Semi-probabilistic methods were used most commonly to survey men who have sex with men, and the use of the Internet was the method that gathered more respondents. We found that female sex workers were more frequently recruited through non-probabilistic methods than men who have sex with men (odds = 2.2; p < 0.05; confidence interval (CI) [1.1-4.2]). In the last 6 years, the number of studies based in middle and low human development countries increased more than the number of studies based in very high and high human development countries (odds = 2.5; p < 0.05; CI [1.3-4.9]). This systematic literature review identified 11 methods used to sample men who have sex with men and female sex workers. There is an association between the type of sampling method and the population being studied. The number of studies based in middle and low human development countries has increased in the last 6 years of this study.
The efficacy of respondent-driven sampling for the health assessment of minority populations.

PubMed

Badowski, Grazyna; Somera, Lilnabeth P; Simsiman, Brayan; Lee, Hye-Ryeon; Cassel, Kevin; Yamanaka, Alisha; Ren, JunHao

2017-10-01

Respondent driven sampling (RDS) is a relatively new network sampling technique typically employed for hard-to-reach populations. Like snowball sampling, initial respondents or "seeds" recruit additional respondents from their network of friends. Under certain assumptions, the method promises to produce a sample independent from the biases that may have been introduced by the non-random choice of "seeds." We conducted a survey on health communication in Guam's general population using the RDS method, the first survey that has utilized this methodology in Guam. It was conducted in hopes of identifying a cost-efficient non-probability sampling strategy that could generate reasonable population estimates for both minority and general populations. RDS data was collected in Guam in 2013 (n=511) and population estimates were compared with 2012 BRFSS data (n=2031) and the 2010 census data. The estimates were calculated using the unweighted RDS sample and the weighted sample using RDS inference methods and compared with known population characteristics. The sample size was reached in 23days, providing evidence that the RDS method is a viable, cost-effective data collection method, which can provide reasonable population estimates. However, the results also suggest that the RDS inference methods used to reduce bias, based on self-reported estimates of network sizes, may not always work. Caution is needed when interpreting RDS study findings. For a more diverse sample, data collection should not be conducted in just one location. Fewer questions about network estimates should be asked, and more careful consideration should be given to the kind of incentives offered to participants. Copyright © 2017. Published by Elsevier Ltd.
A general method to determine sampling windows for nonlinear mixed effects models with an application to population pharmacokinetic studies.

PubMed

Foo, Lee Kien; McGree, James; Duffull, Stephen

2012-01-01

Optimal design methods have been proposed to determine the best sampling times when sparse blood sampling is required in clinical pharmacokinetic studies. However, the optimal blood sampling time points may not be feasible in clinical practice. Sampling windows, a time interval for blood sample collection, have been proposed to provide flexibility in blood sampling times while preserving efficient parameter estimation. Because of the complexity of the population pharmacokinetic models, which are generally nonlinear mixed effects models, there is no analytical solution available to determine sampling windows. We propose a method for determination of sampling windows based on MCMC sampling techniques. The proposed method attains a stationary distribution rapidly and provides time-sensitive windows around the optimal design points. The proposed method is applicable to determine sampling windows for any nonlinear mixed effects model although our work focuses on an application to population pharmacokinetic models. Copyright © 2012 John Wiley & Sons, Ltd.

Observational studies of patients in the emergency department: a comparison of 4 sampling methods.

PubMed

Valley, Morgan A; Heard, Kennon J; Ginde, Adit A; Lezotte, Dennis C; Lowenstein, Steven R

2012-08-01

We evaluate the ability of 4 sampling methods to generate representative samples of the emergency department (ED) population. We analyzed the electronic records of 21,662 consecutive patient visits at an urban, academic ED. From this population, we simulated different models of study recruitment in the ED by using 2 sample sizes (n=200 and n=400) and 4 sampling methods: true random, random 4-hour time blocks by exact sample size, random 4-hour time blocks by a predetermined number of blocks, and convenience or "business hours." For each method and sample size, we obtained 1,000 samples from the population. Using χ(2) tests, we measured the number of statistically significant differences between the sample and the population for 8 variables (age, sex, race/ethnicity, language, triage acuity, arrival mode, disposition, and payer source). Then, for each variable, method, and sample size, we compared the proportion of the 1,000 samples that differed from the overall ED population to the expected proportion (5%). Only the true random samples represented the population with respect to sex, race/ethnicity, triage acuity, mode of arrival, language, and payer source in at least 95% of the samples. Patient samples obtained using random 4-hour time blocks and business hours sampling systematically differed from the overall ED patient population for several important demographic and clinical variables. However, the magnitude of these differences was not large. Common sampling strategies selected for ED-based studies may affect parameter estimates for several representative population variables. However, the potential for bias for these variables appears small. Copyright © 2012. Published by Mosby, Inc.
Genotyping faecal samples of Bengal tiger Panthera tigris tigris for population estimation: a pilot study.

PubMed

Bhagavatula, Jyotsna; Singh, Lalji

2006-10-17

Bengal tiger Panthera tigris tigris the National Animal of India, is an endangered species. Estimating populations for such species is the main objective for designing conservation measures and for evaluating those that are already in place. Due to the tiger's cryptic and secretive behaviour, it is not possible to enumerate and monitor its populations through direct observations; instead indirect methods have always been used for studying tigers in the wild. DNA methods based on non-invasive sampling have not been attempted so far for tiger population studies in India. We describe here a pilot study using DNA extracted from faecal samples of tigers for the purpose of population estimation. In this study, PCR primers were developed based on tiger-specific variations in the mitochondrial cytochrome b for reliably identifying tiger faecal samples from those of sympatric carnivores. Microsatellite markers were developed for the identification of individual tigers with a sibling Probability of Identity of 0.005 that can distinguish even closely related individuals with 99.9% certainty. The effectiveness of using field-collected tiger faecal samples for DNA analysis was evaluated by sampling, identification and subsequently genotyping samples from two protected areas in southern India. Our results demonstrate the feasibility of using tiger faecal matter as a potential source of DNA for population estimation of tigers in protected areas in India in addition to the methods currently in use.
Network Model-Assisted Inference from Respondent-Driven Sampling Data

PubMed Central

Gile, Krista J.; Handcock, Mark S.

2015-01-01

Summary Respondent-Driven Sampling is a widely-used method for sampling hard-to-reach human populations by link-tracing over their social networks. Inference from such data requires specialized techniques because the sampling process is both partially beyond the control of the researcher, and partially implicitly defined. Therefore, it is not generally possible to directly compute the sampling weights for traditional design-based inference, and likelihood inference requires modeling the complex sampling process. As an alternative, we introduce a model-assisted approach, resulting in a design-based estimator leveraging a working network model. We derive a new class of estimators for population means and a corresponding bootstrap standard error estimator. We demonstrate improved performance compared to existing estimators, including adjustment for an initial convenience sample. We also apply the method and an extension to the estimation of HIV prevalence in a high-risk population. PMID:26640328
Network Model-Assisted Inference from Respondent-Driven Sampling Data.

PubMed

Gile, Krista J; Handcock, Mark S

2015-06-01

Respondent-Driven Sampling is a widely-used method for sampling hard-to-reach human populations by link-tracing over their social networks. Inference from such data requires specialized techniques because the sampling process is both partially beyond the control of the researcher, and partially implicitly defined. Therefore, it is not generally possible to directly compute the sampling weights for traditional design-based inference, and likelihood inference requires modeling the complex sampling process. As an alternative, we introduce a model-assisted approach, resulting in a design-based estimator leveraging a working network model. We derive a new class of estimators for population means and a corresponding bootstrap standard error estimator. We demonstrate improved performance compared to existing estimators, including adjustment for an initial convenience sample. We also apply the method and an extension to the estimation of HIV prevalence in a high-risk population.
The Analysis of Organizational Diagnosis on Based Six Box Model in Universities

ERIC Educational Resources Information Center

Hamid, Rahimi; Siadat, Sayyed Ali; Reza, Hoveida; Arash, Shahin; Ali, Nasrabadi Hasan; Azizollah, Arbabisarjou

2011-01-01

Purpose: The analysis of organizational diagnosis on based six box model at universities. Research method: Research method was descriptive-survey. Statistical population consisted of 1544 faculty members of universities which through random strafed sampling method 218 persons were chosen as the sample. Research Instrument were organizational…
Genealogy-based methods for inference of historical recombination and gene flow and their application in Saccharomyces cerevisiae.

PubMed

Jenkins, Paul A; Song, Yun S; Brem, Rachel B

2012-01-01

Genetic exchange between isolated populations, or introgression between species, serves as a key source of novel genetic material on which natural selection can act. While detecting historical gene flow from DNA sequence data is of much interest, many existing methods can be limited by requirements for deep population genomic sampling. In this paper, we develop a scalable genealogy-based method to detect candidate signatures of gene flow into a given population when the source of the alleles is unknown. Our method does not require sequenced samples from the source population, provided that the alleles have not reached fixation in the sampled recipient population. The method utilizes recent advances in algorithms for the efficient reconstruction of ancestral recombination graphs, which encode genealogical histories of DNA sequence data at each site, and is capable of detecting the signatures of gene flow whose footprints are of length up to single genes. Further, we employ a theoretical framework based on coalescent theory to test for statistical significance of certain recombination patterns consistent with gene flow from divergent sources. Implementing these methods for application to whole-genome sequences of environmental yeast isolates, we illustrate the power of our approach to highlight loci with unusual recombination histories. By developing innovative theory and methods to analyze signatures of gene flow from population sequence data, our work establishes a foundation for the continued study of introgression and its evolutionary relevance.
Genealogy-Based Methods for Inference of Historical Recombination and Gene Flow and Their Application in Saccharomyces cerevisiae

PubMed Central

Jenkins, Paul A.; Song, Yun S.; Brem, Rachel B.

2012-01-01

Genetic exchange between isolated populations, or introgression between species, serves as a key source of novel genetic material on which natural selection can act. While detecting historical gene flow from DNA sequence data is of much interest, many existing methods can be limited by requirements for deep population genomic sampling. In this paper, we develop a scalable genealogy-based method to detect candidate signatures of gene flow into a given population when the source of the alleles is unknown. Our method does not require sequenced samples from the source population, provided that the alleles have not reached fixation in the sampled recipient population. The method utilizes recent advances in algorithms for the efficient reconstruction of ancestral recombination graphs, which encode genealogical histories of DNA sequence data at each site, and is capable of detecting the signatures of gene flow whose footprints are of length up to single genes. Further, we employ a theoretical framework based on coalescent theory to test for statistical significance of certain recombination patterns consistent with gene flow from divergent sources. Implementing these methods for application to whole-genome sequences of environmental yeast isolates, we illustrate the power of our approach to highlight loci with unusual recombination histories. By developing innovative theory and methods to analyze signatures of gene flow from population sequence data, our work establishes a foundation for the continued study of introgression and its evolutionary relevance. PMID:23226196
Challenges to be overcome using population-based sampling methods to recruit veterans for a study of post-traumatic stress disorder and traumatic brain injury.

PubMed

Bayley, Peter J; Kong, Jennifer Y; Helmer, Drew A; Schneiderman, Aaron; Roselli, Lauren A; Rosse, Stephanie M; Jackson, Jordan A; Baldwin, Janet; Isaac, Linda; Nolasco, Michael; Blackman, Marc R; Reinhard, Matthew J; Ashford, John Wesson; Chapman, Julie C

2014-04-08

Many investigators are interested in recruiting veterans from recent conflicts in Afghanistan and Iraq with Traumatic Brain Injury (TBI) and/or Post Traumatic Stress Disorder (PTSD). Researchers pursuing such studies may experience problems in recruiting sufficient numbers unless effective strategies are used. Currently, there is very little information on recruitment strategies for individuals with TBI and/or PTSD. It is known that groups of patients with medical conditions may be less likely to volunteer for clinical research. This study investigated the feasibility of recruiting veterans returning from recent military conflicts--Operation Enduring Freedom (OEF) and Operation Iraqi Freedom (OIF)--using a population-based sampling method. Individuals were sampled from a previous epidemiological study. Three study sites focused on recruiting survey respondents (n = 445) who lived within a 60 mile radius of one of the sites. Overall, the successful recruitment of veterans using a population-based sampling method was dependent on the ability to contact potential participants following mass mailing. Study enrollment of participants with probable TBI and/or PTSD had a recruitment yield (enrolled/total identified) of 5.4%. We were able to contact 146 individuals, representing a contact rate of 33%. Sixty-six of the individuals contacted were screened. The major reasons for not screening included a stated lack of interest in the study (n = 37), a failure to answer screening calls after initial contact (n = 30), and an unwillingness or inability to travel to a study site (n = 10). Based on the phone screening, 36 veterans were eligible for the study. Twenty-four veterans were enrolled, (recruitment yield = 5.4%) and twelve were not enrolled for a variety of reasons. Our experience with a population-based sampling method for recruitment of recent combat veterans illustrates the challenges encountered, particularly contacting and screening potential participants. The screening and enrollment data will help guide recruitment for future studies using population-based methods.
Joint Inference of Population Assignment and Demographic History

PubMed Central

Choi, Sang Chul; Hey, Jody

2011-01-01

A new approach to assigning individuals to populations using genetic data is described. Most existing methods work by maximizing Hardy–Weinberg and linkage equilibrium within populations, neither of which will apply for many demographic histories. By including a demographic model, within a likelihood framework based on coalescent theory, we can jointly study demographic history and population assignment. Genealogies and population assignments are sampled from a posterior distribution using a general isolation-with-migration model for multiple populations. A measure of partition distance between assignments facilitates not only the summary of a posterior sample of assignments, but also the estimation of the posterior density for the demographic history. It is shown that joint estimates of assignment and demographic history are possible, including estimation of population phylogeny for samples from three populations. The new method is compared to results of a widely used assignment method, using simulated and published empirical data sets. PMID:21775468
The prevalence of ADHD in a population-based sample

PubMed Central

Rowland, Andrew S.; Skipper, Betty J.; Umbach, David M.; Rabiner, David L.; Campbell, Richard A.; Naftel, A. Jack; Sandler, Dale P.

2014-01-01

Objective Few studies of ADHD prevalence have used population-based samples, multiple informants, and DSM-IV criteria. In addition, children who are asymptomatic while receiving ADHD mediction often have been misclassified. Therefore, we conducted a population-based study to estimate the prevalence of ADHD in elementary school children using DSM-IV critera. Methods We screened 7587 children for ADHD. Teachers of 81% of the children completed a DSM-IV checklist. We then interviewed parents using a structured interview (DISC). Of these, 72% participated. Parent and teacher ratings were combined to determine ADHD status. We also estimated the proportion of cases attributable to other conditions. Results Overall, 15.5% of our sample (95% confidence interval (C.I.) 14.6%-16.4%) met DSM-IV-TR criteria for ADHD. Over 40% of cases reported no previous diagnosis. With additional information, other conditions explained about 9% of cases. Conclusions The prevalence of ADHD in this population-based sample was higher than the 3-7% commonly reported. To compare study results, the methods used to implement the DSM criteria need to be standardized. PMID:24336124
Problems with sampling desert tortoises: A simulation analysis based on field data

USGS Publications Warehouse

Freilich, J.E.; Camp, R.J.; Duda, J.J.; Karl, A.E.

2005-01-01

The desert tortoise (Gopherus agassizii) was listed as a U.S. threatened species in 1990 based largely on population declines inferred from mark-recapture surveys of 2.59-km2 (1-mi2) plots. Since then, several census methods have been proposed and tested, but all methods still pose logistical or statistical difficulties. We conducted computer simulations using actual tortoise location data from 2 1-mi2 plot surveys in southern California, USA, to identify strengths and weaknesses of current sampling strategies. We considered tortoise population estimates based on these plots as "truth" and then tested various sampling methods based on sampling smaller plots or transect lines passing through the mile squares. Data were analyzed using Schnabel's mark-recapture estimate and program CAPTURE. Experimental subsampling with replacement of the 1-mi2 data using 1-km2 and 0.25-km2 plot boundaries produced data sets of smaller plot sizes, which we compared to estimates from the 1-mi 2 plots. We also tested distance sampling by saturating a 1-mi 2 site with computer simulated transect lines, once again evaluating bias in density estimates. Subsampling estimates from 1-km2 plots did not differ significantly from the estimates derived at 1-mi2. The 0.25-km2 subsamples significantly overestimated population sizes, chiefly because too few recaptures were made. Distance sampling simulations were biased 80% of the time and had high coefficient of variation to density ratios. Furthermore, a prospective power analysis suggested limited ability to detect population declines as high as 50%. We concluded that poor performance and bias of both sampling procedures was driven by insufficient sample size, suggesting that all efforts must be directed to increasing numbers found in order to produce reliable results. Our results suggest that present methods may not be capable of accurately estimating desert tortoise populations.
Investigating population continuity with ancient DNA under a spatially explicit simulation framework.

PubMed

Silva, Nuno Miguel; Rio, Jeremy; Currat, Mathias

2017-12-15

Recent advances in sequencing technologies have allowed for the retrieval of ancient DNA data (aDNA) from skeletal remains, providing direct genetic snapshots from diverse periods of human prehistory. Comparing samples taken in the same region but at different times, hereafter called "serial samples", may indicate whether there is continuity in the peopling history of that area or whether an immigration of a genetically different population has occurred between the two sampling times. However, the exploration of genetic relationships between serial samples generally ignores their geographical locations and the spatiotemporal dynamics of populations. Here, we present a new coalescent-based, spatially explicit modelling approach to investigate population continuity using aDNA, which includes two fundamental elements neglected in previous methods: population structure and migration. The approach also considers the extensive temporal and geographical variance that is commonly found in aDNA population samples. We first showed that our spatially explicit approach is more conservative than the previous (panmictic) approach and should be preferred to test for population continuity, especially when small and isolated populations are considered. We then applied our method to two mitochondrial datasets from Germany and France, both including modern and ancient lineages dating from the early Neolithic. The results clearly reject population continuity for the maternal line over the last 7500 years for the German dataset but not for the French dataset, suggesting regional heterogeneity in post-Neolithic migratory processes. Here, we demonstrate the benefits of using a spatially explicit method when investigating population continuity with aDNA. It constitutes an improvement over panmictic methods by considering the spatiotemporal dynamics of genetic lineages and the precise location of ancient samples. The method can be used to investigate population continuity between any pair of serial samples (ancient-ancient or ancient-modern) and to investigate more complex evolutionary scenarios. Although we based our study on mitochondrial DNA sequences, diploid molecular markers of different types (DNA, SNP, STR) can also be simulated with our approach. It thus constitutes a promising tool for the analysis of the numerous aDNA datasets being produced, including genome wide data, in humans but also in many other species.
Single-Phase Mail Survey Design for Rare Population Subgroups

ERIC Educational Resources Information Center

Brick, J. Michael; Andrews, William R.; Mathiowetz, Nancy A.

2016-01-01

Although using random digit dialing (RDD) telephone samples was the preferred method for conducting surveys of households for many years, declining response and coverage rates have led researchers to explore alternative approaches. The use of address-based sampling (ABS) has been examined for sampling the general population and subgroups, most…
Field-based random sampling without a sampling frame: control selection for a case-control study in rural Africa.

PubMed

Crampin, A C; Mwinuka, V; Malema, S S; Glynn, J R; Fine, P E

2001-01-01

Selection bias, particularly of controls, is common in case-control studies and may materially affect the results. Methods of control selection should be tailored both for the risk factors and disease under investigation and for the population being studied. We present here a control selection method devised for a case-control study of tuberculosis in rural Africa (Karonga, northern Malawi) that selects an age/sex frequency-matched random sample of the population, with a geographical distribution in proportion to the population density. We also present an audit of the selection process, and discuss the potential of this method in other settings.
Mapping cell populations in flow cytometry data for cross‐sample comparison using the Friedman–Rafsky test statistic as a distance measure

PubMed Central

Hsiao, Chiaowen; Liu, Mengya; Stanton, Rick; McGee, Monnie; Qian, Yu

2015-01-01

Abstract Flow cytometry (FCM) is a fluorescence‐based single‐cell experimental technology that is routinely applied in biomedical research for identifying cellular biomarkers of normal physiological responses and abnormal disease states. While many computational methods have been developed that focus on identifying cell populations in individual FCM samples, very few have addressed how the identified cell populations can be matched across samples for comparative analysis. This article presents FlowMap‐FR, a novel method for cell population mapping across FCM samples. FlowMap‐FR is based on the Friedman–Rafsky nonparametric test statistic (FR statistic), which quantifies the equivalence of multivariate distributions. As applied to FCM data by FlowMap‐FR, the FR statistic objectively quantifies the similarity between cell populations based on the shapes, sizes, and positions of fluorescence data distributions in the multidimensional feature space. To test and evaluate the performance of FlowMap‐FR, we simulated the kinds of biological and technical sample variations that are commonly observed in FCM data. The results show that FlowMap‐FR is able to effectively identify equivalent cell populations between samples under scenarios of proportion differences and modest position shifts. As a statistical test, FlowMap‐FR can be used to determine whether the expression of a cellular marker is statistically different between two cell populations, suggesting candidates for new cellular phenotypes by providing an objective statistical measure. In addition, FlowMap‐FR can indicate situations in which inappropriate splitting or merging of cell populations has occurred during gating procedures. We compared the FR statistic with the symmetric version of Kullback–Leibler divergence measure used in a previous population matching method with both simulated and real data. The FR statistic outperforms the symmetric version of KL‐distance in distinguishing equivalent from nonequivalent cell populations. FlowMap‐FR was also employed as a distance metric to match cell populations delineated by manual gating across 30 FCM samples from a benchmark FlowCAP data set. An F‐measure of 0.88 was obtained, indicating high precision and recall of the FR‐based population matching results. FlowMap‐FR has been implemented as a standalone R/Bioconductor package so that it can be easily incorporated into current FCM data analytical workflows. © 2015 International Society for Advancement of Cytometry PMID:26274018
Mapping cell populations in flow cytometry data for cross-sample comparison using the Friedman-Rafsky test statistic as a distance measure.

PubMed

Hsiao, Chiaowen; Liu, Mengya; Stanton, Rick; McGee, Monnie; Qian, Yu; Scheuermann, Richard H

2016-01-01

Flow cytometry (FCM) is a fluorescence-based single-cell experimental technology that is routinely applied in biomedical research for identifying cellular biomarkers of normal physiological responses and abnormal disease states. While many computational methods have been developed that focus on identifying cell populations in individual FCM samples, very few have addressed how the identified cell populations can be matched across samples for comparative analysis. This article presents FlowMap-FR, a novel method for cell population mapping across FCM samples. FlowMap-FR is based on the Friedman-Rafsky nonparametric test statistic (FR statistic), which quantifies the equivalence of multivariate distributions. As applied to FCM data by FlowMap-FR, the FR statistic objectively quantifies the similarity between cell populations based on the shapes, sizes, and positions of fluorescence data distributions in the multidimensional feature space. To test and evaluate the performance of FlowMap-FR, we simulated the kinds of biological and technical sample variations that are commonly observed in FCM data. The results show that FlowMap-FR is able to effectively identify equivalent cell populations between samples under scenarios of proportion differences and modest position shifts. As a statistical test, FlowMap-FR can be used to determine whether the expression of a cellular marker is statistically different between two cell populations, suggesting candidates for new cellular phenotypes by providing an objective statistical measure. In addition, FlowMap-FR can indicate situations in which inappropriate splitting or merging of cell populations has occurred during gating procedures. We compared the FR statistic with the symmetric version of Kullback-Leibler divergence measure used in a previous population matching method with both simulated and real data. The FR statistic outperforms the symmetric version of KL-distance in distinguishing equivalent from nonequivalent cell populations. FlowMap-FR was also employed as a distance metric to match cell populations delineated by manual gating across 30 FCM samples from a benchmark FlowCAP data set. An F-measure of 0.88 was obtained, indicating high precision and recall of the FR-based population matching results. FlowMap-FR has been implemented as a standalone R/Bioconductor package so that it can be easily incorporated into current FCM data analytical workflows. © The Authors. Published by Wiley Periodicals, Inc. on behalf of ISAC.
Psychological Abuse between Parents: Associations with Child Maltreatment from a Population-Based Sample

ERIC Educational Resources Information Center

Chang, Jen Jen; Theodore, Adrea D.; Martin, Sandra L.; Runyan, Desmond K.

2008-01-01

Objective: This study examined the association between partner psychological abuse and child maltreatment perpetration. Methods: This cross-sectional study examined a population-based sample of mothers with children aged 0-17 years in North and South Carolina (n = 1,149). Mothers were asked about the occurrence of potentially neglectful or abusive…
Estimating the probability that the sample mean is within a desired fraction of the standard deviation of the true mean.

PubMed

Schillaci, Michael A; Schillaci, Mario E

2009-02-01

The use of small sample sizes in human and primate evolutionary research is commonplace. Estimating how well small samples represent the underlying population, however, is not commonplace. Because the accuracy of determinations of taxonomy, phylogeny, and evolutionary process are dependant upon how well the study sample represents the population of interest, characterizing the uncertainty, or potential error, associated with analyses of small sample sizes is essential. We present a method for estimating the probability that the sample mean is within a desired fraction of the standard deviation of the true mean using small (n<10) or very small (n < or = 5) sample sizes. This method can be used by researchers to determine post hoc the probability that their sample is a meaningful approximation of the population parameter. We tested the method using a large craniometric data set commonly used by researchers in the field. Given our results, we suggest that sample estimates of the population mean can be reasonable and meaningful even when based on small, and perhaps even very small, sample sizes.
Evaluating the performance of the Lee-Carter method and its variants in modelling and forecasting Malaysian mortality

NASA Astrophysics Data System (ADS)

Zakiyatussariroh, W. H. Wan; Said, Z. Mohammad; Norazan, M. R.

2014-12-01

This study investigated the performance of the Lee-Carter (LC) method and it variants in modeling and forecasting Malaysia mortality. These include the original LC, the Lee-Miller (LM) variant and the Booth-Maindonald-Smith (BMS) variant. These methods were evaluated using Malaysia's mortality data which was measured based on age specific death rates (ASDR) for 1971 to 2009 for overall population while those for 1980-2009 were used in separate models for male and female population. The performance of the variants has been examined in term of the goodness of fit of the models and forecasting accuracy. Comparison was made based on several criteria namely, mean square error (MSE), root mean square error (RMSE), mean absolute deviation (MAD) and mean absolute percentage error (MAPE). The results indicate that BMS method was outperformed in in-sample fitting for overall population and when the models were fitted separately for male and female population. However, in the case of out-sample forecast accuracy, BMS method only best when the data were fitted to overall population. When the data were fitted separately for male and female, LCnone performed better for male population and LM method is good for female population.
Change-in-ratio estimators for populations with more than two subclasses

USGS Publications Warehouse

Udevitz, Mark S.; Pollock, Kenneth H.

1991-01-01

Change-in-ratio methods have been developed to estimate the size of populations with two or three population subclasses. Most of these methods require the often unreasonable assumption of equal sampling probabilities for individuals in all subclasses. This paper presents new models based on the weaker assumption that ratios of sampling probabilities are constant over time for populations with three or more subclasses. Estimation under these models requires that a value be assumed for one of these ratios when there are two samples. Explicit expressions are given for the maximum likelihood estimators under models for two samples with three or more subclasses and for three samples with two subclasses. A numerical method using readily available statistical software is described for obtaining the estimators and their standard errors under all of the models. Likelihood ratio tests that can be used in model selection are discussed. Emphasis is on the two-sample, three-subclass models for which Monte-Carlo simulation results and an illustrative example are presented.

Men's and Women's Health Beliefs Differentially Predict Coronary Heart Disease Incidence in a Population-Based Sample

ERIC Educational Resources Information Center

Korin, Maya Rom; Chaplin, William F.; Shaffer, Jonathan A.; Butler, Mark J.; Ojie, Mary-Jane; Davidson, Karina W.

2013-01-01

Objective: To examine gender differences in the association between beliefs in heart disease preventability and 10-year incidence of coronary heart disease (CHD) in a population-based sample. Methods: A total of 2,688 Noninstitutionalized Nova Scotians without prior CHD enrolled in the Nova Scotia Health Study (NSHS95) and were followed for 10…
What a drop can do: dried blood spots as a minimally invasive method for integrating biomarkers into population-based research.

PubMed

McDade, Thomas W; Williams, Sharon; Snodgrass, J Josh

2007-11-01

Logistical constraints associated with the collection and analysis of biological samples in community-based settings have been a significant impediment to integrative, multilevel bio-demographic and biobehavioral research. However recent methodological developments have overcome many of these constraints and have also expanded the options for incorporating biomarkers into population-based health research in international as well as domestic contexts. In particular using dried blood spot (DBS) samples-drops of whole blood collected on filter paper from a simple finger prick-provides a minimally invasive method for collecting blood samples in nonclinical settings. After a brief discussion of biomarkers more generally, we review procedures for collecting, handling, and analyzing DBS samples. Advantages of using DBS samples-compared with venipuncture include the relative ease and low cost of sample collection, transport, and storage. Disadvantages include requirements for assay development and validation as well as the relatively small volumes of sample. We present the results of a comprehensive literature review of published protocols for analysis of DBS samples, and we provide more detailed analysis of protocols for 45 analytes likely to be of particular relevance to population-level health research. Our objective is to provide investigators with the information they need to make informed decisions regarding the appropriateness of blood spot methods for their research interests.
Multiple data sources improve DNA-based mark-recapture population estimates of grizzly bears.

PubMed

Boulanger, John; Kendall, Katherine C; Stetz, Jeffrey B; Roon, David A; Waits, Lisette P; Paetkau, David

2008-04-01

A fundamental challenge to estimating population size with mark-recapture methods is heterogeneous capture probabilities and subsequent bias of population estimates. Confronting this problem usually requires substantial sampling effort that can be difficult to achieve for some species, such as carnivores. We developed a methodology that uses two data sources to deal with heterogeneity and applied this to DNA mark-recapture data from grizzly bears (Ursus arctos). We improved population estimates by incorporating additional DNA "captures" of grizzly bears obtained by collecting hair from unbaited bear rub trees concurrently with baited, grid-based, hair snag sampling. We consider a Lincoln-Petersen estimator with hair snag captures as the initial session and rub tree captures as the recapture session and develop an estimator in program MARK that treats hair snag and rub tree samples as successive sessions. Using empirical data from a large-scale project in the greater Glacier National Park, Montana, USA, area and simulation modeling we evaluate these methods and compare the results to hair-snag-only estimates. Empirical results indicate that, compared with hair-snag-only data, the joint hair-snag-rub-tree methods produce similar but more precise estimates if capture and recapture rates are reasonably high for both methods. Simulation results suggest that estimators are potentially affected by correlation of capture probabilities between sample types in the presence of heterogeneity. Overall, closed population Huggins-Pledger estimators showed the highest precision and were most robust to sparse data, heterogeneity, and capture probability correlation among sampling types. Results also indicate that these estimators can be used when a segment of the population has zero capture probability for one of the methods. We propose that this general methodology may be useful for other species in which mark-recapture data are available from multiple sources.
Population clustering based on copy number variations detected from next generation sequencing data.

PubMed

Duan, Junbo; Zhang, Ji-Gang; Wan, Mingxi; Deng, Hong-Wen; Wang, Yu-Ping

2014-08-01

Copy number variations (CNVs) can be used as significant bio-markers and next generation sequencing (NGS) provides a high resolution detection of these CNVs. But how to extract features from CNVs and further apply them to genomic studies such as population clustering have become a big challenge. In this paper, we propose a novel method for population clustering based on CNVs from NGS. First, CNVs are extracted from each sample to form a feature matrix. Then, this feature matrix is decomposed into the source matrix and weight matrix with non-negative matrix factorization (NMF). The source matrix consists of common CNVs that are shared by all the samples from the same group, and the weight matrix indicates the corresponding level of CNVs from each sample. Therefore, using NMF of CNVs one can differentiate samples from different ethnic groups, i.e. population clustering. To validate the approach, we applied it to the analysis of both simulation data and two real data set from the 1000 Genomes Project. The results on simulation data demonstrate that the proposed method can recover the true common CNVs with high quality. The results on the first real data analysis show that the proposed method can cluster two family trio with different ancestries into two ethnic groups and the results on the second real data analysis show that the proposed method can be applied to the whole-genome with large sample size consisting of multiple groups. Both results demonstrate the potential of the proposed method for population clustering.
A robust and efficient statistical method for genetic association studies using case and control samples from multiple cohorts

PubMed Central

2013-01-01

Background The theoretical basis of genome-wide association studies (GWAS) is statistical inference of linkage disequilibrium (LD) between any polymorphic marker and a putative disease locus. Most methods widely implemented for such analyses are vulnerable to several key demographic factors and deliver a poor statistical power for detecting genuine associations and also a high false positive rate. Here, we present a likelihood-based statistical approach that accounts properly for non-random nature of case–control samples in regard of genotypic distribution at the loci in populations under study and confers flexibility to test for genetic association in presence of different confounding factors such as population structure, non-randomness of samples etc. Results We implemented this novel method together with several popular methods in the literature of GWAS, to re-analyze recently published Parkinson’s disease (PD) case–control samples. The real data analysis and computer simulation show that the new method confers not only significantly improved statistical power for detecting the associations but also robustness to the difficulties stemmed from non-randomly sampling and genetic structures when compared to its rivals. In particular, the new method detected 44 significant SNPs within 25 chromosomal regions of size < 1 Mb but only 6 SNPs in two of these regions were previously detected by the trend test based methods. It discovered two SNPs located 1.18 Mb and 0.18 Mb from the PD candidates, FGF20 and PARK8, without invoking false positive risk. Conclusions We developed a novel likelihood-based method which provides adequate estimation of LD and other population model parameters by using case and control samples, the ease in integration of these samples from multiple genetically divergent populations and thus confers statistically robust and powerful analyses of GWAS. On basis of simulation studies and analysis of real datasets, we demonstrated significant improvement of the new method over the non-parametric trend test, which is the most popularly implemented in the literature of GWAS. PMID:23394771
General Constraints on Sampling Wildlife on FIA Plots

Treesearch

Larissa L. Bailey; John R. Sauer; James D. Nichols; Paul H. Geissler

2005-01-01

This paper reviews the constraints to sampling wildlife populations at FIA points. Wildlife sampling programs must have well-defined goals and provide information adequate to meet those goals. Investigators should choose a State variable based on information needs and the spatial sampling scale. We discuss estimation-based methods for three State variables: species...
Estimating Kinship in Admixed Populations

PubMed Central

Thornton, Timothy; Tang, Hua; Hoffmann, Thomas J.; Ochs-Balcom, Heather M.; Caan, Bette J.; Risch, Neil

2012-01-01

Genome-wide association studies (GWASs) are commonly used for the mapping of genetic loci that influence complex traits. A problem that is often encountered in both population-based and family-based GWASs is that of identifying cryptic relatedness and population stratification because it is well known that failure to appropriately account for both pedigree and population structure can lead to spurious association. A number of methods have been proposed for identifying relatives in samples from homogeneous populations. A strong assumption of population homogeneity, however, is often untenable, and many GWASs include samples from structured populations. Here, we consider the problem of estimating relatedness in structured populations with admixed ancestry. We propose a method, REAP (relatedness estimation in admixed populations), for robust estimation of identity by descent (IBD)-sharing probabilities and kinship coefficients in admixed populations. REAP appropriately accounts for population structure and ancestry-related assortative mating by using individual-specific allele frequencies at SNPs that are calculated on the basis of ancestry derived from whole-genome analysis. In simulation studies with related individuals and admixture from highly divergent populations, we demonstrate that REAP gives accurate IBD-sharing probabilities and kinship coefficients. We apply REAP to the Mexican Americans in Los Angeles, California (MXL) population sample of release 3 of phase III of the International Haplotype Map Project; in this sample, we identify third- and fourth-degree relatives who have not previously been reported. We also apply REAP to the African American and Hispanic samples from the Women's Health Initiative SNP Health Association Resource (WHI-SHARe) study, in which hundreds of pairs of cryptically related individuals have been identified. PMID:22748210
A comparison of respondent-driven and venue-based sampling of female sex workers in Liuzhou, China

PubMed Central

Weir, Sharon S; Merli, M Giovanna; Li, Jing; Gandhi, Anisha D; Neely, William W; Edwards, Jessie K; Suchindran, Chirayath M; Henderson, Gail E; Chen, Xiang-Sheng

2012-01-01

Objectives To compare two methods for sampling female sex workers (FSWs) for bio-behavioural surveillance. We compared the populations of sex workers recruited by the venue-based Priorities for Local AIDS Control Efforts (PLACE) method and a concurrently implemented network-based sampling method, respondent-driven sampling (RDS), in Liuzhou, China. Methods For the PLACE protocol, all female workers at a stratified random sample of venues identified as places where people meet new sexual partners were interviewed and tested for syphilis. Female workers who reported sex work in the past 4 weeks were categorised as FSWs. RDS used peer recruitment and chain referral to obtain a sample of FSWs. Data were collected between October 2009 and January 2010. We compared the socio-demographic characteristics and the percentage with a positive syphilis test of FSWs recruited by PLACE and RDS. Results The prevalence of a positive syphilis test was 24% among FSWs recruited by PLACE and 8.5% among those recruited by RDS and tested (prevalence ratio 3.3; 95% CI 1.5 to 7.2). Socio-demographic characteristics (age, residence and monthly income) also varied by sampling method. PLACE recruited fewer FSWs than RDS (161 vs 583), was more labour-intensive and had difficulty gaining access to some venues. RDS was more likely to recruit from areas near the RDS office and from large low prevalence entertainment venues. Conclusions Surveillance protocols using different sampling methods can obtain different estimates of prevalence and population characteristics. Venue-based and network-based methods each have strengths and limitations reflecting differences in design and assumptions. We recommend that more research be conducted on measuring bias in bio-behavioural surveillance. PMID:23172350
Robust inference of population structure for ancestry prediction and correction of stratification in the presence of relatedness.

PubMed

Conomos, Matthew P; Miller, Michael B; Thornton, Timothy A

2015-05-01

Population structure inference with genetic data has been motivated by a variety of applications in population genetics and genetic association studies. Several approaches have been proposed for the identification of genetic ancestry differences in samples where study participants are assumed to be unrelated, including principal components analysis (PCA), multidimensional scaling (MDS), and model-based methods for proportional ancestry estimation. Many genetic studies, however, include individuals with some degree of relatedness, and existing methods for inferring genetic ancestry fail in related samples. We present a method, PC-AiR, for robust population structure inference in the presence of known or cryptic relatedness. PC-AiR utilizes genome-screen data and an efficient algorithm to identify a diverse subset of unrelated individuals that is representative of all ancestries in the sample. The PC-AiR method directly performs PCA on the identified ancestry representative subset and then predicts components of variation for all remaining individuals based on genetic similarities. In simulation studies and in applications to real data from Phase III of the HapMap Project, we demonstrate that PC-AiR provides a substantial improvement over existing approaches for population structure inference in related samples. We also demonstrate significant efficiency gains, where a single axis of variation from PC-AiR provides better prediction of ancestry in a variety of structure settings than using 10 (or more) components of variation from widely used PCA and MDS approaches. Finally, we illustrate that PC-AiR can provide improved population stratification correction over existing methods in genetic association studies with population structure and relatedness. © 2015 WILEY PERIODICALS, INC.
Methods of Suicide among Cancer Patients: A Nationwide Population-Based Study

ERIC Educational Resources Information Center

Chung, Kuo-Hsuan; Lin, Herng-Ching

2010-01-01

A 3-year nationwide population-based data set was used to explore methods of suicide (violent vs. nonviolent) and possible contributing factors among cancer patients in Taiwan. A total of 1,065 cancer inpatients who committed suicide were included as our study sample. The regression shows that those who had genitourinary cancer were 0.55 times (p…
Monitoring larval populations of the Douglas-fir tussock moth and the western spruce budworm on permanent plots: sampling methods and statistical properties of data

Treesearch

A.R. Mason; H.G. Paul

1994-01-01

Procedures for monitoring larval populations of the Douglas-fir tussock moth and the western spruce budworm are recommended based on many years experience in sampling these species in eastern Oregon and Washington. It is shown that statistically reliable estimates of larval density can be made for a population by sampling host trees in a series of permanent plots in a...
Efficient computation of the joint sample frequency spectra for multiple populations.

PubMed

Kamm, John A; Terhorst, Jonathan; Song, Yun S

2017-01-01

A wide range of studies in population genetics have employed the sample frequency spectrum (SFS), a summary statistic which describes the distribution of mutant alleles at a polymorphic site in a sample of DNA sequences and provides a highly efficient dimensional reduction of large-scale population genomic variation data. Recently, there has been much interest in analyzing the joint SFS data from multiple populations to infer parameters of complex demographic histories, including variable population sizes, population split times, migration rates, admixture proportions, and so on. SFS-based inference methods require accurate computation of the expected SFS under a given demographic model. Although much methodological progress has been made, existing methods suffer from numerical instability and high computational complexity when multiple populations are involved and the sample size is large. In this paper, we present new analytic formulas and algorithms that enable accurate, efficient computation of the expected joint SFS for thousands of individuals sampled from hundreds of populations related by a complex demographic model with arbitrary population size histories (including piecewise-exponential growth). Our results are implemented in a new software package called momi (MOran Models for Inference). Through an empirical study we demonstrate our improvements to numerical stability and computational complexity.
Efficient computation of the joint sample frequency spectra for multiple populations

PubMed Central

Kamm, John A.; Terhorst, Jonathan; Song, Yun S.

2016-01-01

A wide range of studies in population genetics have employed the sample frequency spectrum (SFS), a summary statistic which describes the distribution of mutant alleles at a polymorphic site in a sample of DNA sequences and provides a highly efficient dimensional reduction of large-scale population genomic variation data. Recently, there has been much interest in analyzing the joint SFS data from multiple populations to infer parameters of complex demographic histories, including variable population sizes, population split times, migration rates, admixture proportions, and so on. SFS-based inference methods require accurate computation of the expected SFS under a given demographic model. Although much methodological progress has been made, existing methods suffer from numerical instability and high computational complexity when multiple populations are involved and the sample size is large. In this paper, we present new analytic formulas and algorithms that enable accurate, efficient computation of the expected joint SFS for thousands of individuals sampled from hundreds of populations related by a complex demographic model with arbitrary population size histories (including piecewise-exponential growth). Our results are implemented in a new software package called momi (MOran Models for Inference). Through an empirical study we demonstrate our improvements to numerical stability and computational complexity. PMID:28239248
Gaps in Survey Data on Cancer in American Indian and Alaska Native Populations: Examination of US Population Surveys, 1960–2010

PubMed Central

Duran, Tinka; Stimpson, Jim P.; Smith, Corey

2013-01-01

Introduction Population-based data are essential for quantifying the problems and measuring the progress made by comprehensive cancer control programs. However, cancer information specific to the American Indian/Alaska Native (AI/AN) population is not readily available. We identified major population-based surveys conducted in the United States that contain questions related to cancer, documented the AI/AN sample size in these surveys, and identified gaps in the types of cancer-related information these surveys collect. Methods We conducted an Internet query of US Department of Health and Human Services agency websites and a Medline search to identify population-based surveys conducted in the United States from 1960 through 2010 that contained information about cancer. We used a data extraction form to collect information about the purpose, sample size, data collection methods, and type of information covered in the surveys. Results Seventeen survey sources met the inclusion criteria. Information on access to and use of cancer treatment, follow-up care, and barriers to receiving timely and quality care was not consistently collected. Estimates specific to the AI/AN population were often lacking because of inadequate AI/AN sample size. For example, 9 national surveys reviewed reported an AI/AN sample size smaller than 500, and 10 had an AI/AN sample percentage less than 1.5%. Conclusion Continued efforts are needed to increase the overall number of AI/AN participants in these surveys, improve the quality of information on racial/ethnic background, and collect more information on treatment and survivorship. PMID:23517582
Comparison of base composition analysis and Sanger sequencing of mitochondrial DNA for four U.S. population groups.

PubMed

Kiesler, Kevin M; Coble, Michael D; Hall, Thomas A; Vallone, Peter M

2014-01-01

A set of 711 samples from four U.S. population groups was analyzed using a novel mass spectrometry based method for mitochondrial DNA (mtDNA) base composition profiling. Comparison of the mass spectrometry results with Sanger sequencing derived data yielded a concordance rate of 99.97%. Length heteroplasmy was identified in 46% of samples and point heteroplasmy was observed in 6.6% of samples in the combined mass spectral and Sanger data set. Using discrimination capacity as a metric, Sanger sequencing of the full control region had the highest discriminatory power, followed by the mass spectrometry base composition method, which was more discriminating than Sanger sequencing of just the hypervariable regions. This trend is in agreement with the number of nucleotides covered by each of the three assays. Published by Elsevier Ireland Ltd.
Estimation of pyrethroid pesticide intake using regression ...

EPA Pesticide Factsheets

Population-based estimates of pesticide intake are needed to characterize exposure for particular demographic groups based on their dietary behaviors. Regression modeling performed on measurements of selected pesticides in composited duplicate diet samples allowed (1) estimation of pesticide intakes for a defined demographic community, and (2) comparison of dietary pesticide intakes between the composite and individual samples. Extant databases were useful for assigning individual samples to composites, but they could not provide the breadth of information needed to facilitate measurable levels in every composite. Composite sample measurements were found to be good predictors of pyrethroid pesticide levels in their individual sample constituents where sufficient measurements are available above the method detection limit. Statistical inference shows little evidence of differences between individual and composite measurements and suggests that regression modeling of food groups based on composite dietary samples may provide an effective tool for estimating dietary pesticide intake for a defined population. The research presented in the journal article will improve community's ability to determine exposures through the dietary route with a less burdensome and costly method.
Caught Ya! A School-Based Practical Activity to Evaluate the Capture-Mark-Release-Recapture Method

ERIC Educational Resources Information Center

Kingsnorth, Crawford; Cruickshank, Chae; Paterson, David; Diston, Stephen

2017-01-01

The capture-mark-release-recapture method provides a simple way to estimate population size. However, when used as part of ecological sampling, this method does not easily allow an opportunity to evaluate the accuracy of the calculation because the actual population size is unknown. Here, we describe a method that can be used to measure the…
A Systematic Evaluation of ADHD and Comorbid Psychopathology in a Population-Based Twin Sample

ERIC Educational Resources Information Center

Volk, Heather E.; Neuman, Rosalind J.; Todd, Richard D.

2005-01-01

Objective: Clinical and population samples demonstrate that attention-deficit/hyperactivity disorder (ADHD) occurs with other disorders. Comorbid disorder clustering within ADHD subtypes is not well studied. Method: Latent class analysis (LCA) examined the co-occurrence of DSM-IV ADHD, oppositional defiant disorder (ODD), conduct disorder (CD),…
ADHD Medication Use in a Population-Based Sample of Twins

ERIC Educational Resources Information Center

Reich, Wendy; Huang, Hongyan; Todd, Richard D.

2006-01-01

Objective: To determine treatment patterns for youth attention-deficit/hyperactivity disorder (ADHD) symptoms in a general population sample of 1,610 twins. Method: Twin pairs ages 7 to 17 years and parents ascertained from birth records in the state of Missouri were interviewed using the Missouri Assessment of Genetics Interview for Children…
General constraints on sampling wildlife on FIA plots

USGS Publications Warehouse

Bailey, L.L.; Sauer, J.R.; Nichols, J.D.; Geissler, P.H.; McRoberts, Ronald E.; Reams, Gregory A.; Van Deusen, Paul C.; McWilliams, William H.; Cieszewski, Chris J.

2005-01-01

This paper reviews the constraints to sampling wildlife populations at FIA points. Wildlife sampling programs must have well-defined goals and provide information adequate to meet those goals. Investigators should choose a State variable based on information needs and the spatial sampling scale. We discuss estimation-based methods for three State variables: species richness, abundance, and patch occupancy. All methods incorporate two essential sources of variation: detectability estimation and spatial variation. FIA sampling imposes specific space and time criteria that may need to be adjusted to meet local wildlife objectives.

Probability Sampling Method for a Hidden Population Using Respondent-Driven Sampling: Simulation for Cancer Survivors.

PubMed

Jung, Minsoo

2015-01-01

When there is no sampling frame within a certain group or the group is concerned that making its population public would bring social stigma, we say the population is hidden. It is difficult to approach this kind of population survey-methodologically because the response rate is low and its members are not quite honest with their responses when probability sampling is used. The only alternative known to address the problems caused by previous methods such as snowball sampling is respondent-driven sampling (RDS), which was developed by Heckathorn and his colleagues. RDS is based on a Markov chain, and uses the social network information of the respondent. This characteristic allows for probability sampling when we survey a hidden population. We verified through computer simulation whether RDS can be used on a hidden population of cancer survivors. According to the simulation results of this thesis, the chain-referral sampling of RDS tends to minimize as the sample gets bigger, and it becomes stabilized as the wave progresses. Therefore, it shows that the final sample information can be completely independent from the initial seeds if a certain level of sample size is secured even if the initial seeds were selected through convenient sampling. Thus, RDS can be considered as an alternative which can improve upon both key informant sampling and ethnographic surveys, and it needs to be utilized for various cases domestically as well.
Gradient-free MCMC methods for dynamic causal modelling

DOE PAGES

Sengupta, Biswa; Friston, Karl J.; Penny, Will D.

2015-03-14

Here, we compare the performance of four gradient-free MCMC samplers (random walk Metropolis sampling, slice-sampling, adaptive MCMC sampling and population-based MCMC sampling with tempering) in terms of the number of independent samples they can produce per unit computational time. For the Bayesian inversion of a single-node neural mass model, both adaptive and population-based samplers are more efficient compared with random walk Metropolis sampler or slice-sampling; yet adaptive MCMC sampling is more promising in terms of compute time. Slice-sampling yields the highest number of independent samples from the target density -- albeit at almost 1000% increase in computational time, in comparisonmore » to the most efficient algorithm (i.e., the adaptive MCMC sampler).« less
A cautionary note on Bayesian estimation of population size by removal sampling with diffuse priors.

PubMed

Bord, Séverine; Bioche, Christèle; Druilhet, Pierre

2018-05-01

We consider the problem of estimating a population size by removal sampling when the sampling rate is unknown. Bayesian methods are now widespread and allow to include prior knowledge in the analysis. However, we show that Bayes estimates based on default improper priors lead to improper posteriors or infinite estimates. Similarly, weakly informative priors give unstable estimators that are sensitive to the choice of hyperparameters. By examining the likelihood, we show that population size estimates can be stabilized by penalizing small values of the sampling rate or large value of the population size. Based on theoretical results and simulation studies, we propose some recommendations on the choice of the prior. Then, we applied our results to real datasets. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Systematic sampling for suspended sediment

Treesearch

Robert B. Thomas

1991-01-01

Abstract - Because of high costs or complex logistics, scientific populations cannot be measured entirely and must be sampled. Accepted scientific practice holds that sample selection be based on statistical principles to assure objectivity when estimating totals and variances. Probability sampling--obtaining samples with known probabilities--is the only method that...
Inferring the demographic history from DNA sequences: An importance sampling approach based on non-homogeneous processes.

PubMed

Ait Kaci Azzou, S; Larribe, F; Froda, S

2016-10-01

In Ait Kaci Azzou et al. (2015) we introduced an Importance Sampling (IS) approach for estimating the demographic history of a sample of DNA sequences, the skywis plot. More precisely, we proposed a new nonparametric estimate of a population size that changes over time. We showed on simulated data that the skywis plot can work well in typical situations where the effective population size does not undergo very steep changes. In this paper, we introduce an iterative procedure which extends the previous method and gives good estimates under such rapid variations. In the iterative calibrated skywis plot we approximate the effective population size by a piecewise constant function, whose values are re-estimated at each step. These piecewise constant functions are used to generate the waiting times of non homogeneous Poisson processes related to a coalescent process with mutation under a variable population size model. Moreover, the present IS procedure is based on a modified version of the Stephens and Donnelly (2000) proposal distribution. Finally, we apply the iterative calibrated skywis plot method to a simulated data set from a rapidly expanding exponential model, and we show that the method based on this new IS strategy correctly reconstructs the demographic history. Copyright © 2016. Published by Elsevier Inc.
An empirical comparison of isolate-based and sample-based definitions of antimicrobial resistance and their effect on estimates of prevalence.

PubMed

Humphry, R W; Evans, J; Webster, C; Tongue, S C; Innocent, G T; Gunn, G J

2018-02-01

Antimicrobial resistance is primarily a problem in human medicine but there are unquantified links of transmission in both directions between animal and human populations. Quantitative assessment of the costs and benefits of reduced antimicrobial usage in livestock requires robust quantification of transmission of resistance between animals, the environment and the human population. This in turn requires appropriate measurement of resistance. To tackle this we selected two different methods for determining whether a sample is resistant - one based on screening a sample, the other on testing individual isolates. Our overall objective was to explore the differences arising from choice of measurement. A literature search demonstrated the widespread use of testing of individual isolates. The first aim of this study was to compare, quantitatively, sample level and isolate level screening. Cattle or sheep faecal samples (n=41) submitted for routine parasitology were tested for antimicrobial resistance in two ways: (1) "streak" direct culture onto plates containing the antimicrobial of interest; (2) determination of minimum inhibitory concentration (MIC) of 8-10 isolates per sample compared to published MIC thresholds. Two antibiotics (ampicillin and nalidixic acid) were tested. With ampicillin, direct culture resulted in more than double the number of resistant samples than the MIC method based on eight individual isolates. The second aim of this study was to demonstrate the utility of the observed relationship between these two measures of antimicrobial resistance to re-estimate the prevalence of antimicrobial resistance from a previous study, in which we had used "streak" cultures. Boot-strap methods were used to estimate the proportion of samples that would have tested resistant in the historic study, had we used the isolate-based MIC method instead. Our boot-strap results indicate that our estimates of prevalence of antimicrobial resistance would have been considerably lower in the historic study had the MIC method been used. Finally we conclude that there is no single way of defining a sample as resistant to an antimicrobial agent. The method used greatly affects the estimated prevalence of antimicrobial resistance in a sampled population of animals, thus potentially resulting in misleading results. Comparing methods on the same samples allows us to re-estimate the prevalence from other studies, had other methods for determining resistance been used. The results of this study highlight the importance of establishing what the most appropriate measure of antimicrobial resistance is, for the proposed purpose of the results. Copyright © 2017 Elsevier B.V. All rights reserved.
Influence of population versus convenience sampling on sample characteristics in studies of cognitive aging.

PubMed

Brodaty, Henry; Mothakunnel, Annu; de Vel-Palumbo, Melissa; Ames, David; Ellis, Kathryn A; Reppermund, Simone; Kochan, Nicole A; Savage, Greg; Trollor, Julian N; Crawford, John; Sachdev, Perminder S

2014-01-01

We examined whether differences in findings of studies examining mild cognitive impairment (MCI) were associated with recruitment methods by comparing sample characteristics in two contemporaneous Australian studies, using population-based and convenience sampling. The Sydney Memory and Aging Study invited participants randomly from the electoral roll in defined geographic areas in Sydney. The Australian Imaging, Biomarkers and Lifestyle Study of Ageing recruited cognitively normal (CN) individuals via media appeals and MCI participants via referrals from clinicians in Melbourne and Perth. Demographic and cognitive variables were harmonized, and similar diagnostic criteria were applied to both samples retrospectively. CN participants recruited via convenience sampling were younger, better educated, more likely to be married and have a family history of dementia, and performed better cognitively than those recruited via population-based sampling. MCI participants recruited via population-based sampling had better memory performance and were less likely to carry the apolipoprotein E ε4 allele than clinically referred participants but did not differ on other demographic variables. A convenience sample of normal controls is likely to be younger and better functioning and that of an MCI group likely to perform worse than a purportedly random sample. Sampling bias should be considered when interpreting findings. Copyright © 2014 Elsevier Inc. All rights reserved.
Sequential sampling of ribes populations in the control of white pine blister rust (Cronartium ribicola Fischer) in California

Treesearch

Harold R. Offord

1966-01-01

Sequential sampling based on a negative binomial distribution of ribes populations required less than half the time taken by regular systematic line transect sampling in a comparison test. It gave the same control decision as the regular method in 9 of 13 field trials. A computer program that permits sequential plans to be built readily for other white pine regions is...
Unsupervised discovery of microbial population structure within metagenomes using nucleotide base composition

PubMed Central

Saeed, Isaam; Tang, Sen-Lin; Halgamuge, Saman K.

2012-01-01

An approach to infer the unknown microbial population structure within a metagenome is to cluster nucleotide sequences based on common patterns in base composition, otherwise referred to as binning. When functional roles are assigned to the identified populations, a deeper understanding of microbial communities can be attained, more so than gene-centric approaches that explore overall functionality. In this study, we propose an unsupervised, model-based binning method with two clustering tiers, which uses a novel transformation of the oligonucleotide frequency-derived error gradient and GC content to generate coarse groups at the first tier of clustering; and tetranucleotide frequency to refine these groups at the secondary clustering tier. The proposed method has a demonstrated improvement over PhyloPythia, S-GSOM, TACOA and TaxSOM on all three benchmarks that were used for evaluation in this study. The proposed method is then applied to a pyrosequenced metagenomic library of mud volcano sediment sampled in southwestern Taiwan, with the inferred population structure validated against complementary sequencing of 16S ribosomal RNA marker genes. Finally, the proposed method was further validated against four publicly available metagenomes, including a highly complex Antarctic whale-fall bone sample, which was previously assumed to be too complex for binning prior to functional analysis. PMID:22180538
Sampling considerations for disease surveillance in wildlife populations

USGS Publications Warehouse

Nusser, S.M.; Clark, W.R.; Otis, D.L.; Huang, L.

2008-01-01

Disease surveillance in wildlife populations involves detecting the presence of a disease, characterizing its prevalence and spread, and subsequent monitoring. A probability sample of animals selected from the population and corresponding estimators of disease prevalence and detection provide estimates with quantifiable statistical properties, but this approach is rarely used. Although wildlife scientists often assume probability sampling and random disease distributions to calculate sample sizes, convenience samples (i.e., samples of readily available animals) are typically used, and disease distributions are rarely random. We demonstrate how landscape-based simulation can be used to explore properties of estimators from convenience samples in relation to probability samples. We used simulation methods to model what is known about the habitat preferences of the wildlife population, the disease distribution, and the potential biases of the convenience-sample approach. Using chronic wasting disease in free-ranging deer (Odocoileus virginianus) as a simple illustration, we show that using probability sample designs with appropriate estimators provides unbiased surveillance parameter estimates but that the selection bias and coverage errors associated with convenience samples can lead to biased and misleading results. We also suggest practical alternatives to convenience samples that mix probability and convenience sampling. For example, a sample of land areas can be selected using a probability design that oversamples areas with larger animal populations, followed by harvesting of individual animals within sampled areas using a convenience sampling method.
An elusive paleodemography? A comparison of two methods for estimating the adult age distribution of deaths at late Classic Copan, Honduras.

PubMed

Storey, Rebecca

2007-01-01

Comparison of different adult age estimation methods on the same skeletal sample with unknown ages could forward paleodemographic inference, while researchers sort out various controversies. The original aging method for the auricular surface (Lovejoy et al., 1985a) assigned an age estimation based on several separate characteristics. Researchers have found this original method hard to apply. It is usually forgotten that before assigning an age, there was a seriation, an ordering of all available individuals from youngest to oldest. Thus, age estimation reflected the place of an individual within its sample. A recent article (Buckberry and Chamberlain, 2002) proposed a revised method that scores theses various characteristics into age stages, which can then be used with a Bayesian method to estimate an adult age distribution for the sample. Both methods were applied to the adult auricular surfaces of a Pre-Columbian Maya skeletal population from Copan, Honduras and resulted in age distributions with significant numbers of older adults. However, contrary to the usual paleodemographic distribution, one Bayesian estimation based on uniform prior probabilities yielded a population with 57% of the ages at death over 65, while another based on a high mortality life table still had 12% of the individuals aged over 75 years. The seriation method yielded an age distribution more similar to that known from preindustrial historical situations, without excessive longevity of adults. Paleodemography must still wrestle with its elusive goal of accurate adult age estimation from skeletons, a necessary base for demographic study of past populations. (c) 2006 Wiley-Liss, Inc
Minimal-assumption inference from population-genomic data

NASA Astrophysics Data System (ADS)

Weissman, Daniel; Hallatschek, Oskar

Samples of multiple complete genome sequences contain vast amounts of information about the evolutionary history of populations, much of it in the associations among polymorphisms at different loci. Current methods that take advantage of this linkage information rely on models of recombination and coalescence, limiting the sample sizes and populations that they can analyze. We introduce a method, Minimal-Assumption Genomic Inference of Coalescence (MAGIC), that reconstructs key features of the evolutionary history, including the distribution of coalescence times, by integrating information across genomic length scales without using an explicit model of recombination, demography or selection. Using simulated data, we show that MAGIC's performance is comparable to PSMC' on single diploid samples generated with standard coalescent and recombination models. More importantly, MAGIC can also analyze arbitrarily large samples and is robust to changes in the coalescent and recombination processes. Using MAGIC, we show that the inferred coalescence time histories of samples of multiple human genomes exhibit inconsistencies with a description in terms of an effective population size based on single-genome data.
Design-based and model-based inference in surveys of freshwater mollusks

USGS Publications Warehouse

Dorazio, R.M.

1999-01-01

Well-known concepts in statistical inference and sampling theory are used to develop recommendations for planning and analyzing the results of quantitative surveys of freshwater mollusks. Two methods of inference commonly used in survey sampling (design-based and model-based) are described and illustrated using examples relevant in surveys of freshwater mollusks. The particular objectives of a survey and the type of information observed in each unit of sampling can be used to help select the sampling design and the method of inference. For example, the mean density of a sparsely distributed population of mollusks can be estimated with higher precision by using model-based inference or by using design-based inference with adaptive cluster sampling than by using design-based inference with conventional sampling. More experience with quantitative surveys of natural assemblages of freshwater mollusks is needed to determine the actual benefits of different sampling designs and inferential procedures.
Comparison of Address-based Sampling and Random-digit Dialing Methods for Recruiting Young Men as Controls in a Case-Control Study of Testicular Cancer Susceptibility

PubMed Central

Clagett, Bartholt; Nathanson, Katherine L.; Ciosek, Stephanie L.; McDermoth, Monique; Vaughn, David J.; Mitra, Nandita; Weiss, Andrew; Martonik, Rachel; Kanetsky, Peter A.

2013-01-01

Random-digit dialing (RDD) using landline telephone numbers is the historical gold standard for control recruitment in population-based epidemiologic research. However, increasing cell-phone usage and diminishing response rates suggest that the effectiveness of RDD in recruiting a random sample of the general population, particularly for younger target populations, is decreasing. In this study, we compared landline RDD with alternative methods of control recruitment, including RDD using cell-phone numbers and address-based sampling (ABS), to recruit primarily white men aged 18–55 years into a study of testicular cancer susceptibility conducted in the Philadelphia, Pennsylvania, metropolitan area between 2009 and 2012. With few exceptions, eligible and enrolled controls recruited by means of RDD and ABS were similar with regard to characteristics for which data were collected on the screening survey. While we find ABS to be a comparably effective method of recruiting young males compared with landline RDD, we acknowledge the potential impact that selection bias may have had on our results because of poor overall response rates, which ranged from 11.4% for landline RDD to 1.7% for ABS. PMID:24008901
Gradient-free MCMC methods for dynamic causal modelling.

PubMed

Sengupta, Biswa; Friston, Karl J; Penny, Will D

2015-05-15

In this technical note we compare the performance of four gradient-free MCMC samplers (random walk Metropolis sampling, slice-sampling, adaptive MCMC sampling and population-based MCMC sampling with tempering) in terms of the number of independent samples they can produce per unit computational time. For the Bayesian inversion of a single-node neural mass model, both adaptive and population-based samplers are more efficient compared with random walk Metropolis sampler or slice-sampling; yet adaptive MCMC sampling is more promising in terms of compute time. Slice-sampling yields the highest number of independent samples from the target density - albeit at almost 1000% increase in computational time, in comparison to the most efficient algorithm (i.e., the adaptive MCMC sampler). Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Estimating population trends with a linear model

USGS Publications Warehouse

Bart, Jonathan; Collins, Brian D.; Morrison, R.I.G.

2003-01-01

We describe a simple and robust method for estimating trends in population size. The method may be used with Breeding Bird Survey data, aerial surveys, point counts, or any other program of repeated surveys at permanent locations. Surveys need not be made at each location during each survey period. The method differs from most existing methods in being design based, rather than model based. The only assumptions are that the nominal sampling plan is followed and that sample size is large enough for use of the t-distribution. Simulations based on two bird data sets from natural populations showed that the point estimate produced by the linear model was essentially unbiased even when counts varied substantially and 25% of the complete data set was missing. The estimating-equation approach, often used to analyze Breeding Bird Survey data, performed similarly on one data set but had substantial bias on the second data set, in which counts were highly variable. The advantages of the linear model are its simplicity, flexibility, and that it is self-weighting. A user-friendly computer program to carry out the calculations is available from the senior author.
Efficient inference of population size histories and locus-specific mutation rates from large-sample genomic variation data.

PubMed

Bhaskar, Anand; Wang, Y X Rachel; Song, Yun S

2015-02-01

With the recent increase in study sample sizes in human genetics, there has been growing interest in inferring historical population demography from genomic variation data. Here, we present an efficient inference method that can scale up to very large samples, with tens or hundreds of thousands of individuals. Specifically, by utilizing analytic results on the expected frequency spectrum under the coalescent and by leveraging the technique of automatic differentiation, which allows us to compute gradients exactly, we develop a very efficient algorithm to infer piecewise-exponential models of the historical effective population size from the distribution of sample allele frequencies. Our method is orders of magnitude faster than previous demographic inference methods based on the frequency spectrum. In addition to inferring demography, our method can also accurately estimate locus-specific mutation rates. We perform extensive validation of our method on simulated data and show that it can accurately infer multiple recent epochs of rapid exponential growth, a signal that is difficult to pick up with small sample sizes. Lastly, we use our method to analyze data from recent sequencing studies, including a large-sample exome-sequencing data set of tens of thousands of individuals assayed at a few hundred genic regions. © 2015 Bhaskar et al.; Published by Cold Spring Harbor Laboratory Press.
[Use of blood lead data to evaluate and prevent childhood lead poisoning in Latin America].

PubMed

Romieu, Isabelle

2003-01-01

Exposure to lead is a widespread and serious threat to the health of children in Latin America. Health officials should monitor sources of exposure and health outcomes to design, implement, and evaluate prevention and control activities. To evaluate the magnitude of lead as a public health problem, three key elements must be defined: I) the potential sources of exposure, 2) the indicators to evaluate health effects and environmental exposure, and 3) the sampling methods for the population at risk. Several strategies can be used to select the study population depending on the study objectives, the time limitations, and the available resources. If the objective is to evaluate the magnitude and sources of the problem, the following sampling methods can be used: I) population-based random sampling; 2) facility-based random sampling within hospitals, daycare centers, or schools; 3) target sampling of high risk groups; 4) convenience sampling of volunteers; and 5) case reporting (which can lead to the identification of populations at risk and sources of exposures). For all sampling methods, information gathering should include the use of a questionnaire to collect general information on the participants and on potential local sources of exposure, as well as the collection of biological samples. In interpreting data, one should consider the type of sampling used and the non-response rates, as well as factors that might influence blood lead measurements, such as age and seasonal variability. Blood lead measurements should be integrated in an overall strategy to prevent lead toxicity in children. The English version of this paper is available at: http://www.insp.mx/salud/index.html.
Molecular diagnosis of strongyloidiasis in a population of an endemic area through nested-PCR.

PubMed

Sharifdini, Meysam; Keyhani, Amir; Eshraghian, Mohammad Reza; Beigom Kia, Eshrat

2018-01-01

This study is aimed to diagnose and analyze strongyloidiasis in a population of an endemic area of Iran using nested-PCR, coupled with parasitological methods. Screening of strongyloidiasis infected people using reliable diagnostic techniques are essential to decrease the mortality and morbidity associated with this infection. Molecular methods have been proved to be highly sensitive and specific for detection of Strongyloides stercoralis in stool samples. A total of 155 fresh single stool samples were randomly collected from residents of north and northwest of Khouzestan Province, Iran. All samples were examined by parasitological methods including formalin-ether concentration and nutrient agar plate culture, and molecular method of nested-PCR. Infections with S. stercoralis were analyzed according to demographic criteria. Based on the results of nested-PCR method 15 cases (9.7%) were strongyloidiasis positive. Nested-PCR was more sensitive than parasitological techniques on single stool sampling. Elderly was the most important population index for higher infectivity with S. stercoralis . In endemic areas of S. stercoralis , old age should be considered as one of the most important risk factors of infection, especially among the immunosuppressed individuals.
The Impact of Selection, Gene Conversion, and Biased Sampling on the Assessment of Microbial Demography.

PubMed

Lapierre, Marguerite; Blin, Camille; Lambert, Amaury; Achaz, Guillaume; Rocha, Eduardo P C

2016-07-01

Recent studies have linked demographic changes and epidemiological patterns in bacterial populations using coalescent-based approaches. We identified 26 studies using skyline plots and found that 21 inferred overall population expansion. This surprising result led us to analyze the impact of natural selection, recombination (gene conversion), and sampling biases on demographic inference using skyline plots and site frequency spectra (SFS). Forward simulations based on biologically relevant parameters from Escherichia coli populations showed that theoretical arguments on the detrimental impact of recombination and especially natural selection on the reconstructed genealogies cannot be ignored in practice. In fact, both processes systematically lead to spurious interpretations of population expansion in skyline plots (and in SFS for selection). Weak purifying selection, and especially positive selection, had important effects on skyline plots, showing patterns akin to those of population expansions. State-of-the-art techniques to remove recombination further amplified these biases. We simulated three common sampling biases in microbiological research: uniform, clustered, and mixed sampling. Alone, or together with recombination and selection, they further mislead demographic inferences producing almost any possible skyline shape or SFS. Interestingly, sampling sub-populations also affected skyline plots and SFS, because the coalescent rates of populations and their sub-populations had different distributions. This study suggests that extreme caution is needed to infer demographic changes solely based on reconstructed genealogies. We suggest that the development of novel sampling strategies and the joint analyzes of diverse population genetic methods are strictly necessary to estimate demographic changes in populations where selection, recombination, and biased sampling are present. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

mRNA-Based Parallel Detection of Active Methanotroph Populations by Use of a Diagnostic Microarray

PubMed Central

Bodrossy, Levente; Stralis-Pavese, Nancy; Konrad-Köszler, Marianne; Weilharter, Alexandra; Reichenauer, Thomas G.; Schöfer, David; Sessitsch, Angela

2006-01-01

A method was developed for the mRNA-based application of microbial diagnostic microarrays to detect active microbial populations. DNA- and mRNA-based analyses of environmental samples were compared and confirmed via quantitative PCR. Results indicated that mRNA-based microarray analyses may provide additional information on the composition and functioning of microbial communities. PMID:16461725
Facebook advertisements recruit parents of children with cancer for an online survey of web-based research preferences.

PubMed

Akard, Terrah Foster; Wray, Sarah; Gilmer, Mary Jo

2015-01-01

Studies involving samples of children with life-threatening illnesses and their families face significant challenges, including inadequate sample sizes and limited diversity. Social media recruitment and Web-based research methods may help address such challenges yet have not been explored in pediatric cancer populations. This study examined the feasibility of using Facebook advertisements to recruit parent caregivers of children and teenagers with cancer. We also explored the feasibility of Web-based video recording in pediatric palliative care populations by surveying parents of children with cancer regarding (a) their preferences for research methods and (b) technological capabilities of their computers and phones. Facebook's paid advertising program was used to recruit parent caregivers of children currently living with cancer to complete an electronic survey about research preferences and technological capabilities. The advertising campaign generated 3 897 981 impressions, which resulted in 1050 clicks at a total cost of $1129.88. Of 284 screened individuals, 106 were eligible. Forty-five caregivers of children with cancer completed the entire electronic survey. Parents preferred and had technological capabilities for Web-based and electronic research methods. Participant survey responses are reported. Facebook was a useful, cost-effective method to recruit a diverse sample of parent caregivers of children with cancer. Web-based video recording and data collection may be feasible and desirable in samples of children with cancer and their families. Web-based methods (eg, Facebook, Skype) may enhance communication and access between nurses and pediatric oncology patients and their families.
Guidelines for Measuring Disease Episodes: An Analysis of the Effects on the Components of Expenditure Growth.

PubMed

Dunn, Abe; Liebman, Eli; Rittmueller, Lindsey; Shapiro, Adam Hale

2017-04-01

To provide guidelines to researchers measuring health expenditures by disease and compare these methodologies' implied inflation estimates. A convenience sample of commercially insured individuals over the 2003 to 2007 period from Truven Health. Population weights are applied, based on age, sex, and region, to make the sample of over 4 million enrollees representative of the entire commercially insured population. Different methods are used to allocate medical-care expenditures to distinct condition categories. We compare the estimates of disease-price inflation by method. Across a variety of methods, the compound annual growth rate stays within the range 3.1 to 3.9 percentage points. Disease-specific inflation measures are more sensitive to the selected methodology. The selected allocation method impacts aggregate inflation rates, but considering the variety of methods applied, the differences appear small. Future research is necessary to better understand these differences in other population samples and to connect disease expenditures to measures of quality. © Health Research and Educational Trust.
Differential expression analysis for RNAseq using Poisson mixed models

PubMed Central

Sun, Shiquan; Hood, Michelle; Scott, Laura; Peng, Qinke; Mukherjee, Sayan; Tung, Jenny

2017-01-01

Abstract Identifying differentially expressed (DE) genes from RNA sequencing (RNAseq) studies is among the most common analyses in genomics. However, RNAseq DE analysis presents several statistical and computational challenges, including over-dispersed read counts and, in some settings, sample non-independence. Previous count-based methods rely on simple hierarchical Poisson models (e.g. negative binomial) to model independent over-dispersion, but do not account for sample non-independence due to relatedness, population structure and/or hidden confounders. Here, we present a Poisson mixed model with two random effects terms that account for both independent over-dispersion and sample non-independence. We also develop a scalable sampling-based inference algorithm using a latent variable representation of the Poisson distribution. With simulations, we show that our method properly controls for type I error and is generally more powerful than other widely used approaches, except in small samples (n <15) with other unfavorable properties (e.g. small effect sizes). We also apply our method to three real datasets that contain related individuals, population stratification or hidden confounders. Our results show that our method increases power in all three data compared to other approaches, though the power gain is smallest in the smallest sample (n = 6). Our method is implemented in MACAU, freely available at www.xzlab.org/software.html. PMID:28369632
Modeling abundance effects in distance sampling

USGS Publications Warehouse

Royle, J. Andrew; Dawson, D.K.; Bates, S.

2004-01-01

Distance-sampling methods are commonly used in studies of animal populations to estimate population density. A common objective of such studies is to evaluate the relationship between abundance or density and covariates that describe animal habitat or other environmental influences. However, little attention has been focused on methods of modeling abundance covariate effects in conventional distance-sampling models. In this paper we propose a distance-sampling model that accommodates covariate effects on abundance. The model is based on specification of the distance-sampling likelihood at the level of the sample unit in terms of local abundance (for each sampling unit). This model is augmented with a Poisson regression model for local abundance that is parameterized in terms of available covariates. Maximum-likelihood estimation of detection and density parameters is based on the integrated likelihood, wherein local abundance is removed from the likelihood by integration. We provide an example using avian point-transect data of Ovenbirds (Seiurus aurocapillus) collected using a distance-sampling protocol and two measures of habitat structure (understory cover and basal area of overstory trees). The model yields a sensible description (positive effect of understory cover, negative effect on basal area) of the relationship between habitat and Ovenbird density that can be used to evaluate the effects of habitat management on Ovenbird populations.
Adaptive web sampling.

PubMed

Thompson, Steven K

2006-12-01

A flexible class of adaptive sampling designs is introduced for sampling in network and spatial settings. In the designs, selections are made sequentially with a mixture distribution based on an active set that changes as the sampling progresses, using network or spatial relationships as well as sample values. The new designs have certain advantages compared with previously existing adaptive and link-tracing designs, including control over sample sizes and of the proportion of effort allocated to adaptive selections. Efficient inference involves averaging over sample paths consistent with the minimal sufficient statistic. A Markov chain resampling method makes the inference computationally feasible. The designs are evaluated in network and spatial settings using two empirical populations: a hidden human population at high risk for HIV/AIDS and an unevenly distributed bird population.
A hierarchical model for spatial capture-recapture data

USGS Publications Warehouse

Royle, J. Andrew; Young, K.V.

2008-01-01

Estimating density is a fundamental objective of many animal population studies. Application of methods for estimating population size from ostensibly closed populations is widespread, but ineffective for estimating absolute density because most populations are subject to short-term movements or so-called temporary emigration. This phenomenon invalidates the resulting estimates because the effective sample area is unknown. A number of methods involving the adjustment of estimates based on heuristic considerations are in widespread use. In this paper, a hierarchical model of spatially indexed capture recapture data is proposed for sampling based on area searches of spatial sample units subject to uniform sampling intensity. The hierarchical model contains explicit models for the distribution of individuals and their movements, in addition to an observation model that is conditional on the location of individuals during sampling. Bayesian analysis of the hierarchical model is achieved by the use of data augmentation, which allows for a straightforward implementation in the freely available software WinBUGS. We present results of a simulation study that was carried out to evaluate the operating characteristics of the Bayesian estimator under variable densities and movement patterns of individuals. An application of the model is presented for survey data on the flat-tailed horned lizard (Phrynosoma mcallii) in Arizona, USA.
Facebook Ads Recruit Parents of Children with Cancer for an Online Survey of Web-Based Research Preferences

PubMed Central

Akard, Terrah Foster; Wray, Sarah; Gilmer, Mary

2014-01-01

Background Studies involving samples of children with life-threatening illnesses and their families face significant challenges, including inadequate sample sizes and limited diversity. Social media recruitment and web-based research methods may help address such challenges yet have not been explored in pediatric cancer populations. Objective This study examined the feasibility of using Facebook ads to recruit parent caregivers of children and teens with cancer. We also explored the feasibility of web-based video recording in pediatric palliative care populations by surveying parents of children with cancer regarding (a) their preferences for research methods and (b) technological capabilities of their computers and phones. Methods Facebook's paid advertising program was used to recruit parent caregivers of children currently living with cancer to complete an electronic survey about research preferences and technological capabilities. Results The advertising campaign generated 3,897,981 impressions which resulted in 1050 clicks at a total cost of $1129.88. Of 284 screened individuals, 106 were eligible. Forty-five caregivers of children with cancer completed the entire electronic survey. Parents preferred and had technological capabilities for web-based and electronic research methods. Participant survey responses are reported. Conclusion Facebook was a useful, cost-effective method to recruit a diverse sample of parent caregivers of children with cancer. Web-based video recording and data collection may be feasible and desirable in samples of children with cancer and their families. Implications for Practice Web-based methods (e.g., Facebook, Skype) may enhance communication and access between nurses and pediatric oncology patients and their families. PMID:24945264
On sample size of the kruskal-wallis test with application to a mouse peritoneal cavity study.

PubMed

Fan, Chunpeng; Zhang, Donghui; Zhang, Cun-Hui

2011-03-01

As the nonparametric generalization of the one-way analysis of variance model, the Kruskal-Wallis test applies when the goal is to test the difference between multiple samples and the underlying population distributions are nonnormal or unknown. Although the Kruskal-Wallis test has been widely used for data analysis, power and sample size methods for this test have been investigated to a much lesser extent. This article proposes new power and sample size calculation methods for the Kruskal-Wallis test based on the pilot study in either a completely nonparametric model or a semiparametric location model. No assumption is made on the shape of the underlying population distributions. Simulation results show that, in terms of sample size calculation for the Kruskal-Wallis test, the proposed methods are more reliable and preferable to some more traditional methods. A mouse peritoneal cavity study is used to demonstrate the application of the methods. © 2010, The International Biometric Society.
Error baseline rates of five sample preparation methods used to characterize RNA virus populations.

PubMed

Kugelman, Jeffrey R; Wiley, Michael R; Nagle, Elyse R; Reyes, Daniel; Pfeffer, Brad P; Kuhn, Jens H; Sanchez-Lockhart, Mariano; Palacios, Gustavo F

2017-01-01

Individual RNA viruses typically occur as populations of genomes that differ slightly from each other due to mutations introduced by the error-prone viral polymerase. Understanding the variability of RNA virus genome populations is critical for understanding virus evolution because individual mutant genomes may gain evolutionary selective advantages and give rise to dominant subpopulations, possibly even leading to the emergence of viruses resistant to medical countermeasures. Reverse transcription of virus genome populations followed by next-generation sequencing is the only available method to characterize variation for RNA viruses. However, both steps may lead to the introduction of artificial mutations, thereby skewing the data. To better understand how such errors are introduced during sample preparation, we determined and compared error baseline rates of five different sample preparation methods by analyzing in vitro transcribed Ebola virus RNA from an artificial plasmid-based system. These methods included: shotgun sequencing from plasmid DNA or in vitro transcribed RNA as a basic "no amplification" method, amplicon sequencing from the plasmid DNA or in vitro transcribed RNA as a "targeted" amplification method, sequence-independent single-primer amplification (SISPA) as a "random" amplification method, rolling circle reverse transcription sequencing (CirSeq) as an advanced "no amplification" method, and Illumina TruSeq RNA Access as a "targeted" enrichment method. The measured error frequencies indicate that RNA Access offers the best tradeoff between sensitivity and sample preparation error (1.4-5) of all compared methods.
Error baseline rates of five sample preparation methods used to characterize RNA virus populations

PubMed Central

Kugelman, Jeffrey R.; Wiley, Michael R.; Nagle, Elyse R.; Reyes, Daniel; Pfeffer, Brad P.; Kuhn, Jens H.; Sanchez-Lockhart, Mariano; Palacios, Gustavo F.

2017-01-01

Individual RNA viruses typically occur as populations of genomes that differ slightly from each other due to mutations introduced by the error-prone viral polymerase. Understanding the variability of RNA virus genome populations is critical for understanding virus evolution because individual mutant genomes may gain evolutionary selective advantages and give rise to dominant subpopulations, possibly even leading to the emergence of viruses resistant to medical countermeasures. Reverse transcription of virus genome populations followed by next-generation sequencing is the only available method to characterize variation for RNA viruses. However, both steps may lead to the introduction of artificial mutations, thereby skewing the data. To better understand how such errors are introduced during sample preparation, we determined and compared error baseline rates of five different sample preparation methods by analyzing in vitro transcribed Ebola virus RNA from an artificial plasmid-based system. These methods included: shotgun sequencing from plasmid DNA or in vitro transcribed RNA as a basic “no amplification” method, amplicon sequencing from the plasmid DNA or in vitro transcribed RNA as a “targeted” amplification method, sequence-independent single-primer amplification (SISPA) as a “random” amplification method, rolling circle reverse transcription sequencing (CirSeq) as an advanced “no amplification” method, and Illumina TruSeq RNA Access as a “targeted” enrichment method. The measured error frequencies indicate that RNA Access offers the best tradeoff between sensitivity and sample preparation error (1.4−5) of all compared methods. PMID:28182717
Adaptive sampling in research on risk-related behaviors.

PubMed

Thompson, Steven K; Collins, Linda M

2002-11-01

This article introduces adaptive sampling designs to substance use researchers. Adaptive sampling is particularly useful when the population of interest is rare, unevenly distributed, hidden, or hard to reach. Examples of such populations are injection drug users, individuals at high risk for HIV/AIDS, and young adolescents who are nicotine dependent. In conventional sampling, the sampling design is based entirely on a priori information, and is fixed before the study begins. By contrast, in adaptive sampling, the sampling design adapts based on observations made during the survey; for example, drug users may be asked to refer other drug users to the researcher. In the present article several adaptive sampling designs are discussed. Link-tracing designs such as snowball sampling, random walk methods, and network sampling are described, along with adaptive allocation and adaptive cluster sampling. It is stressed that special estimation procedures taking the sampling design into account are needed when adaptive sampling has been used. These procedures yield estimates that are considerably better than conventional estimates. For rare and clustered populations adaptive designs can give substantial gains in efficiency over conventional designs, and for hidden populations link-tracing and other adaptive procedures may provide the only practical way to obtain a sample large enough for the study objectives.
Stratification of American hearing aid users by age and audiometric characteristics: a method for representative sampling.

PubMed

Aronoff, Justin M; Yoon, Yang-soo; Soli, Sigfrid D

2010-06-01

Stratified sampling plans can increase the accuracy and facilitate the interpretation of a dataset characterizing a large population. However, such sampling plans have found minimal use in hearing aid (HA) research, in part because of a paucity of quantitative data on the characteristics of HA users. The goal of this study was to devise a quantitatively derived stratified sampling plan for HA research, so that such studies will be more representative and generalizable, and the results obtained using this method are more easily reinterpreted as the population changes. Pure-tone average (PTA) and age information were collected for 84,200 HAs acquired in 2006 and 2007. The distribution of PTA and age was quantified for each HA type and for a composite of all HA users. Based on their respective distributions, PTA and age were each divided into three groups, the combination of which defined the stratification plan. The most populous PTA and age group was also subdivided, allowing greater homogeneity within strata. Finally, the percentage of users in each stratum was calculated. This article provides a stratified sampling plan for HA research, based on a quantitative analysis of the distribution of PTA and age for HA users. Adopting such a sampling plan will make HA research results more representative and generalizable. In addition, data acquired using such plans can be reinterpreted as the HA population changes.
Excavating past population structures by surname-based sampling: the genetic legacy of the Vikings in northwest England

PubMed Central

Bowden, Georgina R.; Balaresque, Patricia; King, Turi E.; Hansen, Ziff; Lee, Andrew C.; Pergl-Wilson, Giles; Hurley, Emma; Roberts, Stephen J.; Waite, Patrick; Jesch, Judith; Jones, Abigail L.; Thomas, Mark G.; Harding, Stephen E.; Jobling, Mark A.

2009-01-01

The genetic structures of past human populations are obscured by recent migrations and expansions, and can been observed only indirectly by inference from modern samples. However, the unique link between a heritable cultural marker, the patrilineal surname, and a genetic marker, the Y chromosome, provides a means to target sets of modern individuals that might resemble populations at the time of surname establishment. As a test case, we studied samples from the Wirral peninsula and West Lancashire, in northwest England. Place names and archaeology show clear evidence of a past Viking presence, but heavy immigration and population growth since the Industrial Revolution are likely to have weakened the genetic signal of a thousand-year-old Scandinavian contribution. Samples ascertained on the basis of two generations of residence were compared with independent samples based on known ancestry in the region, plus the possession of a surname known from historical records to have been present there in medieval times. The Y-chromosomal haplotypes of these two sets of samples are significantly different, and in admixture analyses the surname-ascertained samples show markedly greater Scandinavian ancestry proportions, supporting the idea that northwest England was once heavily populated by Scandinavian settlers. The method of historical surname-based ascertainment promises to allow investigation of the influence of migration and drift over the last few centuries in changing the population structure of Britain, and will have general utility in other regions where surnames are patrilineal and suitable historical records survive. PMID:18032405
The 'number needed to sample' in primary care research. Comparison of two primary care sampling frames for chronic back pain.

PubMed

Smith, Blair H; Hannaford, Philip C; Elliott, Alison M; Smith, W Cairns; Chambers, W Alastair

2005-04-01

Sampling for primary care research must strike a balance between efficiency and external validity. For most conditions, even a large population sample will yield a small number of cases, yet other sampling techniques risk problems with extrapolation of findings. To compare the efficiency and external validity of two sampling methods for both an intervention study and epidemiological research in primary care--a convenience sample and a general population sample--comparing the response and follow-up rates, the demographic and clinical characteristics of each sample, and calculating the 'number needed to sample' (NNS) for a hypothetical randomized controlled trial. In 1996, we selected two random samples of adults from 29 general practices in Grampian, for an epidemiological study of chronic pain. One sample of 4175 was identified by an electronic questionnaire that listed patients receiving regular analgesic prescriptions--the 'repeat prescription sample'. The other sample of 5036 was identified from all patients on practice lists--the 'general population sample'. Questionnaires, including demographic, pain and general health measures, were sent to all. A similar follow-up questionnaire was sent in 2000 to all those agreeing to participate in further research. We identified a potential group of subjects for a hypothetical trial in primary care based on a recently published trial (those aged 25-64, with severe chronic back pain, willing to participate in further research). The repeat prescription sample produced better response rates than the general sample overall (86% compared with 82%, P < 0.001), from both genders and from the oldest and youngest age groups. The NNS using convenience sampling was 10 for each member of the final potential trial sample, compared with 55 using general population sampling. There were important differences between the samples in age, marital and employment status, social class and educational level. However, among the potential trial sample, there were no demographic differences. Those from the repeat prescription sample had poorer indices than the general population sample in all pain and health measures. The repeat prescription sampling method was approximately five times more efficient than the general population method. However demographic and clinical differences in the repeat prescription sample might hamper extrapolation of findings to the general population, particularly in an epidemiological study, and demonstrate that simple comparison with age and gender of the target population is insufficient.
Hierarchical modeling and inference in ecology: The analysis of data from populations, metapopulations and communities

USGS Publications Warehouse

Royle, J. Andrew; Dorazio, Robert M.

2008-01-01

A guide to data collection, modeling and inference strategies for biological survey data using Bayesian and classical statistical methods. This book describes a general and flexible framework for modeling and inference in ecological systems based on hierarchical models, with a strict focus on the use of probability models and parametric inference. Hierarchical models represent a paradigm shift in the application of statistics to ecological inference problems because they combine explicit models of ecological system structure or dynamics with models of how ecological systems are observed. The principles of hierarchical modeling are developed and applied to problems in population, metapopulation, community, and metacommunity systems. The book provides the first synthetic treatment of many recent methodological advances in ecological modeling and unifies disparate methods and procedures. The authors apply principles of hierarchical modeling to ecological problems, including * occurrence or occupancy models for estimating species distribution * abundance models based on many sampling protocols, including distance sampling * capture-recapture models with individual effects * spatial capture-recapture models based on camera trapping and related methods * population and metapopulation dynamic models * models of biodiversity, community structure and dynamics.
[A comparison of convenience sampling and purposive sampling].

PubMed

Suen, Lee-Jen Wu; Huang, Hui-Man; Lee, Hao-Hsien

2014-06-01

Convenience sampling and purposive sampling are two different sampling methods. This article first explains sampling terms such as target population, accessible population, simple random sampling, intended sample, actual sample, and statistical power analysis. These terms are then used to explain the difference between "convenience sampling" and purposive sampling." Convenience sampling is a non-probabilistic sampling technique applicable to qualitative or quantitative studies, although it is most frequently used in quantitative studies. In convenience samples, subjects more readily accessible to the researcher are more likely to be included. Thus, in quantitative studies, opportunity to participate is not equal for all qualified individuals in the target population and study results are not necessarily generalizable to this population. As in all quantitative studies, increasing the sample size increases the statistical power of the convenience sample. In contrast, purposive sampling is typically used in qualitative studies. Researchers who use this technique carefully select subjects based on study purpose with the expectation that each participant will provide unique and rich information of value to the study. As a result, members of the accessible population are not interchangeable and sample size is determined by data saturation not by statistical power analysis.
Hepameta-- prevalence of hepatitis B/C and metabolic syndrome in population living in separated and segregated Roma settlements: a methodology for a cross-sectional population-based study using community-based approach.

PubMed

Gecková, Andrea Madarasová; Jarcuska, Peter; Mareková, Mária; Pella, Daniel; Siegfried, Leonard; Jarcuska, Pavol; Halánová, Monika

2014-03-01

Roma represent one of the largest and oldest minorities in Europe. Health of many of them, particularly those living in settlements, is heavily compromised by poor dwelling, low educational level, unemployment, and poverty rooted in generational poverty, segregation and discrimination. The cross-sectional population-based study using community based approach aimed to map the prevalence of viral hepatitis B/C and metabolic syndrome in the population living in separated and segregated Roma settlements and to compare it with the occurrence of the same health indicators in the majority population, considering selected risk and protective factors of these health indicators. The sample consisted of 452 Roma (mean age = 34.7; 35.2% men) and 403 non-Roma (mean age = 33.5; 45.9% men) respondents. Data were collected in 2011 via questionnaire, anthropometric measures and analysed blood and urine samples. A methodology used in the study as well as in the following scientific papers is described in the Methods section (i.e. study design, procedures, samples, methods including questionnaire, anthropometric measurements, physical measurements, blood and urine measurements). There are regions of declining prosperity due to high unemployment, long-term problems with poverty and depleted resources. Populations living in these areas, i.e. in Central and Eastern Europe in Roma settlements, are at risk of poverty, social exclusion and other factors affecting health. Therefore, we should look for successful long-term strategies and tools (e.g. Roma mediators, terrain work) in order to improve the future prospects of these minorities.
Training set optimization under population structure in genomic selection.

PubMed

Isidro, Julio; Jannink, Jean-Luc; Akdemir, Deniz; Poland, Jesse; Heslot, Nicolas; Sorrells, Mark E

2015-01-01

Population structure must be evaluated before optimization of the training set population. Maximizing the phenotypic variance captured by the training set is important for optimal performance. The optimization of the training set (TRS) in genomic selection has received much interest in both animal and plant breeding, because it is critical to the accuracy of the prediction models. In this study, five different TRS sampling algorithms, stratified sampling, mean of the coefficient of determination (CDmean), mean of predictor error variance (PEVmean), stratified CDmean (StratCDmean) and random sampling, were evaluated for prediction accuracy in the presence of different levels of population structure. In the presence of population structure, the most phenotypic variation captured by a sampling method in the TRS is desirable. The wheat dataset showed mild population structure, and CDmean and stratified CDmean methods showed the highest accuracies for all the traits except for test weight and heading date. The rice dataset had strong population structure and the approach based on stratified sampling showed the highest accuracies for all traits. In general, CDmean minimized the relationship between genotypes in the TRS, maximizing the relationship between TRS and the test set. This makes it suitable as an optimization criterion for long-term selection. Our results indicated that the best selection criterion used to optimize the TRS seems to depend on the interaction of trait architecture and population structure.
Are we using the appropriate reference samples to develop juvenile age estimation methods based on bone size? An exploration of growth differences between average children and those who become victims of homicide.

PubMed

Spake, Laure; Cardoso, Hugo F V

2018-01-01

The population on which forensic juvenile skeletal age estimation methods are applied has not been critically considered. Previous research suggests that child victims of homicide tend to be from socioeconomically disadvantaged contexts, and that these contexts impair linear growth. This study investigates whether juvenile skeletal remains examined by forensic anthropologists are short for age compared to their normal healthy peers. Cadaver lengths were obtained from records of autopsies of 1256 individuals, aged birth to eighteen years at death, conducted between 2000 and 2015 in Australia, New Zealand, and the U.S. Growth status of the forensic population, represented by homicide victims, and general population, represented by accident victims, were compared using height for age Z-scores and independent sample t-tests. Cadaver lengths of the accident victims were compared to growth references using one sample t-tests to evaluate whether accident victims reflect the general population. Homicide victims are shorter for age than accident victims in samples from the U.S., but not in Australia and New Zealand. Accident victims are more representative of the general population in Australia and New Zealand. Different results in Australia and New Zealand as opposed to the U.S. may be linked to socioeconomic inequality. These results suggest that physical anthropologists should critically select reference samples when devising forensic juvenile skeletal age estimation methods. Children examined in forensic investigations may be short for age, and thus methods developed on normal healthy children may yield inaccurate results. A healthy reference population may not necessarily constitute an appropriate growth comparison for the forensic anthropology population. Copyright © 2017 Elsevier B.V. All rights reserved.

Triceps and Subscapular Skinfold Thickness Percentiles and Cut-Offs for Overweight and Obesity in a Population-Based Sample of Schoolchildren and Adolescents in Bogota, Colombia.

PubMed

Ramírez-Vélez, Robinson; López-Cifuentes, Mario Ferney; Correa-Bautista, Jorge Enrique; González-Ruíz, Katherine; González-Jiménez, Emilio; Córdoba-Rodríguez, Diana Paola; Vivas, Andrés; Triana-Reina, Hector Reynaldo; Schmidt-RioValle, Jacqueline

2016-09-24

The assessment of skinfold thickness is an objective measure of adiposity. The aims of this study were to establish Colombian smoothed centile charts and LMS L (Box-Cox transformation), M (median), and S (coefficient of variation) tables for triceps, subscapular, and triceps + subscapular skinfolds; appropriate cut-offs were selected using receiver operating characteristic (ROC) analysis based on a population-based sample of children and adolescents in Bogotá, Colombia. A cross-sectional study was conducted in 9618 children and adolescents (55.7% girls; age range of 9-17.9 years). Triceps and subscapular skinfold measurements were obtained using standardized methods. We calculated the triceps + subscapular skinfold (T + SS) sum. Smoothed percentile curves for triceps and subscapular skinfold thickness were derived using the LMS method. ROC curve analyses were used to evaluate the optimal cut-off point of skinfold thickness for overweight and obesity, based on the International Obesity Task Force definitions. Subscapular and triceps skinfolds and T + SS were significantly higher in girls than in boys (p < 0.001). The ROC analysis showed that subscapular and triceps skinfolds and T + SS have a high discriminatory power in the identification of overweight and obesity in the sample population in this study. Our results provide sex- and age-specific normative reference standards for skinfold thickness values from a population from Bogotá, Colombia.
Analysis of four recruitment methods for obtaining normative data through a Web-based questionnaire: a pilot study.

PubMed

Nolte, Michael T; Shauver, Melissa J; Chung, Kevin C

2015-09-01

Quality normative data requires a diverse sample of participants and plays an important role in the appropriate use of health outcomes. Using social media and other online resources for survey recruitment is a tempting prospect, but the effectiveness of these methods in collecting a diverse sample is unknown. The purpose of this study is to pilot test four methods of recruitment to determine their ability to produce a sample representative of the general US population. This project is part of a larger study to gather normative data for the Michigan Hand Outcomes Questionnaire (MHQ). We used flyers, e-mail, Facebook, and an institution-specific clinical research recruitment Web site to direct participants to complete an online version of the MHQ. Participants also provided comorbidity and demographic information. The institution-specific recruitment Web site yielded the greatest number of respondents in an age distribution that mirrored the US population. Facebook was effective for recruiting young adults, and e-mail was successful for recruiting the older adults. None of the methods was successful in reaching an ethnically diverse sample. Obtaining normative data that is truly representative of the US population is a difficult task. The use of any one recruitment method is unlikely to result in a representative sample, but a greater understanding of these methods will empower researchers to use them to target specific populations. This pilot analysis provides support for the use of Facebook and clinical research sites in addition to traditional methods of e-mail and paper flyers.
Differential expression analysis for RNAseq using Poisson mixed models.

PubMed

Sun, Shiquan; Hood, Michelle; Scott, Laura; Peng, Qinke; Mukherjee, Sayan; Tung, Jenny; Zhou, Xiang

2017-06-20

Identifying differentially expressed (DE) genes from RNA sequencing (RNAseq) studies is among the most common analyses in genomics. However, RNAseq DE analysis presents several statistical and computational challenges, including over-dispersed read counts and, in some settings, sample non-independence. Previous count-based methods rely on simple hierarchical Poisson models (e.g. negative binomial) to model independent over-dispersion, but do not account for sample non-independence due to relatedness, population structure and/or hidden confounders. Here, we present a Poisson mixed model with two random effects terms that account for both independent over-dispersion and sample non-independence. We also develop a scalable sampling-based inference algorithm using a latent variable representation of the Poisson distribution. With simulations, we show that our method properly controls for type I error and is generally more powerful than other widely used approaches, except in small samples (n <15) with other unfavorable properties (e.g. small effect sizes). We also apply our method to three real datasets that contain related individuals, population stratification or hidden confounders. Our results show that our method increases power in all three data compared to other approaches, though the power gain is smallest in the smallest sample (n = 6). Our method is implemented in MACAU, freely available at www.xzlab.org/software.html. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
A high-throughput robotic sample preparation system and HPLC-MS/MS for measuring urinary anatabine, anabasine, nicotine and major nicotine metabolites.

PubMed

Wei, Binnian; Feng, June; Rehmani, Imran J; Miller, Sharyn; McGuffey, James E; Blount, Benjamin C; Wang, Lanqing

2014-09-25

Most sample preparation methods characteristically involve intensive and repetitive labor, which is inefficient when preparing large numbers of samples from population-scale studies. This study presents a robotic system designed to meet the sampling requirements for large population-scale studies. Using this robotic system, we developed and validated a method to simultaneously measure urinary anatabine, anabasine, nicotine and seven major nicotine metabolites: 4-Hydroxy-4-(3-pyridyl)butanoic acid, cotinine-N-oxide, nicotine-N-oxide, trans-3'-hydroxycotinine, norcotinine, cotinine and nornicotine. We analyzed robotically prepared samples using high-performance liquid chromatography (HPLC) coupled with triple quadrupole mass spectrometry in positive electrospray ionization mode using scheduled multiple reaction monitoring (sMRM) with a total runtime of 8.5 min. The optimized procedure was able to deliver linear analyte responses over a broad range of concentrations. Responses of urine-based calibrators delivered coefficients of determination (R(2)) of >0.995. Sample preparation recovery was generally higher than 80%. The robotic system was able to prepare four 96-well plate (384 urine samples) per day, and the overall method afforded an accuracy range of 92-115%, and an imprecision of <15.0% on average. The validation results demonstrate that the method is accurate, precise, sensitive, robust, and most significantly labor-saving for sample preparation, making it efficient and practical for routine measurements in large population-scale studies such as the National Health and Nutrition Examination Survey (NHANES) and the Population Assessment of Tobacco and Health (PATH) study. Published by Elsevier B.V.
DNA-based methods of geochemical prospecting

DOEpatents

Ashby, Matthew [Mill Valley, CA

2011-12-06

The present invention relates to methods for performing surveys of the genetic diversity of a population. The invention also relates to methods for performing genetic analyses of a population. The invention further relates to methods for the creation of databases comprising the survey information and the databases created by these methods. The invention also relates to methods for analyzing the information to correlate the presence of nucleic acid markers with desired parameters in a sample. These methods have application in the fields of geochemical exploration, agriculture, bioremediation, environmental analysis, clinical microbiology, forensic science and medicine.
Evaluation of terrestrial and streamside salamander monitoring techniques at Shenandoah National Park

USGS Publications Warehouse

Jung, R.E.; Droege, S.; Sauer, J.R.; Landy, R.B.

2000-01-01

In response to concerns about amphibian declines, a study evaluating and validating amphibian monitoring techniques was initiated in Shenandoah and Big Bend National Parks in the spring of 1998. We evaluate precision, bias, and efficiency of several sampling methods for terrestrial and streamside salamanders in Shenandoah National Park and assess salamander abundance in relation to environmental variables, notably soil and water pH. Terrestrial salamanders, primarily redback salamanders (Plethodon cinereus), were sampled by searching under cover objects during the day in square plots (10 to 35 m2). We compared population indices (mean daily and total counts) with adjusted population estimates from capture-recapture. Analyses suggested that the proportion of salamanders detected (p) during sampling varied among plots, necessitating the use of adjusted population estimates. However, adjusted population estimates were less precise than population indices, and may not be efficient in relating salamander populations to environmental variables. In future sampling, strategic use of capture-recapture to verify consistency of p's among sites may be a reasonable compromise between the possibility of bias in estimation of population size and deficiencies due to inefficiency associated with the estimation of p. The streamside two-lined salamander (Eurycea bislineata) was surveyed using four methods: leaf litter refugia bags, 1 m2 quadrats, 50 x 1 m visual encounter transects, and electric shocking. Comparison of survey methods at nine streams revealed congruent patterns of abundance among sites, suggesting that relative bias among the methods is similar, and that choice of survey method should be based on precision and logistical efficiency. Redback and two-lined salamander abundance were not significantly related to soil or water pH, respectively.
Residential scene classification for gridded population sampling in developing countries using deep convolutional neural networks on satellite imagery.

PubMed

Chew, Robert F; Amer, Safaa; Jones, Kasey; Unangst, Jennifer; Cajka, James; Allpress, Justine; Bruhn, Mark

2018-05-09

Conducting surveys in low- and middle-income countries is often challenging because many areas lack a complete sampling frame, have outdated census information, or have limited data available for designing and selecting a representative sample. Geosampling is a probability-based, gridded population sampling method that addresses some of these issues by using geographic information system (GIS) tools to create logistically manageable area units for sampling. GIS grid cells are overlaid to partition a country's existing administrative boundaries into area units that vary in size from 50 m × 50 m to 150 m × 150 m. To avoid sending interviewers to unoccupied areas, researchers manually classify grid cells as "residential" or "nonresidential" through visual inspection of aerial images. "Nonresidential" units are then excluded from sampling and data collection. This process of manually classifying sampling units has drawbacks since it is labor intensive, prone to human error, and creates the need for simplifying assumptions during calculation of design-based sampling weights. In this paper, we discuss the development of a deep learning classification model to predict whether aerial images are residential or nonresidential, thus reducing manual labor and eliminating the need for simplifying assumptions. On our test sets, the model performs comparable to a human-level baseline in both Nigeria (94.5% accuracy) and Guatemala (96.4% accuracy), and outperforms baseline machine learning models trained on crowdsourced or remote-sensed geospatial features. Additionally, our findings suggest that this approach can work well in new areas with relatively modest amounts of training data. Gridded population sampling methods like geosampling are becoming increasingly popular in countries with outdated or inaccurate census data because of their timeliness, flexibility, and cost. Using deep learning models directly on satellite images, we provide a novel method for sample frame construction that identifies residential gridded aerial units. In cases where manual classification of satellite images is used to (1) correct for errors in gridded population data sets or (2) classify grids where population estimates are unavailable, this methodology can help reduce annotation burden with comparable quality to human analysts.
Age-structured mark-recapture analysis: A virtual-population-analysis-based model for analyzing age-structured capture-recapture data

USGS Publications Warehouse

Coggins, L.G.; Pine, William E.; Walters, C.J.; Martell, S.J.D.

2006-01-01

We present a new model to estimate capture probabilities, survival, abundance, and recruitment using traditional Jolly-Seber capture-recapture methods within a standard fisheries virtual population analysis framework. This approach compares the numbers of marked and unmarked fish at age captured in each year of sampling with predictions based on estimated vulnerabilities and abundance in a likelihood function. Recruitment to the earliest age at which fish can be tagged is estimated by using a virtual population analysis method to back-calculate the expected numbers of unmarked fish at risk of capture. By using information from both marked and unmarked animals in a standard fisheries age structure framework, this approach is well suited to the sparse data situations common in long-term capture-recapture programs with variable sampling effort. ?? Copyright by the American Fisheries Society 2006.
Exploitation of immunofluorescence for the quantification and characterization of small numbers of Pasteuria endospores.

PubMed

Costa, Sofia R; Kerry, Brian R; Bardgett, Richard D; Davies, Keith G

2006-12-01

The Pasteuria group of endospore-forming bacteria has been studied as a biocontrol agent of plant-parasitic nematodes. Techniques have been developed for its detection and quantification in soil samples, and these mainly focus on observations of endospore attachment to nematodes. Characterization of Pasteuria populations has recently been performed with DNA-based techniques, which usually require the extraction of large numbers of spores. We describe a simple immunological method for the quantification and characterization of Pasteuria populations. Bayesian statistics were used to determine an extraction efficiency of 43% and a threshold of detection of 210 endospores g(-1) sand. This provided a robust means of estimating numbers of endospores in small-volume samples from a natural system. Based on visual assessment of endospore fluorescence, a quantitative method was developed to characterize endospore populations, which were shown to vary according to their host.
Psychometric Validation of the Parental Bonding Instrument in a U.K. Population-Based Sample: Role of Gender and Association With Mental Health in Mid-Late Life.

PubMed

Xu, Man K; Morin, Alexandre J S; Marsh, Herbert W; Richards, Marcus; Jones, Peter B

2016-08-01

The factorial structure of the Parental Bonding Instrument (PBI) has been frequently studied in diverse samples but no study has examined its psychometric properties from large, population-based samples. In particular, important questions have not been addressed such as the measurement invariance properties across parental and offspring gender. We evaluated the PBI based on responses from a large, representative population-based sample, using an exploratory structural equation modeling method appropriate for categorical data. Analysis revealed a three-factor structure representing "care," "overprotection," and "autonomy" parenting styles. In terms of psychometric measurement validity, our results supported the complete invariance of the PBI ratings across sons and daughters for their mothers and fathers. The PBI ratings were also robust in relation to personality and mental health status. In terms of predictive value, paternal care showed a protective effect on mental health at age 43 in sons. The PBI is a sound instrument for capturing perceived parenting styles, and is predictive of mental health in middle adulthood. © The Author(s) 2016.
Advancing Methods for U.S. Transgender Health Research

PubMed Central

Reisner, Sari L.; Deutsch, Madeline B.; Bhasin, Shalender; Bockting, Walter; Brown, George R.; Feldman, Jamie; Garofalo, Rob; Kreukels, Baudewijntje; Radix, Asa; Safer, Joshua D.; Tangpricha, Vin; T’Sjoen, Guy; Goodman, Michael

2016-01-01

Purpose of Review To describe methodological challenges, gaps, and opportunities in U.S. transgender health research. Recent Findings Lack of large prospective observational studies and intervention trials, limited data on risks and benefits of gender affirmation (e.g., hormones and surgical interventions), and inconsistent use of definitions across studies hinder evidence-based care for transgender people. Systematic high-quality observational and intervention-testing studies may be carried out using several approaches, including general population-based, health systems-based, clinic-based, venue-based, and hybrid designs. Each of these approaches has its strength and limitations; however, harmonization of research efforts is needed. Ongoing development of evidence-based clinical recommendations will benefit from a series of observational and intervention studies aimed at identification, recruitment, and follow-up of transgender people of different ages, from different racial, ethnic, and socioeconomic backgrounds and with diverse gender identities. Summary Transgender health research faces challenges that include standardization of lexicon, agreed-upon population definitions, study design, sampling, measurement, outcome ascertainment, and sample size. Application of existing and new methods is needed to fill existing gaps, increase the scientific rigor and reach of transgender health research, and inform evidence-based prevention and care for this underserved population. PMID:26845331
Reliable Quantification of the Potential for Equations Based on Spot Urine Samples to Estimate Population Salt Intake: Protocol for a Systematic Review and Meta-Analysis.

PubMed

Huang, Liping; Crino, Michelle; Wu, Jason Hy; Woodward, Mark; Land, Mary-Anne; McLean, Rachael; Webster, Jacqui; Enkhtungalag, Batsaikhan; Nowson, Caryl A; Elliott, Paul; Cogswell, Mary; Toft, Ulla; Mill, Jose G; Furlanetto, Tania W; Ilich, Jasminka Z; Hong, Yet Hoi; Cohall, Damian; Luzardo, Leonella; Noboa, Oscar; Holm, Ellen; Gerbes, Alexander L; Senousy, Bahaa; Pinar Kara, Sonat; Brewster, Lizzy M; Ueshima, Hirotsugu; Subramanian, Srinivas; Teo, Boon Wee; Allen, Norrina; Choudhury, Sohel Reza; Polonia, Jorge; Yasuda, Yoshinari; Campbell, Norm Rc; Neal, Bruce; Petersen, Kristina S

2016-09-21

Methods based on spot urine samples (a single sample at one time-point) have been identified as a possible alternative approach to 24-hour urine samples for determining mean population salt intake. The aim of this study is to identify a reliable method for estimating mean population salt intake from spot urine samples. This will be done by comparing the performance of existing equations against one other and against estimates derived from 24-hour urine samples. The effects of factors such as ethnicity, sex, age, body mass index, antihypertensive drug use, health status, and timing of spot urine collection will be explored. The capacity of spot urine samples to measure change in salt intake over time will also be determined. Finally, we aim to develop a novel equation (or equations) that performs better than existing equations to estimate mean population salt intake. A systematic review and meta-analysis of individual participant data will be conducted. A search has been conducted to identify human studies that report salt (or sodium) excretion based upon 24-hour urine samples and spot urine samples. There were no restrictions on language, study sample size, or characteristics of the study population. MEDLINE via OvidSP (1946-present), Premedline via OvidSP, EMBASE, Global Health via OvidSP (1910-present), and the Cochrane Library were searched, and two reviewers identified eligible studies. The authors of these studies will be invited to contribute data according to a standard format. Individual participant records will be compiled and a series of analyses will be completed to: (1) compare existing equations for estimating 24-hour salt intake from spot urine samples with 24-hour urine samples, and assess the degree of bias according to key demographic and clinical characteristics; (2) assess the reliability of using spot urine samples to measure population changes in salt intake overtime; and (3) develop a novel equation that performs better than existing equations to estimate mean population salt intake. The search strategy identified 538 records; 100 records were obtained for review in full text and 73 have been confirmed as eligible. In addition, 68 abstracts were identified, some of which may contain data eligible for inclusion. Individual participant data will be requested from the authors of eligible studies. Many equations for estimating salt intake from spot urine samples have been developed and validated, although most have been studied in very specific settings. This meta-analysis of individual participant data will enable a much broader understanding of the capacity for spot urine samples to estimate population salt intake.
Connecting micro dynamics and population distributions in system dynamics models

PubMed Central

Rahmandad, Hazhir; Chen, Hsin-Jen; Xue, Hong; Wang, Youfa

2014-01-01

Researchers use system dynamics models to capture the mean behavior of groups of indistinguishable population elements (e.g., people) aggregated in stock variables. Yet, many modeling problems require capturing the heterogeneity across elements with respect to some attribute(s) (e.g., body weight). This paper presents a new method to connect the micro-level dynamics associated with elements in a population with the macro-level population distribution along an attribute of interest without the need to explicitly model every element. We apply the proposed method to model the distribution of Body Mass Index and its changes over time in a sample population of American women obtained from the U.S. National Health and Nutrition Examination Survey. Comparing the results with those obtained from an individual-based model that captures the same phenomena shows that our proposed method delivers accurate results with less computation than the individual-based model. PMID:25620842
HIV Research with Men who Have Sex with Men (MSM): Advantages and Challenges of Different Methods for Most Appropriately Targeting a Key Population.

PubMed

Gama, Ana; Martins, Maria O; Dias, Sónia

2017-01-01

The difficulty in accessing hard-to-reach populations as men who have sex with men presents a dilemma for HIV surveillance as their omission from surveillance systems leaves significant gaps in our understanding of HIV/AIDS epidemics. Several methods for recruiting difficult-to-access populations and collecting data on trends of HIV prevalence and behavioural factors for surveillance and research purposes have emerged. This paper aims to critically review different sampling approaches, from chain-referral and venue-based to respondent-driven, time-location and internet sampling methods, focusing on its main advantages and challenges for conducting HIV research among key populations, such as men who have sex with men. The benefits of using these approaches to recruit participants must be weighed against privacy concerns inherent in any social situation or health condition. Nevertheless, the methods discussed in this paper represent some of the best efforts to effectively reach most-at-risk subgroups of men who have sex with men, contributing to obtain unbiased trends of HIV prevalence and HIV-related risk behaviours among this population group.
Methodological Challenges in Collecting Social and Behavioural Data Regarding the HIV Epidemic among Gay and Other Men Who Have Sex with Men in Australia

PubMed Central

Holt, Martin; de Wit, John; Brown, Graham; Maycock, Bruce; Fairley, Christopher; Prestage, Garrett

2014-01-01

Background Behavioural surveillance and research among gay and other men who have sex with men (GMSM) commonly relies on non-random recruitment approaches. Methodological challenges limit their ability to accurately represent the population of adult GMSM. We compared the social and behavioural profiles of GMSM recruited via venue-based, online, and respondent-driven sampling (RDS) and discussed their utility for behavioural surveillance. Methods Data from four studies were selected to reflect each recruitment method. We compared demographic characteristics and the prevalence of key indicators including sexual and HIV testing practices obtained from samples recruited through different methods, and population estimates from respondent-driven sampling partition analysis. Results Overall, the socio-demographic profile of GMSM was similar across samples, with some differences observed in age and sexual identification. Men recruited through time-location sampling appeared more connected to the gay community, reported a greater number of sexual partners, but engaged in less unprotected anal intercourse with regular (UAIR) or casual partners (UAIC). The RDS sample overestimated the proportion of HIV-positive men and appeared to recruit men with an overall higher number of sexual partners. A single-website survey recruited a sample with characteristics which differed considerably from the population estimates with regards to age, ethnically diversity and behaviour. Data acquired through time-location sampling underestimated the rates of UAIR and UAIC, while RDS and online sampling both generated samples that underestimated UAIR. Simulated composite samples combining recruits from time-location and multi-website online sampling may produce characteristics more consistent with the population estimates, particularly with regards to sexual practices. Conclusion Respondent-driven sampling produced the sample that was most consistent to population estimates, but this methodology is complex and logistically demanding. Time-location and online recruitment are more cost-effective and easier to implement; using these approaches in combination may offer the potential to recruit a more representative sample of GMSM. PMID:25409440
Estimation of stream salamander (Plethodontidae, Desmognathinae and Plethodontinae) populations in Shenandoah National Park, Virginia, USA

USGS Publications Warehouse

Jung, R.E.; Royle, J. Andrew; Sauer, J.R.; Addison, C.; Rau, R.D.; Shirk, J.L.; Whissel, J.C.

2005-01-01

Stream salamanders in the family Plethodontidae constitute a large biomass in and near headwater streams in the eastern United States and are promising indicators of stream ecosystem health. Many studies of stream salamanders have relied on population indices based on counts rather than population estimates based on techniques such as capture-recapture and removal. Application of estimation procedures allows the calculation of detection probabilities (the proportion of total animals present that are detected during a survey) and their associated sampling error, and may be essential for determining salamander population sizes and trends. In 1999, we conducted capture-recapture and removal population estimation methods for Desmognathus salamanders at six streams in Shenandoah National Park, Virginia, USA. Removal sampling appeared more efficient and detection probabilities from removal data were higher than those from capture-recapture. During 2001-2004, we used removal estimation at eight streams in the park to assess the usefulness of this technique for long-term monitoring of stream salamanders. Removal detection probabilities ranged from 0.39 to 0.96 for Desmognathus, 0.27 to 0.89 for Eurycea and 0.27 to 0.75 for northern spring (Gyrinophilus porphyriticus) and northern red (Pseudotriton ruber) salamanders across stream transects. Detection probabilities did not differ across years for Desmognathus and Eurycea, but did differ among streams for Desmognathus. Population estimates of Desmognathus decreased between 2001-2002 and 2003-2004 which may be related to changes in stream flow conditions. Removal-based procedures may be a feasible approach for population estimation of salamanders, but field methods should be designed to meet the assumptions of the sampling procedures. New approaches to estimating stream salamander populations are discussed.
A log-linear model approach to estimation of population size using the line-transect sampling method

USGS Publications Warehouse

Anderson, D.R.; Burnham, K.P.; Crain, B.R.

1978-01-01

The technique of estimating wildlife population size and density using the belt or line-transect sampling method has been used in many past projects, such as the estimation of density of waterfowl nestling sites in marshes, and is being used currently in such areas as the assessment of Pacific porpoise stocks in regions of tuna fishing activity. A mathematical framework for line-transect methodology has only emerged in the last 5 yr. In the present article, we extend this mathematical framework to a line-transect estimator based upon a log-linear model approach.
Respondent-driven sampling for an adolescent health study in vulnerable urban settings: a multi-country study.

PubMed

Decker, Michele R; Marshall, Beth Dail; Emerson, Mark; Kalamar, Amanda; Covarrubias, Laura; Astone, Nan; Wang, Ziliang; Gao, Ersheng; Mashimbye, Lawrence; Delany-Moretlwe, Sinead; Acharya, Rajib; Olumide, Adesola; Ojengbede, Oladosu; Blum, Robert W; Sonenstein, Freya L

2014-12-01

The global adolescent population is larger than ever before and is rapidly urbanizing. Global surveillance systems to monitor youth health typically use household- and school-based recruitment methods. These systems risk not reaching the most marginalized youth made vulnerable by conditions of migration, civil conflict, and other forms of individual and structural vulnerability. We describe the methodology of the Well-Being of Adolescents in Vulnerable Environments survey, which used respondent-driven sampling (RDS) to recruit male and female youth aged 15-19 years and living in economically distressed urban settings in Baltimore, MD; Johannesburg, South Africa; Ibadan, Nigeria; New Delhi, India; and Shanghai, China (migrant youth only) for a cross-sectional study. We describe a shared recruitment and survey administration protocol across the five sites, present recruitment parameters, and illustrate challenges and necessary adaptations for use of RDS with youth in disadvantaged urban settings. We describe the reach of RDS into populations of youth who may be missed by traditional household- and school-based sampling. Across all sites, an estimated 9.6% were unstably housed; among those enrolled in school, absenteeism was pervasive with 29% having missed over 6 days of school in the past month. Overall findings confirm the feasibility, efficiency, and utility of RDS in quickly reaching diverse samples of youth, including those both in and out of school and those unstably housed, and provide direction for optimizing RDS methods with this population. In our rapidly urbanizing global landscape with an unprecedented youth population, RDS may serve as a valuable tool in complementing existing household- and school-based methods for health-related surveillance that can guide policy. Copyright © 2014 Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.
Respondent-driven sampling for an adolescent health study in vulnerable urban settings: a multi-country study

PubMed Central

Decker, Michele R.; Marshall, Beth; Emerson, Mark; Kalamar, Amanda; Covarrubias, Laura; Astone, Nan; Wang, Ziliang; Gao, Ersheng; Mashimbye, Lawrence; Delany-Moretlwe, Sinead; Acharya, Rajib; Olumide, Adesola; Ojengbede, Oladosu; Blum, Robert

2015-01-01

The global adolescent population is larger than ever before and is rapidly urbanizing. Global surveillance systems to monitor youth health typically use household- and school-based recruitment methods. These systems risk not reaching the most marginalized youth made vulnerable by conditions of migration, civil conflict and other forms of individual and structural vulnerability. We describe the methodology of the Well Being of Adolescents in Vulnerable Environments (WAVE) survey, which used respondent-driven sampling (RDS) to recruit male and female youth aged 15 to 19 years and living in economically distressed urban settings in Baltimore, USA, Johannesburg, South Africa, Ibadan, Nigeria, Delhi, India and Shanghai, China (migrant youth only) for a cross-sectional study. We describe a shared recruitment and survey administration protocol across the five sites, present recruitment parameters, and illustrate challenges and necessary adaptations for use of RDS with youth in disadvantaged urban settings. We describe the reach of RDS into populations of youth who may be missed by traditional householdbased and school-based sampling. Across all sites, an estimated 9.6% were unstably housed; among those enrolled in school, absenteeism was pervasive with 29% having missed over 6 days of school in the past month. Overall findings confirm the feasibility, efficiency and utility of RDS in quickly reaching diverse samples of youth, including those both in and out of school and those unstably housed, and provide direction for optimizing RDS methods with this population. In our rapidly urbanizing global landscape with an unprecedented youth population, RDS may serve as a valuable tool in complementing existing household- and school-based methods for health-related surveillance that can guide policy. PMID:25454005
Population Pharmacokinetics of Metronidazole Evaluated Using Scavenged Samples from Preterm Infants

PubMed Central

Ouellet, Daniele; Smith, P. Brian; James, Laura P.; Ross, Ashley; Sullivan, Janice E.; Walsh, Michele C.; Zadell, Arlene; Newman, Nancy; White, Nicole R.; Kashuba, Angela D. M.; Benjamin, Daniel K.

2012-01-01

Pharmacokinetic (PK) studies in preterm infants are rarely conducted due to the research challenges posed by this population. To overcome these challenges, minimal-risk methods such as scavenged sampling can be used to evaluate the PK of commonly used drugs in this population. We evaluated the population PK of metronidazole using targeted sparse sampling and scavenged samples from infants that were ≤32 weeks of gestational age at birth and <120 postnatal days. A 5-center study was performed. A population PK model using nonlinear mixed-effect modeling (NONMEM) was developed. Covariate effects were evaluated based on estimated precision and clinical significance. Using the individual Bayesian PK estimates from the final population PK model and the dosing regimen used for each subject, the proportion of subjects achieving the therapeutic target of trough concentrations >8 mg/liter was calculated. Monte Carlo simulations were performed to evaluate the adequacy of different dosing recommendations per gestational age group. Thirty-two preterm infants were enrolled: the median (range) gestational age at birth was 27 (22 to 32) weeks, postnatal age was 41 (0 to 97) days, postmenstrual age (PMA) was 32 (24 to 43) weeks, and weight was 1,495 (678 to 3,850) g. The final PK data set contained 116 samples; 104/116 (90%) were scavenged from discarded clinical specimens. Metronidazole population PK was best described by a 1-compartment model. The population mean clearance (CL; liter/h) was determined as 0.0397 × (weight/1.5) × (PMA/32)2.49 using a volume of distribution (V) (liter) of 1.07 × (weight/1.5). The relative standard errors around parameter estimates ranged between 11% and 30%. On average, metronidazole concentrations in scavenged samples were 30% lower than those measured in scheduled blood draws. The majority of infants (>70%) met predefined pharmacodynamic efficacy targets. A new, simplified, postmenstrual-age-based dosing regimen is recommended for this population. Minimal-risk methods such as scavenged PK sampling provided meaningful information related to development of metronidazole PK models and dosing recommendations. PMID:22252819

Surveillance Among Men Who have Sex with Men in the United States: A Comparison of Web-Based and Venue-Based Samples.

PubMed

Chen, Yen-Tyng; Bowles, Kristina; An, Qian; DiNenno, Elizabeth; Finlayson, Teresa; Hoots, Brooke; Paz-Bailey, Gabriela; Wejnert, Cyprian

2018-07-01

Although men who have sex with men (MSM) recruited through web-based and venue-based sampling methods have been compared, no large web-based and venue-based samples using similar survey instruments have been examined in the U.S. This study describes the differences in sociodemographic characteristics and risk behaviors between the 2012 Web-based HIV Behavioral Survey (n = 3221) and 2011 National HIV Behavioral Surveillance (n = 9256). Compared with participants in the venue-based sample, participants in the web-based sample were older, less likely to be black or Hispanic, more likely to have higher socioeconomic status, and more likely to have anal sex without a condom with their last male sex partner. Web-based participants were less likely to have multiple male sex partners, ever injected drugs, been tested for HIV in the past 12 months, and received free condoms than venue-based participants. The method for sampling MSM into a behavioral survey should consider the sub-population of MSM to be reached.
The Petersen-Lincoln estimator and its extension to estimate the size of a shared population.

PubMed

Chao, Anne; Pan, H-Y; Chiang, Shu-Chuan

2008-12-01

The Petersen-Lincoln estimator has been used to estimate the size of a population in a single mark release experiment. However, the estimator is not valid when the capture sample and recapture sample are not independent. We provide an intuitive interpretation for "independence" between samples based on 2 x 2 categorical data formed by capture/non-capture in each of the two samples. From the interpretation, we review a general measure of "dependence" and quantify the correlation bias of the Petersen-Lincoln estimator when two types of dependences (local list dependence and heterogeneity of capture probability) exist. An important implication in the census undercount problem is that instead of using a post enumeration sample to assess the undercount of a census, one should conduct a prior enumeration sample to avoid correlation bias. We extend the Petersen-Lincoln method to the case of two populations. This new estimator of the size of the shared population is proposed and its variance is derived. We discuss a special case where the correlation bias of the proposed estimator due to dependence between samples vanishes. The proposed method is applied to a study of the relapse rate of illicit drug use in Taiwan. ((c) 2008 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim).
A filter paper dry blood spot procedure for acute intermittent porphyria population screening by use of whole blood uroporphyrinogen-I-synthase assay.

PubMed

Johansson, L; Thunell, S; Wetterberg, L

1984-03-13

A filter paper dry blood spot procedure for the determination of whole blood uroporphyrinogen-I-synthase (UIS) activity is presented. The method is based on the concept of enzyme specific activity, the enzyme activity being related to the haemoglobin concentration of the assay sample. The diagnostic capacity with regard to the acute intermittent porphyria (AIP) gene carrier state is shown to be equivalent to that of a washed red cell reference method. On grounds of easy capillary blood sampling, uncomplicated and safe mail specimen transport and simple laboratory reception routines, the method is stated to be well adapted for use in AIP preadolescent population screening.
Sex estimation in a modern American osteological sample using a discriminant function analysis from the calcaneus.

PubMed

DiMichele, Daniel L; Spradley, M Katherine

2012-09-10

Reliable methods for sex estimation during the development of a biological profile are important to the forensic community in instances when the common skeletal elements used to assess sex are absent or damaged. Sex estimation from the calcaneus has potentially significant importance for the forensic community. Specifically, measurements of the calcaneus provide an additional reliable method for sex estimation via discriminant function analysis based on a North American forensic population. Research on a modern American sample was chosen in order to develop up-to-date population specific discriminant functions for sex estimation. The current study addresses this matter, building upon previous research and introduces a new measurement, posterior circumference that promises to advance the accuracy of use of this single, highly resistant bone in future instances of sex determination from partial skeletal remains. Data were collected from The William Bass Skeletal Collection, housed at The University of Tennessee. Sample size includes 320 adult individuals born between the years 1900 and 1985. The sample was comprised of 136 females and 184 males. Skeletons used for measurements were confined to those with fused diaphyses showing no signs of pathology or damage that may have altered measurements, and that also had accompanying records that included information on ancestry, age, and sex. Measurements collected and analyzed include maximum length, load-arm length, load-arm width, and posterior circumference. The sample was used to compute a discriminant function, based on all four variables, and was performed in SAS 9.1.3. The discriminant function obtained an overall cross-validated classification rate of 86.69%. Females were classified correctly in 88.64% of the cases and males were correctly classified in 84.75% of the cases. Due to the increasing heterogeneity of current populations further discussion on this topic will include the importance that the re-evaluation of past studies has on modern forensic populations. Due to secular and micro evolutionary changes among populations, the near future must include additional methods being updated, and new methods being examined, both which should cover a wide population spectrum. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Within-Plant Distribution of Adult Brown Stink Bug (Hemiptera: Pentatomidae) in Corn and Its Implications on Stink Bug Sampling and Management in Corn.

PubMed

Babu, Arun; Reisig, Dominic D

2018-05-29

Brown stink bug, Euschistus servus (Say) (Hemiptera: Pentatomidae), has emerged as a significant pest of corn, Zea mays L., in the southeastern United States. A 2-year study was conducted to quantify the within-plant vertical distribution of adult E. servus in field corn, to examine potential plant phenological characteristics associated with their observed distribution, and to select an efficient partial plant sampling method for adult E. servus population estimation. Within-plant distribution of adult E. servus was influenced by corn phenology. On V4- and V6-stage corn, most of the individuals were found at the base of the plant. Mean relative vertical position of adult E. servus population in corn plants trended upward between the V6 and V14 growth stages. During the reproductive corn growth stages (R1, R2, and R4), a majority of the adult E. servus were concentrated around developing ears. Based on the multiple selection criteria, during V4-V6 corn growth stages, either the corn stalk below the lowest green leaf or basal stratum method could employ for efficient E. servus sampling. Similarly, on reproductive corn growth stages (R1-R4), the plant parts between two leaves above and three leaves below the primary ear leaf were found to be areas to provide the most precise and cost-efficient sampling method. The results from our study successfully demonstrate that in the early vegetative and reproductive stages of corn, scouts can replace the current labor-intensive whole-plant search method with a more efficient, specific partial plant sampling method for E. servus population estimation.
A Direct Latent Variable Modeling Based Method for Point and Interval Estimation of Coefficient Alpha

ERIC Educational Resources Information Center

Raykov, Tenko; Marcoulides, George A.

2015-01-01

A direct approach to point and interval estimation of Cronbach's coefficient alpha for multiple component measuring instruments is outlined. The procedure is based on a latent variable modeling application with widely circulated software. As a by-product, using sample data the method permits ascertaining whether the population discrepancy…
Kinetic quantitation of cerebral PET-FDG studies without concurrent blood sampling: statistical recovery of the arterial input function.

PubMed

O'Sullivan, F; Kirrane, J; Muzi, M; O'Sullivan, J N; Spence, A M; Mankoff, D A; Krohn, K A

2010-03-01

Kinetic quantitation of dynamic positron emission tomography (PET) studies via compartmental modeling usually requires the time-course of the radio-tracer concentration in the arterial blood as an arterial input function (AIF). For human and animal imaging applications, significant practical difficulties are associated with direct arterial sampling and as a result there is substantial interest in alternative methods that require no blood sampling at the time of the study. A fixed population template input function derived from prior experience with directly sampled arterial curves is one possibility. Image-based extraction, including requisite adjustment for spillover and recovery, is another approach. The present work considers a hybrid statistical approach based on a penalty formulation in which the information derived from a priori studies is combined in a Bayesian manner with information contained in the sampled image data in order to obtain an input function estimate. The absolute scaling of the input is achieved by an empirical calibration equation involving the injected dose together with the subject's weight, height and gender. The technique is illustrated in the context of (18)F -Fluorodeoxyglucose (FDG) PET studies in humans. A collection of 79 arterially sampled FDG blood curves are used as a basis for a priori characterization of input function variability, including scaling characteristics. Data from a series of 12 dynamic cerebral FDG PET studies in normal subjects are used to evaluate the performance of the penalty-based AIF estimation technique. The focus of evaluations is on quantitation of FDG kinetics over a set of 10 regional brain structures. As well as the new method, a fixed population template AIF and a direct AIF estimate based on segmentation are also considered. Kinetics analyses resulting from these three AIFs are compared with those resulting from radially sampled AIFs. The proposed penalty-based AIF extraction method is found to achieve significant improvements over the fixed template and the segmentation methods. As well as achieving acceptable kinetic parameter accuracy, the quality of fit of the region of interest (ROI) time-course data based on the extracted AIF, matches results based on arterially sampled AIFs. In comparison, significant deviation in the estimation of FDG flux and degradation in ROI data fit are found with the template and segmentation methods. The proposed AIF extraction method is recommended for practical use.
Sampling Key Populations for HIV Surveillance: Results From Eight Cross-Sectional Studies Using Respondent-Driven Sampling and Venue-Based Snowball Sampling

PubMed Central

Stahlman, Shauna; Hargreaves, James; Weir, Sharon; Edwards, Jessie; Rice, Brian; Kochelani, Duncan; Mavimbela, Mpumelelo; Baral, Stefan

2017-01-01

Background In using regularly collected or existing surveillance data to characterize engagement in human immunodeficiency virus (HIV) services among marginalized populations, differences in sampling methods may produce different pictures of the target population and may therefore result in different priorities for response. Objective The objective of this study was to use existing data to evaluate the sample distribution of eight studies of female sex workers (FSW) and men who have sex with men (MSM), who were recruited using different sampling approaches in two locations within Sub-Saharan Africa: Manzini, Swaziland and Yaoundé, Cameroon. Methods MSM and FSW participants were recruited using either respondent-driven sampling (RDS) or venue-based snowball sampling. Recruitment took place between 2011 and 2016. Participants at each study site were administered a face-to-face survey to assess sociodemographics, along with the prevalence of self-reported HIV status, frequency of HIV testing, stigma, and other HIV-related characteristics. Crude and RDS-adjusted prevalence estimates were calculated. Crude prevalence estimates from the venue-based snowball samples were compared with the overlap of the RDS-adjusted prevalence estimates, between both FSW and MSM in Cameroon and Swaziland. Results RDS samples tended to be younger (MSM aged 18-21 years in Swaziland: 47.6% [139/310] in RDS vs 24.3% [42/173] in Snowball, in Cameroon: 47.9% [99/306] in RDS vs 20.1% [52/259] in Snowball; FSW aged 18-21 years in Swaziland 42.5% [82/325] in RDS vs 8.0% [20/249] in Snowball; in Cameroon 15.6% [75/576] in RDS vs 8.1% [25/306] in Snowball). They were less educated (MSM: primary school completed or less in Swaziland 42.6% [109/310] in RDS vs 4.0% [7/173] in Snowball, in Cameroon 46.2% [138/306] in RDS vs 14.3% [37/259] in Snowball; FSW: primary school completed or less in Swaziland 86.6% [281/325] in RDS vs 23.9% [59/247] in Snowball, in Cameroon 87.4% [520/576] in RDS vs 77.5% [238/307] in Snowball) than the snowball samples. In addition, RDS samples indicated lower exposure to HIV prevention information, less knowledge about HIV prevention, limited access to HIV prevention tools such as condoms, and less-reported frequency of sexually transmitted infections (STI) and HIV testing as compared with the venue-based samples. Findings pertaining to the level of disclosure of sexual practices and sexual practice–related stigma were mixed. Conclusions Samples generated by RDS and venue-based snowball sampling produced significantly different prevalence estimates of several important characteristics. These findings are tempered by limitations to the application of both approaches in practice. Ultimately, these findings provide further context for understanding existing surveillance data and how differences in methods of sampling can influence both the type of individuals captured and whether or not these individuals are representative of the larger target population. These data highlight the need to consider how program coverage estimates of marginalized populations are determined when characterizing the level of unmet need. PMID:29054832
Dietary Behaviors of a Racially and Ethnically Diverse Sample of Overweight and Obese Californians

ERIC Educational Resources Information Center

Sorkin, Dara H.; Billimek, John

2012-01-01

Objectives: To examine racial/ethnic differences in the dietary behaviors of overweight or obese adults using the 2007 California Health Interview Survey. Method: Data were obtained from the 2007 California Health Interview Survey, a population-based sample of noninstitutionalized adults in California. The sample included 26,721 adults aged 18…
A nonparametric method to generate synthetic populations to adjust for complex sampling design features.

PubMed

Dong, Qi; Elliott, Michael R; Raghunathan, Trivellore E

2014-06-01

Outside of the survey sampling literature, samples are often assumed to be generated by a simple random sampling process that produces independent and identically distributed (IID) samples. Many statistical methods are developed largely in this IID world. Application of these methods to data from complex sample surveys without making allowance for the survey design features can lead to erroneous inferences. Hence, much time and effort have been devoted to develop the statistical methods to analyze complex survey data and account for the sample design. This issue is particularly important when generating synthetic populations using finite population Bayesian inference, as is often done in missing data or disclosure risk settings, or when combining data from multiple surveys. By extending previous work in finite population Bayesian bootstrap literature, we propose a method to generate synthetic populations from a posterior predictive distribution in a fashion inverts the complex sampling design features and generates simple random samples from a superpopulation point of view, making adjustment on the complex data so that they can be analyzed as simple random samples. We consider a simulation study with a stratified, clustered unequal-probability of selection sample design, and use the proposed nonparametric method to generate synthetic populations for the 2006 National Health Interview Survey (NHIS), and the Medical Expenditure Panel Survey (MEPS), which are stratified, clustered unequal-probability of selection sample designs.
A nonparametric method to generate synthetic populations to adjust for complex sampling design features

PubMed Central

Dong, Qi; Elliott, Michael R.; Raghunathan, Trivellore E.

2017-01-01

Outside of the survey sampling literature, samples are often assumed to be generated by a simple random sampling process that produces independent and identically distributed (IID) samples. Many statistical methods are developed largely in this IID world. Application of these methods to data from complex sample surveys without making allowance for the survey design features can lead to erroneous inferences. Hence, much time and effort have been devoted to develop the statistical methods to analyze complex survey data and account for the sample design. This issue is particularly important when generating synthetic populations using finite population Bayesian inference, as is often done in missing data or disclosure risk settings, or when combining data from multiple surveys. By extending previous work in finite population Bayesian bootstrap literature, we propose a method to generate synthetic populations from a posterior predictive distribution in a fashion inverts the complex sampling design features and generates simple random samples from a superpopulation point of view, making adjustment on the complex data so that they can be analyzed as simple random samples. We consider a simulation study with a stratified, clustered unequal-probability of selection sample design, and use the proposed nonparametric method to generate synthetic populations for the 2006 National Health Interview Survey (NHIS), and the Medical Expenditure Panel Survey (MEPS), which are stratified, clustered unequal-probability of selection sample designs. PMID:29200608
Physical Activity among Rural Older Adults with Diabetes

ERIC Educational Resources Information Center

Arcury, Thomas A.; Snively, Beverly M.; Bell, Ronny A.; Smith, Shannon L.; Stafford, Jeanette M.; Wetmore-Arkader, Lindsay K.; Quandt, Sara A.

2006-01-01

Purpose: This analysis describes physical activity levels and factors associated with physical activity in an ethnically diverse (African American, Native American, white) sample of rural older adults with diabetes. Method: Data were collected using a population-based, cross-sectional stratified random sample survey of 701 community-dwelling…
High-Resolution Detection of Identity by Descent in Unrelated Individuals

PubMed Central

Browning, Sharon R.; Browning, Brian L.

2010-01-01

Detection of recent identity by descent (IBD) in population samples is important for population-based linkage mapping and for highly accurate genotype imputation and haplotype-phase inference. We present a method for detection of recent IBD in population samples. Our method accounts for linkage disequilibrium between SNPs to enable full use of high-density SNP data. We find that our method can detect segments of a length of 2 cM with moderate power and negligible false discovery rate in Illumina 550K data in Northwestern Europeans. We compare our method with GERMLINE and PLINK, and we show that our method has a level of resolution that is significantly better than these existing methods, thus extending the usefulness of recent IBD in analysis of high-density SNP data. We survey four genomic regions in a sample of UK individuals of European descent and find that on average, at a given location, our method detects IBD in 2.7 per 10,000 pairs of individuals in Illumina 550K data. We also present methodology and results for detection of homozygosity by descent (HBD) and survey the whole genome in a sample of 1373 UK individuals of European descent. We detect HBD in 4.7 individuals per 10,000 on average at a given location. Our methodology is implemented in the freely available BEAGLE software package. PMID:20303063
Triceps and Subscapular Skinfold Thickness Percentiles and Cut-Offs for Overweight and Obesity in a Population-Based Sample of Schoolchildren and Adolescents in Bogota, Colombia

PubMed Central

Ramírez-Vélez, Robinson; López-Cifuentes, Mario Ferney; Correa-Bautista, Jorge Enrique; González-Ruíz, Katherine; González-Jiménez, Emilio; Córdoba-Rodríguez, Diana Paola; Vivas, Andrés; Triana-Reina, Hector Reynaldo; Schmidt-RioValle, Jacqueline

2016-01-01

The assessment of skinfold thickness is an objective measure of adiposity. The aims of this study were to establish Colombian smoothed centile charts and LMS L (Box–Cox transformation), M (median), and S (coefficient of variation) tables for triceps, subscapular, and triceps + subscapular skinfolds; appropriate cut-offs were selected using receiver operating characteristic (ROC) analysis based on a population-based sample of children and adolescents in Bogotá, Colombia. A cross-sectional study was conducted in 9618 children and adolescents (55.7% girls; age range of 9–17.9 years). Triceps and subscapular skinfold measurements were obtained using standardized methods. We calculated the triceps + subscapular skinfold (T + SS) sum. Smoothed percentile curves for triceps and subscapular skinfold thickness were derived using the LMS method. ROC curve analyses were used to evaluate the optimal cut-off point of skinfold thickness for overweight and obesity, based on the International Obesity Task Force definitions. Subscapular and triceps skinfolds and T + SS were significantly higher in girls than in boys (p < 0.001). The ROC analysis showed that subscapular and triceps skinfolds and T + SS have a high discriminatory power in the identification of overweight and obesity in the sample population in this study. Our results provide sex- and age-specific normative reference standards for skinfold thickness values from a population from Bogotá, Colombia. PMID:27669294
Evaluating manta ray mucus as an alternative DNA source for population genetics study: underwater-sampling, dry-storage and PCR success.

PubMed

Kashiwagi, Tom; Maxwell, Elisabeth A; Marshall, Andrea D; Christensen, Ana B

2015-01-01

Sharks and rays are increasingly being identified as high-risk species for extinction, prompting urgent assessments of their local or regional populations. Advanced genetic analyses can contribute relevant information on effective population size and connectivity among populations although acquiring sufficient regional sample sizes can be challenging. DNA is typically amplified from tissue samples which are collected by hand spears with modified biopsy punch tips. This technique is not always popular due mainly to a perception that invasive sampling might harm the rays, change their behaviour, or have a negative impact on tourism. To explore alternative methods, we evaluated the yields and PCR success of DNA template prepared from the manta ray mucus collected underwater and captured and stored on a Whatman FTA™ Elute card. The pilot study demonstrated that mucus can be effectively collected underwater using toothbrush. DNA stored on cards was found to be reliable for PCR-based population genetics studies. We successfully amplified mtDNA ND5, nuclear DNA RAG1, and microsatellite loci for all samples and confirmed sequences and genotypes being those of target species. As the yields of DNA with the tested method were low, further improvements are desirable for assays that may require larger amounts of DNA, such as population genomic studies using emerging next-gen sequencing.
Evaluating manta ray mucus as an alternative DNA source for population genetics study: underwater-sampling, dry-storage and PCR success

PubMed Central

Maxwell, Elisabeth A.; Marshall, Andrea D.; Christensen, Ana B.

2015-01-01

Sharks and rays are increasingly being identified as high-risk species for extinction, prompting urgent assessments of their local or regional populations. Advanced genetic analyses can contribute relevant information on effective population size and connectivity among populations although acquiring sufficient regional sample sizes can be challenging. DNA is typically amplified from tissue samples which are collected by hand spears with modified biopsy punch tips. This technique is not always popular due mainly to a perception that invasive sampling might harm the rays, change their behaviour, or have a negative impact on tourism. To explore alternative methods, we evaluated the yields and PCR success of DNA template prepared from the manta ray mucus collected underwater and captured and stored on a Whatman FTA™ Elute card. The pilot study demonstrated that mucus can be effectively collected underwater using toothbrush. DNA stored on cards was found to be reliable for PCR-based population genetics studies. We successfully amplified mtDNA ND5, nuclear DNA RAG1, and microsatellite loci for all samples and confirmed sequences and genotypes being those of target species. As the yields of DNA with the tested method were low, further improvements are desirable for assays that may require larger amounts of DNA, such as population genomic studies using emerging next-gen sequencing. PMID:26413431
Bounce Back Now! Protocol of a population-based randomized controlled trial to examine the efficacy of a Web-based intervention with disaster-affected families.

PubMed

Ruggiero, Kenneth J; Davidson, Tatiana M; McCauley, Jenna; Gros, Kirstin Stauffacher; Welsh, Kyleen; Price, Matthew; Resnick, Heidi S; Danielson, Carla Kmett; Soltis, Kathryn; Galea, Sandro; Kilpatrick, Dean G; Saunders, Benjamin E; Nissenboim, Josh; Muzzy, Wendy; Fleeman, Anna; Amstadter, Ananda B

2015-01-01

Disasters have far-reaching and potentially long-lasting effects on youth and families. Research has consistently shown a clear increase in the prevalence of several mental health disorders after disasters, including depression and posttraumatic stress disorder. Widely accessible evidence-based interventions are needed to address this unmet need for youth and families, who are underrepresented in disaster research. Rapid growth in Internet and Smartphone access, as well as several Web based evaluation studies with various adult populations has shown that Web-based interventions are likely to be feasible in this context and can improve clinical outcomes. Such interventions also are generally cost-effective, can be targeted or personalized, and can easily be integrated in a stepped care approach to screening and intervention delivery. This is a protocol paper that describes an innovative study design in which we evaluate a self-help Web-based resource, Bounce Back Now, with a population-based sample of disaster affected adolescents and families. The paper includes description and justification for sampling selection and procedures, selection of assessment measures and methods, design of the intervention, and statistical evaluation of critical outcomes. Unique features of this study design include the use of address-based sampling to recruit a population-based sample of disaster-affected adolescents and parents, telephone and Web-based assessments, and development and evaluation of a highly individualized Web intervention for adolescents. Challenges related to large-scale evaluation of technology-delivered interventions with high-risk samples in time-sensitive research are discussed, as well as implications for future research and practice. Published by Elsevier Inc.
Bounce Back Now! Protocol of a Population-Based Randomized Controlled Trial to Examine the Efficacy of a Web-based Intervention with Disaster-Affected Families

PubMed Central

Ruggiero, Kenneth J.; Davidson, Tatiana M.; McCauley, Jenna; Gros, Kirstin Stauffacher; Welsh, Kyleen; Price, Matthew; Resnick, Heidi S.; Danielson, Carla Kmett; Soltis, Kathryn; Galea, Sandro; Kilpatrick, Dean G.; Saunders, Benjamin E.; Nissenboim, Josh; Muzzy, Wendy; Fleeman, Anna; Amstadter, Ananda B.

2014-01-01

Disasters have far-reaching and potentially long-lasting effects on youth and families. Research has consistently shown a clear increase in the prevalence of several mental health disorders after disasters, including depression and posttraumatic stress disorder. Widely accessible evidence-based interventions are needed to address this unmet need for youth and families, who are underrepresented in disaster research. Rapid growth in Internet and Smartphone access, as well as several web based evaluation studies with various adult populations has shown that web-based interventions are likely to be feasible in this context and can improve clinical outcomes. Such interventions also are generally cost-effective, can be targeted or personalized, and can easily be integrated in a stepped care approach to screening and intervention delivery. This is a protocol paper that describes an innovative study design in which we evaluate a self-help web-based resource, Bounce Back Now, with a population-based sample of disaster affected adolescents and families. The paper includes description and justification for sampling selection and procedures, selection of assessment measures and methods, design of the intervention, and statistical evaluation of critical outcomes. Unique features of this study design include the use of address-based sampling to recruit a population-based sample of disaster-affected adolescents and parents, telephone and web-based assessments, and development and evaluation of a highly individualized web intervention for adolescents. Challenges related to large-scale evaluation of technology-delivered interventions with high-risk samples in time-sensitive research are discussed, as well as implications for future research and practice. PMID:25478956
Palladium-based Mass-Tag Cell Barcoding with a Doublet-Filtering Scheme and Single Cell Deconvolution Algorithm

PubMed Central

Zunder, Eli R.; Finck, Rachel; Behbehani, Gregory K.; Amir, El-ad D.; Krishnaswamy, Smita; Gonzalez, Veronica D.; Lorang, Cynthia G.; Bjornson, Zach; Spitzer, Matthew H.; Bodenmiller, Bernd; Fantl, Wendy J.; Pe’er, Dana; Nolan, Garry P.

2015-01-01

SUMMARY Mass-tag cell barcoding (MCB) labels individual cell samples with unique combinatorial barcodes, after which they are pooled for processing and measurement as a single multiplexed sample. The MCB method eliminates variability between samples in antibody staining and instrument sensitivity, reduces antibody consumption, and shortens instrument measurement time. Here, we present an optimized MCB protocol with several improvements over previously described methods. The use of palladium-based labeling reagents expands the number of measurement channels available for mass cytometry and reduces interference with lanthanide-based antibody measurement. An error-detecting combinatorial barcoding scheme allows cell doublets to be identified and removed from the analysis. A debarcoding algorithm that is single cell-based rather than population-based improves the accuracy and efficiency of sample deconvolution. This debarcoding algorithm has been packaged into software that allows rapid and unbiased sample deconvolution. The MCB procedure takes 3–4 h, not including sample acquisition time of ~1 h per million cells. PMID:25612231
Predicting discovery rates of genomic features.

PubMed

Gravel, Simon

2014-06-01

Successful sequencing experiments require judicious sample selection. However, this selection must often be performed on the basis of limited preliminary data. Predicting the statistical properties of the final sample based on preliminary data can be challenging, because numerous uncertain model assumptions may be involved. Here, we ask whether we can predict "omics" variation across many samples by sequencing only a fraction of them. In the infinite-genome limit, we find that a pilot study sequencing 5% of a population is sufficient to predict the number of genetic variants in the entire population within 6% of the correct value, using an estimator agnostic to demography, selection, or population structure. To reach similar accuracy in a finite genome with millions of polymorphisms, the pilot study would require ∼15% of the population. We present computationally efficient jackknife and linear programming methods that exhibit substantially less bias than the state of the art when applied to simulated data and subsampled 1000 Genomes Project data. Extrapolating based on the National Heart, Lung, and Blood Institute Exome Sequencing Project data, we predict that 7.2% of sites in the capture region would be variable in a sample of 50,000 African Americans and 8.8% in a European sample of equal size. Finally, we show how the linear programming method can also predict discovery rates of various genomic features, such as the number of transcription factor binding sites across different cell types. Copyright © 2014 by the Genetics Society of America.

Challenges of DNA-based mark-recapture studies of American black bears

USGS Publications Warehouse

Settlage, K.E.; Van Manen, F.T.; Clark, J.D.; King, T.L.

2008-01-01

We explored whether genetic sampling would be feasible to provide a region-wide population estimate for American black bears (Ursus americanus) in the southern Appalachians, USA. Specifically, we determined whether adequate capture probabilities (p >0.20) and population estimates with a low coefficient of variation (CV <20%) could be achieved given typical agency budget and personnel constraints. We extracted DNA from hair collected from baited barbed-wire enclosures sampled over a 10-week period on 2 study areas: a high-density black bear population in a portion of Great Smoky Mountains National Park and a lower density population on National Forest lands in North Carolina, South Carolina, and Georgia. We identified individual bears by their unique genotypes obtained from 9 microsatellite loci. We sampled 129 and 60 different bears in the National Park and National Forest study areas, respectively, and applied closed mark–recapture models to estimate population abundance. Capture probabilities and precision of the population estimates were acceptable only for sampling scenarios for which we pooled weekly sampling periods. We detected capture heterogeneity biases, probably because of inadequate spatial coverage by the hair-trapping grid. The logistical challenges of establishing and checking a sufficiently high density of hair traps make DNA-based estimates of black bears impractical for the southern Appalachian region. Alternatives are to estimate population size for smaller areas, estimate population growth rates or survival using mark–recapture methods, or use independent marking and recapturing techniques to reduce capture heterogeneity.
Population genomics of C. melanopterus using target gene capture data: demographic inferences and conservation perspectives

PubMed Central

Maisano Delser, Pierpaolo; Corrigan, Shannon; Hale, Matthew; Li, Chenhong; Veuille, Michel; Planes, Serge; Naylor, Gavin; Mona, Stefano

2016-01-01

Population genetics studies on non-model organisms typically involve sampling few markers from multiple individuals. Next-generation sequencing approaches open up the possibility of sampling many more markers from fewer individuals to address the same questions. Here, we applied a target gene capture method to deep sequence ~1000 independent autosomal regions of a non-model organism, the blacktip reef shark (Carcharhinus melanopterus). We devised a sampling scheme based on the predictions of theoretical studies of metapopulations to show that sampling few individuals, but many loci, can be extremely informative to reconstruct the evolutionary history of species. We collected data from a single deme (SID) from Northern Australia and from a scattered sampling representing various locations throughout the Indian Ocean (SCD). We explored the genealogical signature of population dynamics detected from both sampling schemes using an ABC algorithm. We then contrasted these results with those obtained by fitting the data to a non-equilibrium finite island model. Both approaches supported an Nm value ~40, consistent with philopatry in this species. Finally, we demonstrate through simulation that metapopulations exhibit greater resilience to recent changes in effective size compared to unstructured populations. We propose an empirical approach to detect recent bottlenecks based on our sampling scheme. PMID:27651217
Population genomics of C. melanopterus using target gene capture data: demographic inferences and conservation perspectives.

PubMed

Maisano Delser, Pierpaolo; Corrigan, Shannon; Hale, Matthew; Li, Chenhong; Veuille, Michel; Planes, Serge; Naylor, Gavin; Mona, Stefano

2016-09-21

Population genetics studies on non-model organisms typically involve sampling few markers from multiple individuals. Next-generation sequencing approaches open up the possibility of sampling many more markers from fewer individuals to address the same questions. Here, we applied a target gene capture method to deep sequence ~1000 independent autosomal regions of a non-model organism, the blacktip reef shark (Carcharhinus melanopterus). We devised a sampling scheme based on the predictions of theoretical studies of metapopulations to show that sampling few individuals, but many loci, can be extremely informative to reconstruct the evolutionary history of species. We collected data from a single deme (SID) from Northern Australia and from a scattered sampling representing various locations throughout the Indian Ocean (SCD). We explored the genealogical signature of population dynamics detected from both sampling schemes using an ABC algorithm. We then contrasted these results with those obtained by fitting the data to a non-equilibrium finite island model. Both approaches supported an Nm value ~40, consistent with philopatry in this species. Finally, we demonstrate through simulation that metapopulations exhibit greater resilience to recent changes in effective size compared to unstructured populations. We propose an empirical approach to detect recent bottlenecks based on our sampling scheme.
The Vineyard Yeast Microbiome, a Mixed Model Microbial Map

PubMed Central

Setati, Mathabatha Evodia; Jacobson, Daniel; Andong, Ursula-Claire; Bauer, Florian

2012-01-01

Vineyards harbour a wide variety of microorganisms that play a pivotal role in pre- and post-harvest grape quality and will contribute significantly to the final aromatic properties of wine. The aim of the current study was to investigate the spatial distribution of microbial communities within and between individual vineyard management units. For the first time in such a study, we applied the Theory of Sampling (TOS) to sample gapes from adjacent and well established commercial vineyards within the same terroir unit and from several sampling points within each individual vineyard. Cultivation-based and molecular data sets were generated to capture the spatial heterogeneity in microbial populations within and between vineyards and analysed with novel mixed-model networks, which combine sample correlations and microbial community distribution probabilities. The data demonstrate that farming systems have a significant impact on fungal diversity but more importantly that there is significant species heterogeneity between samples in the same vineyard. Cultivation-based methods confirmed that while the same oxidative yeast species dominated in all vineyards, the least treated vineyard displayed significantly higher species richness, including many yeasts with biocontrol potential. The cultivatable yeast population was not fully representative of the more complex populations seen with molecular methods, and only the molecular data allowed discrimination amongst farming practices with multivariate and network analysis methods. Importantly, yeast species distribution is subject to significant intra-vineyard spatial fluctuations and the frequently reported heterogeneity of tank samples of grapes harvested from single vineyards at the same stage of ripeness might therefore, at least in part, be due to the differing microbiota in different sections of the vineyard. PMID:23300721
Science deficiency in conservation practice: the monitoring of tiger populations in India

USGS Publications Warehouse

Karanth, K.U.; Nichols, J.D.; Seidensticker, J.; Dinerstein, Eric; Smith, J.L.D.; McDougal, C.; Johnsingh, A.J.T.; Chundawat, Raghunandan S.; Thapar, V.

2003-01-01

Conservation practices are supposed to get refined by advancing scientific knowledge. We study this phenomenon in the context of monitoring tiger populations in India, by evaluating the 'pugmark census method' employed by wildlife managers for three decades. We use an analytical framework of modem animal population sampling to test the efficacy of the pugmark censuses using scientific data on tigers and our field observations. We identify three critical goals for monitoring tiger populations, in order of increasing sophistication: (1) distribution mapping, (2) tracking relative abundance, (3) estimation of absolute abundance. We demonstrate that the present census-based paradigm does not work because it ignores the first two simpler goals, and targets, but fails to achieve, the most difficult third goal. We point out the utility and ready availability of alternative monitoring paradigms that deal with the central problems of spatial sampling and observability. We propose an alternative sampling-based approach that can be tailored to meet practical needs of tiger monitoring at different levels of refinement.
Efficient simulation and likelihood methods for non-neutral multi-allele models.

PubMed

Joyce, Paul; Genz, Alan; Buzbas, Erkan Ozge

2012-06-01

Throughout the 1980s, Simon Tavaré made numerous significant contributions to population genetics theory. As genetic data, in particular DNA sequence, became more readily available, a need to connect population-genetic models to data became the central issue. The seminal work of Griffiths and Tavaré (1994a , 1994b , 1994c) was among the first to develop a likelihood method to estimate the population-genetic parameters using full DNA sequences. Now, we are in the genomics era where methods need to scale-up to handle massive data sets, and Tavaré has led the way to new approaches. However, performing statistical inference under non-neutral models has proved elusive. In tribute to Simon Tavaré, we present an article in spirit of his work that provides a computationally tractable method for simulating and analyzing data under a class of non-neutral population-genetic models. Computational methods for approximating likelihood functions and generating samples under a class of allele-frequency based non-neutral parent-independent mutation models were proposed by Donnelly, Nordborg, and Joyce (DNJ) (Donnelly et al., 2001). DNJ (2001) simulated samples of allele frequencies from non-neutral models using neutral models as auxiliary distribution in a rejection algorithm. However, patterns of allele frequencies produced by neutral models are dissimilar to patterns of allele frequencies produced by non-neutral models, making the rejection method inefficient. For example, in some cases the methods in DNJ (2001) require 10(9) rejections before a sample from the non-neutral model is accepted. Our method simulates samples directly from the distribution of non-neutral models, making simulation methods a practical tool to study the behavior of the likelihood and to perform inference on the strength of selection.
Statistical power calculations for mixed pharmacokinetic study designs using a population approach.

PubMed

Kloprogge, Frank; Simpson, Julie A; Day, Nicholas P J; White, Nicholas J; Tarning, Joel

2014-09-01

Simultaneous modelling of dense and sparse pharmacokinetic data is possible with a population approach. To determine the number of individuals required to detect the effect of a covariate, simulation-based power calculation methodologies can be employed. The Monte Carlo Mapped Power method (a simulation-based power calculation methodology using the likelihood ratio test) was extended in the current study to perform sample size calculations for mixed pharmacokinetic studies (i.e. both sparse and dense data collection). A workflow guiding an easy and straightforward pharmacokinetic study design, considering also the cost-effectiveness of alternative study designs, was used in this analysis. Initially, data were simulated for a hypothetical drug and then for the anti-malarial drug, dihydroartemisinin. Two datasets (sampling design A: dense; sampling design B: sparse) were simulated using a pharmacokinetic model that included a binary covariate effect and subsequently re-estimated using (1) the same model and (2) a model not including the covariate effect in NONMEM 7.2. Power calculations were performed for varying numbers of patients with sampling designs A and B. Study designs with statistical power >80% were selected and further evaluated for cost-effectiveness. The simulation studies of the hypothetical drug and the anti-malarial drug dihydroartemisinin demonstrated that the simulation-based power calculation methodology, based on the Monte Carlo Mapped Power method, can be utilised to evaluate and determine the sample size of mixed (part sparsely and part densely sampled) study designs. The developed method can contribute to the design of robust and efficient pharmacokinetic studies.
Survey of predators and sampling method comparison in sweet corn.

PubMed

Musser, Fred R; Nyrop, Jan P; Shelton, Anthony M

2004-02-01

Natural predation is an important component of integrated pest management that is often overlooked because it is difficult to quantify and perceived to be unreliable. To begin incorporating natural predation into sweet corn, Zea mays L., pest management, a predator survey was conducted and then three sampling methods were compared for their ability to accurately monitor the most abundant predators. A predator survey on sweet corn foliage in New York between 1999 and 2001 identified 13 species. Orius insidiosus (Say), Coleomegilla maculata (De Geer), and Harmonia axyridis (Pallas) were the most numerous predators in all years. To determine the best method for sampling adult and immature stages of these predators, comparisons were made among nondestructive field counts, destructive counts, and yellow sticky cards. Field counts were correlated with destructive counts for all populations, but field counts of small insects were biased. Sticky cards underrepresented immature populations. Yellow sticky cards were more attractive to C. maculata adults than H. axyridis adults, especially before pollen shed, making coccinellid population estimates based on sticky cards unreliable. Field counts were the most precise method for monitoring adult and immature stages of the three major predators. Future research on predicting predation of pests in sweet corn should be based on field counts of predators because these counts are accurate, have no associated supply costs, and can be made quickly.
An integrated modeling approach to estimating Gunnison Sage-Grouse population dynamics: combining index and demographic data.

USGS Publications Warehouse

Davis, Amy J.; Hooten, Mevin B.; Phillips, Michael L.; Doherty, Paul F.

2014-01-01

Evaluation of population dynamics for rare and declining species is often limited to data that are sparse and/or of poor quality. Frequently, the best data available for rare bird species are based on large-scale, population count data. These data are commonly based on sampling methods that lack consistent sampling effort, do not account for detectability, and are complicated by observer bias. For some species, short-term studies of demographic rates have been conducted as well, but the data from such studies are typically analyzed separately. To utilize the strengths and minimize the weaknesses of these two data types, we developed a novel Bayesian integrated model that links population count data and population demographic data through population growth rate (λ) for Gunnison sage-grouse (Centrocercus minimus). The long-term population index data available for Gunnison sage-grouse are annual (years 1953–2012) male lek counts. An intensive demographic study was also conducted from years 2005 to 2010. We were able to reduce the variability in expected population growth rates across time, while correcting for potential small sample size bias in the demographic data. We found the population of Gunnison sage-grouse to be variable and slightly declining over the past 16 years.
Pilot Test of a Novel Method for Assessing Community Response to Low-Amplitude Sonic Booms

NASA Technical Reports Server (NTRS)

Fidell, Sanford; Horonjeff, Richard D.; Harris, Michael

2012-01-01

A pilot test of a novel method for assessing residents annoyance to sonic booms was performed. During a two-week period, residents of the base housing area at Edwards Air Force Base provided data on their reactions to sonic booms using Smartphone-based interviews. Noise measurements were conducted at the same time. The report presents information about data collection methods and about test participants reactions to low-amplitude sonic booms. The latter information should not be viewed as definitive for several reasons. It may not be reliably generalized to the wider U.S. residential population (because it was not derived from a representative random sample) and the sample itself was not large.
Iris pigmentation as a quantitative trait: variation in populations of European, East Asian and South Asian ancestry and association with candidate gene polymorphisms.

PubMed

Edwards, Melissa; Cha, David; Krithika, S; Johnson, Monique; Cook, Gillian; Parra, Esteban J

2016-03-01

In this study, we present a new quantitative method to measure iris colour based on high-resolution photographs. We applied this method to analyse iris colour variation in a sample of individuals of East Asian, European and South Asian ancestry. We show that measuring iris colour using the coordinates of the CIELAB colour space uncovers a significant amount of variation that is not captured using conventional categorical classifications, such as 'brown', 'blue' or 'green'. We tested the association of a selected panel of polymorphisms with iris colour in each population group. Six markers showed significant associations with iris colour in the European sample, three in the South Asian sample and two in the East Asian sample. We also observed that the marker HERC2 rs12913832, which is the main determinant of 'blue' versus 'brown' iris colour in European populations, is also significantly associated with central heterochromia in the European sample. © 2015 The Authors. Pigment Cell & Melanoma Research Published by John Wiley & Sons Ltd.
Duration of Sleep and ADHD Tendency among Adolescents in China

ERIC Educational Resources Information Center

Lam, Lawrence T.; Yang, L.

2008-01-01

Objective: This study investigates the association between duration of sleep and ADHD tendency among adolescents. Method: This population-based health survey uses a two-stage random cluster sampling design. Participants ages 13 to 17 are recruited from the total population of adolescents attending high school in one city of China. Duration of…
Applications of Small Area Estimation to Generalization with Subclassification by Propensity Scores

ERIC Educational Resources Information Center

Chan, Wendy

2018-01-01

Policymakers have grown increasingly interested in how experimental results may generalize to a larger population. However, recently developed propensity score-based methods are limited by small sample sizes, where the experimental study is generalized to a population that is at least 20 times larger. This is particularly problematic for methods…
Oviposition traps to survey eggs of Lambdina fiscellaria (Lepidoptera: Geometridae).

PubMed

Hébert, Christian; Jobin, Luc; Auger, Michel; Dupont, Alain

2003-06-01

Outbreaks of the hemlock looper, Lambdina fiscellaria (Gueneé), are characterized by rapid increase and patchy distribution over widespread areas, which make it difficult to detect impending outbreaks. This is a major problem with this insect. Population forecasting is based on tedious and expensive egg surveys in which eggs are extracted from 1-m branches; careful observation is needed to avoid counting old unhatched eggs of previous year populations. The efficacy of artificial substrates as oviposition traps to sample hemlock looper eggs was tested as a means of improving outbreak detection and population forecasting. A white polyurethane foam substrate (1,095 lb/ft3) used with the Luminoc insect trap, a portable light trap, was highly efficient in sampling eggs of the hemlock looper. Foam strips placed on tree trunks at breast height were less efficient but easier and less expensive to use for the establishment of extensive survey networks. Estimates based on oviposition traps were highly correlated with those obtained from the 1-m branch extraction method. The oviposition trap is a standard, inexpensive, easy, and robust method that can be used by nonspecialists. This technique makes it possible to sample higher numbers of plots in widespread monitoring networks, which is crucial for improving the management of hemlock looper populations.
A New Method to Separate Star-forming from AGN Galaxies at Intermediate Redshift: The Submillijansky Radio Population in the VLA-COSMOS Survey

NASA Astrophysics Data System (ADS)

Smolčić, V.; Schinnerer, E.; Scodeggio, M.; Franzetti, P.; Aussel, H.; Bondi, M.; Brusa, M.; Carilli, C. L.; Capak, P.; Charlot, S.; Ciliegi, P.; Ilbert, O.; Ivezić, Ž.; Jahnke, K.; McCracken, H. J.; Obrić, M.; Salvato, M.; Sanders, D. B.; Scoville, N.; Trump, J. R.; Tremonti, C.; Tasca, L.; Walcher, C. J.; Zamorani, G.

2008-07-01

We explore the properties of the submillijansky radio population at 20 cm by applying a newly developed optical color-based method to separate star-forming (SF) from active galactic nucleus (AGN) galaxies at intermediate redshifts (zlesssim 1.3). Although optical rest-frame colors are used, our separation method is shown to be efficient and not biased against dusty starburst galaxies. This classification method has been calibrated and tested on a local radio-selected optical sample. Given accurate multiband photometry and redshifts, it carries the potential to be generally applicable to any galaxy sample where SF and AGN galaxies are the two dominant populations. In order to quantify the properties of the submillijansky radio population, we have analyzed ~2,400 radio sources, detected at 20 cm in the VLA-COSMOS survey; 90% of these have submillijansky flux densities. We classify the objects into (1) star candidates, (2) quasi-stellar objects, (3) AGN, (4) SF, and (5) high-redshift (z > 1.3) galaxies. We find, for the composition of the submillijansky radio population, that SF galaxies are not the dominant population at submillijansky flux levels, as previously often assumed, but that they make up an approximately constant fraction of 30%-40% in the flux density range of ~50 μJy to 0.7 mJy. In summary, based on the entire VLA-COSMOS radio population at 20 cm, we find that the radio population at these flux densities is a mixture of roughly 30%-40% of SF and 50%-60% of AGN galaxies, with a minor contribution (~10%) of QSOs.
Estimation of Standard Error of Regression Effects in Latent Regression Models Using Binder's Linearization. Research Report. ETS RR-07-09

ERIC Educational Resources Information Center

Li, Deping; Oranje, Andreas

2007-01-01

Two versions of a general method for approximating standard error of regression effect estimates within an IRT-based latent regression model are compared. The general method is based on Binder's (1983) approach, accounting for complex samples and finite populations by Taylor series linearization. In contrast, the current National Assessment of…
Population entropies estimates of proteins

NASA Astrophysics Data System (ADS)

Low, Wai Yee

2017-05-01

The Shannon entropy equation provides a way to estimate variability of amino acids sequences in a multiple sequence alignment of proteins. Knowledge of protein variability is useful in many areas such as vaccine design, identification of antibody binding sites, and exploration of protein 3D structural properties. In cases where the population entropies of a protein are of interest but only a small sample size can be obtained, a method based on linear regression and random subsampling can be used to estimate the population entropy. This method is useful for comparisons of entropies where the actual sequence counts differ and thus, correction for alignment size bias is needed. In the current work, an R based package named EntropyCorrect that enables estimation of population entropy is presented and an empirical study on how well this new algorithm performs on simulated dataset of various combinations of population and sample sizes is discussed. The package is available at https://github.com/lloydlow/EntropyCorrect. This article, which was originally published online on 12 May 2017, contained an error in Eq. (1), where the summation sign was missing. The corrected equation appears in the Corrigendum attached to the pdf.
Assessment of the effect of population and diary sampling methods on estimation of school-age children exposure to fine particles.

PubMed

Che, W W; Frey, H Christopher; Lau, Alexis K H

2014-12-01

Population and diary sampling methods are employed in exposure models to sample simulated individuals and their daily activity on each simulation day. Different sampling methods may lead to variations in estimated human exposure. In this study, two population sampling methods (stratified-random and random-random) and three diary sampling methods (random resampling, diversity and autocorrelation, and Markov-chain cluster [MCC]) are evaluated. Their impacts on estimated children's exposure to ambient fine particulate matter (PM2.5 ) are quantified via case studies for children in Wake County, NC for July 2002. The estimated mean daily average exposure is 12.9 μg/m(3) for simulated children using the stratified population sampling method, and 12.2 μg/m(3) using the random sampling method. These minor differences are caused by the random sampling among ages within census tracts. Among the three diary sampling methods, there are differences in the estimated number of individuals with multiple days of exposures exceeding a benchmark of concern of 25 μg/m(3) due to differences in how multiday longitudinal diaries are estimated. The MCC method is relatively more conservative. In case studies evaluated here, the MCC method led to 10% higher estimation of the number of individuals with repeated exposures exceeding the benchmark. The comparisons help to identify and contrast the capabilities of each method and to offer insight regarding implications of method choice. Exposure simulation results are robust to the two population sampling methods evaluated, and are sensitive to the choice of method for simulating longitudinal diaries, particularly when analyzing results for specific microenvironments or for exposures exceeding a benchmark of concern. © 2014 Society for Risk Analysis.
Genomic scan as a tool for assessing the genetic component of phenotypic variance in wild populations.

PubMed

Herrera, Carlos M

2012-01-01

Methods for estimating quantitative trait heritability in wild populations have been developed in recent years which take advantage of the increased availability of genetic markers to reconstruct pedigrees or estimate relatedness between individuals, but their application to real-world data is not exempt from difficulties. This chapter describes a recent marker-based technique which, by adopting a genomic scan approach and focusing on the relationship between phenotypes and genotypes at the individual level, avoids the problems inherent to marker-based estimators of relatedness. This method allows the quantification of the genetic component of phenotypic variance ("degree of genetic determination" or "heritability in the broad sense") in wild populations and is applicable whenever phenotypic trait values and multilocus data for a large number of genetic markers (e.g., amplified fragment length polymorphisms, AFLPs) are simultaneously available for a sample of individuals from the same population. The method proceeds by first identifying those markers whose variation across individuals is significantly correlated with individual phenotypic differences ("adaptive loci"). The proportion of phenotypic variance in the sample that is statistically accounted for by individual differences in adaptive loci is then estimated by fitting a linear model to the data, with trait value as the dependent variable and scores of adaptive loci as independent ones. The method can be easily extended to accommodate quantitative or qualitative information on biologically relevant features of the environment experienced by each sampled individual, in which case estimates of the environmental and genotype × environment components of phenotypic variance can also be obtained.
HIV Research with Men who Have Sex with Men (MSM): Advantages and Challenges of Different Methods for Most Appropriately Targeting a Key Population

PubMed Central

Gama, Ana; Martins, Maria O.; Dias, Sónia

2017-01-01

The difficulty in accessing hard-to-reach populations as men who have sex with men presents a dilemma for HIV surveillance as their omission from surveillance systems leaves significant gaps in our understanding of HIV/AIDS epidemics. Several methods for recruiting difficult-to-access populations and collecting data on trends of HIV prevalence and behavioural factors for surveillance and research purposes have emerged. This paper aims to critically review different sampling approaches, from chain-referral and venue-based to respondent-driven, time-location and internet sampling methods, focusing on its main advantages and challenges for conducting HIV research among key populations, such as men who have sex with men. The benefits of using these approaches to recruit participants must be weighed against privacy concerns inherent in any social situation or health condition. Nevertheless, the methods discussed in this paper represent some of the best efforts to effectively reach most-at-risk subgroups of men who have sex with men, contributing to obtain unbiased trends of HIV prevalence and HIV-related risk behaviours among this population group. PMID:29546214

A Third Moment Adjusted Test Statistic for Small Sample Factor Analysis.

PubMed

Lin, Johnny; Bentler, Peter M

2012-01-01

Goodness of fit testing in factor analysis is based on the assumption that the test statistic is asymptotically chi-square; but this property may not hold in small samples even when the factors and errors are normally distributed in the population. Robust methods such as Browne's asymptotically distribution-free method and Satorra Bentler's mean scaling statistic were developed under the presumption of non-normality in the factors and errors. This paper finds new application to the case where factors and errors are normally distributed in the population but the skewness of the obtained test statistic is still high due to sampling error in the observed indicators. An extension of Satorra Bentler's statistic is proposed that not only scales the mean but also adjusts the degrees of freedom based on the skewness of the obtained test statistic in order to improve its robustness under small samples. A simple simulation study shows that this third moment adjusted statistic asymptotically performs on par with previously proposed methods, and at a very small sample size offers superior Type I error rates under a properly specified model. Data from Mardia, Kent and Bibby's study of students tested for their ability in five content areas that were either open or closed book were used to illustrate the real-world performance of this statistic.
Analysis of genetic population structure in Acacia caven (Leguminosae, Mimosoideae), comparing one exploratory and two Bayesian-model-based methods.

PubMed

Pometti, Carolina L; Bessega, Cecilia F; Saidman, Beatriz O; Vilardi, Juan C

2014-03-01

Bayesian clustering as implemented in STRUCTURE or GENELAND software is widely used to form genetic groups of populations or individuals. On the other hand, in order to satisfy the need for less computer-intensive approaches, multivariate analyses are specifically devoted to extracting information from large datasets. In this paper, we report the use of a dataset of AFLP markers belonging to 15 sampling sites of Acacia caven for studying the genetic structure and comparing the consistency of three methods: STRUCTURE, GENELAND and DAPC. Of these methods, DAPC was the fastest one and showed accuracy in inferring the K number of populations (K = 12 using the find.clusters option and K = 15 with a priori information of populations). GENELAND in turn, provides information on the area of membership probabilities for individuals or populations in the space, when coordinates are specified (K = 12). STRUCTURE also inferred the number of K populations and the membership probabilities of individuals based on ancestry, presenting the result K = 11 without prior information of populations and K = 15 using the LOCPRIOR option. Finally, in this work all three methods showed high consistency in estimating the population structure, inferring similar numbers of populations and the membership probabilities of individuals to each group, with a high correlation between each other.
Individualized head-related transfer functions based on population grouping.

PubMed

Xu, Song; Li, Zhizhong; Salvendy, Gavriel

2008-11-01

A method is proposed to divide a population into different groups for partial individualization of head-related transfer functions (HRTFs). Borrowing the basic idea in sizing system design, factor analysis is used to identify the most representative measurements which are then in a case study used to group the population. The comparison between the group mean HRTFs and the population mean HRTFs shows that the group mean HRTFs could greatly reduce spectral distortion at most sampled positions.
Methods for sampling geographically mobile female traders in an East African market setting

PubMed Central

Achiro, Lillian; Kwena, Zachary A.; McFarland, Willi; Neilands, Torsten B.; Cohen, Craig R.; Bukusi, Elizabeth A.; Camlin, Carol S.

2018-01-01

Background The role of migration in the spread of HIV in sub-Saharan Africa is well-documented. Yet migration and HIV research have often focused on HIV risks to male migrants and their partners, or migrants overall, often failing to measure the risks to women via their direct involvement in migration. Inconsistent measures of mobility, gender biases in those measures, and limited data sources for sex-specific population-based estimates of mobility have contributed to a paucity of research on the HIV prevention and care needs of migrant and highly mobile women. This study addresses an urgent need for novel methods for developing probability-based, systematic samples of highly mobile women, focusing on a population of female traders operating out of one of the largest open air markets in East Africa. Our method involves three stages: 1.) identification and mapping of all market stall locations using Global Positioning System (GPS) coordinates; 2.) using female market vendor stall GPS coordinates to build the sampling frame using replicates; and 3.) using maps and GPS data for recruitment of study participants. Results The location of 6,390 vendor stalls were mapped using GPS. Of these, 4,064 stalls occupied by women (63.6%) were used to draw four replicates of 128 stalls each, and a fifth replicate of 15 pre-selected random alternates for a total of 527 stalls assigned to one of five replicates. Staff visited 323 stalls from the first three replicates and from these successfully recruited 306 female vendors into the study for a participation rate of 94.7%. Mobilization strategies and involving traders association representatives in participant recruitment were critical to the study’s success. Conclusion The study’s high participation rate suggests that this geospatial sampling method holds promise for development of probability-based samples in other settings that serve as transport hubs for highly mobile populations. PMID:29324780
Multiple Imputation in Two-Stage Cluster Samples Using The Weighted Finite Population Bayesian Bootstrap.

PubMed

Zhou, Hanzhi; Elliott, Michael R; Raghunathan, Trivellore E

2016-06-01

Multistage sampling is often employed in survey samples for cost and convenience. However, accounting for clustering features when generating datasets for multiple imputation is a nontrivial task, particularly when, as is often the case, cluster sampling is accompanied by unequal probabilities of selection, necessitating case weights. Thus, multiple imputation often ignores complex sample designs and assumes simple random sampling when generating imputations, even though failing to account for complex sample design features is known to yield biased estimates and confidence intervals that have incorrect nominal coverage. In this article, we extend a recently developed, weighted, finite-population Bayesian bootstrap procedure to generate synthetic populations conditional on complex sample design data that can be treated as simple random samples at the imputation stage, obviating the need to directly model design features for imputation. We develop two forms of this method: one where the probabilities of selection are known at the first and second stages of the design, and the other, more common in public use files, where only the final weight based on the product of the two probabilities is known. We show that this method has advantages in terms of bias, mean square error, and coverage properties over methods where sample designs are ignored, with little loss in efficiency, even when compared with correct fully parametric models. An application is made using the National Automotive Sampling System Crashworthiness Data System, a multistage, unequal probability sample of U.S. passenger vehicle crashes, which suffers from a substantial amount of missing data in "Delta-V," a key crash severity measure.
Multiple Imputation in Two-Stage Cluster Samples Using The Weighted Finite Population Bayesian Bootstrap

PubMed Central

Zhou, Hanzhi; Elliott, Michael R.; Raghunathan, Trivellore E.

2017-01-01

Multistage sampling is often employed in survey samples for cost and convenience. However, accounting for clustering features when generating datasets for multiple imputation is a nontrivial task, particularly when, as is often the case, cluster sampling is accompanied by unequal probabilities of selection, necessitating case weights. Thus, multiple imputation often ignores complex sample designs and assumes simple random sampling when generating imputations, even though failing to account for complex sample design features is known to yield biased estimates and confidence intervals that have incorrect nominal coverage. In this article, we extend a recently developed, weighted, finite-population Bayesian bootstrap procedure to generate synthetic populations conditional on complex sample design data that can be treated as simple random samples at the imputation stage, obviating the need to directly model design features for imputation. We develop two forms of this method: one where the probabilities of selection are known at the first and second stages of the design, and the other, more common in public use files, where only the final weight based on the product of the two probabilities is known. We show that this method has advantages in terms of bias, mean square error, and coverage properties over methods where sample designs are ignored, with little loss in efficiency, even when compared with correct fully parametric models. An application is made using the National Automotive Sampling System Crashworthiness Data System, a multistage, unequal probability sample of U.S. passenger vehicle crashes, which suffers from a substantial amount of missing data in “Delta-V,” a key crash severity measure. PMID:29226161
flowVS: channel-specific variance stabilization in flow cytometry.

PubMed

Azad, Ariful; Rajwa, Bartek; Pothen, Alex

2016-07-28

Comparing phenotypes of heterogeneous cell populations from multiple biological conditions is at the heart of scientific discovery based on flow cytometry (FC). When the biological signal is measured by the average expression of a biomarker, standard statistical methods require that variance be approximately stabilized in populations to be compared. Since the mean and variance of a cell population are often correlated in fluorescence-based FC measurements, a preprocessing step is needed to stabilize the within-population variances. We present a variance-stabilization algorithm, called flowVS, that removes the mean-variance correlations from cell populations identified in each fluorescence channel. flowVS transforms each channel from all samples of a data set by the inverse hyperbolic sine (asinh) transformation. For each channel, the parameters of the transformation are optimally selected by Bartlett's likelihood-ratio test so that the populations attain homogeneous variances. The optimum parameters are then used to transform the corresponding channels in every sample. flowVS is therefore an explicit variance-stabilization method that stabilizes within-population variances in each channel by evaluating the homoskedasticity of clusters with a likelihood-ratio test. With two publicly available datasets, we show that flowVS removes the mean-variance dependence from raw FC data and makes the within-population variance relatively homogeneous. We demonstrate that alternative transformation techniques such as flowTrans, flowScape, logicle, and FCSTrans might not stabilize variance. Besides flow cytometry, flowVS can also be applied to stabilize variance in microarray data. With a publicly available data set we demonstrate that flowVS performs as well as the VSN software, a state-of-the-art approach developed for microarrays. The homogeneity of variance in cell populations across FC samples is desirable when extracting features uniformly and comparing cell populations with different levels of marker expressions. The newly developed flowVS algorithm solves the variance-stabilization problem in FC and microarrays by optimally transforming data with the help of Bartlett's likelihood-ratio test. On two publicly available FC datasets, flowVS stabilizes within-population variances more evenly than the available transformation and normalization techniques. flowVS-based variance stabilization can help in performing comparison and alignment of phenotypically identical cell populations across different samples. flowVS and the datasets used in this paper are publicly available in Bioconductor.
A new method for estimating the demographic history from DNA sequences: an importance sampling approach

PubMed Central

Ait Kaci Azzou, Sadoune; Larribe, Fabrice; Froda, Sorana

2015-01-01

The effective population size over time (demographic history) can be retraced from a sample of contemporary DNA sequences. In this paper, we propose a novel methodology based on importance sampling (IS) for exploring such demographic histories. Our starting point is the generalized skyline plot with the main difference being that our procedure, skywis plot, uses a large number of genealogies. The information provided by these genealogies is combined according to the IS weights. Thus, we compute a weighted average of the effective population sizes on specific time intervals (epochs), where the genealogies that agree more with the data are given more weight. We illustrate by a simulation study that the skywis plot correctly reconstructs the recent demographic history under the scenarios most commonly considered in the literature. In particular, our method can capture a change point in the effective population size, and its overall performance is comparable with the one of the bayesian skyline plot. We also introduce the case of serially sampled sequences and illustrate that it is possible to improve the performance of the skywis plot in the case of an exponential expansion of the effective population size. PMID:26300910
Hepatitis E Virus in Wild Boar in Northwest Poland: Sensitivity of Methods of Detection.

PubMed

Dorn-In, Samart; Schwaiger, Karin; Twarużek, Magdalena; Grajewski, Jan; Gottschalk, Christoph; Gareis, Manfred

2017-02-01

In northwest Poland, 163 blood and 53 fecal samples of wild boars were collected in winter 2012/13 and 2013/14. All blood samples were tested for the presence of hepatitis E virus (HEV) ribonucleic acid (RNA) by two reverse transcription-polymerase chain reaction (RT-PCR) based methods and by anti-HEV IgG enzyme-linked immunosorbent assay (ELISA). About 17.2% of blood samples were seropositive. One-step nested RT-PCR turned out to be too insensitive (11.6% were positive). Therefore a two-step nested RT-PCR was applied where 25.8% of the blood samples were tested positive for HEV RNA. About 50.0% of blood samples positive in ELISA were also positive in two-step nested RT-PCR. The prevalence of HEV RNA in feces was 9.4%. Based on the results of blood (ELISA, PCR) and fecal (PCR) tests, the overall prevalence of HEV in wild boars in northwest Poland was 36.8%. There was no correlation between the ELISA results and the presence of HEV RNA in plasma or in feces. According to the sequencing results of 348 bp PCR products of HEV, there were four different subtypes identified. Reports on the prevalence of HEV in wild boar populations are varying due to different sensitivities of the detection methods. However, this study reveals based on a highly sensitive method that HEV is widely spread in wild boar populations in the northwestern region of Poland and posing a potential risk to the consumer of game meat.
Reference Intervals of Common Clinical Chemistry Analytes for Adults in Hong Kong.

PubMed

Lo, Y C; Armbruster, David A

2012-04-01

Defining reference intervals is a major challenge because of the difficulty in recruiting volunteers to participate and testing samples from a significant number of healthy reference individuals. Historical literature citation intervals are often suboptimal because they're be based on obsolete methods and/or only a small number of poorly defined reference samples. Blood donors in Hong Kong gave permission for additional blood to be collected for reference interval testing. The samples were tested for twenty-five routine analytes on the Abbott ARCHITECT clinical chemistry system. Results were analyzed using the Rhoads EP evaluator software program, which is based on the CLSI/IFCC C28-A guideline, and defines the reference interval as the 95% central range. Method specific reference intervals were established for twenty-five common clinical chemistry analytes for a Chinese ethnic population. The intervals were defined for each gender separately and for genders combined. Gender specific or combined gender intervals were adapted as appropriate for each analyte. A large number of healthy, apparently normal blood donors from a local ethnic population were tested to provide current reference intervals for a new clinical chemistry system. Intervals were determined following an accepted international guideline. Laboratories using the same or similar methodologies may adapt these intervals if deemed validated and deemed suitable for their patient population. Laboratories using different methodologies may be able to successfully adapt the intervals for their facilities using the reference interval transference technique based on a method comparison study.
Adaptive sampling in behavioral surveys.

PubMed

Thompson, S K

1997-01-01

Studies of populations such as drug users encounter difficulties because the members of the populations are rare, hidden, or hard to reach. Conventionally designed large-scale surveys detect relatively few members of the populations so that estimates of population characteristics have high uncertainty. Ethnographic studies, on the other hand, reach suitable numbers of individuals only through the use of link-tracing, chain referral, or snowball sampling procedures that often leave the investigators unable to make inferences from their sample to the hidden population as a whole. In adaptive sampling, the procedure for selecting people or other units to be in the sample depends on variables of interest observed during the survey, so the design adapts to the population as encountered. For example, when self-reported drug use is found among members of the sample, sampling effort may be increased in nearby areas. Types of adaptive sampling designs include ordinary sequential sampling, adaptive allocation in stratified sampling, adaptive cluster sampling, and optimal model-based designs. Graph sampling refers to situations with nodes (for example, people) connected by edges (such as social links or geographic proximity). An initial sample of nodes or edges is selected and edges are subsequently followed to bring other nodes into the sample. Graph sampling designs include network sampling, snowball sampling, link-tracing, chain referral, and adaptive cluster sampling. A graph sampling design is adaptive if the decision to include linked nodes depends on variables of interest observed on nodes already in the sample. Adjustment methods for nonsampling errors such as imperfect detection of drug users in the sample apply to adaptive as well as conventional designs.
The Factor Structure of ADHD in a General Population of Primary School Children

ERIC Educational Resources Information Center

Ullebo, Anne Karin; Breivik, Kyrre; Gillberg, Christopher; Lundervold, Astri J.; Posserud, Maj-Britt

2012-01-01

Objective: To examine whether a bifactor model with a general ADHD factor and domain specific factors of inattention, hyperactivity and impulsivity was supported in a large general population sample of children. We also explored the utility of forming subscales based on the domain-specific factors. Methods: Child mental health questionnaires were…
Genetic demographic networks: Mathematical model and applications.

PubMed

Kimmel, Marek; Wojdyła, Tomasz

2016-10-01

Recent improvement in the quality of genetic data obtained from extinct human populations and their ancestors encourages searching for answers to basic questions regarding human population history. The most common and successful are model-based approaches, in which genetic data are compared to the data obtained from the assumed demography model. Using such approach, it is possible to either validate or adjust assumed demography. Model fit to data can be obtained based on reverse-time coalescent simulations or forward-time simulations. In this paper we introduce a computational method based on mathematical equation that allows obtaining joint distributions of pairs of individuals under a specified demography model, each of them characterized by a genetic variant at a chosen locus. The two individuals are randomly sampled from either the same or two different populations. The model assumes three types of demographic events (split, merge and migration). Populations evolve according to the time-continuous Moran model with drift and Markov-process mutation. This latter process is described by the Lyapunov-type equation introduced by O'Brien and generalized in our previous works. Application of this equation constitutes an original contribution. In the result section of the paper we present sample applications of our model to both simulated and literature-based demographies. Among other we include a study of the Slavs-Balts-Finns genetic relationship, in which we model split and migrations between the Balts and Slavs. We also include another example that involves the migration rates between farmers and hunters-gatherers, based on modern and ancient DNA samples. This latter process was previously studied using coalescent simulations. Our results are in general agreement with the previous method, which provides validation of our approach. Although our model is not an alternative to simulation methods in the practical sense, it provides an algorithm to compute pairwise distributions of alleles, in the case of haploid non-recombining loci such as mitochondrial and Y-chromosome loci in humans. Copyright © 2016 Elsevier Inc. All rights reserved.
Estimating the Size of the Methamphetamine-Using Population in New York City Using Network Sampling Techniques.

PubMed

Dombrowski, Kirk; Khan, Bilal; Wendel, Travis; McLean, Katherine; Misshula, Evan; Curtis, Ric

2012-12-01

As part of a recent study of the dynamics of the retail market for methamphetamine use in New York City, we used network sampling methods to estimate the size of the total networked population. This process involved sampling from respondents' list of co-use contacts, which in turn became the basis for capture-recapture estimation. Recapture sampling was based on links to other respondents derived from demographic and "telefunken" matching procedures-the latter being an anonymized version of telephone number matching. This paper describes the matching process used to discover the links between the solicited contacts and project respondents, the capture-recapture calculation, the estimation of "false matches", and the development of confidence intervals for the final population estimates. A final population of 12,229 was estimated, with a range of 8235 - 23,750. The techniques described here have the special virtue of deriving an estimate for a hidden population while retaining respondent anonymity and the anonymity of network alters, but likely require larger sample size than the 132 persons interviewed to attain acceptable confidence levels for the estimate.
Simulated fissioning of uranium and testing of the fission-track dating method

USGS Publications Warehouse

McGee, V.E.; Johnson, N.M.; Naeser, C.W.

1985-01-01

A computer program (FTD-SIM) faithfully simulates the fissioning of 238U with time and 235U with neutron dose. The simulation is based on first principles of physics where the fissioning of 238U with the flux of time is described by Ns = ??f 238Ut and the fissioning of 235U with the fluence of neutrons is described by Ni = ??235U??. The Poisson law is used to set the stochastic variation of fissioning within the uranium population. The life history of a given crystal can thus be traced under an infinite variety of age and irradiation conditions. A single dating attempt or up to 500 dating attempts on a given crystal population can be simulated by specifying the age of the crystal population, the size and variation in the areas to be counted, the amount and distribution of uranium, the neutron dose to be used and its variation, and the desired ratio of 238U to 235U. A variety of probability distributions can be applied to uranium and counting-area. The Price and Walker age equation is used to estimate age. The output of FTD-SIM includes the tabulated results of each individual dating attempt (sample) on demand and/or the summary statistics and histograms for multiple dating attempts (samples) including the sampling age. An analysis of the results from FTD-SIM shows that: (1) The external detector method is intrinsically more precise than the population method. (2) For the external detector method a correlation between spontaneous track count, Ns, and induced track count, Ni, results when the population of grains has a stochastic uranium content and/or when the counting areas between grains are stochastic. For the population method no correlation can exist. (3) In the external detector method the sampling distribution of age is independent of the number of grains counted. In the population method the sampling distribution of age is highly dependent on the number of grains counted. (4) Grains with zero-track counts, either in Ns or Ni, are in integral part of fissioning theory and under certain circumstances must be included in any estimate of age. (5) In estimating standard error of age the standard error of Ns and Ni and ?? must be accurately estimated and propagated through the age equation. Several statistical models are presently available to do so. ?? 1985.
Population-based validation of a German version of the Brief Resilience Scale

PubMed Central

Wenzel, Mario; Stieglitz, Rolf-Dieter; Kunzler, Angela; Bagusat, Christiana; Helmreich, Isabella; Gerlicher, Anna; Kampa, Miriam; Kubiak, Thomas; Kalisch, Raffael; Lieb, Klaus; Tüscher, Oliver

2018-01-01

Smith and colleagues developed the Brief Resilience Scale (BRS) to assess the individual ability to recover from stress despite significant adversity. This study aimed to validate the German version of the BRS. We used data from a population-based (sample 1: n = 1.481) and a representative (sample 2: n = 1.128) sample of participants from the German general population (age ≥ 18) to assess reliability and validity. Confirmatory factor analyses (CFA) were conducted to compare one- and two-factorial models from previous studies with a method-factor model which especially accounts for the wording of the items. Reliability was analyzed. Convergent validity was measured by correlating BRS scores with mental health measures, coping, social support, and optimism. Reliability was good (α = .85, ω = .85 for both samples). The method-factor model showed excellent model fit (sample 1: χ2/df = 7.544; RMSEA = .07; CFI = .99; SRMR = .02; sample 2: χ2/df = 1.166; RMSEA = .01; CFI = 1.00; SRMR = .01) which was significantly better than the one-factor model (Δχ2(4) = 172.71, p < .001) or the two-factor model (Δχ2(3) = 31.16, p < .001). The BRS was positively correlated with well-being, social support, optimism, and the coping strategies active coping, positive reframing, acceptance, and humor. It was negatively correlated with somatic symptoms, anxiety and insomnia, social dysfunction, depression, and the coping strategies religion, denial, venting, substance use, and self-blame. To conclude, our results provide evidence for the reliability and validity of the German adaptation of the BRS as well as the unidimensional structure of the scale once method effects are accounted for. PMID:29438435
Systematic Review of the Use of Online Questionnaires among the Geriatric Population

PubMed Central

Remillard, Meegan L.; Mazor, Kathleen M.; Cutrona, Sarah L.; Gurwitz, Jerry H.; Tjia, Jennifer

2014-01-01

Background/Objectives The use of internet-based questionnaires to collect information from older adults is not well established. This systematic literature review of studies using online questionnaires in older adult populations aims to 1. describe methodologic approaches to population targeting and sampling and 2. summarize limitations of Internet-based questionnaires in geriatric populations. Design, Setting, Participants We identified English language articles using search terms for geriatric, age 65 and over, Internet survey, online survey, Internet questionnaire, and online questionnaire in PubMed and EBSCO host between 1984 and July 2012. Inclusion criteria were: study population mean age ≥65 years old and use of an online questionnaire for research. Review of 336 abstracts yielded 14 articles for full review by 2 investigators; 11 articles met inclusion criteria. Measurements Articles were extracted for study design and setting, patient characteristics, recruitment strategy, country, and study limitations. Results Eleven (11) articles were published after 2001. Studies had populations with a mean age of 65 to 78 years, included descriptive and analytical designs, and were conducted in the United States, Australia, and Japan. Recruiting methods varied widely from paper fliers and personal emails to use of consumer marketing panels. Investigator-reported study limitations included the use of small convenience samples and limited generalizability. Conclusion Online questionnaires are a feasible method of surveying older adults in some geographic regions and for some subsets of older adults, but limited Internet access constrains recruiting methods and often limits study generalizability. PMID:24635138
Coalescent Inference Using Serially Sampled, High-Throughput Sequencing Data from Intrahost HIV Infection

PubMed Central

Dialdestoro, Kevin; Sibbesen, Jonas Andreas; Maretty, Lasse; Raghwani, Jayna; Gall, Astrid; Kellam, Paul; Pybus, Oliver G.; Hein, Jotun; Jenkins, Paul A.

2016-01-01

Human immunodeficiency virus (HIV) is a rapidly evolving pathogen that causes chronic infections, so genetic diversity within a single infection can be very high. High-throughput “deep” sequencing can now measure this diversity in unprecedented detail, particularly since it can be performed at different time points during an infection, and this offers a potentially powerful way to infer the evolutionary dynamics of the intrahost viral population. However, population genomic inference from HIV sequence data is challenging because of high rates of mutation and recombination, rapid demographic changes, and ongoing selective pressures. In this article we develop a new method for inference using HIV deep sequencing data, using an approach based on importance sampling of ancestral recombination graphs under a multilocus coalescent model. The approach further extends recent progress in the approximation of so-called conditional sampling distributions, a quantity of key interest when approximating coalescent likelihoods. The chief novelties of our method are that it is able to infer rates of recombination and mutation, as well as the effective population size, while handling sampling over different time points and missing data without extra computational difficulty. We apply our method to a data set of HIV-1, in which several hundred sequences were obtained from an infected individual at seven time points over 2 years. We find mutation rate and effective population size estimates to be comparable to those produced by the software BEAST. Additionally, our method is able to produce local recombination rate estimates. The software underlying our method, Coalescenator, is freely available. PMID:26857628
Inferring population history with DIY ABC: a user-friendly approach to approximate Bayesian computation.

PubMed

Cornuet, Jean-Marie; Santos, Filipe; Beaumont, Mark A; Robert, Christian P; Marin, Jean-Michel; Balding, David J; Guillemaud, Thomas; Estoup, Arnaud

2008-12-01

Genetic data obtained on population samples convey information about their evolutionary history. Inference methods can extract part of this information but they require sophisticated statistical techniques that have been made available to the biologist community (through computer programs) only for simple and standard situations typically involving a small number of samples. We propose here a computer program (DIY ABC) for inference based on approximate Bayesian computation (ABC), in which scenarios can be customized by the user to fit many complex situations involving any number of populations and samples. Such scenarios involve any combination of population divergences, admixtures and population size changes. DIY ABC can be used to compare competing scenarios, estimate parameters for one or more scenarios and compute bias and precision measures for a given scenario and known values of parameters (the current version applies to unlinked microsatellite data). This article describes key methods used in the program and provides its main features. The analysis of one simulated and one real dataset, both with complex evolutionary scenarios, illustrates the main possibilities of DIY ABC. The software DIY ABC is freely available at http://www.montpellier.inra.fr/CBGP/diyabc.
Gaussian process-based Bayesian nonparametric inference of population size trajectories from gene genealogies.

PubMed

Palacios, Julia A; Minin, Vladimir N

2013-03-01

Changes in population size influence genetic diversity of the population and, as a result, leave a signature of these changes in individual genomes in the population. We are interested in the inverse problem of reconstructing past population dynamics from genomic data. We start with a standard framework based on the coalescent, a stochastic process that generates genealogies connecting randomly sampled individuals from the population of interest. These genealogies serve as a glue between the population demographic history and genomic sequences. It turns out that only the times of genealogical lineage coalescences contain information about population size dynamics. Viewing these coalescent times as a point process, estimating population size trajectories is equivalent to estimating a conditional intensity of this point process. Therefore, our inverse problem is similar to estimating an inhomogeneous Poisson process intensity function. We demonstrate how recent advances in Gaussian process-based nonparametric inference for Poisson processes can be extended to Bayesian nonparametric estimation of population size dynamics under the coalescent. We compare our Gaussian process (GP) approach to one of the state-of-the-art Gaussian Markov random field (GMRF) methods for estimating population trajectories. Using simulated data, we demonstrate that our method has better accuracy and precision. Next, we analyze two genealogies reconstructed from real sequences of hepatitis C and human Influenza A viruses. In both cases, we recover more believed aspects of the viral demographic histories than the GMRF approach. We also find that our GP method produces more reasonable uncertainty estimates than the GMRF method. Copyright © 2013, The International Biometric Society.

Estimating the size of hidden populations using respondent-driven sampling data: Case examples from Morocco

PubMed Central

Johnston, Lisa G; McLaughlin, Katherine R; Rhilani, Houssine El; Latifi, Amina; Toufik, Abdalla; Bennani, Aziza; Alami, Kamal; Elomari, Boutaina; Handcock, Mark S

2015-01-01

Background Respondent-driven sampling is used worldwide to estimate the population prevalence of characteristics such as HIV/AIDS and associated risk factors in hard-to-reach populations. Estimating the total size of these populations is of great interest to national and international organizations, however reliable measures of population size often do not exist. Methods Successive Sampling-Population Size Estimation (SS-PSE) along with network size imputation allows population size estimates to be made without relying on separate studies or additional data (as in network scale-up, multiplier and capture-recapture methods), which may be biased. Results Ten population size estimates were calculated for people who inject drugs, female sex workers, men who have sex with other men, and migrants from sub-Sahara Africa in six different cities in Morocco. SS-PSE estimates fell within or very close to the likely values provided by experts and the estimates from previous studies using other methods. Conclusions SS-PSE is an effective method for estimating the size of hard-to-reach populations that leverages important information within respondent-driven sampling studies. The addition of a network size imputation method helps to smooth network sizes allowing for more accurate results. However, caution should be used particularly when there is reason to believe that clustered subgroups may exist within the population of interest or when the sample size is small in relation to the population. PMID:26258908
An ELISA method to compute endpoint titers to Epstein-Barr virus and cytomegalovirus: application to population-based studies.

PubMed

Stowe, Raymond P; Ruiz, R Jeanne; Fagundes, Christopher P; Stowe, Robin H; Chen, Min; Glaser, Ronald

2014-06-01

Indirect fluorescence analysis (IFA), the gold standard for determining herpesvirus antibody titers, is labor-intensive and poorly suited for large population-based studies. The enzyme-linked immunosorbent assay (ELISA) is used widely for measuring antiviral antibodies but also suffers drawbacks such as reduced specificity and the qualitative nature of the results due to limited interpretation of the optical density (OD) units. This paper describes a method to titer herpesvirus antibodies using microplates coated with virally-infected cells in which a standard curve, derived from IFA-scored samples, allowed OD units to be converted into titers. A LOOKUP function was created in order to report the data as traditional IFA-based (i.e., 2-fold) titers. The modified ELISA correlated significantly with IFA and was subsequently used to compute endpoint antibody titers to Epstein-Barr virus (EBV)-virus capsid antigen (VCA) and cytomegalovirus (CMV) in blood samples taken from 398 pregnant Hispanic women. Four women were EBV negative (1%), while 58 women were CMV negative (14.6%). EBV VCA antibody titers were significantly higher than CMV antibody titers (p<0.001). This method allows titering of herpesvirus antibodies by ELISA suitable for large population-based studies. In addition, the LOOKUP table enables conversion from OD-derived titers into 2-fold titers for comparison of results with other studies. Copyright © 2014 Elsevier B.V. All rights reserved.
An evaluation of authentication methods for smartphone based on users’ preferences

NASA Astrophysics Data System (ADS)

Sari, P. K.; Ratnasari, G. S.; Prasetio, A.

2016-04-01

This study discusses about smartphone screen lock preferences using some types of authentication methods. The purpose is to determine the user behaviours based on the perceived security and convenience, as well as the preferences for different types of authentication methods. Variables used are the considerations for locking the screens and the types of authentication methods. The population consists of the smartphone users with the total samples of 400 respondents within a nonprobability sampling method. Data analysis method used is the descriptive analysis. The results showed that the convenience factor is still the major consideration for locking the smartphone screens. Majority of the users chose the pattern unlock as the most convenient method to use. Meanwhile, fingerprint unlock becomes the most secure method in the users’ perceptions and as the method chosen to be used in the future.
Rapid Antibiotic Susceptibility Testing of Uropathogenic E. coli by Tracking Submicron Scale Motion of Single Bacterial Cells.

PubMed

Syal, Karan; Shen, Simon; Yang, Yunze; Wang, Shaopeng; Haydel, Shelley E; Tao, Nongjian

2017-08-25

To combat antibiotic resistance, a rapid antibiotic susceptibility testing (AST) technology that can identify resistant infections at disease onset is required. Current clinical AST technologies take 1-3 days, which is often too slow for accurate treatment. Here we demonstrate a rapid AST method by tracking sub-μm scale bacterial motion with an optical imaging and tracking technique. We apply the method to clinically relevant bacterial pathogens, Escherichia coli O157: H7 and uropathogenic E. coli (UPEC) loosely tethered to a glass surface. By analyzing dose-dependent sub-μm motion changes in a population of bacterial cells, we obtain the minimum bactericidal concentration within 2 h using human urine samples spiked with UPEC. We validate the AST method using the standard culture-based AST methods. In addition to population studies, the method allows single cell analysis, which can identify subpopulations of resistance strains within a sample.
Using effort information with change-in-ratio data for population estimation

USGS Publications Warehouse

Udevitz, Mark S.; Pollock, Kenneth H.

1995-01-01

Most change-in-ratio (CIR) methods for estimating fish and wildlife population sizes have been based only on assumptions about how encounter probabilities vary among population subclasses. When information on sampling effort is available, it is also possible to derive CIR estimators based on assumptions about how encounter probabilities vary over time. This paper presents a generalization of previous CIR models that allows explicit consideration of a range of assumptions about the variation of encounter probabilities among subclasses and over time. Explicit estimators are derived under this model for specific sets of assumptions about the encounter probabilities. Numerical methods are presented for obtaining estimators under the full range of possible assumptions. Likelihood ratio tests for these assumptions are described. Emphasis is on obtaining estimators based on assumptions about variation of encounter probabilities over time.
Geographic origin and individual assignment of Shorea platyclados (Dipterocarpaceae) for forensic identification

PubMed Central

Diway, Bibian; Khoo, Eyen

2017-01-01

The development of timber tracking methods based on genetic markers can provide scientific evidence to verify the origin of timber products and fulfill the growing requirement for sustainable forestry practices. In this study, the origin of an important Dark Red Meranti wood, Shorea platyclados, was studied by using the combination of seven chloroplast DNA and 15 short tandem repeats (STRs) markers. A total of 27 natural populations of S. platyclados were sampled throughout Malaysia to establish population level and individual level identification databases. A haplotype map was generated from chloroplast DNA sequencing for population identification, resulting in 29 multilocus haplotypes, based on 39 informative intraspecific variable sites. Subsequently, a DNA profiling database was developed from 15 STRs allowing for individual identification in Malaysia. Cluster analysis divided the 27 populations into two genetic clusters, corresponding to the region of Eastern and Western Malaysia. The conservativeness tests showed that the Malaysia database is conservative after removal of bias from population subdivision and sampling effects. Independent self-assignment tests correctly assigned individuals to the database in an overall 60.60−94.95% of cases for identified populations, and in 98.99−99.23% of cases for identified regions. Both the chloroplast DNA database and the STRs appear to be useful for tracking timber originating in Malaysia. Hence, this DNA-based method could serve as an effective addition tool to the existing forensic timber identification system for ensuring the sustainably management of this species into the future. PMID:28430826
Intercoalescence time distribution of incomplete gene genealogies in temporally varying populations, and applications in population genetic inference.

PubMed

Chen, Hua

2013-03-01

Tracing back to a specific time T in the past, the genealogy of a sample of haplotypes may not have reached their common ancestor and may leave m lineages extant. For such an incomplete genealogy truncated at a specific time T in the past, the distribution and expectation of the intercoalescence times conditional on T are derived in an exact form in this paper for populations of deterministically time-varying sizes, specifically, for populations growing exponentially. The derived intercoalescence time distribution can be integrated to the coalescent-based joint allele frequency spectrum (JAFS) theory, and is useful for population genetic inference from large-scale genomic data, without relying on computationally intensive approaches, such as importance sampling and Markov Chain Monte Carlo (MCMC) methods. The inference of several important parameters relying on this derived conditional distribution is demonstrated: quantifying population growth rate and onset time, and estimating the number of ancestral lineages at a specific ancient time. Simulation studies confirm validity of the derivation and statistical efficiency of the methods using the derived intercoalescence time distribution. Two examples of real data are given to show the inference of the population growth rate of a European sample from the NIEHS Environmental Genome Project, and the number of ancient lineages of 31 mitochondrial genomes from Tibetan populations. © 2013 Blackwell Publishing Ltd/University College London.
Investigation of real tissue water equivalent path lengths using an efficient dose extinction method

NASA Astrophysics Data System (ADS)

Zhang, Rongxiao; Baer, Esther; Jee, Kyung-Wook; Sharp, Gregory C.; Flanz, Jay; Lu, Hsiao-Ming

2017-07-01

For proton therapy, an accurate conversion of CT HU to relative stopping power (RSP) is essential. Validation of the conversion based on real tissue samples is more direct than the current practice solely based on tissue substitutes and can potentially address variations over the population. Based on a novel dose extinction method, we measured water equivalent path lengths (WEPL) on animal tissue samples to evaluate the accuracy of CT HU to RSP conversion and potential variations over a population. A broad proton beam delivered a spread out Bragg peak to the samples sandwiched between a water tank and a 2D ion-chamber detector. WEPLs of the samples were determined from the transmission dose profiles measured as a function of the water level in the tank. Tissue substitute inserts and Lucite blocks with known WEPLs were used to validate the accuracy. A large number of real tissue samples were measured. Variations of WEPL over different batches of tissue samples were also investigated. The measured WEPLs were compared with those computed from CT scans with the Stoichiometric calibration method. WEPLs were determined within ±0.5% percentage deviation (% std/mean) and ±0.5% error for most of the tissue surrogate inserts and the calibration blocks. For biological tissue samples, percentage deviations were within ±0.3%. No considerable difference (<1%) in WEPL was observed for the same type of tissue from different sources. The differences between measured WEPLs and those calculated from CT were within 1%, except for some bony tissues. Depending on the sample size, each dose extinction measurement took around 5 min to produce ~1000 WEPL values to be compared with calculations. This dose extinction system measures WEPL efficiently and accurately, which allows the validation of CT HU to RSP conversions based on the WEPL measured for a large number of samples and real tissues.
Mark-recapture using tetracycline and genetics reveal record-high bear density

USGS Publications Warehouse

Peacock, E.; Titus, K.; Garshelis, D.L.; Peacock, M.M.; Kuc, M.

2011-01-01

We used tetracycline biomarking, augmented with genetic methods to estimate the size of an American black bear (Ursus americanus) population on an island in Southeast Alaska. We marked 132 and 189 bears that consumed remote, tetracycline-laced baits in 2 different years, respectively, and observed 39 marks in 692 bone samples subsequently collected from hunters. We genetically analyzed hair samples from bait sites to determine the sex of marked bears, facilitating derivation of sex-specific population estimates. We obtained harvest samples from beyond the study area to correct for emigration. We estimated a density of 155 independent bears/100 km2, which is equivalent to the highest recorded for this species. This high density appears to be maintained by abundant, accessible natural food. Our population estimate (approx. 1,000 bears) could be used as a baseline and to set hunting quotas. The refined biomarking method for abundance estimation is a useful alternative where physical captures or DNA-based estimates are precluded by cost or logistics. Copyright ?? 2011 The Wildlife Society.
Methodological challenges in collecting social and behavioural data regarding the HIV epidemic among gay and other men who have sex with men in Australia.

PubMed

Zablotska, Iryna B; Frankland, Andrew; Holt, Martin; de Wit, John; Brown, Graham; Maycock, Bruce; Fairley, Christopher; Prestage, Garrett

2014-01-01

Behavioural surveillance and research among gay and other men who have sex with men (GMSM) commonly relies on non-random recruitment approaches. Methodological challenges limit their ability to accurately represent the population of adult GMSM. We compared the social and behavioural profiles of GMSM recruited via venue-based, online, and respondent-driven sampling (RDS) and discussed their utility for behavioural surveillance. Data from four studies were selected to reflect each recruitment method. We compared demographic characteristics and the prevalence of key indicators including sexual and HIV testing practices obtained from samples recruited through different methods, and population estimates from respondent-driven sampling partition analysis. Overall, the socio-demographic profile of GMSM was similar across samples, with some differences observed in age and sexual identification. Men recruited through time-location sampling appeared more connected to the gay community, reported a greater number of sexual partners, but engaged in less unprotected anal intercourse with regular (UAIR) or casual partners (UAIC). The RDS sample overestimated the proportion of HIV-positive men and appeared to recruit men with an overall higher number of sexual partners. A single-website survey recruited a sample with characteristics which differed considerably from the population estimates with regards to age, ethnically diversity and behaviour. Data acquired through time-location sampling underestimated the rates of UAIR and UAIC, while RDS and online sampling both generated samples that underestimated UAIR. Simulated composite samples combining recruits from time-location and multi-website online sampling may produce characteristics more consistent with the population estimates, particularly with regards to sexual practices. Respondent-driven sampling produced the sample that was most consistent to population estimates, but this methodology is complex and logistically demanding. Time-location and online recruitment are more cost-effective and easier to implement; using these approaches in combination may offer the potential to recruit a more representative sample of GMSM.
Population Fisher information matrix and optimal design of discrete data responses in population pharmacodynamic experiments.

PubMed

Ogungbenro, Kayode; Aarons, Leon

2011-08-01

In the recent years, interest in the application of experimental design theory to population pharmacokinetic (PK) and pharmacodynamic (PD) experiments has increased. The aim is to improve the efficiency and the precision with which parameters are estimated during data analysis and sometimes to increase the power and reduce the sample size required for hypothesis testing. The population Fisher information matrix (PFIM) has been described for uniresponse and multiresponse population PK experiments for design evaluation and optimisation. Despite these developments and availability of tools for optimal design of population PK and PD experiments much of the effort has been focused on repeated continuous variable measurements with less work being done on repeated discrete type measurements. Discrete data arise mainly in PDs e.g. ordinal, nominal, dichotomous or count measurements. This paper implements expressions for the PFIM for repeated ordinal, dichotomous and count measurements based on analysis by a mixed-effects modelling technique. Three simulation studies were used to investigate the performance of the expressions. Example 1 is based on repeated dichotomous measurements, Example 2 is based on repeated count measurements and Example 3 is based on repeated ordinal measurements. Data simulated in MATLAB were analysed using NONMEM (Laplace method) and the glmmML package in R (Laplace and adaptive Gauss-Hermite quadrature methods). The results obtained for Examples 1 and 2 showed good agreement between the relative standard errors obtained using the PFIM and simulations. The results obtained for Example 3 showed the importance of sampling at the most informative time points. Implementation of these expressions will provide the opportunity for efficient design of population PD experiments that involve discrete type data through design evaluation and optimisation.
Comparison of Sample Size by Bootstrap and by Formulas Based on Normal Distribution Assumption.

PubMed

Wang, Zuozhen

2018-01-01

Bootstrapping technique is distribution-independent, which provides an indirect way to estimate the sample size for a clinical trial based on a relatively smaller sample. In this paper, sample size estimation to compare two parallel-design arms for continuous data by bootstrap procedure are presented for various test types (inequality, non-inferiority, superiority, and equivalence), respectively. Meanwhile, sample size calculation by mathematical formulas (normal distribution assumption) for the identical data are also carried out. Consequently, power difference between the two calculation methods is acceptably small for all the test types. It shows that the bootstrap procedure is a credible technique for sample size estimation. After that, we compared the powers determined using the two methods based on data that violate the normal distribution assumption. To accommodate the feature of the data, the nonparametric statistical method of Wilcoxon test was applied to compare the two groups in the data during the process of bootstrap power estimation. As a result, the power estimated by normal distribution-based formula is far larger than that by bootstrap for each specific sample size per group. Hence, for this type of data, it is preferable that the bootstrap method be applied for sample size calculation at the beginning, and that the same statistical method as used in the subsequent statistical analysis is employed for each bootstrap sample during the course of bootstrap sample size estimation, provided there is historical true data available that can be well representative of the population to which the proposed trial is planning to extrapolate.
Lot quality assurance sampling (LQAS) for monitoring a leprosy elimination program.

PubMed

Gupte, M D; Narasimhamurthy, B

1999-06-01

In a statistical sense, prevalences of leprosy in different geographical areas can be called very low or rare. Conventional survey methods to monitor leprosy control programs, therefore, need large sample sizes, are expensive, and are time-consuming. Further, with the lowering of prevalence to the near-desired target level, 1 case per 10,000 population at national or subnational levels, the program administrator's concern will be shifted to smaller areas, e.g., districts, for assessment and, if needed, for necessary interventions. In this paper, Lot Quality Assurance Sampling (LQAS), a quality control tool in industry, is proposed to identify districts/regions having a prevalence of leprosy at or above a certain target level, e.g., 1 in 10,000. This technique can also be considered for identifying districts/regions at or below the target level of 1 per 10,000, i.e., areas where the elimination level is attained. For simulating various situations and strategies, a hypothetical computerized population of 10 million persons was created. This population mimics the actual population in terms of the empirical information on rural/urban distributions and the distribution of households by size for the state of Tamil Nadu, India. Various levels with respect to leprosy prevalence are created using this population. The distribution of the number of cases in the population was expected to follow the Poisson process, and this was also confirmed by examination. Sample sizes and corresponding critical values were computed using Poisson approximation. Initially, villages/towns are selected from the population and from each selected village/town households are selected using systematic sampling. Households instead of individuals are used as sampling units. This sampling procedure was simulated 1000 times in the computer from the base population. The results in four different prevalence situations meet the required limits of Type I error of 5% and 90% Power. It is concluded that after validation under field conditions, this method can be considered for a rapid assessment of the leprosy situation.
The use of genetics for the management of a recovering population: temporal assessment of migratory peregrine falcons in North America

USGS Publications Warehouse

Johnson, Jeff A.; Talbot, Sandra L.; Sage, George K.; Burnham, Kurt K.; Brown, Joseph W.; Maechtle, Tom L.; Seegar, William S.; Yates, Michael A.; Anderson, Bud; Mindell, David P.

2010-01-01

Background:Our ability to monitor populations or species that were once threatened or endangered and in the process of recovery is enhanced by using genetic methods to assess overall population stability and size over time. This can be accomplished most directly by obtaining genetic measures from temporally-spaced samples that reflect the overall stability of the population as given by changes in genetic diversity levels (allelic richness and heterozygosity), degree of population differentiation (FST and DEST), and effective population size (Ne). The primary goal of any recovery effort is to produce a long-term self-sustaining population, and these measures provide a metric by which we can gauge our progress and help make important management decisions. Methodology/Principal Findings:The peregrine falcon in North America (Falco peregrinus tundrius and anatum) was delisted in 1994 and 1999, respectively, and its abundance will be monitored by the species Recovery Team every three years until 2015. Although the United States Fish and Wildlife Service makes a distinction between tundrius and anatum subspecies, our genetic results based on eleven microsatellite loci, including those from Brown et al. (2007), suggest no differentiation and warrant delineation of a subspecies in its northern latitudinal distribution from Alaska through Canada into Greenland. Using temporal samples collected at Padre Island, Texas during migration (seven temporal time periods between 1985-2007), no significant differences in genetic diversity or significant population differentiation in allele frequencies between time periods were observed and were indistinguishable from those obtained from tundrius/anatum breeding locations throughout their northern distribution. Estimates of harmonic mean Ne were variable and imprecise, but always greater than 500 when employing multiple temporal genetic methods. These results, including those from simulations to assess the power of each method to estimate Ne, suggest a stable population consistent with data from field-based monitoring indicating that this species is stable or continuing to increase in abundance. Therefore, historic and continuing efforts to prevent the extinction of the peregrine falcon in North America appear successful, further highlighting the importance of archiving samples for continual assessment of population recovery and long-term viability.
Sexual Abuse among Female High School Students in Istanbul, Turkey

ERIC Educational Resources Information Center

Alikasifoglu, Mujgan; Erginoz, Ethem; Ercan, Oya; Albayrak-Kaymak, Deniz; Uysal, Omer; Ilter, Ozdemir

2006-01-01

Objective: The objective of the study was to determine the prevalence of sexual abuse in female adolescents in Istanbul, Turkey from data collected as part of a school-based population study on health and health behaviors. Method: A stratified cluster sampling procedure was used for this cross-sectional study. The study sample included 1,955…
The Effect on Non-Normal Distributions on the Integrated Moving Average Model of Time-Series Analysis.

ERIC Educational Resources Information Center

Doerann-George, Judith

The Integrated Moving Average (IMA) model of time series, and the analysis of intervention effects based on it, assume random shocks which are normally distributed. To determine the robustness of the analysis to violations of this assumption, empirical sampling methods were employed. Samples were generated from three populations; normal,…
Genetic and Environmental Effects on Vocal Symptoms and Their Intercorrelations

ERIC Educational Resources Information Center

Nybacka, Ida; Simberg, Susanna; Santtila, Pekka; Sala, Eeva; Sandnabba, N. Kenneth

2012-01-01

Purpose: Recently, Simberg et al. (2009) found genetic effects on a composite variable consisting of 6 vocal symptom items measuring dysphonia. The purpose of the present study was to determine genetic and environmental effects on the individual vocal symptoms in a population-based sample of Finnish twins. Method: The sample comprised 1,728 twins…
Estimating uncertainty in respondent-driven sampling using a tree bootstrap method.

PubMed

Baraff, Aaron J; McCormick, Tyler H; Raftery, Adrian E

2016-12-20

Respondent-driven sampling (RDS) is a network-based form of chain-referral sampling used to estimate attributes of populations that are difficult to access using standard survey tools. Although it has grown quickly in popularity since its introduction, the statistical properties of RDS estimates remain elusive. In particular, the sampling variability of these estimates has been shown to be much higher than previously acknowledged, and even methods designed to account for RDS result in misleadingly narrow confidence intervals. In this paper, we introduce a tree bootstrap method for estimating uncertainty in RDS estimates based on resampling recruitment trees. We use simulations from known social networks to show that the tree bootstrap method not only outperforms existing methods but also captures the high variability of RDS, even in extreme cases with high design effects. We also apply the method to data from injecting drug users in Ukraine. Unlike other methods, the tree bootstrap depends only on the structure of the sampled recruitment trees, not on the attributes being measured on the respondents, so correlations between attributes can be estimated as well as variability. Our results suggest that it is possible to accurately assess the high level of uncertainty inherent in RDS.
Analysis of genetic population structure in Acacia caven (Leguminosae, Mimosoideae), comparing one exploratory and two Bayesian-model-based methods

PubMed Central

Pometti, Carolina L.; Bessega, Cecilia F.; Saidman, Beatriz O.; Vilardi, Juan C.

2014-01-01

Bayesian clustering as implemented in STRUCTURE or GENELAND software is widely used to form genetic groups of populations or individuals. On the other hand, in order to satisfy the need for less computer-intensive approaches, multivariate analyses are specifically devoted to extracting information from large datasets. In this paper, we report the use of a dataset of AFLP markers belonging to 15 sampling sites of Acacia caven for studying the genetic structure and comparing the consistency of three methods: STRUCTURE, GENELAND and DAPC. Of these methods, DAPC was the fastest one and showed accuracy in inferring the K number of populations (K = 12 using the find.clusters option and K = 15 with a priori information of populations). GENELAND in turn, provides information on the area of membership probabilities for individuals or populations in the space, when coordinates are specified (K = 12). STRUCTURE also inferred the number of K populations and the membership probabilities of individuals based on ancestry, presenting the result K = 11 without prior information of populations and K = 15 using the LOCPRIOR option. Finally, in this work all three methods showed high consistency in estimating the population structure, inferring similar numbers of populations and the membership probabilities of individuals to each group, with a high correlation between each other. PMID:24688293
An Empirical Analysis of the Impact of Recruitment Patterns on RDS Estimates among a Socially Ordered Population of Female Sex Workers in China

PubMed Central

Yamanis, Thespina J.; Merli, M. Giovanna; Neely, William Whipple; Tian, Felicia Feng; Moody, James; Tu, Xiaowen; Gao, Ersheng

2013-01-01

Respondent-driven sampling (RDS) is a method for recruiting “hidden” populations through a network-based, chain and peer referral process. RDS recruits hidden populations more effectively than other sampling methods and promises to generate unbiased estimates of their characteristics. RDS’s faithful representation of hidden populations relies on the validity of core assumptions regarding the unobserved referral process. With empirical recruitment data from an RDS study of female sex workers (FSWs) in Shanghai, we assess the RDS assumption that participants recruit nonpreferentially from among their network alters. We also present a bootstrap method for constructing the confidence intervals around RDS estimates. This approach uniquely incorporates real-world features of the population under study (e.g., the sample’s observed branching structure). We then extend this approach to approximate the distribution of RDS estimates under various peer recruitment scenarios consistent with the data as a means to quantify the impact of recruitment bias and of rejection bias on the RDS estimates. We find that the hierarchical social organization of FSWs leads to recruitment biases by constraining RDS recruitment across social classes and introducing bias in the RDS estimates. PMID:24288418

A Third Moment Adjusted Test Statistic for Small Sample Factor Analysis

PubMed Central

Lin, Johnny; Bentler, Peter M.

2012-01-01

Goodness of fit testing in factor analysis is based on the assumption that the test statistic is asymptotically chi-square; but this property may not hold in small samples even when the factors and errors are normally distributed in the population. Robust methods such as Browne’s asymptotically distribution-free method and Satorra Bentler’s mean scaling statistic were developed under the presumption of non-normality in the factors and errors. This paper finds new application to the case where factors and errors are normally distributed in the population but the skewness of the obtained test statistic is still high due to sampling error in the observed indicators. An extension of Satorra Bentler’s statistic is proposed that not only scales the mean but also adjusts the degrees of freedom based on the skewness of the obtained test statistic in order to improve its robustness under small samples. A simple simulation study shows that this third moment adjusted statistic asymptotically performs on par with previously proposed methods, and at a very small sample size offers superior Type I error rates under a properly specified model. Data from Mardia, Kent and Bibby’s study of students tested for their ability in five content areas that were either open or closed book were used to illustrate the real-world performance of this statistic. PMID:23144511
Whither RDS? An investigation of Respondent Driven Sampling as a method of recruiting mainstream marijuana users

PubMed Central

2010-01-01

Background An important challenge in conducting social research of specific relevance to harm reduction programs is locating hidden populations of consumers of substances like cannabis who typically report few adverse or unwanted consequences of their use. Much of the deviant, pathologized perception of drug users is historically derived from, and empirically supported, by a research emphasis on gaining ready access to users in drug treatment or in prison populations with higher incidence of problems of dependence and misuse. Because they are less visible, responsible recreational users of illicit drugs have been more difficult to study. Methods This article investigates Respondent Driven Sampling (RDS) as a method of recruiting experienced marijuana users representative of users in the general population. Based on sampling conducted in a multi-city study (Halifax, Montreal, Toronto, and Vancouver), and compared to samples gathered using other research methods, we assess the strengths and weaknesses of RDS recruitment as a means of gaining access to illicit substance users who experience few harmful consequences of their use. Demographic characteristics of the sample in Toronto are compared with those of users in a recent household survey and a pilot study of Toronto where the latter utilized nonrandom self-selection of respondents. Results A modified approach to RDS was necessary to attain the target sample size in all four cities (i.e., 40 'users' from each site). The final sample in Toronto was largely similar, however, to marijuana users in a random household survey that was carried out in the same city. Whereas well-educated, married, whites and females in the survey were all somewhat overrepresented, the two samples, overall, were more alike than different with respect to economic status and employment. Furthermore, comparison with a self-selected sample suggests that (even modified) RDS recruitment is a cost-effective way of gathering respondents who are more representative of users in the general population than nonrandom methods of recruitment ordinarily produce. Conclusions Research on marijuana use, and other forms of drug use hidden in the general population of adults, is important for informing and extending harm reduction beyond its current emphasis on 'at-risk' populations. Expanding harm reduction in a normalizing context, through innovative research on users often overlooked, further challenges assumptions about reducing harm through prohibition of drug use and urges consideration of alternative policies such as decriminalization and legal regulation. PMID:20618944
Identification of a novel interspecific hybrid yeast from a metagenomic spontaneously inoculated beer sample using Hi-C.

PubMed

Smukowski Heil, Caiti; Burton, Joshua N; Liachko, Ivan; Friedrich, Anne; Hanson, Noah A; Morris, Cody L; Schacherer, Joseph; Shendure, Jay; Thomas, James H; Dunham, Maitreya J

2018-01-01

Interspecific hybridization is a common mechanism enabling genetic diversification and adaptation; however, the detection of hybrid species has been quite difficult. The identification of microbial hybrids is made even more complicated, as most environmental microbes are resistant to culturing and must be studied in their native mixed communities. We have previously adapted the chromosome conformation capture method Hi-C to the assembly of genomes from mixed populations. Here, we show the method's application in assembling genomes directly from an uncultured, mixed population from a spontaneously inoculated beer sample. Our assembly method has enabled us to de-convolute four bacterial and four yeast genomes from this sample, including a putative yeast hybrid. Downstream isolation and analysis of this hybrid confirmed its genome to consist of Pichia membranifaciens and that of another related, but undescribed, yeast. Our work shows that Hi-C-based metagenomic methods can overcome the limitation of traditional sequencing methods in studying complex mixtures of genomes. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
The program structure does not reliably recover the correct population structure when sampling is uneven: subsampling and new estimators alleviate the problem.

PubMed

Puechmaille, Sebastien J

2016-05-01

Inferences of population structure and more precisely the identification of genetically homogeneous groups of individuals are essential to the fields of ecology, evolutionary biology and conservation biology. Such population structure inferences are routinely investigated via the program structure implementing a Bayesian algorithm to identify groups of individuals at Hardy-Weinberg and linkage equilibrium. While the method is performing relatively well under various population models with even sampling between subpopulations, the robustness of the method to uneven sample size between subpopulations and/or hierarchical levels of population structure has not yet been tested despite being commonly encountered in empirical data sets. In this study, I used simulated and empirical microsatellite data sets to investigate the impact of uneven sample size between subpopulations and/or hierarchical levels of population structure on the detected population structure. The results demonstrated that uneven sampling often leads to wrong inferences on hierarchical structure and downward-biased estimates of the true number of subpopulations. Distinct subpopulations with reduced sampling tended to be merged together, while at the same time, individuals from extensively sampled subpopulations were generally split, despite belonging to the same panmictic population. Four new supervised methods to detect the number of clusters were developed and tested as part of this study and were found to outperform the existing methods using both evenly and unevenly sampled data sets. Additionally, a subsampling strategy aiming to reduce sampling unevenness between subpopulations is presented and tested. These results altogether demonstrate that when sampling evenness is accounted for, the detection of the correct population structure is greatly improved. © 2016 John Wiley & Sons Ltd.
Consequences of population topology for studying gene flow using link-based landscape genetic methods.

PubMed

van Strien, Maarten J

2017-07-01

Many landscape genetic studies aim to determine the effect of landscape on gene flow between populations. These studies frequently employ link-based methods that relate pairwise measures of historical gene flow to measures of the landscape and the geographical distance between populations. However, apart from landscape and distance, there is a third important factor that can influence historical gene flow, that is, population topology (i.e., the arrangement of populations throughout a landscape). As the population topology is determined in part by the landscape configuration, I argue that it should play a more prominent role in landscape genetics. Making use of existing literature and theoretical examples, I discuss how population topology can influence results in landscape genetic studies and how it can be taken into account to improve the accuracy of these results. In support of my arguments, I have performed a literature review of landscape genetic studies published during the first half of 2015 as well as several computer simulations of gene flow between populations. First, I argue why one should carefully consider which population pairs should be included in link-based analyses. Second, I discuss several ways in which the population topology can be incorporated in response and explanatory variables. Third, I outline why it is important to sample populations in such a way that a good representation of the population topology is obtained. Fourth, I discuss how statistical testing for link-based approaches could be influenced by the population topology. I conclude the article with six recommendations geared toward better incorporating population topology in link-based landscape genetic studies.
Non-parametric estimation of population size changes from the site frequency spectrum.

PubMed

Waltoft, Berit Lindum; Hobolth, Asger

2018-06-11

Changes in population size is a useful quantity for understanding the evolutionary history of a species. Genetic variation within a species can be summarized by the site frequency spectrum (SFS). For a sample of size n, the SFS is a vector of length n - 1 where entry i is the number of sites where the mutant base appears i times and the ancestral base appears n - i times. We present a new method, CubSFS, for estimating the changes in population size of a panmictic population from an observed SFS. First, we provide a straightforward proof for the expression of the expected site frequency spectrum depending only on the population size. Our derivation is based on an eigenvalue decomposition of the instantaneous coalescent rate matrix. Second, we solve the inverse problem of determining the changes in population size from an observed SFS. Our solution is based on a cubic spline for the population size. The cubic spline is determined by minimizing the weighted average of two terms, namely (i) the goodness of fit to the observed SFS, and (ii) a penalty term based on the smoothness of the changes. The weight is determined by cross-validation. The new method is validated on simulated demographic histories and applied on unfolded and folded SFS from 26 different human populations from the 1000 Genomes Project.
Rapid estimation of microbial populations in fish samples by using terminal restriction fragment length polymorphism analysis of 16S rDNA.

PubMed

Tanaka, Yuichiro; Takahashi, Hajime; Kitazawa, Nao; Kimura, Bon

2010-01-01

A rapid system using terminal restriction fragment length polymorphism (T-RFLP) analysis targeting 16S rDNA is described for microbial population analysis in edible fish samples. The defined terminal restriction fragment database was constructed by collecting 102 strains of bacteria representing 53 genera that are associated with fish. Digestion of these 102 strains with two restriction enzymes, HhaI and MspI, formed 54 pattern groups with discrimination to the genus level. This T-RFLP system produced results comparable to those from a culture-based method in six natural fish samples with a qualitative correspondence of 71.4 to 92.3%. Using the T-RFLP system allowed an estimation of the microbial population within 7 h. Rapid assay of the microbial population is advantageous for food manufacturers and testing laboratories; moreover, the strategy presented here allows adaptation to specific testing applications.
Standardizing the double-observer survey method for estimating mountain ungulate prey of the endangered snow leopard.

PubMed

Suryawanshi, Kulbhushansingh R; Bhatnagar, Yash Veer; Mishra, Charudutt

2012-07-01

Mountain ungulates around the world have been threatened by illegal hunting, habitat modification, increased livestock grazing, disease and development. Mountain ungulates play an important functional role in grasslands as primary consumers and as prey for wild carnivores, and monitoring of their populations is important for conservation purposes. However, most of the several currently available methods of estimating wild ungulate abundance are either difficult to implement or too expensive for mountainous terrain. A rigorous method of sampling ungulate abundance in mountainous areas that can allow for some measure of sampling error is therefore much needed. To this end, we used a combination of field data and computer simulations to test the critical assumptions associated with double-observer technique based on capture-recapture theory. The technique was modified and adapted to estimate the populations of bharal (Pseudois nayaur) and ibex (Capra sibirica) at five different sites. Conducting the two double-observer surveys simultaneously led to underestimation of the population by 15%. We therefore recommend separating the surveys in space or time. The overall detection probability for the two observers was 0.74 and 0.79. Our surveys estimated mountain ungulate populations (± 95% confidence interval) of 735 (± 44), 580 (± 46), 509 (± 53), 184 (± 40) and 30 (± 14) individuals at the five sites, respectively. A detection probability of 0.75 was found to be sufficient to detect a change of 20% in populations of >420 individuals. Based on these results, we believe that this method is sufficiently precise for scientific and conservation purposes and therefore recommend the use of the double-observer approach (with the two surveys separated in time or space) for the estimation and monitoring of mountain ungulate populations.
Bayesian Estimation of Fish Disease Prevalence from Pooled Samples Incorporating Sensitivity and Specificity

NASA Astrophysics Data System (ADS)

Williams, Christopher J.; Moffitt, Christine M.

2003-03-01

An important emerging issue in fisheries biology is the health of free-ranging populations of fish, particularly with respect to the prevalence of certain pathogens. For many years, pathologists focused on captive populations and interest was in the presence or absence of certain pathogens, so it was economically attractive to test pooled samples of fish. Recently, investigators have begun to study individual fish prevalence from pooled samples. Estimation of disease prevalence from pooled samples is straightforward when assay sensitivity and specificity are perfect, but this assumption is unrealistic. Here we illustrate the use of a Bayesian approach for estimating disease prevalence from pooled samples when sensitivity and specificity are not perfect. We also focus on diagnostic plots to monitor the convergence of the Gibbs-sampling-based Bayesian analysis. The methods are illustrated with a sample data set.
The use and correlates of illicit silicone or “fillers” in a population-based sample of transwomen, San Francisco, 2013

PubMed Central

Wilson, Erin; Rapues, Jenna; Jin, Harry; Raymond, H. Fisher

2014-01-01

Introduction There is a dearth of studies to quantify the use of illicit fillers by transwomen. Case studies of illicit filler injections have pointed to an array of serious health complications, including death. Aim The aim of this study was to determine the population prevalence of filler use among transwomen, and to identify correlations with filler use. Methods An analysis of data collected in 2013 with a population-based sample of 234 transwomen recruited using respondent driven sampling (RDS). We used RDS weights to conduct bivariate and multivariate analyses of correlates of filler use. Main Outcome measures Main outcome measures were an RDS-weighted population prevalence of filler use among transwomen and differences in demographic characteristics, transition-related care factors and self-esteem related to appearance. Results Weighted filler prevalence among transwomen was 16.7%. Being a transwomen between 30–49 years of age, owning/renting or living with a partner/family/friend, having had and planning to have surgery in the future and having used non-prescribed hormones were all associated with filler use. HIV was not associated with filler use. Conclusions This study provides the first known estimate to date of the prevalence of filler use in a population-based sample of transwomen in San Francisco. Accessing illicit fillers may be the only choice available for many transwomen due to the cost of legal surgeries and other procedures to change one’s appearance. An important next step in this research is to determine the overall prevalence and long-term consequences of filler use among transwomen, to explore how the use of fillers is protective to the safety and wellbeing of transwomen, and to find safe and affordable alternatives to this method that meets important gender-related appearance needs. PMID:24810672
Noninvasive methods for dynamic mapping of microbial populations across the landscape

NASA Astrophysics Data System (ADS)

Meredith, L. K.; Sengupta, A.; Troch, P. A.; Volkmann, T. H. M.

2017-12-01

Soil microorganisms drive key ecosystem processes, and yet characterizing their distribution and activity in soil has been notoriously difficult. This is due, in part, to the heterogeneous nature of their response to changing environmental and nutrient conditions across time and space. These dynamics are challenging to constrain in both natural and experimental systems because of sampling difficulty and constraints. For example, soil microbial sampling at the Landscape Evolution Observatory (LEO) infrastructure in Biosphere 2 is limited in efforts to minimize soil disruption to the long term experiment that aims to characterize the interacting biological, hydrological, and geochemical processes driving soil evolution. In this and other systems, new methods are needed to monitor soil microbial communities and their genetic potential over time. In this study, we take advantage of the well-defined boundary conditions on hydrological flow at LEO to develop a new method to nondestructively characterize in situ microbial populations. In our approach, we sample microbes from the seepage flow at the base of each of three replicate LEO hillslopes and use hydrological models to `map back' in situ microbial populations. Over the course of a 3-month periodic rainfall experiment we collected samples from the LEO outflow for DNA and extraction and microbial community composition analysis. These data will be used to describe changes in microbial community composition over the course of the experiment. In addition, we will use hydrological flow models to identify the changing source region of discharge water over the course of periodic rainfall pulses, thereby mapping back microbial populations onto their geographic origin in the slope. These predictions of in situ microbial populations will be ground-truthed against those derived from destructive soil sampling at the beginning and end of the rainfall experiment. Our results will show the suitability of this method for long-term, non-destructive monitoring of the microbial communities that contribute to soil evolution in this large-scale model system. Furthermore, this method may be useful for other study systems with limitations to destructive sampling including other model infrastructures and natural landscapes.
Simultaneous genomic identification and profiling of a single cell using semiconductor-based next generation sequencing.

PubMed

Watanabe, Manabu; Kusano, Junko; Ohtaki, Shinsaku; Ishikura, Takashi; Katayama, Jin; Koguchi, Akira; Paumen, Michael; Hayashi, Yoshiharu

2014-09-01

Combining single-cell methods and next-generation sequencing should provide a powerful means to understand single-cell biology and obviate the effects of sample heterogeneity. Here we report a single-cell identification method and seamless cancer gene profiling using semiconductor-based massively parallel sequencing. A549 cells (adenocarcinomic human alveolar basal epithelial cell line) were used as a model. Single-cell capture was performed using laser capture microdissection (LCM) with an Arcturus® XT system, and a captured single cell and a bulk population of A549 cells (≈ 10(6) cells) were subjected to whole genome amplification (WGA). For cell identification, a multiplex PCR method (AmpliSeq™ SNP HID panel) was used to enrich 136 highly discriminatory SNPs with a genotype concordance probability of 10(31-35). For cancer gene profiling, we used mutation profiling that was performed in parallel using a hotspot panel for 50 cancer-related genes. Sequencing was performed using a semiconductor-based bench top sequencer. The distribution of sequence reads for both HID and Cancer panel amplicons was consistent across these samples. For the bulk population of cells, the percentages of sequence covered at coverage of more than 100 × were 99.04% for the HID panel and 98.83% for the Cancer panel, while for the single cell percentages of sequence covered at coverage of more than 100 × were 55.93% for the HID panel and 65.96% for the Cancer panel. Partial amplification failure or randomly distributed non-amplified regions across samples from single cells during the WGA procedures or random allele drop out probably caused these differences. However, comparative analyses showed that this method successfully discriminated a single A549 cancer cell from a bulk population of A549 cells. Thus, our approach provides a powerful means to overcome tumor sample heterogeneity when searching for somatic mutations.
A modified approach to estimating sample size for simple logistic regression with one continuous covariate.

PubMed

Novikov, I; Fund, N; Freedman, L S

2010-01-15

Different methods for the calculation of sample size for simple logistic regression (LR) with one normally distributed continuous covariate give different results. Sometimes the difference can be large. Furthermore, some methods require the user to specify the prevalence of cases when the covariate equals its population mean, rather than the more natural population prevalence. We focus on two commonly used methods and show through simulations that the power for a given sample size may differ substantially from the nominal value for one method, especially when the covariate effect is large, while the other method performs poorly if the user provides the population prevalence instead of the required parameter. We propose a modification of the method of Hsieh et al. that requires specification of the population prevalence and that employs Schouten's sample size formula for a t-test with unequal variances and group sizes. This approach appears to increase the accuracy of the sample size estimates for LR with one continuous covariate.
Relationship of tooth wear to chronological age among indigenous Amazon populations.

PubMed

Vieira, Elma Pinto; Barbosa, Mayara Silva; Quintão, Cátia Cardoso Abdo; Normando, David

2015-01-01

In indigenous populations, age can be estimated based on family structure and physical examination. However, the accuracy of such methods is questionable. The aim of this cross-sectional study was to evaluate occlusal tooth wear related to estimated age in the remote indigenous populations of the Xingu River, Amazon. Two hundred and twenty three semi-isolated indigenous subjects with permanent dentition from the Arara (n = 117), Xicrin-Kayapó (n = 60) and Assurini (n = 46) villages were examined. The control group consisted of 40 non-indigenous individuals living in an urban area in the Amazon basin (Belem). A modified tooth wear index was applied and then associated with chronological age by linear regression analysis. A strong association was found between tooth wear and chronological age in the indigenous populations (p <0.001). Tooth wear measurements were able to explain 86% of the variation in the ages of the Arara sample, 70% of the Xicrin-Kaiapó sample and 65% of the Assurini sample. In the urban control sample, only 12% of ages could be determined by tooth wear. These findings suggest that tooth wear is a poor estimator of chronological age in the urban population; however, it has a strong association with age for the more remote indigenous populations. Consequently, these findings suggest that a simple tooth wear evaluation method, as described and applied in this study, can be used to provide a straightforward and efficient means to assist in age determination of newly contacted indigenous groups.
Relationship of Tooth Wear to Chronological Age among Indigenous Amazon Populations

PubMed Central

Vieira, Elma Pinto; Barbosa, Mayara Silva; Quintão, Cátia Cardoso Abdo; Normando, David

2015-01-01

In indigenous populations, age can be estimated based on family structure and physical examination. However, the accuracy of such methods is questionable. The aim of this cross-sectional study was to evaluate occlusal tooth wear related to estimated age in the remote indigenous populations of the Xingu River, Amazon. Two hundred and twenty three semi-isolated indigenous subjects with permanent dentition from the Arara (n = 117), Xicrin-Kayapó (n = 60) and Assurini (n = 46) villages were examined. The control group consisted of 40 non-indigenous individuals living in an urban area in the Amazon basin (Belem). A modified tooth wear index was applied and then associated with chronological age by linear regression analysis. A strong association was found between tooth wear and chronological age in the indigenous populations (p <0.001). Tooth wear measurements were able to explain 86% of the variation in the ages of the Arara sample, 70% of the Xicrin-Kaiapó sample and 65% of the Assurini sample. In the urban control sample, only 12% of ages could be determined by tooth wear. These findings suggest that tooth wear is a poor estimator of chronological age in the urban population; however, it has a strong association with age for the more remote indigenous populations. Consequently, these findings suggest that a simple tooth wear evaluation method, as described and applied in this study, can be used to provide a straightforward and efficient means to assist in age determination of newly contacted indigenous groups. PMID:25602501
Zoonoses research in the German National Cohort : feasibility of parallel sampling of pets and owners.

PubMed

Hille, Katja; Möbius, Nadine; Akmatov, Manas K; Verspohl, Jutta; Rabold, Denise; Hartmann, Maria; Günther, Kathrin; Obi, Nadia; Kreienbrock, Lothar

2014-11-01

Cats and dogs live in more than 20 % of German households and the contact between these pets and their owners can be very close. Therefore, a transmission of zoonotic pathogens may occur. To investigate whether zoonotic research questions can be examined in the context of population-based studies like the German National Cohort (GNC), two studies on different study populations were conducted as part of the feasibility tests of the GNC. The aim of the first study was to quantify the actual exposure of participants of the GNC to cats and dogs. In the second study summarised here the feasibility of the sampling of cats and dogs by their owners was tested. To quantify the exposure of participants of the GNC to cats and dogs 744 study participants of the Pretests of the GNC were asked whether they had contact with animals. Currently 10 % have a dog and 14 % have a cat in their household. These figures confirm that a large proportion of the German population has contact with pets and that there is a need for further zoonoses research. To establish the collection of biological samples from cats and dogs in the context of large-scale population-based studies feasible methods are needed. Therefore, a study was conducted to test whether pet owners can take samples from their cats and dogs and whether the quality of these samples is comparable to samples taken by a qualified veterinarian. A total of 82 dog and 18 cat owners were recruited in two veterinary practices in Hannover and the Clinic for Small Animals at the University of Veterinary Medicine in Hannover. Sampling instructions and sample material for nasal and buccal swabs, faecal samples and, in the case of cat owners, a brush for fur samples, were given to the pet owners. The pet owners were asked to take the samples from their pets at home and to send the samples by surface mail. Swab samples were cultured and bacterial growth was quantified independent of bacterial species. The growth of Gram-positive and Gram-negative bacteria from samples taken by the veterinarian and the pet owners were compared. For Gram-positive bacteria the agreement of laboratory results was 71 % for nasal swabs and 78 % for oral swabs while for Gram-negative bacteria the agreement of laboratory results was 55 % for nasal swabs and 87 % for oral swabs. In conclusion it has been shown that participants of the GNC are exposed to cats and dogs and that the sampling of cats and dogs by their owners is a feasible method which can be a useful tool for zoonoses research in population-based studies.
Sampling strategies for estimating brook trout effective population size

Treesearch

Andrew R. Whiteley; Jason A. Coombs; Mark Hudy; Zachary Robinson; Keith H. Nislow; Benjamin H. Letcher

2012-01-01

The influence of sampling strategy on estimates of effective population size (Ne) from single-sample genetic methods has not been rigorously examined, though these methods are increasingly used. For headwater salmonids, spatially close kin association among age-0 individuals suggests that sampling strategy (number of individuals and location from...
Estimating mean change in population salt intake using spot urine samples.

PubMed

Petersen, Kristina S; Wu, Jason H Y; Webster, Jacqui; Grimes, Carley; Woodward, Mark; Nowson, Caryl A; Neal, Bruce

2017-10-01

Spot urine samples are easier to collect than 24-h urine samples and have been used with estimating equations to derive the mean daily salt intake of a population. Whether equations using data from spot urine samples can also be used to estimate change in mean daily population salt intake over time is unknown. We compared estimates of change in mean daily population salt intake based upon 24-h urine collections with estimates derived using equations based on spot urine samples. Paired and unpaired 24-h urine samples and spot urine samples were collected from individuals in two Australian populations, in 2011 and 2014. Estimates of change in daily mean population salt intake between 2011 and 2014 were obtained directly from the 24-h urine samples and by applying established estimating equations (Kawasaki, Tanaka, Mage, Toft, INTERSALT) to the data from spot urine samples. Differences between 2011 and 2014 were calculated using mixed models. A total of 1000 participants provided a 24-h urine sample and a spot urine sample in 2011, and 1012 did so in 2014 (paired samples n = 870; unpaired samples n = 1142). The participants were community-dwelling individuals living in the State of Victoria or the town of Lithgow in the State of New South Wales, Australia, with a mean age of 55 years in 2011. The mean (95% confidence interval) difference in population salt intake between 2011 and 2014 determined from the 24-h urine samples was -0.48g/day (-0.74 to -0.21; P < 0.001). The corresponding result estimated from the spot urine samples was -0.24 g/day (-0.42 to -0.06; P = 0.01) using the Tanaka equation, -0.42 g/day (-0.70 to -0.13; p = 0.004) using the Kawasaki equation, -0.51 g/day (-1.00 to -0.01; P = 0.046) using the Mage equation, -0.26 g/day (-0.42 to -0.10; P = 0.001) using the Toft equation, -0.20 g/day (-0.32 to -0.09; P = 0.001) using the INTERSALT equation and -0.27 g/day (-0.39 to -0.15; P < 0.001) using the INTERSALT equation with potassium. There was no evidence that the changes detected by the 24-h collections and estimating equations were different (all P > 0.058). Separate analysis of the unpaired and paired data showed that detection of change by the estimating equations was observed only in the paired data. All the estimating equations based upon spot urine samples identified a similar change in daily salt intake to that detected by the 24-h urine samples. Methods based upon spot urine samples may provide an approach to measuring change in mean population salt intake, although further investigation in larger and more diverse population groups is required. © The Author 2016; all rights reserved. Published by Oxford University Press on behalf of the International Epidemiological Association
Sampling Methods and the Accredited Population in Athletic Training Education Research

ERIC Educational Resources Information Center

Carr, W. David; Volberding, Jennifer

2009-01-01

Context: We describe methods of sampling the widely-studied, yet poorly defined, population of accredited athletic training education programs (ATEPs). Objective: There are two purposes to this study; first to describe the incidence and types of sampling methods used in athletic training education research, and second to clearly define the…
Classifier performance prediction for computer-aided diagnosis using a limited dataset.

PubMed

Sahiner, Berkman; Chan, Heang-Ping; Hadjiiski, Lubomir

2008-04-01

In a practical classifier design problem, the true population is generally unknown and the available sample is finite-sized. A common approach is to use a resampling technique to estimate the performance of the classifier that will be trained with the available sample. We conducted a Monte Carlo simulation study to compare the ability of the different resampling techniques in training the classifier and predicting its performance under the constraint of a finite-sized sample. The true population for the two classes was assumed to be multivariate normal distributions with known covariance matrices. Finite sets of sample vectors were drawn from the population. The true performance of the classifier is defined as the area under the receiver operating characteristic curve (AUC) when the classifier designed with the specific sample is applied to the true population. We investigated methods based on the Fukunaga-Hayes and the leave-one-out techniques, as well as three different types of bootstrap methods, namely, the ordinary, 0.632, and 0.632+ bootstrap. The Fisher's linear discriminant analysis was used as the classifier. The dimensionality of the feature space was varied from 3 to 15. The sample size n2 from the positive class was varied between 25 and 60, while the number of cases from the negative class was either equal to n2 or 3n2. Each experiment was performed with an independent dataset randomly drawn from the true population. Using a total of 1000 experiments for each simulation condition, we compared the bias, the variance, and the root-mean-squared error (RMSE) of the AUC estimated using the different resampling techniques relative to the true AUC (obtained from training on a finite dataset and testing on the population). Our results indicated that, under the study conditions, there can be a large difference in the RMSE obtained using different resampling methods, especially when the feature space dimensionality is relatively large and the sample size is small. Under this type of conditions, the 0.632 and 0.632+ bootstrap methods have the lowest RMSE, indicating that the difference between the estimated and the true performances obtained using the 0.632 and 0.632+ bootstrap will be statistically smaller than those obtained using the other three resampling methods. Of the three bootstrap methods, the 0.632+ bootstrap provides the lowest bias. Although this investigation is performed under some specific conditions, it reveals important trends for the problem of classifier performance prediction under the constraint of a limited dataset.

Evaluation and comparison of FTA card and CTAB DNA extraction methods for non-agricultural taxa.

PubMed

Siegel, Chloe S; Stevenson, Florence O; Zimmer, Elizabeth A

2017-02-01

An efficient, effective DNA extraction method is necessary for comprehensive analysis of plant genomes. This study analyzed the quality of DNA obtained using paper FTA cards prepared directly in the field when compared to the more traditional cetyltrimethylammonium bromide (CTAB)-based extraction methods from silica-dried samples. DNA was extracted using FTA cards according to the manufacturer's protocol. In parallel, CTAB-based extractions were done using the automated AutoGen DNA isolation system. DNA quality for both methods was determined for 15 non-agricultural species collected in situ, by gel separation, spectrophotometry, fluorometry, and successful amplification and sequencing of nuclear and chloroplast gene markers. The FTA card extraction method yielded less concentrated, but also less fragmented samples than the CTAB-based technique. The card-extracted samples provided DNA that could be successfully amplified and sequenced. The FTA cards are also useful because the collected samples do not require refrigeration, extensive laboratory expertise, or as many hazardous chemicals as extractions using the CTAB-based technique. The relative success of the FTA card method in our study suggested that this method could be a valuable tool for studies in plant population genetics and conservation biology that may involve screening of hundreds of individual plants. The FTA cards, like the silica gel samples, do not contain plant material capable of propagation, and therefore do not require permits from the U.S. Department of Agriculture (USDA) Animal and Plant Health Inspection Service (APHIS) for transportation.
DNA-based approach to aging martens (Martes americana and M. caurina)

Treesearch

Jonathan N. Pauli; John P. Whiteman; Bruce G. Marcot; Terry M. McClean; Merav Ben-David

2011-01-01

Demographic structure is central to understanding the dynamics of animal populations. However, determining the age of free-ranging mammals is difficult, and currently impossible when sampling with noninvasive, genetic-based approaches. We present a method to estimate age class by combining measures of telomere lengths with other biologically meaningful covariates in a...
Estimating survival of precocial chicks during the prefledging period using a catch-curve analysis and count-based age-class data

USGS Publications Warehouse

McGowan, C.P.; Millspaugh, J.J.; Ryan, M.R.; Kruse, C.D.; Pavelka, G.

2009-01-01

Estimating reproductive success for birds with precocial young can be difficult because chicks leave nests soon after hatching and individuals or broods can be difficult to track. Researchers often turn to estimating survival during the prefledging period and, though effective, mark-recapture based approaches are not always feasible due to cost, time, and animal welfare concerns. Using a threatened population of Piping Plovers (Charadrius melodus) that breeds along the Missouri River, we present an approach for estimating chick survival during the prefledging period using long-term (1993-2005), count-based, age-class data. We used a modified catch-curve analysis, and data collected during three 5-day sampling periods near the middle of the breeding season. The approach has several ecological and statistical assumptions and our analyses were designed to minimize the probability of violating those assumptions. For example, limiting the sampling periods to only 5 days gave reasonable assurance that population size was stable during the sampling period. Annual daily survival estimates ranged from 0.825 (SD = 0.03) to 0.931 (0.02) depending on year and sampling period, with these estimates assuming constant survival during the prefledging period and no change in the age structure of the population. The average probability of survival to fledging ranged from 0.126 to 0.188. Our results are similar to other published estimates for this species in similar habitats. This method of estimating chick survival may be useful for a variety of precocial bird species when mark-recapture methods are not feasible and only count-based age class data are available. ?? 2009 Association of Field Ornithologists.
Estimating numbers of females with cubs-of-the-year in the Yellowstone grizzly bear population

USGS Publications Warehouse

Keating, K.A.; Schwartz, C.C.; Haroldson, M.A.; Moody, D.

2001-01-01

For grizzly bears (Ursus arctos horribilis) in the Greater Yellowstone Ecosystem (GYE), minimum population size and allowable numbers of human-caused mortalities have been calculated as a function of the number of unique females with cubs-of-the-year (FCUB) seen during a 3- year period. This approach underestimates the total number of FCUB, thereby biasing estimates of population size and sustainable mortality. Also, it does not permit calculation of valid confidence bounds. Many statistical methods can resolve or mitigate these problems, but there is no universal best method. Instead, relative performances of different methods can vary with population size, sample size, and degree of heterogeneity among sighting probabilities for individual animals. We compared 7 nonparametric estimators, using Monte Carlo techniques to assess performances over the range of sampling conditions deemed plausible for the Yellowstone population. Our goal was to estimate the number of FCUB present in the population each year. Our evaluation differed from previous comparisons of such estimators by including sample coverage methods and by treating individual sightings, rather than sample periods, as the sample unit. Consequently, our conclusions also differ from earlier studies. Recommendations regarding estimators and necessary sample sizes are presented, together with estimates of annual numbers of FCUB in the Yellowstone population with bootstrap confidence bounds.
Standard methods for sampling North American freshwater fishes

USGS Publications Warehouse

Bonar, Scott A.; Hubert, Wayne A.; Willis, David W.

2009-01-01

This important reference book provides standard sampling methods recommended by the American Fisheries Society for assessing and monitoring freshwater fish populations in North America. Methods apply to ponds, reservoirs, natural lakes, and streams and rivers containing cold and warmwater fishes. Range-wide and eco-regional averages for indices of abundance, population structure, and condition for individual species are supplied to facilitate comparisons of standard data among populations. Provides information on converting nonstandard to standard data, statistical and database procedures for analyzing and storing standard data, and methods to prevent transfer of invasive species while sampling.
Modelling population distribution using remote sensing imagery and location-based data

NASA Astrophysics Data System (ADS)

Song, J.; Prishchepov, A. V.

2017-12-01

Detailed spatial distribution of population density is essential for city studies such as urban planning, environmental pollution and city emergency, even estimate pressure on the environment and human exposure and risks to health. However, most of the researches used census data as the detailed dynamic population distribution are difficult to acquire, especially in microscale research. This research describes a method using remote sensing imagery and location-based data to model population distribution at the function zone level. Firstly, urban functional zones within a city were mapped by high-resolution remote sensing images and POIs. The workflow of functional zones extraction includes five parts: (1) Urban land use classification. (2) Segmenting images in built-up area. (3) Identification of functional segments by POIs. (4) Identification of functional blocks by functional segmentation and weight coefficients. (5) Assessing accuracy by validation points. The result showed as Fig.1. Secondly, we applied ordinary least square and geographically weighted regression to assess spatial nonstationary relationship between light digital number (DN) and population density of sampling points. The two methods were employed to predict the population distribution over the research area. The R²of GWR model were in the order of 0.7 and typically showed significant variations over the region than traditional OLS model. The result showed as Fig.2.Validation with sampling points of population density demonstrated that the result predicted by the GWR model correlated well with light value. The result showed as Fig.3. Results showed: (1) Population density is not linear correlated with light brightness using global model. (2) VIIRS night-time light data could estimate population density integrating functional zones at city level. (3) GWR is a robust model to map population distribution, the adjusted R2 of corresponding GWR models were higher than the optimal OLS models, confirming that GWR models demonstrate better prediction accuracy. So this method provide detailed population density information for microscale citizen studies.
Self-Harmful Behaviors in a Population-Based Sample of Young Adults

ERIC Educational Resources Information Center

Nada-Raja, Shyamala; Skegg, Keren; Langley, John; Morrison, Dianne; Sowerby, Paula

2004-01-01

A birth cohort of 472 women and 494 men aged 26 years was interviewed about a range of self-harmful behaviors first and then asked about suicidal intent.- Lifetime prevalence of self-harm using traditional methods of suicide (ICD [International Classification of Diseases] self-harm) was 13%, with 9% of the sample describing at least one such…
Vipie: web pipeline for parallel characterization of viral populations from multiple NGS samples.

PubMed

Lin, Jake; Kramna, Lenka; Autio, Reija; Hyöty, Heikki; Nykter, Matti; Cinek, Ondrej

2017-05-15

Next generation sequencing (NGS) technology allows laboratories to investigate virome composition in clinical and environmental samples in a culture-independent way. There is a need for bioinformatic tools capable of parallel processing of virome sequencing data by exactly identical methods: this is especially important in studies of multifactorial diseases, or in parallel comparison of laboratory protocols. We have developed a web-based application allowing direct upload of sequences from multiple virome samples using custom parameters. The samples are then processed in parallel using an identical protocol, and can be easily reanalyzed. The pipeline performs de-novo assembly, taxonomic classification of viruses as well as sample analyses based on user-defined grouping categories. Tables of virus abundance are produced from cross-validation by remapping the sequencing reads to a union of all observed reference viruses. In addition, read sets and reports are created after processing unmapped reads against known human and bacterial ribosome references. Secured interactive results are dynamically plotted with population and diversity charts, clustered heatmaps and a sortable and searchable abundance table. The Vipie web application is a unique tool for multi-sample metagenomic analysis of viral data, producing searchable hits tables, interactive population maps, alpha diversity measures and clustered heatmaps that are grouped in applicable custom sample categories. Known references such as human genome and bacterial ribosomal genes are optionally removed from unmapped ('dark matter') reads. Secured results are accessible and shareable on modern browsers. Vipie is a freely available web-based tool whose code is open source.
Single gene-based distinction of individual microbial genomes from a mixed population of microbial cells.

PubMed

Tamminen, Manu V; Virta, Marko P J

2015-01-01

Recent progress in environmental microbiology has revealed vast populations of microbes in any given habitat that cannot be detected by conventional culturing strategies. The use of sensitive genetic detection methods such as CARD-FISH and in situ PCR have been limited by the cell wall permeabilization requirement that cannot be performed similarly on all cell types without lysing some and leaving some nonpermeabilized. Furthermore, the detection of low copy targets such as genes present in single copies in the microbial genomes, has remained problematic. We describe an emulsion-based procedure to trap individual microbial cells into picoliter-volume polyacrylamide droplets that provide a rigid support for genetic material and therefore allow complete degradation of cellular material to expose the individual genomes. The polyacrylamide droplets are subsequently converted into picoliter-scale reactors for genome amplification. The amplified genomes are labeled based on the presence of a target gene and differentiated from those that do not contain the gene by flow cytometry. Using the Escherichia coli strains XL1 and MC1061, which differ with respect to the presence (XL1), or absence (MC1061) of a single copy of a tetracycline resistance gene per genome, we demonstrate that XL1 genomes present at 0.1% of MC1061 genomes can be differentiated using this method. Using a spiked sediment microbial sample, we demonstrate that the method is applicable to highly complex environmental microbial communities as a target gene-based screen for individual microbes. The method provides a novel tool for enumerating functional cell populations in complex microbial communities. We envision that the method could be optimized for fluorescence-activated cell sorting to enrich genetic material of interest from complex environmental samples.
Sensitivity and specificity of normality tests and consequences on reference interval accuracy at small sample size: a computer-simulation study.

PubMed

Le Boedec, Kevin

2016-12-01

According to international guidelines, parametric methods must be chosen for RI construction when the sample size is small and the distribution is Gaussian. However, normality tests may not be accurate at small sample size. The purpose of the study was to evaluate normality test performance to properly identify samples extracted from a Gaussian population at small sample sizes, and assess the consequences on RI accuracy of applying parametric methods to samples that falsely identified the parent population as Gaussian. Samples of n = 60 and n = 30 values were randomly selected 100 times from simulated Gaussian, lognormal, and asymmetric populations of 10,000 values. The sensitivity and specificity of 4 normality tests were compared. Reference intervals were calculated using 6 different statistical methods from samples that falsely identified the parent population as Gaussian, and their accuracy was compared. Shapiro-Wilk and D'Agostino-Pearson tests were the best performing normality tests. However, their specificity was poor at sample size n = 30 (specificity for P < .05: .51 and .50, respectively). The best significance levels identified when n = 30 were 0.19 for Shapiro-Wilk test and 0.18 for D'Agostino-Pearson test. Using parametric methods on samples extracted from a lognormal population but falsely identified as Gaussian led to clinically relevant inaccuracies. At small sample size, normality tests may lead to erroneous use of parametric methods to build RI. Using nonparametric methods (or alternatively Box-Cox transformation) on all samples regardless of their distribution or adjusting, the significance level of normality tests depending on sample size would limit the risk of constructing inaccurate RI. © 2016 American Society for Veterinary Clinical Pathology.
Identifying DNA Methylation Biomarkers for Non-Endoscopic Detection of Barrett’s Esophagus

PubMed Central

Moinova, Helen R.; LaFramboise, Thomas; Lutterbaugh, James D.; Chandar, Apoorva Krishna; Dumot, John; Faulx, Ashley; Brock, Wendy; De la Cruz Cabrera, Omar; Guda, Kishore; Barnholtz-Sloan, Jill S.; Iyer, Prasad G.; Canto, Marcia I.; Wang, Jean S.; Shaheen, Nicholas J.; Thota, Prashanti N.; Willis, Joseph E.; Chak, Amitabh; Markowitz, Sanford D.

2018-01-01

We report a biomarker-based non-endoscopic method for detecting Barrett’s esophagus (BE), based on detecting methylated DNAs retrieved via a swallowable balloon-based esophageal sampling device. BE is the precursor of, and a major recognized risk factor for, developing esophageal adenocarcinoma (EAC). Endoscopy, the current standard for BE detection, is not cost-effective for population screening. We performed genome-wide screening to ascertain regions targeted for recurrent aberrant cytosine methylation in BE, identifying high-frequency methylation within the CCNA1 locus. We tested CCNA1 DNA methylation as a BE biomarker in cytology brushings of the distal esophagus from 173 individuals with or without BE. CCNA1 DNA methylation demonstrated an area under the curve (AUC)=0.95 for discriminating BE-related metaplasia and neoplasia cases versus normal individuals, performing identically to methylation of VIM DNA, an established BE biomarker. When combined, the resulting two biomarker panel was 95% sensitive and 91% specific. These results were replicated in an independent validation cohort of 149 individuals, who were assayed using the same cutoff values for test positivity established in the training population. To progress toward non-endoscopic esophageal screening, we engineered a well-tolerated, swallowable, encapsulated balloon device able to selectively sample the distal esophagus within 5 minutes. In balloon samples from 86 individuals, tests of CCNA1 plus VIM DNA methylation detected BE metaplasia with 90.3% sensitivity and 91.7% specificity. Combining the balloon sampling device with molecular assays of CCNA1 plus VIM DNA methylation enables an efficient, well-tolerated, sensitive, and specific method of screening at-risk populations for BE. PMID:29343623
Incorporating precision, accuracy and alternative sampling designs into a continental monitoring program for colonial waterbirds

USGS Publications Warehouse

Steinkamp, Melanie J.; Peterjohn, B.G.; Keisman, J.L.

2003-01-01

A comprehensive monitoring program for colonial waterbirds in North America has never existed. At smaller geographic scales, many states and provinces conduct surveys of colonial waterbird populations. Periodic regional surveys are conducted at varying times during the breeding season using a variety of survey methods, which complicates attempts to estimate population trends for most species. The US Geological Survey Patuxent Wildlife Research Center has recently started to coordinate colonial waterbird monitoring efforts throughout North America. A centralized database has been developed with an Internet-based data entry and retrieval page. The extent of existing colonial waterbird surveys has been defined, allowing gaps in coverage to be identified and basic inventories completed where desirable. To enable analyses of comparable data at regional or larger geographic scales, sampling populations through statistically sound sampling designs should supersede obtaining counts at every colony. Standardized breeding season survey techniques have been agreed upon and documented in a monitoring manual. Each survey in the manual has associated with it recommendations for bias estimation, and includes specific instructions on measuring detectability. The methods proposed in the manual are for developing reliable, comparable indices of population size to establish trend information at multiple spatial and temporal scales, but they will not result in robust estimates of total population numbers.
Inferring population history with DIY ABC: a user-friendly approach to approximate Bayesian computation

PubMed Central

Cornuet, Jean-Marie; Santos, Filipe; Beaumont, Mark A.; Robert, Christian P.; Marin, Jean-Michel; Balding, David J.; Guillemaud, Thomas; Estoup, Arnaud

2008-01-01

Summary: Genetic data obtained on population samples convey information about their evolutionary history. Inference methods can extract part of this information but they require sophisticated statistical techniques that have been made available to the biologist community (through computer programs) only for simple and standard situations typically involving a small number of samples. We propose here a computer program (DIY ABC) for inference based on approximate Bayesian computation (ABC), in which scenarios can be customized by the user to fit many complex situations involving any number of populations and samples. Such scenarios involve any combination of population divergences, admixtures and population size changes. DIY ABC can be used to compare competing scenarios, estimate parameters for one or more scenarios and compute bias and precision measures for a given scenario and known values of parameters (the current version applies to unlinked microsatellite data). This article describes key methods used in the program and provides its main features. The analysis of one simulated and one real dataset, both with complex evolutionary scenarios, illustrates the main possibilities of DIY ABC. Availability: The software DIY ABC is freely available at http://www.montpellier.inra.fr/CBGP/diyabc. Contact: j.cornuet@imperial.ac.uk Supplementary information: Supplementary data are also available at http://www.montpellier.inra.fr/CBGP/diyabc PMID:18842597
"The solution needs to be complex." Obese adults' attitudes about the effectiveness of individual and population based interventions for obesity

PubMed Central

2010-01-01

Background Previous studies of public perceptions of obesity interventions have been quantitative and based on general population surveys. This study aims to explore the opinions and attitudes of obese individuals towards population and individual interventions for obesity in Australia. Methods Qualitative methods using in-depth semi-structured telephone interviews with a community sample of obese adults (Body Mass Index ≥30). Theoretical, purposive and strategic recruitment techniques were used to ensure a broad sample of obese individuals with different types of experiences with their obesity. Participants were asked about their attitudes towards three population based interventions (regulation, media campaigns, and public health initiatives) and three individual interventions (tailored fitness programs, commercial dieting, and gastric banding surgery), and the effectiveness of these interventions. Results One hundred and forty two individuals (19-75 years) were interviewed. Participants strongly supported non-commercial interventions that were focused on encouraging individuals to make healthy lifestyle changes (regulation, physical activity programs, and public health initiatives). There was less support for interventions perceived to be invasive or high risk (gastric band surgery), stigmatising (media campaigns), or commercially motivated and promoting weight loss techniques (commercial diets and gastric banding surgery). Conclusion Obese adults support non-commercial, non-stigmatising interventions which are designed to improve lifestyles, rather than promote weight loss. PMID:20633250
The allele combinations of three loci based on, liver, stomach cancers, hematencephalon, COPD and normal population: A preliminary study.

PubMed

Gai, Liping; Liu, Hui; Cui, Jing-Hui; Yu, Weijian; Ding, Xiao-Dong

2017-03-20

The purpose of this study was to examine the specific allele combinations of three loci connected with the liver cancers, stomach cancers, hematencephalon and patients with chronic obstructive pulmonary disease (COPD) and to explore the feasibility of the research methods. We explored different mathematical methods for statistical analyses to assess the association between the genotype and phenotype. At the same time we still analyses the statistical results of allele combinations of three loci by difference value method and ratio method. All the DNA blood samples were collected from patients with 50 liver cancers, 75 stomach cancers, 50 hematencephalon, 72 COPD and 200 normal populations. All the samples were from Chinese. Alleles from short tandem repeat (STR) loci were determined using the STR Profiler plus PCR amplification kit (15 STR loci). Previous research was based on combinations of single-locus alleles, and combinations of cross-loci (two loci) alleles. Allele combinations of three loci were obtained by computer counting and stronger genetic signal was obtained. The methods of allele combinations of three loci can help to identify the statistically significant differences of allele combinations between liver cancers, stomach cancers, patients with hematencephalon, COPD and the normal population. The probability of illness followed different rules and had apparent specificity. This method can be extended to other diseases and provide reference for early clinical diagnosis. Copyright © 2016. Published by Elsevier B.V.
Evaluating sampling designs by computer simulation: A case study with the Missouri bladderpod

USGS Publications Warehouse

Morrison, L.W.; Smith, D.R.; Young, C.; Nichols, D.W.

2008-01-01

To effectively manage rare populations, accurate monitoring data are critical. Yet many monitoring programs are initiated without careful consideration of whether chosen sampling designs will provide accurate estimates of population parameters. Obtaining accurate estimates is especially difficult when natural variability is high, or limited budgets determine that only a small fraction of the population can be sampled. The Missouri bladderpod, Lesquerella filiformis Rollins, is a federally threatened winter annual that has an aggregated distribution pattern and exhibits dramatic interannual population fluctuations. Using the simulation program SAMPLE, we evaluated five candidate sampling designs appropriate for rare populations, based on 4 years of field data: (1) simple random sampling, (2) adaptive simple random sampling, (3) grid-based systematic sampling, (4) adaptive grid-based systematic sampling, and (5) GIS-based adaptive sampling. We compared the designs based on the precision of density estimates for fixed sample size, cost, and distance traveled. Sampling fraction and cost were the most important factors determining precision of density estimates, and relative design performance changed across the range of sampling fractions. Adaptive designs did not provide uniformly more precise estimates than conventional designs, in part because the spatial distribution of L. filiformis was relatively widespread within the study site. Adaptive designs tended to perform better as sampling fraction increased and when sampling costs, particularly distance traveled, were taken into account. The rate that units occupied by L. filiformis were encountered was higher for adaptive than for conventional designs. Overall, grid-based systematic designs were more efficient and practically implemented than the others. ?? 2008 The Society of Population Ecology and Springer.
Moving on From Representativeness: Testing the Utility of the Global Drug Survey.

PubMed

Barratt, Monica J; Ferris, Jason A; Zahnow, Renee; Palamar, Joseph J; Maier, Larissa J; Winstock, Adam R

2017-01-01

A decline in response rates in traditional household surveys, combined with increased internet coverage and decreased research budgets, has resulted in increased attractiveness of web survey research designs based on purposive and voluntary opt-in sampling strategies. In the study of hidden or stigmatised behaviours, such as cannabis use, web survey methods are increasingly common. However, opt-in web surveys are often heavily criticised due to their lack of sampling frame and unknown representativeness. In this article, we outline the current state of the debate about the relevance of pursuing representativeness, the state of probability sampling methods, and the utility of non-probability, web survey methods especially for accessing hidden or minority populations. Our article has two aims: (1) to present a comprehensive description of the methodology we use at Global Drug Survey (GDS), an annual cross-sectional web survey and (2) to compare the age and sex distributions of cannabis users who voluntarily completed (a) a household survey or (b) a large web-based purposive survey (GDS), across three countries: Australia, the United States, and Switzerland. We find that within each set of country comparisons, the demographic distributions among recent cannabis users are broadly similar, demonstrating that the age and sex distributions of those who volunteer to be surveyed are not vastly different between these non-probability and probability methods. We conclude that opt-in web surveys of hard-to-reach populations are an efficient way of gaining in-depth understanding of stigmatised behaviours and are appropriate, as long as they are not used to estimate drug use prevalence of the general population.
Moving on From Representativeness: Testing the Utility of the Global Drug Survey

PubMed Central

Barratt, Monica J; Ferris, Jason A; Zahnow, Renee; Palamar, Joseph J; Maier, Larissa J; Winstock, Adam R

2017-01-01

A decline in response rates in traditional household surveys, combined with increased internet coverage and decreased research budgets, has resulted in increased attractiveness of web survey research designs based on purposive and voluntary opt-in sampling strategies. In the study of hidden or stigmatised behaviours, such as cannabis use, web survey methods are increasingly common. However, opt-in web surveys are often heavily criticised due to their lack of sampling frame and unknown representativeness. In this article, we outline the current state of the debate about the relevance of pursuing representativeness, the state of probability sampling methods, and the utility of non-probability, web survey methods especially for accessing hidden or minority populations. Our article has two aims: (1) to present a comprehensive description of the methodology we use at Global Drug Survey (GDS), an annual cross-sectional web survey and (2) to compare the age and sex distributions of cannabis users who voluntarily completed (a) a household survey or (b) a large web-based purposive survey (GDS), across three countries: Australia, the United States, and Switzerland. We find that within each set of country comparisons, the demographic distributions among recent cannabis users are broadly similar, demonstrating that the age and sex distributions of those who volunteer to be surveyed are not vastly different between these non-probability and probability methods. We conclude that opt-in web surveys of hard-to-reach populations are an efficient way of gaining in-depth understanding of stigmatised behaviours and are appropriate, as long as they are not used to estimate drug use prevalence of the general population. PMID:28924351
Estimation and modeling of electrofishing capture efficiency for fishes in wadeable warmwater streams

USGS Publications Warehouse

Price, A.; Peterson, James T.

2010-01-01

Stream fish managers often use fish sample data to inform management decisions affecting fish populations. Fish sample data, however, can be biased by the same factors affecting fish populations. To minimize the effect of sample biases on decision making, biologists need information on the effectiveness of fish sampling methods. We evaluated single-pass backpack electrofishing and seining combined with electrofishing by following a dual-gear, mark–recapture approach in 61 blocknetted sample units within first- to third-order streams. We also estimated fish movement out of unblocked units during sampling. Capture efficiency and fish abundances were modeled for 50 fish species by use of conditional multinomial capture–recapture models. The best-approximating models indicated that capture efficiencies were generally low and differed among species groups based on family or genus. Efficiencies of single-pass electrofishing and seining combined with electrofishing were greatest for Catostomidae and lowest for Ictaluridae. Fish body length and stream habitat characteristics (mean cross-sectional area, wood density, mean current velocity, and turbidity) also were related to capture efficiency of both methods, but the effects differed among species groups. We estimated that, on average, 23% of fish left the unblocked sample units, but net movement varied among species. Our results suggest that (1) common warmwater stream fish sampling methods have low capture efficiency and (2) failure to adjust for incomplete capture may bias estimates of fish abundance. We suggest that managers minimize bias from incomplete capture by adjusting data for site- and species-specific capture efficiency and by choosing sampling gear that provide estimates with minimal bias and variance. Furthermore, if block nets are not used, we recommend that managers adjust the data based on unconditional capture efficiency.
Challenges in projecting clustering results across gene expression-profiling datasets.

PubMed

Lusa, Lara; McShane, Lisa M; Reid, James F; De Cecco, Loris; Ambrogi, Federico; Biganzoli, Elia; Gariboldi, Manuela; Pierotti, Marco A

2007-11-21

Gene expression microarray studies for several types of cancer have been reported to identify previously unknown subtypes of tumors. For breast cancer, a molecular classification consisting of five subtypes based on gene expression microarray data has been proposed. These subtypes have been reported to exist across several breast cancer microarray studies, and they have demonstrated some association with clinical outcome. A classification rule based on the method of centroids has been proposed for identifying the subtypes in new collections of breast cancer samples; the method is based on the similarity of the new profiles to the mean expression profile of the previously identified subtypes. Previously identified centroids of five breast cancer subtypes were used to assign 99 breast cancer samples, including a subset of 65 estrogen receptor-positive (ER+) samples, to five breast cancer subtypes based on microarray data for the samples. The effect of mean centering the genes (i.e., transforming the expression of each gene so that its mean expression is equal to 0) on subtype assignment by method of centroids was assessed. Further studies of the effect of mean centering and of class prevalence in the test set on the accuracy of method of centroids classifications of ER status were carried out using training and test sets for which ER status had been independently determined by ligand-binding assay and for which the proportion of ER+ and ER- samples were systematically varied. When all 99 samples were considered, mean centering before application of the method of centroids appeared to be helpful for correctly assigning samples to subtypes, as evidenced by the expression of genes that had previously been used as markers to identify the subtypes. However, when only the 65 ER+ samples were considered for classification, many samples appeared to be misclassified, as evidenced by an unexpected distribution of ER+ samples among the resultant subtypes. When genes were mean centered before classification of samples for ER status, the accuracy of the ER subgroup assignments was highly dependent on the proportion of ER+ samples in the test set; this effect of subtype prevalence was not seen when gene expression data were not mean centered. Simple corrections such as mean centering of genes aimed at microarray platform or batch effect correction can have undesirable consequences because patient population effects can easily be confused with these assay-related effects. Careful thought should be given to the comparability of the patient populations before attempting to force data comparability for purposes of assigning subtypes to independent subjects.

Using the Johns Hopkins' Aggregated Diagnosis Groups (ADGs) to predict 1-year mortality in population-based cohorts of patients with diabetes in Ontario, Canada.

PubMed

Austin, P C; Shah, B R; Newman, A; Anderson, G M

2012-09-01

There are limited validated methods to ascertain comorbidities for risk adjustment in ambulatory populations of patients with diabetes using administrative health-care databases. The objective was to examine the ability of the Johns Hopkins' Aggregated Diagnosis Groups to predict mortality in population-based ambulatory samples of both incident and prevalent subjects with diabetes. Retrospective cohorts constructed using population-based administrative data. The incident cohort consisted of all 346,297 subjects diagnosed with diabetes between 1 April 2004 and 31 March 2008. The prevalent cohort consisted of all 879,849 subjects with pre-existing diabetes on 1 January, 2007. The outcome was death within 1 year of the subject's index date. A logistic regression model consisting of age, sex and indicator variables for 22 of the 32 Johns Hopkins' Aggregated Diagnosis Group categories had excellent discrimination for predicting mortality in incident diabetes patients: the c-statistic was 0.87 in an independent validation sample. A similar model had excellent discrimination for predicting mortality in prevalent diabetes patients: the c-statistic was 0.84 in an independent validation sample. Both models demonstrated very good calibration, denoting good agreement between observed and predicted mortality across the range of predicted mortality in which the large majority of subjects lay. For comparative purposes, regression models incorporating the Charlson comorbidity index, age and sex, age and sex, and age alone had poorer discrimination than the model that incorporated the Johns Hopkins' Aggregated Diagnosis Groups. Logistical regression models using age, sex and the John Hopkins' Aggregated Diagnosis Groups were able to accurately predict 1-year mortality in population-based samples of patients with diabetes. © 2011 The Authors. Diabetic Medicine © 2011 Diabetes UK.
Invited review: study design considerations for clinical research in veterinary radiology and radiation oncology.

PubMed

Scrivani, Peter V; Erb, Hollis N

2013-01-01

High quality clinical research is essential for advancing knowledge in the areas of veterinary radiology and radiation oncology. Types of clinical research studies may include experimental studies, method-comparison studies, and patient-based studies. Experimental studies explore issues relative to pathophysiology, patient safety, and treatment efficacy. Method-comparison studies evaluate agreement between techniques or between observers. Patient-based studies investigate naturally acquired disease and focus on questions asked in clinical practice that relate to individuals or populations (e.g., risk, accuracy, or prognosis). Careful preplanning and study design are essential in order to achieve valid results. A key point to planning studies is ensuring that the design is tailored to the study objectives. Good design includes a comprehensive literature review, asking suitable questions, selecting the proper sample population, collecting the appropriate data, performing the correct statistical analyses, and drawing conclusions supported by the available evidence. Most study designs are classified by whether they are experimental or observational, longitudinal or cross-sectional, and prospective or retrospective. Additional features (e.g., controlled, randomized, or blinded) may be described that address bias. Two related challenging aspects of study design are defining an important research question and selecting an appropriate sample population. The sample population should represent the target population as much as possible. Furthermore, when comparing groups, it is important that the groups are as alike to each other as possible except for the variables of interest. Medical images are well suited for clinical research because imaging signs are categorical or numerical variables that might be predictors or outcomes of diseases or treatments. © 2013 Veterinary Radiology & Ultrasound.
The HealthNuts population-based study of paediatric food allergy: validity, safety and acceptability.

PubMed

Osborne, N J; Koplin, J J; Martin, P E; Gurrin, L C; Thiele, L; Tang, M L; Ponsonby, A-L; Dharmage, S C; Allen, K J

2010-10-01

The incidence of hospital admissions for food allergy-related anaphylaxis in Australia has increased, in line with world-wide trends. However, a valid measure of food allergy prevalence and risk factor data from a population-based study is still lacking. To describe the study design and methods used to recruit infants from a population for skin prick testing and oral food challenges, and the use of preliminary data to investigate the extent to which the study sample is representative of the target population. The study sampling frame design comprises 12-month-old infants presenting for routine scheduled vaccination at immunization clinics in Melbourne, Australia. We compared demographic features of participating families to population summary statistics from the Victorian Perinatal census database, and administered a survey to those non-responders who chose not to participate in the study. Study design proved acceptable to the community with good uptake (response rate 73.4%), with 2171 participants recruited. Demographic information on the study population mirrored the Victorian population with most the population parameters measured falling within our confidence intervals (CI). Use of a non-responder questionnaire revealed that a higher proportion of infants who declined to participate (non-responders) were already eating and tolerating peanuts, than those agreeing to participate (54.4%; 95% CI 50.8, 58.0 vs. 27.4%; 95% CI 25.5, 29.3 among participants). A high proportion of individuals approached in a community setting participated in a food allergy study. The study population differed from the eligible sample in relation to family history of allergy and prior consumption and peanut tolerance, providing some insights into the internal validity of the sample. The study exhibited external validity on general demographics to all births in Victoria. © 2010 Blackwell Publishing Ltd.
Advantage of population pharmacokinetic method for evaluating the bioequivalence and accuracy of parameter estimation of pidotimod.

PubMed

Huang, Jihan; Li, Mengying; Lv, Yinghua; Yang, Juan; Xu, Ling; Wang, Jingjing; Chen, Junchao; Wang, Kun; He, Yingchun; Zheng, Qingshan

2016-09-01

This study was aimed at exploring the accuracy of population pharmacokinetic method in evaluating the bioequivalence of pidotimod with sparse data profiles and whether this method is suitable for bioequivalence evaluation in special populations such as children with fewer samplings. Methods In this single-dose, two-period crossover study, 20 healthy male Chinese volunteers were randomized 1 : 1 to receive either the test or reference formulation, with a 1-week washout before receiving the alternative formulation. Noncompartmental and population compartmental pharmacokinetic analyses were conducted. Simulated data were analyzed to graphically evaluate the model and the pharmacokinetic characteristics of the two pidotimod formulations. Various sparse sampling scenarios were generated from the real bioequivalence clinical trial data and evaluated by population pharmacokinetic method. The 90% confidence intervals (CIs) for AUC0-12h, AUC0-∞, and Cmax were 97.3 - 118.7%, 96.9 - 118.7%, and 95.1 - 109.8%, respectively, within the 80 - 125% range for bioequivalence using noncompartmental analysis. The population compartmental pharmacokinetics of pidotimod were described using a one-compartment model with first-order absorption and lag time. In the comparison of estimations in different dataset, the estimation of random three- and< fixed four-point sampling strategies can provide results similar to those obtained through rich sampling. The nonlinear mixed-effects model requires fewer data points. Moreover, compared with the noncompartmental analysis method, the pharmacokinetic parameters can be more accurately estimated using nonlinear mixed-effects model. The population pharmacokinetic modeling method was used to assess the bioequivalence of two pidotimod formulations with relatively few sampling points and further validated the bioequivalence of the two formulations. This method may provide useful information for regulating bioequivalence evaluation in special populations.
Evaluation and comparison of FTA card and CTAB DNA extraction methods for non-agricultural taxa1

PubMed Central

Siegel, Chloe S.; Stevenson, Florence O.; Zimmer, Elizabeth A.

2017-01-01

Premise of the study: An efficient, effective DNA extraction method is necessary for comprehensive analysis of plant genomes. This study analyzed the quality of DNA obtained using paper FTA cards prepared directly in the field when compared to the more traditional cetyltrimethylammonium bromide (CTAB)–based extraction methods from silica-dried samples. Methods: DNA was extracted using FTA cards according to the manufacturer’s protocol. In parallel, CTAB-based extractions were done using the automated AutoGen DNA isolation system. DNA quality for both methods was determined for 15 non-agricultural species collected in situ, by gel separation, spectrophotometry, fluorometry, and successful amplification and sequencing of nuclear and chloroplast gene markers. Results: The FTA card extraction method yielded less concentrated, but also less fragmented samples than the CTAB-based technique. The card-extracted samples provided DNA that could be successfully amplified and sequenced. The FTA cards are also useful because the collected samples do not require refrigeration, extensive laboratory expertise, or as many hazardous chemicals as extractions using the CTAB-based technique. Discussion: The relative success of the FTA card method in our study suggested that this method could be a valuable tool for studies in plant population genetics and conservation biology that may involve screening of hundreds of individual plants. The FTA cards, like the silica gel samples, do not contain plant material capable of propagation, and therefore do not require permits from the U.S. Department of Agriculture (USDA) Animal and Plant Health Inspection Service (APHIS) for transportation. PMID:28224056
Probabilistic population projections with migration uncertainty

PubMed Central

Azose, Jonathan J.; Ševčíková, Hana; Raftery, Adrian E.

2016-01-01

We produce probabilistic projections of population for all countries based on probabilistic projections of fertility, mortality, and migration. We compare our projections to those from the United Nations’ Probabilistic Population Projections, which uses similar methods for fertility and mortality but deterministic migration projections. We find that uncertainty in migration projection is a substantial contributor to uncertainty in population projections for many countries. Prediction intervals for the populations of Northern America and Europe are over 70% wider, whereas prediction intervals for the populations of Africa, Asia, and the world as a whole are nearly unchanged. Out-of-sample validation shows that the model is reasonably well calibrated. PMID:27217571
A model-based 'varimax' sampling strategy for a heterogeneous population.

PubMed

Akram, Nuzhat A; Farooqi, Shakeel R

2014-01-01

Sampling strategies are planned to enhance the homogeneity of a sample, hence to minimize confounding errors. A sampling strategy was developed to minimize the variation within population groups. Karachi, the largest urban agglomeration in Pakistan, was used as a model population. Blood groups ABO and Rh factor were determined for 3000 unrelated individuals selected through simple random sampling. Among them five population groups, namely Balochi, Muhajir, Pathan, Punjabi and Sindhi, based on paternal ethnicity were identified. An index was designed to measure the proportion of admixture at parental and grandparental levels. Population models based on index score were proposed. For validation, 175 individuals selected through stratified random sampling were genotyped for the three STR loci CSF1PO, TPOX and TH01. ANOVA showed significant differences across the population groups for blood groups and STR loci distribution. Gene diversity was higher across the sub-population model than in the agglomerated population. At parental level gene diversities are significantly higher across No admixture models than Admixture models. At grandparental level the difference was not significant. A sub-population model with no admixture at parental level was justified for sampling the heterogeneous population of Karachi.
Application of Response Surface Methods To Determine Conditions for Optimal Genomic Prediction

PubMed Central

Howard, Réka; Carriquiry, Alicia L.; Beavis, William D.

2017-01-01

An epistatic genetic architecture can have a significant impact on prediction accuracies of genomic prediction (GP) methods. Machine learning methods predict traits comprised of epistatic genetic architectures more accurately than statistical methods based on additive mixed linear models. The differences between these types of GP methods suggest a diagnostic for revealing genetic architectures underlying traits of interest. In addition to genetic architecture, the performance of GP methods may be influenced by the sample size of the training population, the number of QTL, and the proportion of phenotypic variability due to genotypic variability (heritability). Possible values for these factors and the number of combinations of the factor levels that influence the performance of GP methods can be large. Thus, efficient methods for identifying combinations of factor levels that produce most accurate GPs is needed. Herein, we employ response surface methods (RSMs) to find the experimental conditions that produce the most accurate GPs. We illustrate RSM with an example of simulated doubled haploid populations and identify the combination of factors that maximize the difference between prediction accuracies of best linear unbiased prediction (BLUP) and support vector machine (SVM) GP methods. The greatest impact on the response is due to the genetic architecture of the population, heritability of the trait, and the sample size. When epistasis is responsible for all of the genotypic variance and heritability is equal to one and the sample size of the training population is large, the advantage of using the SVM method vs. the BLUP method is greatest. However, except for values close to the maximum, most of the response surface shows little difference between the methods. We also determined that the conditions resulting in the greatest prediction accuracy for BLUP occurred when genetic architecture consists solely of additive effects, and heritability is equal to one. PMID:28720710
Building Better Planet Populations for EXOSIMS

NASA Astrophysics Data System (ADS)

Garrett, Daniel; Savransky, Dmitry

2018-01-01

The Exoplanet Open-Source Imaging Mission Simulator (EXOSIMS) software package simulates ensembles of space-based direct imaging surveys to provide a variety of science and engineering yield distributions for proposed mission designs. These mission simulations rely heavily on assumed distributions of planetary population parameters including semi-major axis, planetary radius, eccentricity, albedo, and orbital orientation to provide heuristics for target selection and to simulate planetary systems for detection and characterization. The distributions are encoded in PlanetPopulation modules within EXOSIMS which are selected by the user in the input JSON script when a simulation is run. The earliest written PlanetPopulation modules available in EXOSIMS are based on planet population models where the planetary parameters are considered to be independent from one another. While independent parameters allow for quick computation of heuristics and sampling for simulated planetary systems, results from planet-finding surveys have shown that many parameters (e.g., semi-major axis/orbital period and planetary radius) are not independent. We present new PlanetPopulation modules for EXOSIMS which are built on models based on planet-finding survey results where semi-major axis and planetary radius are not independent and provide methods for sampling their joint distribution. These new modules enhance the ability of EXOSIMS to simulate realistic planetary systems and give more realistic science yield distributions.
Mean population salt intake estimated from 24-h urine samples and spot urine samples: a systematic review and meta-analysis.

PubMed

Huang, Liping; Crino, Michelle; Wu, Jason H Y; Woodward, Mark; Barzi, Federica; Land, Mary-Anne; McLean, Rachael; Webster, Jacqui; Enkhtungalag, Batsaikhan; Neal, Bruce

2016-02-01

Estimating equations based on spot urine samples have been identified as a possible alternative approach to 24-h urine collections for determining mean population salt intake. This review compares estimates of mean population salt intake based upon spot and 24-h urine samples. We systematically searched for all studies that reported estimates of daily salt intake based upon both spot and 24-h urine samples for the same population. The associations between the two were quantified and compared overall and in subsets of studies. A total of 538 records were identified, 108 were assessed as full text and 29 were included. The included studies involved 10,414 participants from 34 countries and made 71 comparisons available for the primary analysis. Overall average population salt intake estimated from 24-h urine samples was 9.3 g/day compared with 9.0 g/day estimated from the spot urine samples. Estimates based upon spot urine samples had excellent sensitivity (97%) and specificity (100%) at classifying mean population salt intake as above or below the World Health Organization maximum target of 5 g/day. Compared with the 24-h samples, estimates based upon spot urine overestimated intake at lower levels of consumption and underestimated intake at higher levels of consumption. Estimates of mean population salt intake based upon spot urine samples can provide countries with a good indication of mean population salt intake and whether action on salt consumption is required. Published by Oxford University Press on behalf of the International Epidemiological Association 2015. This work is written by US Government employees and is in the public domain in the US.
A panel of microsatellites to individually identify leopards and its application to leopard monitoring in human dominated landscapes.

PubMed

Mondol, Samrat; Navya, R; Athreya, Vidya; Sunagar, Kartik; Selvaraj, Velu Mani; Ramakrishnan, Uma

2009-12-04

Leopards are the most widely distributed of the large cats, ranging from Africa to the Russian Far East. Because of habitat fragmentation, high human population densities and the inherent adaptability of this species, they now occupy landscapes close to human settlements. As a result, they are the most common species involved in human wildlife conflict in India, necessitating their monitoring. However, their elusive nature makes such monitoring difficult. Recent advances in DNA methods along with non-invasive sampling techniques can be used to monitor populations and individuals across large landscapes including human dominated ones. In this paper, we describe a DNA-based method for leopard individual identification where we used fecal DNA samples to obtain genetic material. Further, we apply our methods to non-invasive samples collected in a human-dominated landscape to estimate the minimum number of leopards in this human-leopard conflict area in Western India. In this study, 25 of the 29 tested cross-specific microsatellite markers showed positive amplification in 37 wild-caught leopards. These loci revealed varied levels of polymorphism (four-12 alleles) and heterozygosity (0.05-0.79). Combining data on amplification success (including non-invasive samples) and locus specific polymorphisms, we showed that eight loci provide a sibling probability of identity of 0.0005, suggesting that this panel can be used to discriminate individuals in the wild. When this microsatellite panel was applied to fecal samples collected from a human-dominated landscape, we identified 7 individuals, with a sibling probability of identity of 0.001. Amplification success of field collected scats was up to 72%, and genotype error ranged from 0-7.4%. Our results demonstrated that the selected panel of eight microsatellite loci can conclusively identify leopards from various kinds of biological samples. Our methods can be used to monitor leopards over small and large landscapes to assess population trends, as well as could be tested for population assignment in forensic applications.
A panel of microsatellites to individually identify leopards and its application to leopard monitoring in human dominated landscapes

PubMed Central

2009-01-01

Background Leopards are the most widely distributed of the large cats, ranging from Africa to the Russian Far East. Because of habitat fragmentation, high human population densities and the inherent adaptability of this species, they now occupy landscapes close to human settlements. As a result, they are the most common species involved in human wildlife conflict in India, necessitating their monitoring. However, their elusive nature makes such monitoring difficult. Recent advances in DNA methods along with non-invasive sampling techniques can be used to monitor populations and individuals across large landscapes including human dominated ones. In this paper, we describe a DNA-based method for leopard individual identification where we used fecal DNA samples to obtain genetic material. Further, we apply our methods to non-invasive samples collected in a human-dominated landscape to estimate the minimum number of leopards in this human-leopard conflict area in Western India. Results In this study, 25 of the 29 tested cross-specific microsatellite markers showed positive amplification in 37 wild-caught leopards. These loci revealed varied levels of polymorphism (four-12 alleles) and heterozygosity (0.05-0.79). Combining data on amplification success (including non-invasive samples) and locus specific polymorphisms, we showed that eight loci provide a sibling probability of identity of 0.0005, suggesting that this panel can be used to discriminate individuals in the wild. When this microsatellite panel was applied to fecal samples collected from a human-dominated landscape, we identified 7 individuals, with a sibling probability of identity of 0.001. Amplification success of field collected scats was up to 72%, and genotype error ranged from 0-7.4%. Conclusion Our results demonstrated that the selected panel of eight microsatellite loci can conclusively identify leopards from various kinds of biological samples. Our methods can be used to monitor leopards over small and large landscapes to assess population trends, as well as could be tested for population assignment in forensic applications. PMID:19961605
Sex Differences in DSM-IV Posttraumatic Stress Disorder Symptoms Expression Using Item Response Theory: a Population-based Study

PubMed Central

Rivollier, Fabrice; Peyre, Hugo; Hoertel, Nicolas; Blanco, Carlos; Limosin, Frédéric; Delorme, Richard

2015-01-01

Background Whether there are systematic sex differences in posttraumatic stress disorder (PTSD) symptom expression remains debated. Using methods based on item response theory (IRT), we aimed at examining differences in the likelihood of reporting DSM-IV symptoms of PTSD between women and men, while stratifying for major trauma type and equating for PTSD severity. Method We compared data from women and men in a large nationally representative adult sample, the National Epidemiologic Survey on Alcohol and Related Conditions. Analyses were conducted in the full population sample of individuals who met the DSM-IV criterion A (n = 23,860) and in subsamples according to trauma types. Results The clinical presentation of the 17 DSM-IV PTSD symptoms in the general population did not substantially differ in women and men in the full population and by trauma type after equating for levels of PTSD severity. The only exception was the symptom “foreshortened future”, which was more likely endorsed by men at equivalent levels of PTSD severity. Limitations The retrospective nature of the assessment of PTSD symptoms could have led to recall bias. Our sample size was too small to draw conclusions among individuals who experienced war-related traumas. Conclusions Our findings suggest that the clinical presentation of PTSD does not differ substantially between women and men. We also provide additional psychometric support to the exclusion of the symptom “foreshortened future” from the diagnostic criteria for PTSD in the DSM-5. PMID:26342916
State-space modeling of population sizes and trends in Nihoa Finch and Millerbird

USGS Publications Warehouse

Gorresen, P. Marcos; Brinck, Kevin W.; Camp, Richard J.; Farmer, Chris; Plentovich, Sheldon M.; Banko, Paul C.

2016-01-01

Both of the 2 passerines endemic to Nihoa Island, Hawai‘i, USA—the Nihoa Millerbird (Acrocephalus familiaris kingi) and Nihoa Finch (Telespiza ultima)—are listed as endangered by federal and state agencies. Their abundances have been estimated by irregularly implemented fixed-width strip-transect sampling from 1967 to 2012, from which area-based extrapolation of the raw counts produced highly variable abundance estimates for both species. To evaluate an alternative survey method and improve abundance estimates, we conducted variable-distance point-transect sampling between 2010 and 2014. We compared our results to those obtained from strip-transect samples. In addition, we applied state-space models to derive improved estimates of population size and trends from the legacy time series of strip-transect counts. Both species were fairly evenly distributed across Nihoa and occurred in all or nearly all available habitat. Population trends for Nihoa Millerbird were inconclusive because of high within-year variance. Trends for Nihoa Finch were positive, particularly since the early 1990s. Distance-based analysis of point-transect counts produced mean estimates of abundance similar to those from strip-transects but was generally more precise. However, both survey methods produced biologically unrealistic variability between years. State-space modeling of the long-term time series of abundances obtained from strip-transect counts effectively reduced uncertainty in both within- and between-year estimates of population size, and allowed short-term changes in abundance trajectories to be smoothed into a long-term trend.
The impact of sample non-normality on ANOVA and alternative methods.

PubMed

Lantz, Björn

2013-05-01

In this journal, Zimmerman (2004, 2011) has discussed preliminary tests that researchers often use to choose an appropriate method for comparing locations when the assumption of normality is doubtful. The conceptual problem with this approach is that such a two-stage process makes both the power and the significance of the entire procedure uncertain, as type I and type II errors are possible at both stages. A type I error at the first stage, for example, will obviously increase the probability of a type II error at the second stage. Based on the idea of Schmider et al. (2010), which proposes that simulated sets of sample data be ranked with respect to their degree of normality, this paper investigates the relationship between population non-normality and sample non-normality with respect to the performance of the ANOVA, Brown-Forsythe test, Welch test, and Kruskal-Wallis test when used with different distributions, sample sizes, and effect sizes. The overall conclusion is that the Kruskal-Wallis test is considerably less sensitive to the degree of sample normality when populations are distinctly non-normal and should therefore be the primary tool used to compare locations when it is known that populations are not at least approximately normal. © 2012 The British Psychological Society.
Direct sampling of cystic fibrosis lungs indicates that DNA-based analyses of upper-airway specimens can misrepresent lung microbiota.

PubMed

Goddard, Amanda F; Staudinger, Benjamin J; Dowd, Scot E; Joshi-Datar, Amruta; Wolcott, Randall D; Aitken, Moira L; Fligner, Corinne L; Singh, Pradeep K

2012-08-21

Recent work using culture-independent methods suggests that the lungs of cystic fibrosis (CF) patients harbor a vast array of bacteria not conventionally implicated in CF lung disease. However, sampling lung secretions in living subjects requires that expectorated specimens or collection devices pass through the oropharynx. Thus, contamination could confound results. Here, we compared culture-independent analyses of throat and sputum specimens to samples directly obtained from the lungs at the time of transplantation. We found that CF lungs with advanced disease contained relatively homogenous populations of typical CF pathogens. In contrast, upper-airway specimens from the same subjects contained higher levels of microbial diversity and organisms not typically considered CF pathogens. Furthermore, sputum exhibited day-to-day variation in the abundance of nontypical organisms, even in the absence of clinical changes. These findings suggest that oropharyngeal contamination could limit the accuracy of DNA-based measurements on upper-airway specimens. This work highlights the importance of sampling procedures for microbiome studies and suggests that methods that account for contamination are needed when DNA-based methods are used on clinical specimens.
Inference and Analysis of Population Structure Using Genetic Data and Network Theory

PubMed Central

Greenbaum, Gili; Templeton, Alan R.; Bar-David, Shirli

2016-01-01

Clustering individuals to subpopulations based on genetic data has become commonplace in many genetic studies. Inference about population structure is most often done by applying model-based approaches, aided by visualization using distance-based approaches such as multidimensional scaling. While existing distance-based approaches suffer from a lack of statistical rigor, model-based approaches entail assumptions of prior conditions such as that the subpopulations are at Hardy-Weinberg equilibria. Here we present a distance-based approach for inference about population structure using genetic data by defining population structure using network theory terminology and methods. A network is constructed from a pairwise genetic-similarity matrix of all sampled individuals. The community partition, a partition of a network to dense subgraphs, is equated with population structure, a partition of the population to genetically related groups. Community-detection algorithms are used to partition the network into communities, interpreted as a partition of the population to subpopulations. The statistical significance of the structure can be estimated by using permutation tests to evaluate the significance of the partition’s modularity, a network theory measure indicating the quality of community partitions. To further characterize population structure, a new measure of the strength of association (SA) for an individual to its assigned community is presented. The strength of association distribution (SAD) of the communities is analyzed to provide additional population structure characteristics, such as the relative amount of gene flow experienced by the different subpopulations and identification of hybrid individuals. Human genetic data and simulations are used to demonstrate the applicability of the analyses. The approach presented here provides a novel, computationally efficient model-free method for inference about population structure that does not entail assumption of prior conditions. The method is implemented in the software NetStruct (available at https://giligreenbaum.wordpress.com/software/). PMID:26888080
Inference and Analysis of Population Structure Using Genetic Data and Network Theory.

PubMed

Greenbaum, Gili; Templeton, Alan R; Bar-David, Shirli

2016-04-01

Clustering individuals to subpopulations based on genetic data has become commonplace in many genetic studies. Inference about population structure is most often done by applying model-based approaches, aided by visualization using distance-based approaches such as multidimensional scaling. While existing distance-based approaches suffer from a lack of statistical rigor, model-based approaches entail assumptions of prior conditions such as that the subpopulations are at Hardy-Weinberg equilibria. Here we present a distance-based approach for inference about population structure using genetic data by defining population structure using network theory terminology and methods. A network is constructed from a pairwise genetic-similarity matrix of all sampled individuals. The community partition, a partition of a network to dense subgraphs, is equated with population structure, a partition of the population to genetically related groups. Community-detection algorithms are used to partition the network into communities, interpreted as a partition of the population to subpopulations. The statistical significance of the structure can be estimated by using permutation tests to evaluate the significance of the partition's modularity, a network theory measure indicating the quality of community partitions. To further characterize population structure, a new measure of the strength of association (SA) for an individual to its assigned community is presented. The strength of association distribution (SAD) of the communities is analyzed to provide additional population structure characteristics, such as the relative amount of gene flow experienced by the different subpopulations and identification of hybrid individuals. Human genetic data and simulations are used to demonstrate the applicability of the analyses. The approach presented here provides a novel, computationally efficient model-free method for inference about population structure that does not entail assumption of prior conditions. The method is implemented in the software NetStruct (available at https://giligreenbaum.wordpress.com/software/). Copyright © 2016 by the Genetics Society of America.
Approaches to Recruiting ‘Hard-To-Reach’ Populations into Research: A Review of the Literature

PubMed Central

Shaghaghi, Abdolreza; Bhopal, Raj S; Sheikh, Aziz

2011-01-01

Background: ‘Hard-to-reach’ is a term used to describe those sub-groups of the population that may be difficult to reach or involve in research or public health programmes. Application of a single term to call these sub-sections of populations implies a homogeneity within distinct groups, which does not necessarily exist. Different sampling techniques were introduced so far to recruit hard-to-reach populations. In this article, we have reviewed a range of approaches that have been used to widen participation in studies. Methods: We performed a Pubmed and Google search for relevant English language articles using the keywords and phrases: (hard-to-reach AND population* OR sampl*), (hidden AND population* OR sample*) and (“hard to reach” AND population* OR sample*) and a consultation of the retrieved articles’ bibliographies to extract empirical evidence from publications that discussed or examined the use of sampling techniques to recruit hidden or hard-to-reach populations in health studies. Results: Reviewing the literature has identified a range of techniques to recruit hard-to-reach populations, including snowball sampling, respondent-driven sampling (RDS), indigenous field worker sampling (IFWS), facility-based sampling (FBS), targeted sampling (TS), time-location (space) sampling (TLS), conventional cluster sampling (CCS) and capture re-capture sampling (CR). Conclusion: The degree of compliance with a study by a certain ‘hard-to-reach’ group depends on the characteristics of that group, recruitment technique used and the subject of interest. Irrespective of potential advantages or limitations of the recruitment techniques reviewed, their successful use depends mainly upon our knowledge about specific characteristics of the target populations. Thus in line with attempts to expand the current boundaries of our knowledge about recruitment techniques in health studies and their applications in varying situations, we should also focus on possibly all contributing factors which may have an impact on participation rate within a defined population group. PMID:24688904
Detection of Only Viable Bacterial Spores Using a Live/Dead Indicator in Mixed Populations

NASA Technical Reports Server (NTRS)

Behar, Alberto E.; Stam, Christina N.; Smiley, Ronald

2013-01-01

This method uses a photoaffinity label that recognizes DNA and can be used to distinguish populations of bacterial cells from bacterial spores without the use of heat shocking during conventional culture, and live from dead bacterial spores using molecular-based methods. Biological validation of commercial sterility using traditional and alternative technologies remains challenging. Recovery of viable spores is cumbersome, as the process requires substantial incubation time, and the extended time to results limits the ability to quickly evaluate the efficacy of existing technologies. Nucleic acid amplification approaches such as PCR (polymerase chain reaction) have shown promise for improving time to detection for a wide range of applications. Recent real-time PCR methods are particularly promising, as these methods can be made at least semi-quantitative by correspondence to a standard curve. Nonetheless, PCR-based methods are rarely used for process validation, largely because the DNA from dead bacterial cells is highly stable and hence, DNA-based amplification methods fail to discriminate between live and inactivated microorganisms. Currently, no published method has been shown to effectively distinguish between live and dead bacterial spores. This technology uses a DNA binding photoaffinity label that can be used to distinguish between live and dead bacterial spores with detection limits ranging from 109 to 102 spores/mL. An environmental sample suspected of containing a mixture of live and dead vegetative cells and bacterial endospores is treated with a photoaffinity label. This step will eliminate any vegetative cells (live or dead) and dead endospores present in the sample. To further determine the bacterial spore viability, DNA is extracted from the spores and total population is quantified by real-time PCR. The current NASA standard assay takes 72 hours for results. Part of this procedure requires a heat shock step at 80 degC for 15 minutes before the sample can be plated. Using a photoaffinity label would remove this step from the current assay as the label readily penetrates both live and dead bacterial cells. Secondly, the photoaffinity label can only penetrate dead bacterial spores, leaving behind the viable spore population. This would allow for rapid bacterial spore detection in a matter of hours compared to the several days that it takes for the NASA standard assay.

Respondent-Driven Sampling with Hard-to-Reach Emerging Adults: An Introduction and Case Study with Rural African Americans

ERIC Educational Resources Information Center

Kogan, Steven M.; Wejnert, Cyprian; Chen, Yi-fu; Brody, Gene H.; Slater, LaTrina M.

2011-01-01

Obtaining representative samples from populations of emerging adults who do not attend college is challenging for researchers. This article introduces respondent-driven sampling (RDS), a method for obtaining representative samples of hard-to-reach but socially interconnected populations. RDS combines a prescribed method for chain referral with a…
Serum markers for type II diabetes mellitus

DOEpatents

Metz, Thomas O; Qian, Wei-Jun; Jacobs, Jon M; Polpitiya, Ashoka D; Camp, II, David G; Smith, Richard D

2014-03-18

A method for identifying persons with increased risk of developing type 2 diabetes mellitus utilizing selected biomarkers described hereafter either alone or in combination. The present invention allows for broad based, reliable, screening of large population bases and provides other advantages, including the formulation of effective strategies for characterizing, archiving, and contrasting data from multiple sample types under varying conditions.
Laser desorption mass spectrometry for molecular diagnosis

NASA Astrophysics Data System (ADS)

Chen, C. H. Winston; Taranenko, N. I.; Zhu, Y. F.; Allman, S. L.; Tang, K.; Matteson, K. J.; Chang, L. Y.; Chung, C. N.; Martin, Steve; Haff, Lawrence

1996-04-01

Laser desorption mass spectrometry has been used for molecular diagnosis of cystic fibrosis. Both 3-base deletion and single-base point mutation have been successfully detected by clinical samples. This new detection method can possibly speed up the diagnosis by one order of magnitude in the future. It may become a new biotechnology technique for population screening of genetic disease.
Quantifying and Mitigating the Effect of Preferential Sampling on Phylodynamic Inference

PubMed Central

Karcher, Michael D.; Palacios, Julia A.; Bedford, Trevor; Suchard, Marc A.; Minin, Vladimir N.

2016-01-01

Phylodynamics seeks to estimate effective population size fluctuations from molecular sequences of individuals sampled from a population of interest. One way to accomplish this task formulates an observed sequence data likelihood exploiting a coalescent model for the sampled individuals’ genealogy and then integrating over all possible genealogies via Monte Carlo or, less efficiently, by conditioning on one genealogy estimated from the sequence data. However, when analyzing sequences sampled serially through time, current methods implicitly assume either that sampling times are fixed deterministically by the data collection protocol or that their distribution does not depend on the size of the population. Through simulation, we first show that, when sampling times do probabilistically depend on effective population size, estimation methods may be systematically biased. To correct for this deficiency, we propose a new model that explicitly accounts for preferential sampling by modeling the sampling times as an inhomogeneous Poisson process dependent on effective population size. We demonstrate that in the presence of preferential sampling our new model not only reduces bias, but also improves estimation precision. Finally, we compare the performance of the currently used phylodynamic methods with our proposed model through clinically-relevant, seasonal human influenza examples. PMID:26938243
Non-invasive genetic censusing and monitoring of primate populations.

PubMed

Arandjelovic, Mimi; Vigilant, Linda

2018-03-01

Knowing the density or abundance of primate populations is essential for their conservation management and contextualizing socio-demographic and behavioral observations. When direct counts of animals are not possible, genetic analysis of non-invasive samples collected from wildlife populations allows estimates of population size with higher accuracy and precision than is possible using indirect signs. Furthermore, in contrast to traditional indirect survey methods, prolonged or periodic genetic sampling across months or years enables inference of group membership, movement, dynamics, and some kin relationships. Data may also be used to estimate sex ratios, sex differences in dispersal distances, and detect gene flow among locations. Recent advances in capture-recapture models have further improved the precision of population estimates derived from non-invasive samples. Simulations using these methods have shown that the confidence interval of point estimates includes the true population size when assumptions of the models are met, and therefore this range of population size minima and maxima should be emphasized in population monitoring studies. Innovations such as the use of sniffer dogs or anti-poaching patrols for sample collection are important to ensure adequate sampling, and the expected development of efficient and cost-effective genotyping by sequencing methods for DNAs derived from non-invasive samples will automate and speed analyses. © 2018 Wiley Periodicals, Inc.
Clinical prediction and the idea of a population.

PubMed

Armstrong, David

2017-04-01

Using an analysis of the British Medical Journal over the past 170 years, this article describes how changes in the idea of a population have informed new technologies of medical prediction. These approaches have largely replaced older ideas of clinical prognosis based on understanding the natural histories of the underlying pathologies. The 19 th -century idea of a population, which provided a denominator for medical events such as births and deaths, was constrained in its predictive power by its method of enumerating individual bodies. During the 20 th century, populations were increasingly constructed through inferential techniques based on patient groups and samples seen to possess variable characteristics. The emergence of these new virtual populations created the conditions for the emergence of predictive algorithms that are used to foretell our medical futures.
Sex determination using humeral dimensions in a sample from KwaZulu-Natal: an osteometric study

PubMed Central

Ogedengbe, Oluwatosin Olalekan; Ajayi, Sunday Adelaja; Komolafe, Omobola Aderibigbe; Zaw, Aung Khaing; Naidu, Edwin Coleridge Stephen

2017-01-01

The morphological characteristics of the humeral bone has been investigated in recent times with studies showing varying degrees of sexual dimorphism. Osteologists and forensic scientists have shown that sex determination methods based on skeletal measurements are population specific, and these population-specific variations are present in many body dimensions. The present study aims to establish sex identification using osteometric standards for the humerus in a contemporary KwaZulu-Natal population. A total of 11 parameters were measured in a sample of n=211 humeri (males, 113; females, 98) from the osteological collection in the Discipline of Clinical Anatomy, Nelson R. Mandela School of Medicine, University of KwaZulu-Natal, Durban, South Africa. The difference in means for nearly all variables were found to be significantly higher in males compared to females (P<0.01) with the most effective single parameter for predicting sex being the vertical head diameter having an accuracy of 82.5%. Stepwise discriminant analysis increased the overall accuracy rate to 87.7% when all measurements were jointly applied. We conclude that the humerus is an important bone which can be reliably used for sex determination based on standard metric methods despite minor tribal or ancestral differences amongst an otherwise homogenous population. PMID:29043096
Sex determination using humeral dimensions in a sample from KwaZulu-Natal: an osteometric study.

PubMed

Ogedengbe, Oluwatosin Olalekan; Ajayi, Sunday Adelaja; Komolafe, Omobola Aderibigbe; Zaw, Aung Khaing; Naidu, Edwin Coleridge Stephen; Okpara Azu, Onyemaechi

2017-09-01

The morphological characteristics of the humeral bone has been investigated in recent times with studies showing varying degrees of sexual dimorphism. Osteologists and forensic scientists have shown that sex determination methods based on skeletal measurements are population specific, and these population-specific variations are present in many body dimensions. The present study aims to establish sex identification using osteometric standards for the humerus in a contemporary KwaZulu-Natal population. A total of 11 parameters were measured in a sample of n=211 humeri (males, 113; females, 98) from the osteological collection in the Discipline of Clinical Anatomy, Nelson R. Mandela School of Medicine, University of KwaZulu-Natal, Durban, South Africa. The difference in means for nearly all variables were found to be significantly higher in males compared to females ( P <0.01) with the most effective single parameter for predicting sex being the vertical head diameter having an accuracy of 82.5%. Stepwise discriminant analysis increased the overall accuracy rate to 87.7% when all measurements were jointly applied. We conclude that the humerus is an important bone which can be reliably used for sex determination based on standard metric methods despite minor tribal or ancestral differences amongst an otherwise homogenous population.
[New population curves in spanish extremely preterm neonates].

PubMed

García-Muñoz Rodrigo, F; García-Alix Pérez, A; Figueras Aloy, J; Saavedra Santana, P

2014-08-01

Most anthropometric reference data for extremely preterm infants used in Spain are outdated and based on non-Spanish populations, or are derived from small hospital-based samples that failed to include neonates of borderline viability. To develop gender-specific, population-based curves for birth weight, length, and head circumference in extremely preterm Caucasian infants, using a large contemporary sample size of Spanish singletons. Anthropometric data from neonates ≤ 28 weeks of gestational age were collected between January 2002 and December 2010 using the Spanish database SEN1500. Gestational age was estimated according to obstetric data (early pregnancy ultrasound). The data were analyzed with the SPSS.20 package, and centile tables were created for males and females using the Cole and Green LMS method. This study presents the first population-based growth curves for extremely preterm infants, including those of borderline viability, in Spain. A sexual dimorphism is evident for all of the studied parameters, starting at early gestation. These new gender-specific and population-based data could be useful for the improvement of growth assessments of extremely preterm infants in our country, for the development of epidemiological studies, for the evaluation of temporal trends, and for clinical or public health interventions seeking to optimize fetal growth. Copyright © 2013 Asociación Española de Pediatría. Published by Elsevier Espana. All rights reserved.
Probability of detecting Porcine reproductive and respiratory syndrome virus infection using pen-based swine oral fluid specimens as a function of within-pen prevalence.

PubMed

Olsen, Chris; Wang, Chong; Christopher-Hennings, Jane; Doolittle, Kent; Harmon, Karen M; Abate, Sarah; Kittawornrat, Apisit; Lizano, Sergio; Main, Rodger; Nelson, Eric A; Otterson, Tracy; Panyasing, Yaowalak; Rademacher, Chris; Rauh, Rolf; Shah, Rohan; Zimmerman, Jeffrey

2013-05-01

Pen-based oral fluid sampling has proven to be an efficient method for surveillance of infectious diseases in swine populations. To better interpret diagnostic results, the performance of oral fluid assays (antibody- and nucleic acid-based) must be established for pen-based oral fluid samples. Therefore, the objective of the current study was to determine the probability of detecting Porcine reproductive and respiratory syndrome virus (PRRSV) infection in pen-based oral fluid samples from pens of known PRRSV prevalence. In 1 commercial swine barn, 25 pens were assigned to 1 of 5 levels of PRRSV prevalence (0%, 4%, 12%, 20%, or 36%) by placing a fixed number (0, 1, 3, 5, or 9) of PRRSV-positive pigs (14 days post PRRSV modified live virus vaccination) in each pen. Prior to placement of the vaccinated pigs, 1 oral fluid sample was collected from each pen. Thereafter, 5 oral fluid samples were collected from each pen, for a total of 150 samples. To confirm individual pig PRRSV status, serum samples from the PRRSV-negative pigs (n = 535) and the PRRSV vaccinated pigs (n = 90) were tested for PRRSV antibodies and PRRSV RNA. The 150 pen-based oral fluid samples were assayed for PRRSV antibody and PRRSV RNA at 6 laboratories. Among the 100 samples from pens containing ≥1 positive pig (≥4% prevalence) and tested at the 6 laboratories, the mean positivity was 62% for PRRSV RNA and 61% for PRRSV antibody. These results support the use of pen-based oral fluid sampling for PRRSV surveillance in commercial pig populations.
CoMet: a workflow using contig coverage and composition for binning a metagenomic sample with high precision.

PubMed

Herath, Damayanthi; Tang, Sen-Lin; Tandon, Kshitij; Ackland, David; Halgamuge, Saman Kumara

2017-12-28

In metagenomics, the separation of nucleotide sequences belonging to an individual or closely matched populations is termed binning. Binning helps the evaluation of underlying microbial population structure as well as the recovery of individual genomes from a sample of uncultivable microbial organisms. Both supervised and unsupervised learning methods have been employed in binning; however, characterizing a metagenomic sample containing multiple strains remains a significant challenge. In this study, we designed and implemented a new workflow, Coverage and composition based binning of Metagenomes (CoMet), for binning contigs in a single metagenomic sample. CoMet utilizes coverage values and the compositional features of metagenomic contigs. The binning strategy in CoMet includes the initial grouping of contigs in guanine-cytosine (GC) content-coverage space and refinement of bins in tetranucleotide frequencies space in a purely unsupervised manner. With CoMet, the clustering algorithm DBSCAN is employed for binning contigs. The performances of CoMet were compared against four existing approaches for binning a single metagenomic sample, including MaxBin, Metawatt, MyCC (default) and MyCC (coverage) using multiple datasets including a sample comprised of multiple strains. Binning methods based on both compositional features and coverages of contigs had higher performances than the method which is based only on compositional features of contigs. CoMet yielded higher or comparable precision in comparison to the existing binning methods on benchmark datasets of varying complexities. MyCC (coverage) had the highest ranking score in F1-score. However, the performances of CoMet were higher than MyCC (coverage) on the dataset containing multiple strains. Furthermore, CoMet recovered contigs of more species and was 18 - 39% higher in precision than the compared existing methods in discriminating species from the sample of multiple strains. CoMet resulted in higher precision than MyCC (default) and MyCC (coverage) on a real metagenome. The approach proposed with CoMet for binning contigs, improves the precision of binning while characterizing more species in a single metagenomic sample and in a sample containing multiple strains. The F1-scores obtained from different binning strategies vary with different datasets; however, CoMet yields the highest F1-score with a sample comprised of multiple strains.
What is a representative brain? Neuroscience meets population science.

PubMed

Falk, Emily B; Hyde, Luke W; Mitchell, Colter; Faul, Jessica; Gonzalez, Richard; Heitzeg, Mary M; Keating, Daniel P; Langa, Kenneth M; Martz, Meghan E; Maslowsky, Julie; Morrison, Frederick J; Noll, Douglas C; Patrick, Megan E; Pfeffer, Fabian T; Reuter-Lorenz, Patricia A; Thomason, Moriah E; Davis-Kean, Pamela; Monk, Christopher S; Schulenberg, John

2013-10-29

The last decades of neuroscience research have produced immense progress in the methods available to understand brain structure and function. Social, cognitive, clinical, affective, economic, communication, and developmental neurosciences have begun to map the relationships between neuro-psychological processes and behavioral outcomes, yielding a new understanding of human behavior and promising interventions. However, a limitation of this fast moving research is that most findings are based on small samples of convenience. Furthermore, our understanding of individual differences may be distorted by unrepresentative samples, undermining findings regarding brain-behavior mechanisms. These limitations are issues that social demographers, epidemiologists, and other population scientists have tackled, with solutions that can be applied to neuroscience. By contrast, nearly all social science disciplines, including social demography, sociology, political science, economics, communication science, and psychology, make assumptions about processes that involve the brain, but have incorporated neural measures to differing, and often limited, degrees; many still treat the brain as a black box. In this article, we describe and promote a perspective--population neuroscience--that leverages interdisciplinary expertise to (i) emphasize the importance of sampling to more clearly define the relevant populations and sampling strategies needed when using neuroscience methods to address such questions; and (ii) deepen understanding of mechanisms within population science by providing insight regarding underlying neural mechanisms. Doing so will increase our confidence in the generalizability of the findings. We provide examples to illustrate the population neuroscience approach for specific types of research questions and discuss the potential for theoretical and applied advances from this approach across areas.
What is a representative brain? Neuroscience meets population science

PubMed Central

Falk, Emily B.; Hyde, Luke W.; Mitchell, Colter; Faul, Jessica; Gonzalez, Richard; Heitzeg, Mary M.; Keating, Daniel P.; Langa, Kenneth M.; Martz, Meghan E.; Maslowsky, Julie; Morrison, Frederick J.; Noll, Douglas C.; Patrick, Megan E.; Pfeffer, Fabian T.; Reuter-Lorenz, Patricia A.; Thomason, Moriah E.; Davis-Kean, Pamela; Monk, Christopher S.; Schulenberg, John

2013-01-01

The last decades of neuroscience research have produced immense progress in the methods available to understand brain structure and function. Social, cognitive, clinical, affective, economic, communication, and developmental neurosciences have begun to map the relationships between neuro-psychological processes and behavioral outcomes, yielding a new understanding of human behavior and promising interventions. However, a limitation of this fast moving research is that most findings are based on small samples of convenience. Furthermore, our understanding of individual differences may be distorted by unrepresentative samples, undermining findings regarding brain–behavior mechanisms. These limitations are issues that social demographers, epidemiologists, and other population scientists have tackled, with solutions that can be applied to neuroscience. By contrast, nearly all social science disciplines, including social demography, sociology, political science, economics, communication science, and psychology, make assumptions about processes that involve the brain, but have incorporated neural measures to differing, and often limited, degrees; many still treat the brain as a black box. In this article, we describe and promote a perspective—population neuroscience—that leverages interdisciplinary expertise to (i) emphasize the importance of sampling to more clearly define the relevant populations and sampling strategies needed when using neuroscience methods to address such questions; and (ii) deepen understanding of mechanisms within population science by providing insight regarding underlying neural mechanisms. Doing so will increase our confidence in the generalizability of the findings. We provide examples to illustrate the population neuroscience approach for specific types of research questions and discuss the potential for theoretical and applied advances from this approach across areas. PMID:24151336
TNO/Centaurs grouping tested with asteroid data sets

NASA Astrophysics Data System (ADS)

Fulchignoni, M.; Birlan, M.; Barucci, M. A.

2001-11-01

Recently, we have discussed the possible subdivision in few groups of a sample of 22 TNO and Centaurs for which the BVRIJ photometry were available (Barucci et al., 2001, A&A, 371,1150). We obtained this results using the multivariate statistics adopted to define the current asteroid taxonomy, namely the Principal Components Analysis and the G-mode method (Tholen & Barucci, 1989, in ASTEROIDS II). How these methods work with a very small statistical sample as the TNO/Centaurs one? Theoretically, the number of degrees of freedom of the sample is correct. In fact it is 88 in our case and have to be larger then 50 to cope with the requirements of the G-mode. Does the random sampling of the small number of members of a large population contain enough information to reveal some structure in the population? We extracted several samples of 22 asteroids out of a data-base of 86 objects of known taxonomic type for which BVRIJ photometry is available from ECAS (Zellner et al. 1985, ICARUS 61, 355), SMASS II (S.W. Bus, 1999, PhD Thesis, MIT), and the Bell et al. Atlas of the asteroid infrared spectra. The objects constituting the first sample were selected in order to give a good representation of the major asteroid taxonomic classes (at least three samples each class): C,S,D,A, and G. Both methods were able to distinguish all these groups confirming the validity of the adopted methods. The S class is hard to individuate as a consequence of the choice of I and J variables, which imply a lack of information on the absorption band at 1 micron. The other samples were obtained by random choice of the objects. Not all the major groups were well represented (less than three samples per groups), but the general trend of the asteroid taxonomy has been always obtained. We conclude that the quoted grouping of TNO/Centaurs is representative of some physico-chemical structure of the outer solar system small body population.
One-step estimation of networked population size: Respondent-driven capture-recapture with anonymity.

PubMed

Khan, Bilal; Lee, Hsuan-Wei; Fellows, Ian; Dombrowski, Kirk

2018-01-01

Size estimation is particularly important for populations whose members experience disproportionate health issues or pose elevated health risks to the ambient social structures in which they are embedded. Efforts to derive size estimates are often frustrated when the population is hidden or hard-to-reach in ways that preclude conventional survey strategies, as is the case when social stigma is associated with group membership or when group members are involved in illegal activities. This paper extends prior research on the problem of network population size estimation, building on established survey/sampling methodologies commonly used with hard-to-reach groups. Three novel one-step, network-based population size estimators are presented, for use in the context of uniform random sampling, respondent-driven sampling, and when networks exhibit significant clustering effects. We give provably sufficient conditions for the consistency of these estimators in large configuration networks. Simulation experiments across a wide range of synthetic network topologies validate the performance of the estimators, which also perform well on a real-world location-based social networking data set with significant clustering. Finally, the proposed schemes are extended to allow them to be used in settings where participant anonymity is required. Systematic experiments show favorable tradeoffs between anonymity guarantees and estimator performance. Taken together, we demonstrate that reasonable population size estimates are derived from anonymous respondent driven samples of 250-750 individuals, within ambient populations of 5,000-40,000. The method thus represents a novel and cost-effective means for health planners and those agencies concerned with health and disease surveillance to estimate the size of hidden populations. We discuss limitations and future work in the concluding section.
Methods for estimating population coverage of mass distribution programmes: a review of practices in relation to trachoma control.

PubMed

Cromwell, Elizabeth A; Ngondi, Jeremiah; McFarland, Deborah; King, Jonathan D; Emerson, Paul M

2012-10-01

In the context of trachoma control, population coverage with mass drug administration (MDA) using antibiotics is measured using routine data. Due to the limitations of administrative records as well as the potential for bias from incomplete or incorrect records, a literature review of coverage survey methods applied in neglected tropical disease control programmes and immunisation outreach was conducted to inform the design of coverage surveys for trachoma control. Several methods were identified, including the '30 × 7' survey method for the Expanded Programme on Immunization (EPI 30×7), other cluster random sampling (CRS) methods, lot quality assurance sampling (LQAS), purposive sampling and routine data. When compared against one another, the EPI and other CRS methods produced similar population coverage estimates, whilst LQAS, purposive sampling and use of administrative data did not generate estimates consistent with CRS. In conclusion, CRS methods present a consistent approach for MDA coverage surveys despite different methods of household selection. They merit use until standard guidelines are available. CRS methods should be used to verify population coverage derived from LQAS, purposive sampling methods and administrative reports. Copyright © 2012 Royal Society of Tropical Medicine and Hygiene. Published by Elsevier Ltd. All rights reserved.
Sampling Key Populations for HIV Surveillance: Results From Eight Cross-Sectional Studies Using Respondent-Driven Sampling and Venue-Based Snowball Sampling.

PubMed

Rao, Amrita; Stahlman, Shauna; Hargreaves, James; Weir, Sharon; Edwards, Jessie; Rice, Brian; Kochelani, Duncan; Mavimbela, Mpumelelo; Baral, Stefan

2017-10-20

In using regularly collected or existing surveillance data to characterize engagement in human immunodeficiency virus (HIV) services among marginalized populations, differences in sampling methods may produce different pictures of the target population and may therefore result in different priorities for response. The objective of this study was to use existing data to evaluate the sample distribution of eight studies of female sex workers (FSW) and men who have sex with men (MSM), who were recruited using different sampling approaches in two locations within Sub-Saharan Africa: Manzini, Swaziland and Yaoundé, Cameroon. MSM and FSW participants were recruited using either respondent-driven sampling (RDS) or venue-based snowball sampling. Recruitment took place between 2011 and 2016. Participants at each study site were administered a face-to-face survey to assess sociodemographics, along with the prevalence of self-reported HIV status, frequency of HIV testing, stigma, and other HIV-related characteristics. Crude and RDS-adjusted prevalence estimates were calculated. Crude prevalence estimates from the venue-based snowball samples were compared with the overlap of the RDS-adjusted prevalence estimates, between both FSW and MSM in Cameroon and Swaziland. RDS samples tended to be younger (MSM aged 18-21 years in Swaziland: 47.6% [139/310] in RDS vs 24.3% [42/173] in Snowball, in Cameroon: 47.9% [99/306] in RDS vs 20.1% [52/259] in Snowball; FSW aged 18-21 years in Swaziland 42.5% [82/325] in RDS vs 8.0% [20/249] in Snowball; in Cameroon 15.6% [75/576] in RDS vs 8.1% [25/306] in Snowball). They were less educated (MSM: primary school completed or less in Swaziland 42.6% [109/310] in RDS vs 4.0% [7/173] in Snowball, in Cameroon 46.2% [138/306] in RDS vs 14.3% [37/259] in Snowball; FSW: primary school completed or less in Swaziland 86.6% [281/325] in RDS vs 23.9% [59/247] in Snowball, in Cameroon 87.4% [520/576] in RDS vs 77.5% [238/307] in Snowball) than the snowball samples. In addition, RDS samples indicated lower exposure to HIV prevention information, less knowledge about HIV prevention, limited access to HIV prevention tools such as condoms, and less-reported frequency of sexually transmitted infections (STI) and HIV testing as compared with the venue-based samples. Findings pertaining to the level of disclosure of sexual practices and sexual practice-related stigma were mixed. Samples generated by RDS and venue-based snowball sampling produced significantly different prevalence estimates of several important characteristics. These findings are tempered by limitations to the application of both approaches in practice. Ultimately, these findings provide further context for understanding existing surveillance data and how differences in methods of sampling can influence both the type of individuals captured and whether or not these individuals are representative of the larger target population. These data highlight the need to consider how program coverage estimates of marginalized populations are determined when characterizing the level of unmet need. ©Amrita Rao, Shauna Stahlman, James Hargreaves, Sharon Weir, Jessie Edwards, Duncan Kochelani, Duncan Kochelani, Mpumelelo Mavimbela, Stefan Baral. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 20.10.2017.
Tobacco, Marijuana, and Alcohol Use in University Students: A Cluster Analysis

PubMed Central

Primack, Brian A.; Kim, Kevin H.; Shensa, Ariel; Sidani, Jaime E.; Barnett, Tracey E.; Switzer, Galen E.

2012-01-01

Objective Segmentation of populations may facilitate development of targeted substance abuse prevention programs. We aimed to partition a national sample of university students according to profiles based on substance use. Participants We used 2008–2009 data from the National College Health Assessment from the American College Health Association. Our sample consisted of 111,245 individuals from 158 institutions. Method We partitioned the sample using cluster analysis according to current substance use behaviors. We examined the association of cluster membership with individual and institutional characteristics. Results Cluster analysis yielded six distinct clusters. Three individual factors—gender, year in school, and fraternity/sorority membership—were the most strongly associated with cluster membership. Conclusions In a large sample of university students, we were able to identify six distinct patterns of substance abuse. It may be valuable to target specific populations of college-aged substance users based on individual factors. However, comprehensive intervention will require a multifaceted approach. PMID:22686360
Utilizing Big Data and Twitter to Discover Emergent Online Communities of Cannabis Users

PubMed Central

Baumgartner, Peter; Peiper, Nicholas

2017-01-01

Large shifts in medical, recreational, and illicit cannabis consumption in the United States have implications for personalizing treatment and prevention programs to a wide variety of populations. As such, considerable research has investigated clinical presentations of cannabis users in clinical and population-based samples. Studies leveraging big data, social media, and social network analysis have emerged as a promising mechanism to generate timely insights that can inform treatment and prevention research. This study extends a novel method called stochastic block modeling to derive communities of cannabis consumers as part of a complex social network on Twitter. A set of examples illustrate how this method can ascertain candidate samples of medical, recreational, and illicit cannabis users. Implications for research planning, intervention design, and public health surveillance are discussed. PMID:28615950
Sample Size Calculations for Population Size Estimation Studies Using Multiplier Methods With Respondent-Driven Sampling Surveys.

PubMed

Fearon, Elizabeth; Chabata, Sungai T; Thompson, Jennifer A; Cowan, Frances M; Hargreaves, James R

2017-09-14

While guidance exists for obtaining population size estimates using multiplier methods with respondent-driven sampling surveys, we lack specific guidance for making sample size decisions. To guide the design of multiplier method population size estimation studies using respondent-driven sampling surveys to reduce the random error around the estimate obtained. The population size estimate is obtained by dividing the number of individuals receiving a service or the number of unique objects distributed (M) by the proportion of individuals in a representative survey who report receipt of the service or object (P). We have developed an approach to sample size calculation, interpreting methods to estimate the variance around estimates obtained using multiplier methods in conjunction with research into design effects and respondent-driven sampling. We describe an application to estimate the number of female sex workers in Harare, Zimbabwe. There is high variance in estimates. Random error around the size estimate reflects uncertainty from M and P, particularly when the estimate of P in the respondent-driven sampling survey is low. As expected, sample size requirements are higher when the design effect of the survey is assumed to be greater. We suggest a method for investigating the effects of sample size on the precision of a population size estimate obtained using multipler methods and respondent-driven sampling. Uncertainty in the size estimate is high, particularly when P is small, so balancing against other potential sources of bias, we advise researchers to consider longer service attendance reference periods and to distribute more unique objects, which is likely to result in a higher estimate of P in the respondent-driven sampling survey. ©Elizabeth Fearon, Sungai T Chabata, Jennifer A Thompson, Frances M Cowan, James R Hargreaves. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 14.09.2017.

Characterization of Aspergillus section Nigri species populations in vineyard soil using droplet digital PCR.

PubMed

Palumbo, J D; O'Keeffe, T L; Fidelibus, M W

2016-12-01

Identification of populations of Aspergillus section Nigri species in environmental samples using traditional methods is laborious and impractical for large numbers of samples. We developed species-specific primers and probes for quantitative droplet digital PCR (ddPCR) to improve sample throughput and simultaneously detect multiple species in each sample. The ddPCR method was used to distinguish Aspergillus niger, Aspergillus welwitschiae, Aspergillus tubingensis and Aspergillus carbonarius in mixed samples of total DNA. Relative abundance of each species measured by ddPCR agreed with input ratios of template DNAs. Soil samples were collected at six time points over two growing seasons from two raisin vineyards in Fresno County, California. Aspergillus section Nigri strains were detected in these soils in the range of 10 2 -10 5 CFU g -1 . Relative abundance of each species varied widely among samples, but in 52 of 60 samples, A. niger was the most abundant species, ranging from 38 to 88% of the total population. In combination with total plate counts, this ddPCR method provides a high-throughput method for describing population dynamics of important potential mycotoxin-producing species in environmental samples. This is the first study to demonstrate the utility of ddPCR as a means to quantify species of Aspergillus section Nigri in soil. This method eliminates the need for isolation and sequence identification of individual fungal isolates, and allows for greater throughput in measuring relative population sizes of important (i.e. mycotoxigenic) Aspergillus species within a population of morphologically indistinguishable species. Published 2016. This article is a U.S. Government work and is in the public domain in the USA.
Spatially explicit population estimates for black bears based on cluster sampling

USGS Publications Warehouse

Humm, J.; McCown, J. Walter; Scheick, B.K.; Clark, Joseph D.

2017-01-01

We estimated abundance and density of the 5 major black bear (Ursus americanus) subpopulations (i.e., Eglin, Apalachicola, Osceola, Ocala-St. Johns, Big Cypress) in Florida, USA with spatially explicit capture-mark-recapture (SCR) by extracting DNA from hair samples collected at barbed-wire hair sampling sites. We employed a clustered sampling configuration with sampling sites arranged in 3 × 3 clusters spaced 2 km apart within each cluster and cluster centers spaced 16 km apart (center to center). We surveyed all 5 subpopulations encompassing 38,960 km2 during 2014 and 2015. Several landscape variables, most associated with forest cover, helped refine density estimates for the 5 subpopulations we sampled. Detection probabilities were affected by site-specific behavioral responses coupled with individual capture heterogeneity associated with sex. Model-averaged bear population estimates ranged from 120 (95% CI = 59–276) bears or a mean 0.025 bears/km2 (95% CI = 0.011–0.44) for the Eglin subpopulation to 1,198 bears (95% CI = 949–1,537) or 0.127 bears/km2 (95% CI = 0.101–0.163) for the Ocala-St. Johns subpopulation. The total population estimate for our 5 study areas was 3,916 bears (95% CI = 2,914–5,451). The clustered sampling method coupled with information on land cover was efficient and allowed us to estimate abundance across extensive areas that would not have been possible otherwise. Clustered sampling combined with spatially explicit capture-recapture methods has the potential to provide rigorous population estimates for a wide array of species that are extensive and heterogeneous in their distribution.
The Use of Genetics for the Management of a Recovering Population: Temporal Assessment of Migratory Peregrine Falcons in North America

PubMed Central

Johnson, Jeff A.; Talbot, Sandra L.; Sage, George K.; Burnham, Kurt K.; Brown, Joseph W.; Maechtle, Tom L.; Seegar, William S.; Yates, Michael A.; Anderson, Bud; Mindell, David P.

2010-01-01

Background Our ability to monitor populations or species that were once threatened or endangered and in the process of recovery is enhanced by using genetic methods to assess overall population stability and size over time. This can be accomplished most directly by obtaining genetic measures from temporally-spaced samples that reflect the overall stability of the population as given by changes in genetic diversity levels (allelic richness and heterozygosity), degree of population differentiation (F ST and D EST), and effective population size (N e). The primary goal of any recovery effort is to produce a long-term self-sustaining population, and these genetic measures provide a metric by which we can gauge our progress and help make important management decisions. Methodology/Principal Findings The peregrine falcon in North America (Falco peregrinus tundrius and anatum) was delisted in 1994 and 1999, respectively, and its abundance will be monitored by the species Recovery Team every three years until 2015. Although the United States Fish and Wildlife Service makes a distinction between tundrius and anatum subspecies, our genetic results based on eleven microsatellite loci suggest limited differentiation that can be attributed to an isolation by distance relationship and warrant no delineation of these two subspecies in its northern latitudinal distribution from Alaska through Canada into Greenland. Using temporal samples collected at Padre Island, Texas during migration (seven temporal time periods between 1985–2007), no significant differences in genetic diversity or significant population differentiation in allele frequencies between time periods were observed and were indistinguishable from those obtained from tundrius/anatum breeding locations throughout their northern distribution. Estimates of harmonic mean N e were variable and imprecise, but always greater than 500 when employing multiple temporal genetic methods. Conclusions/Significance These results, including those from simulations to assess the power of each method to estimate N e, suggest a stable or growing population, which is consistent with ongoing field-based monitoring surveys. Therefore, historic and continuing efforts to prevent the extinction of the peregrine falcon in North America appear successful with no indication of recent decline, at least from the northern latitude range-wide perspective. The results also further highlight the importance of archiving samples and their use for continual assessment of population recovery and long-term viability. PMID:21124969
Design Appropriate Models Based on Intelligent Dimension in Fars Education Organization

ERIC Educational Resources Information Center

Goodarzi, Shahbaz; Fallah, Vahid; Saffarian, Saeid

2016-01-01

The purpose of this study is to determine the dimensions of smart schools in the Fars education system and provide a suitable model. The research method is descriptive survey. The study population consisted of all school principals Fars Province in the academic 2014-2015 and number of them was 1364. The sample volume using Cochran method was 302…
Sampling methods, dispersion patterns, and fixed precision sequential sampling plans for western flower thrips (Thysanoptera: Thripidae) and cotton fleahoppers (Hemiptera: Miridae) in cotton.

PubMed

Parajulee, M N; Shrestha, R B; Leser, J F

2006-04-01

A 2-yr field study was conducted to examine the effectiveness of two sampling methods (visual and plant washing techniques) for western flower thrips, Frankliniella occidentalis (Pergande), and five sampling methods (visual, beat bucket, drop cloth, sweep net, and vacuum) for cotton fleahopper, Pseudatomoscelis seriatus (Reuter), in Texas cotton, Gossypium hirsutum (L.), and to develop sequential sampling plans for each pest. The plant washing technique gave similar results to the visual method in detecting adult thrips, but the washing technique detected significantly higher number of thrips larvae compared with the visual sampling. Visual sampling detected the highest number of fleahoppers followed by beat bucket, drop cloth, vacuum, and sweep net sampling, with no significant difference in catch efficiency between vacuum and sweep net methods. However, based on fixed precision cost reliability, the sweep net sampling was the most cost-effective method followed by vacuum, beat bucket, drop cloth, and visual sampling. Taylor's Power Law analysis revealed that the field dispersion patterns of both thrips and fleahoppers were aggregated throughout the crop growing season. For thrips management decision based on visual sampling (0.25 precision), 15 plants were estimated to be the minimum sample size when the estimated population density was one thrips per plant, whereas the minimum sample size was nine plants when thrips density approached 10 thrips per plant. The minimum visual sample size for cotton fleahoppers was 16 plants when the density was one fleahopper per plant, but the sample size decreased rapidly with an increase in fleahopper density, requiring only four plants to be sampled when the density was 10 fleahoppers per plant. Sequential sampling plans were developed and validated with independent data for both thrips and cotton fleahoppers.
Uncovering a latent multinomial: Analysis of mark-recapture data with misidentification

USGS Publications Warehouse

Link, W.A.; Yoshizaki, J.; Bailey, L.L.; Pollock, K.H.

2010-01-01

Natural tags based on DNA fingerprints or natural features of animals are now becoming very widely used in wildlife population biology. However, classic capture-recapture models do not allow for misidentification of animals which is a potentially very serious problem with natural tags. Statistical analysis of misidentification processes is extremely difficult using traditional likelihood methods but is easily handled using Bayesian methods. We present a general framework for Bayesian analysis of categorical data arising from a latent multinomial distribution. Although our work is motivated by a specific model for misidentification in closed population capture-recapture analyses, with crucial assumptions which may not always be appropriate, the methods we develop extend naturally to a variety of other models with similar structure. Suppose that observed frequencies f are a known linear transformation f = A???x of a latent multinomial variable x with cell probability vector ?? = ??(??). Given that full conditional distributions [?? | x] can be sampled, implementation of Gibbs sampling requires only that we can sample from the full conditional distribution [x | f, ??], which is made possible by knowledge of the null space of A???. We illustrate the approach using two data sets with individual misidentification, one simulated, the other summarizing recapture data for salamanders based on natural marks. ?? 2009, The International Biometric Society.
Uncovering a Latent Multinomial: Analysis of Mark-Recapture Data with Misidentification

USGS Publications Warehouse

Link, W.A.; Yoshizaki, J.; Bailey, L.L.; Pollock, K.H.

2009-01-01

Natural tags based on DNA fingerprints or natural features of animals are now becoming very widely used in wildlife population biology. However, classic capture-recapture models do not allow for misidentification of animals which is a potentially very serious problem with natural tags. Statistical analysis of misidentification processes is extremely difficult using traditional likelihood methods but is easily handled using Bayesian methods. We present a general framework for Bayesian analysis of categorical data arising from a latent multinomial distribution. Although our work is motivated by a specific model for misidentification in closed population capture-recapture analyses, with crucial assumptions which may not always be appropriate, the methods we develop extend naturally to a variety of other models with similar structure. Suppose that observed frequencies f are a known linear transformation f=A'x of a latent multinomial variable x with cell probability vector pi= pi(theta). Given that full conditional distributions [theta | x] can be sampled, implementation of Gibbs sampling requires only that we can sample from the full conditional distribution [x | f, theta], which is made possible by knowledge of the null space of A'. We illustrate the approach using two data sets with individual misidentification, one simulated, the other summarizing recapture data for salamanders based on natural marks.
[Prevalence of Variants in the Apolipoprotein E (APOE) Gene in a General Population of Adults from an Urban Area of Medellin (Antioquia)].

PubMed

Arango Viana, Juan Carlos; Valencia, Ana Victoria; Páez, Ana Lucía; Montoya Gómez, Nilton; Palacio, Carlos; Arbeláez, María Patricia; Bedoya Berrío, Gabriel; García Valencia, Jenny

2014-01-01

To determine the allelic and genotype frequencies of apolipoproteine E (APOE) gene in a representative sample of the adult population of Medellin in 2010. A representative sample of the adult population of Medellin, was obtained by means of a multi-stage, stratified, conglomerate based sampling method. APOE genotyping was carried out on each of the participants. The sampling design was taken into consideration for the frequencies and association analysis. The frequencies of the APOE alleles E2, E3 and E4 were 3.9, 92.0 and 4.1%, respectively. The frequencies of the different APOE genotypes were as follows: 2/2, 0.2%; 2/3, 6.8%; 2/4, 0.6%; 3/3, 85.0%; 3/4, 7.2%, and 4/4, 0.3%. The allelic and genotype frequencies of APOE in an adult population of Medellin did not differ substantially from other series reported in South America. These data are important to determine the real impact of APOE on the population risk of several psychiatric diseases. Copyright © 2013 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.
Population Size Estimation of Men Who Have Sex with Men in Tbilisi, Georgia; Multiple Methods and Triangulation of Findings.

PubMed

Sulaberidze, Lela; Mirzazadeh, Ali; Chikovani, Ivdity; Shengelia, Natia; Tsereteli, Nino; Gotsadze, George

2016-01-01

An accurate estimation of the population size of men who have sex with men (MSM) is critical to the success of HIV program planning and to monitoring of the response to epidemic as a whole, but is quite often missing. In this study, our aim was to estimate the population size of MSM in Tbilisi, Georgia and compare it with other estimates in the region. In the absence of a gold standard for estimating the population size of MSM, this study reports a range of methods, including network scale-up, mobile/web apps multiplier, service and unique object multiplier, network-based capture-recapture, Handcock RDS-based and Wisdom of Crowds methods. To apply all these methods, two surveys were conducted: first, a household survey among 1,015 adults from the general population, and second, a respondent driven sample of 210 MSM. We also conducted a literature review of MSM size estimation in Eastern European and Central Asian countries. The median population size of MSM generated from all previously mentioned methods was estimated to be 5,100 (95% Confidence Interval (CI): 3,243~9,088). This corresponds to 1.42% (95%CI: 0.9%~2.53%) of the adult male population in Tbilisi. Our size estimates of the MSM population (1.42% (95%CI: 0.9%~2.53%) of the adult male population in Tbilisi) fall within ranges reported in other Eastern European and Central Asian countries. These estimates can provide valuable information for country level HIV prevention program planning and evaluation. Furthermore, we believe, that our results will narrow the gap in data availability on the estimates of the population size of MSM in the region.
Base-Calling Algorithm with Vocabulary (BCV) Method for Analyzing Population Sequencing Chromatograms

PubMed Central

Fantin, Yuri S.; Neverov, Alexey D.; Favorov, Alexander V.; Alvarez-Figueroa, Maria V.; Braslavskaya, Svetlana I.; Gordukova, Maria A.; Karandashova, Inga V.; Kuleshov, Konstantin V.; Myznikova, Anna I.; Polishchuk, Maya S.; Reshetov, Denis A.; Voiciehovskaya, Yana A.; Mironov, Andrei A.; Chulanov, Vladimir P.

2013-01-01

Sanger sequencing is a common method of reading DNA sequences. It is less expensive than high-throughput methods, and it is appropriate for numerous applications including molecular diagnostics. However, sequencing mixtures of similar DNA of pathogens with this method is challenging. This is important because most clinical samples contain such mixtures, rather than pure single strains. The traditional solution is to sequence selected clones of PCR products, a complicated, time-consuming, and expensive procedure. Here, we propose the base-calling with vocabulary (BCV) method that computationally deciphers Sanger chromatograms obtained from mixed DNA samples. The inputs to the BCV algorithm are a chromatogram and a dictionary of sequences that are similar to those we expect to obtain. We apply the base-calling function on a test dataset of chromatograms without ambiguous positions, as well as one with 3–14% sequence degeneracy. Furthermore, we use BCV to assemble a consensus sequence for an HIV genome fragment in a sample containing a mixture of viral DNA variants and to determine the positions of the indels. Finally, we detect drug-resistant Mycobacterium tuberculosis strains carrying frameshift mutations mixed with wild-type bacteria in the pncA gene, and roughly characterize bacterial communities in clinical samples by direct 16S rRNA sequencing. PMID:23382983
Differential resistance of drinking water bacterial populations to monochloramine disinfection.

PubMed

Chiao, Tzu-Hsin; Clancy, Tara M; Pinto, Ameet; Xi, Chuanwu; Raskin, Lutgarde

2014-04-01

The impact of monochloramine disinfection on the complex bacterial community structure in drinking water systems was investigated using culture-dependent and culture-independent methods. Changes in viable bacterial diversity were monitored using culture-independent methods that distinguish between live and dead cells based on membrane integrity, providing a highly conservative measure of viability. Samples were collected from lab-scale and full-scale drinking water filters exposed to monochloramine for a range of contact times. Culture-independent detection of live cells was based on propidium monoazide (PMA) treatment to selectively remove DNA from membrane-compromised cells. Quantitative PCR (qPCR) and pyrosequencing of 16S rRNA genes was used to quantify the DNA of live bacteria and characterize the bacterial communities, respectively. The inactivation rate determined by the culture-independent PMA-qPCR method (1.5-log removal at 664 mg·min/L) was lower than the inactivation rate measured by the culture-based methods (4-log removal at 66 mg·min/L). Moreover, drastic changes in the live bacterial community structure were detected during monochloramine disinfection using PMA-pyrosequencing, while the community structure appeared to remain stable when pyrosequencing was performed on samples that were not subject to PMA treatment. Genera that increased in relative abundance during monochloramine treatment include Legionella, Escherichia, and Geobacter in the lab-scale system and Mycobacterium, Sphingomonas, and Coxiella in the full-scale system. These results demonstrate that bacterial populations in drinking water exhibit differential resistance to monochloramine, and that the disinfection process selects for resistant bacterial populations.
Nose profile morphology and accuracy study of nose profile estimation method in Scottish subadult and Indonesian adult populations.

PubMed

Sarilita, Erli; Rynn, Christopher; Mossey, Peter A; Black, Sue; Oscandar, Fahmi

2018-05-01

This study investigated nose profile morphology and its relationship to the skull in Scottish subadult and Indonesian adult populations, with the aim of improving the accuracy of forensic craniofacial reconstruction. Samples of 86 lateral head cephalograms from Dundee Dental School (mean age, 11.8 years) and 335 lateral head cephalograms from the Universitas Padjadjaran Dental Hospital, Bandung, Indonesia (mean age 24.2 years), were measured. The method of nose profile estimation based on skull morphology previously proposed by Rynn and colleagues in 2010 (FSMP 6:20-34) was tested in this study. Following this method, three nasal aperture-related craniometrics and six nose profile dimensions were measured from the cephalograms. To assess the accuracy of the method, six nose profile dimensions were estimated from the three craniometric parameters using the published method and then compared to the actual nose profile dimensions.In the Scottish subadult population, no sexual dimorphism was evident in the measured dimensions. In contrast, sexual dimorphism of the Indonesian adult population was evident in all craniometric and nose profile dimensions; notably, males exhibited statistically significant larger values than females. The published method by Rynn and colleagues (FSMP 6:20-34, 2010) performed better in the Scottish subadult population (mean difference of maximum, 2.35 mm) compared to the Indonesian adult population (mean difference of maximum, 5.42 mm in males and 4.89 mm in females).In addition, regression formulae were derived to estimate nose profile dimensions based on the craniometric measurements for the Indonesian adult population. The published method is not sufficiently accurate for use on the Indonesian population, so the derived method should be used. The accuracy of the published method by Rynn and colleagues (FSMP 6:20-34, 2010) was sufficiently reliable to be applied in Scottish subadult population.
Robust Identification of Local Adaptation from Allele Frequencies

PubMed Central

Günther, Torsten; Coop, Graham

2013-01-01

Comparing allele frequencies among populations that differ in environment has long been a tool for detecting loci involved in local adaptation. However, such analyses are complicated by an imperfect knowledge of population allele frequencies and neutral correlations of allele frequencies among populations due to shared population history and gene flow. Here we develop a set of methods to robustly test for unusual allele frequency patterns and correlations between environmental variables and allele frequencies while accounting for these complications based on a Bayesian model previously implemented in the software Bayenv. Using this model, we calculate a set of “standardized allele frequencies” that allows investigators to apply tests of their choice to multiple populations while accounting for sampling and covariance due to population history. We illustrate this first by showing that these standardized frequencies can be used to detect nonparametric correlations with environmental variables; these correlations are also less prone to spurious results due to outlier populations. We then demonstrate how these standardized allele frequencies can be used to construct a test to detect SNPs that deviate strongly from neutral population structure. This test is conceptually related to FST and is shown to be more powerful, as we account for population history. We also extend the model to next-generation sequencing of population pools—a cost-efficient way to estimate population allele frequencies, but one that introduces an additional level of sampling noise. The utility of these methods is demonstrated in simulations and by reanalyzing human SNP data from the Human Genome Diversity Panel populations and pooled next-generation sequencing data from Atlantic herring. An implementation of our method is available from http://gcbias.org. PMID:23821598
Impact of Bioreactor Environment and Recovery Method on the Profile of Bacterial Populations from Water Distribution Systems.

PubMed

Luo, Xia; Jellison, Kristen L; Huynh, Kevin; Widmer, Giovanni

2015-01-01

Multiple rotating annular reactors were seeded with biofilms flushed from water distribution systems to assess (1) whether biofilms grown in bioreactors are representative of biofilms flushed from the water distribution system in terms of bacterial composition and diversity, and (2) whether the biofilm sampling method affects the population profile of the attached bacterial community. Biofilms were grown in bioreactors until thickness stabilized (9 to 11 weeks) and harvested from reactor coupons by sonication, stomaching, bead-beating, and manual scraping. High-throughput sequencing of 16S rRNA amplicons was used to profile bacterial populations from flushed biofilms seeded into bioreactors as well as biofilms recovered from bioreactor coupons by different methods. β diversity between flushed and reactor biofilms was compared to β diversity between (i) biofilms harvested from different reactors and (ii) biofilms harvested by different methods from the same reactor. These analyses showed that average diversity between flushed and bioreactor biofilms was double the diversity between biofilms from different reactors operated in parallel. The diversity between bioreactors was larger than the diversity associated with different biofilm recovery methods. Compared to other experimental variables, the method used to recover biofilms had a negligible impact on the outcome of water biofilm analyses based on 16S amplicon sequencing. Results from this study show that biofilms grown in reactors over 9 to 11 weeks are not representative models of the microbial populations flushed from a distribution system. Furthermore, the bacterial population profile of biofilms grown in replicate reactors from the same flushed water are likely to diverge. However, four common sampling protocols, which differ with respect to disruption of bacterial cells, provide similar information with respect to the 16S rRNA population profile of the biofilm community.
Identifying currents in the gene pool for bacterial populations using an integrative approach.

PubMed

Tang, Jing; Hanage, William P; Fraser, Christophe; Corander, Jukka

2009-08-01

The evolution of bacterial populations has recently become considerably better understood due to large-scale sequencing of population samples. It has become clear that DNA sequences from a multitude of genes, as well as a broad sample coverage of a target population, are needed to obtain a relatively unbiased view of its genetic structure and the patterns of ancestry connected to the strains. However, the traditional statistical methods for evolutionary inference, such as phylogenetic analysis, are associated with several difficulties under such an extensive sampling scenario, in particular when a considerable amount of recombination is anticipated to have taken place. To meet the needs of large-scale analyses of population structure for bacteria, we introduce here several statistical tools for the detection and representation of recombination between populations. Also, we introduce a model-based description of the shape of a population in sequence space, in terms of its molecular variability and affinity towards other populations. Extensive real data from the genus Neisseria are utilized to demonstrate the potential of an approach where these population genetic tools are combined with an phylogenetic analysis. The statistical tools introduced here are freely available in BAPS 5.2 software, which can be downloaded from http://web.abo.fi/fak/mnf/mate/jc/software/baps.html.
Accounting for missing data in the estimation of contemporary genetic effective population size (N(e) ).

PubMed

Peel, D; Waples, R S; Macbeth, G M; Do, C; Ovenden, J R

2013-03-01

Theoretical models are often applied to population genetic data sets without fully considering the effect of missing data. Researchers can deal with missing data by removing individuals that have failed to yield genotypes and/or by removing loci that have failed to yield allelic determinations, but despite their best efforts, most data sets still contain some missing data. As a consequence, realized sample size differs among loci, and this poses a problem for unbiased methods that must explicitly account for random sampling error. One commonly used solution for the calculation of contemporary effective population size (N(e) ) is to calculate the effective sample size as an unweighted mean or harmonic mean across loci. This is not ideal because it fails to account for the fact that loci with different numbers of alleles have different information content. Here we consider this problem for genetic estimators of contemporary effective population size (N(e) ). To evaluate bias and precision of several statistical approaches for dealing with missing data, we simulated populations with known N(e) and various degrees of missing data. Across all scenarios, one method of correcting for missing data (fixed-inverse variance-weighted harmonic mean) consistently performed the best for both single-sample and two-sample (temporal) methods of estimating N(e) and outperformed some methods currently in widespread use. The approach adopted here may be a starting point to adjust other population genetics methods that include per-locus sample size components. © 2012 Blackwell Publishing Ltd.
Rarity and Incomplete Sampling in DNA-Based Species Delimitation.

PubMed

Ahrens, Dirk; Fujisawa, Tomochika; Krammer, Hans-Joachim; Eberle, Jonas; Fabrizi, Silvia; Vogler, Alfried P

2016-05-01

DNA-based species delimitation may be compromised by limited sampling effort and species rarity, including "singleton" representatives of species, which hampers estimates of intra- versus interspecies evolutionary processes. In a case study of southern African chafers (beetles in the family Scarabaeidae), many species and subclades were poorly represented and 48.5% of species were singletons. Using cox1 sequences from >500 specimens and ∼100 species, the Generalized Mixed Yule Coalescent (GMYC) analysis as well as various other approaches for DNA-based species delimitation (Automatic Barcode Gap Discovery (ABGD), Poisson tree processes (PTP), Species Identifier, Statistical Parsimony), frequently produced poor results if analyzing a narrow target group only, but the performance improved when several subclades were combined. Hence, low sampling may be compensated for by "clade addition" of lineages outside of the focal group. Similar findings were obtained in reanalysis of published data sets of taxonomically poorly known species assemblages of insects from Madagascar. The low performance of undersampled trees is not due to high proportions of singletons per se, as shown in simulations (with 13%, 40% and 52% singletons). However, the GMYC method was highly sensitive to variable effective population size ([Formula: see text]), which was exacerbated by variable species abundances in the simulations. Hence, low sampling success and rarity of species affect the power of the GMYC method only if they reflect great differences in [Formula: see text] among species. Potential negative effects of skewed species abundances and prevalence of singletons are ultimately an issue about the variation in [Formula: see text] and the degree to which this is correlated with the census population size and sampling success. Clade addition beyond a limited study group can overcome poor sampling for the GMYC method in particular under variable [Formula: see text] This effect was less pronounced for methods of species delimitation not based on coalescent models. © The Author(s) 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Evaluating diagnosis-based risk-adjustment methods in a population with spinal cord dysfunction.

PubMed

Warner, Grace; Hoenig, Helen; Montez, Maria; Wang, Fei; Rosen, Amy

2004-02-01

To examine performance of models in predicting health care utilization for individuals with spinal cord dysfunction. Regression models compared 2 diagnosis-based risk-adjustment methods, the adjusted clinical groups (ACGs) and diagnostic cost groups (DCGs). To improve prediction, we added to our model: (1) spinal cord dysfunction-specific diagnostic information, (2) limitations in self-care function, and (3) both 1 and 2. Models were replicated in 3 populations. Samples from 3 populations: (1) 40% of veterans using Veterans Health Administration services in fiscal year 1997 (FY97) (N=1,046,803), (2) veteran sample with spinal cord dysfunction identified by codes from the International Statistical Classification of Diseases, 9th Revision, Clinical Modifications (N=7666), and (3) veteran sample identified in Veterans Affairs Spinal Cord Dysfunction Registry (N=5888). Not applicable. Inpatient, outpatient, and total days of care in FY97. The DCG models (R(2) range,.22-.38) performed better than ACG models (R(2) range,.04-.34) for all outcomes. Spinal cord dysfunction-specific diagnostic information improved prediction more in the ACG model than in the DCG model (R(2) range for ACG,.14-.34; R(2) range for DCG,.24-.38). Information on self-care function slightly improved performance (R(2) range increased from 0 to.04). The DCG risk-adjustment models predicted health care utilization better than ACG models. ACG model prediction was improved by adding information.
Distinguishing Heterodera filipjevi and H. avenae using polymerase chain reaction-restriction fragment length polymorphism and cyst morphology.

PubMed

Yan, Guiping; Smiley, Richard W

2010-03-01

The cereal cyst nematodes Heterodera filipjevi and H. avenae impede wheat production in the Pacific Northwest (PNW). Accurate identification of cyst nematode species and awareness of high population density in affected fields are essential for designing effective control measures. Morphological methods for differentiating these species are laborious. These species were differentiated using polymerase chain reaction restriction fragment length polymorphism (PCR-RFLP) of internal transcribed spacer (ITS)-ribosomal (r)DNA with up to six restriction endonucleases (TaqI, HinfI, PstI, HaeIII, RsaI, and AluI). The method was validated by inspecting underbridge structures of cyst vulval cones. Grid soil sampling of an Oregon field infested by both species revealed that H. filipjevi was present at most of the infested grid sites but mixtures of H. avenae and H. filipjevi also occurred. These procedures also detected and differentiated H. filipjevi and H. avenae in soil samples from nearby fields in Oregon and H. avenae in samples from Idaho and Washington. Intraspecific polymorphism was not observed within H. filipjevi or PNW H. avenae populations based on the ITS-rDNA. However, intraspecific variation was observed between H. avenae populations occurring in the PNW and France. Methods described here will improve detection and identification efficiencies for cereal cyst nematodes in wheat fields.
Improvement of Predictive Ability by Uniform Coverage of the Target Genetic Space

PubMed Central

Bustos-Korts, Daniela; Malosetti, Marcos; Chapman, Scott; Biddulph, Ben; van Eeuwijk, Fred

2016-01-01

Genome-enabled prediction provides breeders with the means to increase the number of genotypes that can be evaluated for selection. One of the major challenges in genome-enabled prediction is how to construct a training set of genotypes from a calibration set that represents the target population of genotypes, where the calibration set is composed of a training and validation set. A random sampling protocol of genotypes from the calibration set will lead to low quality coverage of the total genetic space by the training set when the calibration set contains population structure. As a consequence, predictive ability will be affected negatively, because some parts of the genotypic diversity in the target population will be under-represented in the training set, whereas other parts will be over-represented. Therefore, we propose a training set construction method that uniformly samples the genetic space spanned by the target population of genotypes, thereby increasing predictive ability. To evaluate our method, we constructed training sets alongside with the identification of corresponding genomic prediction models for four genotype panels that differed in the amount of population structure they contained (maize Flint, maize Dent, wheat, and rice). Training sets were constructed using uniform sampling, stratified-uniform sampling, stratified sampling and random sampling. We compared these methods with a method that maximizes the generalized coefficient of determination (CD). Several training set sizes were considered. We investigated four genomic prediction models: multi-locus QTL models, GBLUP models, combinations of QTL and GBLUPs, and Reproducing Kernel Hilbert Space (RKHS) models. For the maize and wheat panels, construction of the training set under uniform sampling led to a larger predictive ability than under stratified and random sampling. The results of our methods were similar to those of the CD method. For the rice panel, all training set construction methods led to similar predictive ability, a reflection of the very strong population structure in this panel. PMID:27672112

Estimating population genetic parameters and comparing model goodness-of-fit using DNA sequences with error

PubMed Central

Liu, Xiaoming; Fu, Yun-Xin; Maxwell, Taylor J.; Boerwinkle, Eric

2010-01-01

It is known that sequencing error can bias estimation of evolutionary or population genetic parameters. This problem is more prominent in deep resequencing studies because of their large sample size n, and a higher probability of error at each nucleotide site. We propose a new method based on the composite likelihood of the observed SNP configurations to infer population mutation rate θ = 4Neμ, population exponential growth rate R, and error rate ɛ, simultaneously. Using simulation, we show the combined effects of the parameters, θ, n, ɛ, and R on the accuracy of parameter estimation. We compared our maximum composite likelihood estimator (MCLE) of θ with other θ estimators that take into account the error. The results show the MCLE performs well when the sample size is large or the error rate is high. Using parametric bootstrap, composite likelihood can also be used as a statistic for testing the model goodness-of-fit of the observed DNA sequences. The MCLE method is applied to sequence data on the ANGPTL4 gene in 1832 African American and 1045 European American individuals. PMID:19952140
People with dementia in nursing home research: a methodological review of the definition and identification of the study population.

PubMed

Palm, Rebecca; Jünger, Saskia; Reuther, Sven; Schwab, Christian G G; Dichter, Martin N; Holle, Bernhard; Halek, Margareta

2016-04-05

There are various definitions and diagnostic criteria for dementia, leading to discrepancies in case ascertainment in both clinical practice and research. We reviewed the different definitions, approaches and measurements used to operationalize dementia in health care studies in German nursing homes with the aim of discussing the implications of different approaches. We conducted a systematic search of the MEDLINE and CINAHL databases to identify pre-2016 studies conducted in German nursing homes that focused on residents with dementia or cognitive impairment. In- or exclusion of studies were consented by all authors; data extraction was independently carried out by 2 authors (RP, SJ). The studies' sampling methods were compared with respect to their inclusion criteria, assessment tools and methods used to identify the study population. We summarized case ascertainment methods from 64 studies. Study participants were identified based on a diagnosis that was evaluated during the study, or a recorded medical dementia diagnosis, or a recorded medical diagnosis either with additional cognitive screenings or using screening tests exclusively. The descriptions of the diagnostics that were applied to assess a diagnosis of dementia were not fully transparent in most of the studies with respect to either a clear reference definition of dementia or applied diagnostic criteria. If reported, various neuropsychological tests were used, mostly without a clear rationale for their selection. Pragmatic considerations often determine the sampling strategy; they also may explain the variances we detected in the different studies. Variations in sampling methods impede the comparability of study results. There is a need to consent case ascertainment strategies in dementia studies in health service research in nursing homes. These strategies should consider resource constraints and ethical issues that are related to the vulnerable population of nursing home residents. Additionally, reporting about dementia studies in nursing homes need to be improved. If a diagnosis cannot be evaluated based on either ICD or DSM criteria, the study population may not be reported as having dementia. If a diagnosis is evaluated based on ICD or DSM criteria within the study, there is a need for more transparency of the diagnostic process.
cpDNA microsatellite markers for Lemna minor (Araceae): Phylogeographic implications1

PubMed Central

Wani, Gowher A.; Shah, Manzoor A.; Reshi, Zafar A.; Atangana, Alain R.; Khasa, Damase P.

2014-01-01

• Premise of the study: A lack of genetic markers impedes our understanding of the population biology of Lemna minor. Thus, the development of appropriate genetic markers for L. minor promises to be highly useful for population genetic studies and for addressing other life history questions regarding the species. • Methods and Results: For the first time, we characterized nine polymorphic and 24 monomorphic chloroplast microsatellite markers in L. minor using DNA samples of 26 individuals sampled from five populations in Kashmir and of 17 individuals from three populations in Quebec. Initially, we designed 33 primer pairs, which were tested on genomic DNA from natural populations. Nine loci provided markers with two alleles. Based on genotyping of the chloroplast DNA fragments from 43 sampled individuals, we identified one haplotype in Quebec and 11 haplotypes in Kashmir, of which one occurs in 56% of the genotypes, one in 8%, and nine in 4%, respectively. There was a maximum of two alleles per locus. • Conclusions: These new chloroplast microsatellite markers for L. minor and haplotype distribution patterns indicate a complex phylogeographic history that merits further investigation. PMID:25202636
Molecular – genetic variance of RH blood group system within human population of Bosnia and Herzegovina

PubMed Central

Lasić, Lejla; Lojo-Kadrić, Naida; Silajdžić, Elma; Pojskić, Lejla; Hadžiselimović, Rifat; Pojskić, Naris

2013-01-01

There are two major theories for inheritance of Rh blood group system: Fisher – Race theory and Wiener theory. Aim of this study was identifying frequency of RHDCE alleles in Bosnian – Herzegovinian population and introduction of this method in screening for Rh phenotype in B&H since this type of analysis was not used for blood typing in B&H before. Rh blood group was typed by Polymerase Chain Reaction, using the protocols and primers previously established by other authors, then carrying out electrophoresis in 2-3% agarose gel. Percentage of Rh positive individuals in our sample is 84.48%, while the percentage of Rh negative individuals is 15.52%. Inter-rater agreement statistic showed perfect agreement (K=1) between the results of Rh blood system detection based on serological and molecular-genetics methods. In conclusion, molecular – genetic methods are suitable for prenatal genotyping and specific cases while standard serological method is suitable for high-throughput of samples. PMID:23448604
Use of experience sampling method to understand the wilderness experience

Treesearch

Lynn Anderson

2002-01-01

There is a growing body of research documenting the benefits of outdoor adventure and wilderness-based programs with a variety of special populations. Criticisms of this body of research are that it is not grounded in theory and it is outcome-based, with no investigation into the processes causing the behavior change in individuals. This study attempted to investigate...
Childhood Predictors of Male Criminality: A Prospective Population-Based Follow-up Study from Age 8 to Late Adolescence

ERIC Educational Resources Information Center

Sourander, Andre; Elonheimo, Henrik; Niemela, Solja; Nuutila, Ari-Matti; Helenius, Hans; Sillanmaki, Lauri; Piha, Jorma; Tamminen, Tuula; Kumpulainen, Kirsti; Moilanen, Irma; Almqvist, Frederik

2006-01-01

Objective: To study childhood predictors for late adolescence criminality. Method: The follow-up sample included 2,713 Finnish boys born in 1981. Information about the 8-year-old boys' problem behavior was obtained from parents, teachers, and the children themselves. The follow-up information about criminal offenses was based on the national…
The Role of Perceived Control in Explaining Depressive Symptoms Associated with Driving Cessation in a Longitudinal Study

ERIC Educational Resources Information Center

Windsor, Timothy D.; Anstey, Kaarin J.; Butterworth, Peter; Luszcz, Mary A.; Andrews, Gary R.

2007-01-01

Purpose: The purpose of this article was to investigate the role of control beliefs in mediating the relationship between driving cessation and change in depressive symptoms in a population-based sample of older adults. Design and Methods: We report results from a prospective, community-based cohort study that included two waves of data collected…
The Contribution of Early Language Development to Children's Emotional and Behavioural Functioning at 6 Years: An Analysis of Data from the Children in Focus Sample from the ALSPAC Birth Cohort

ERIC Educational Resources Information Center

Clegg, Judy; Law, James; Rush, Robert; Peters, Tim J.; Roulstone, Susan

2015-01-01

Background: An association between children's early language development and their emotional and behavioural functioning is reported in the literature. The nature of the association remains unclear and it has not been established if such an association is found in a population-based cohort in addition to clinical populations. Methods: This study…
Elucidation of Seventeen Human Peripheral Blood B cell Subsets and Quantification of the Tetanus Response Using a Density-Based Method for the Automated Identification of Cell Populations in Multidimensional Flow Cytometry Data

PubMed Central

Qian, Yu; Wei, Chungwen; Lee, F. Eun-Hyung; Campbell, John; Halliley, Jessica; Lee, Jamie A.; Cai, Jennifer; Kong, Megan; Sadat, Eva; Thomson, Elizabeth; Dunn, Patrick; Seegmiller, Adam C.; Karandikar, Nitin J.; Tipton, Chris; Mosmann, Tim; Sanz, Iñaki; Scheuermann, Richard H.

2011-01-01

Background Advances in multi-parameter flow cytometry (FCM) now allow for the independent detection of larger numbers of fluorochromes on individual cells, generating data with increasingly higher dimensionality. The increased complexity of these data has made it difficult to identify cell populations from high-dimensional FCM data using traditional manual gating strategies based on single-color or two-color displays. Methods To address this challenge, we developed a novel program, FLOCK (FLOw Clustering without K), that uses a density-based clustering approach to algorithmically identify biologically relevant cell populations from multiple samples in an unbiased fashion, thereby eliminating operator-dependent variability. Results FLOCK was used to objectively identify seventeen distinct B cell subsets in a human peripheral blood sample and to identify and quantify novel plasmablast subsets responding transiently to tetanus and other vaccinations in peripheral blood. FLOCK has been implemented in the publically available Immunology Database and Analysis Portal – ImmPort (http://www.immport.org) for open use by the immunology research community. Conclusions FLOCK is able to identify cell subsets in experiments that use multi-parameter flow cytometry through an objective, automated computational approach. The use of algorithms like FLOCK for FCM data analysis obviates the need for subjective and labor intensive manual gating to identify and quantify cell subsets. Novel populations identified by these computational approaches can serve as hypotheses for further experimental study. PMID:20839340
Review of sampling hard-to-reach and hidden populations for HIV surveillance.

PubMed

Magnani, Robert; Sabin, Keith; Saidel, Tobi; Heckathorn, Douglas

2005-05-01

Adequate surveillance of hard-to-reach and 'hidden' subpopulations is crucial to containing the HIV epidemic in low prevalence settings and in slowing the rate of transmission in high prevalence settings. For a variety of reasons, however, conventional facility and survey-based surveillance data collection strategies are ineffective for a number of key subpopulations, particularly those whose behaviors are illegal or illicit. This paper critically reviews alternative sampling strategies for undertaking behavioral or biological surveillance surveys of such groups. Non-probability sampling approaches such as facility-based sentinel surveillance and snowball sampling are the simplest to carry out, but are subject to a high risk of sampling/selection bias. Most of the probability sampling methods considered are limited in that they are adequate only under certain circumstances and for some groups. One relatively new method, respondent-driven sampling, an adaptation of chain-referral sampling, appears to be the most promising for general applications. However, as its applicability to HIV surveillance in resource-poor settings has yet to be established, further field trials are needed before a firm conclusion can be reached.
HacDivSel: Two new methods (haplotype-based and outlier-based) for the detection of divergent selection in pairs of populations

PubMed Central

2017-01-01

The detection of genomic regions involved in local adaptation is an important topic in current population genetics. There are several detection strategies available depending on the kind of genetic and demographic information at hand. A common drawback is the high risk of false positives. In this study we introduce two complementary methods for the detection of divergent selection from populations connected by migration. Both methods have been developed with the aim of being robust to false positives. The first method combines haplotype information with inter-population differentiation (FST). Evidence of divergent selection is concluded only when both the haplotype pattern and the FST value support it. The second method is developed for independently segregating markers i.e. there is no haplotype information. In this case, the power to detect selection is attained by developing a new outlier test based on detecting a bimodal distribution. The test computes the FST outliers and then assumes that those of interest would have a different mode. We demonstrate the utility of the two methods through simulations and the analysis of real data. The simulation results showed power ranging from 60–95% in several of the scenarios whilst the false positive rate was controlled below the nominal level. The analysis of real samples consisted of phased data from the HapMap project and unphased data from intertidal marine snail ecotypes. The results illustrate that the proposed methods could be useful for detecting locally adapted polymorphisms. The software HacDivSel implements the methods explained in this manuscript. PMID:28423003
Efficacy of "Dimodent" sex predictive equation assessed in an Indian population.

PubMed

Bharti, A; Angadi, P V; Kale, A D; Hallikerimath, S R

2011-07-01

Teeth are considered as a useful adjunct for sex assessment and may play an important role in constructing a post-mortem profile. The Dimodent method is based on the high degree of sex discrimination obtained with the mandibular canine and the high correlation coefficients between mandibular canine and lateral incisor mesiodistal (MD) and buccolingual (BL) dimensions. This has been evaluated in the French and Lebanese, but no study exists on its efficacy in Indians. Here, we have applied the 'Dimodent' equation on an Indian sample (100 males, 100 females; age range of 19-27yrs). Additionally, a population-specific Dimodent equation was derived using logistic regression analysis and applied to our sample. Also, the sex determination potential of MD and BL measurements of mandibular lateral incisors and canines, individually, was assessed. We found a poor sex assessment accuracy using the Dimodent equation of Fronty (34.5%) in our Indian sample, but the populationspecific Dimodent equation gave a better accuracy (72%).Thus, it appears that sexual dimorphism in teeth is population-specific; consequently the Dimodent equation has to be derived individually in different populations for use in sex assessment. The mesiodistal measurement of the mandibular canine alone gave a marginally higher accuracy (72.5%); therefore, we suggest the use of mandibular canines alone rather than the Dimodent method.
Heritability of Autism Spectrum Disorder in a UK Population-Based Twin Sample

PubMed Central

Colvert, Emma; Tick, Beata; McEwen, Fiona; Stewart, Catherine; Curran, Sarah R.; Woodhouse, Emma; Gillan, Nicola; Hallett, Victoria; Lietz, Stephanie; Garnett, Tracy; Ronald, Angelica; Plomin, Robert; Rijsdijk, Frühling; Happé, Francesca; Bolton, Patrick

2016-01-01

IMPORTANCE Most evidence to date highlights the importance of genetic influences on the liability to autism and related traits. However, most of these findings are derived from clinically ascertained samples, possibly missing individuals with subtler manifestations, and obtained estimates may not be representative of the population. OBJECTIVES To establish the relative contributions of genetic and environmental factors in liability to autism spectrum disorder (ASD) and a broader autism phenotype in a large population-based twin sample and to ascertain the genetic/environmental relationship between dimensional trait measures and categorical diagnostic constructs of ASD. DESIGN, SETTING, AND PARTICIPANTS We used data from the population-based cohort Twins Early Development Study, which included all twin pairs born in England and Wales from January 1, 1994, through December 31, 1996. We performed joint continuous-ordinal liability threshold model fitting using the full information maximum likelihood method to estimate genetic and environmental parameters of covariance. Twin pairs underwent the following assessments: the Childhood Autism Spectrum Test (CAST) (6423 pairs; mean age, 7.9 years), the Development and Well-being Assessment (DAWBA) (359 pairs; mean age, 10.3 years), the Autism Diagnostic Observation Schedule (ADOS) (203 pairs; mean age, 13.2 years), the Autism Diagnostic Interview–Revised (ADI-R) (205 pairs; mean age, 13.2 years), and a best-estimate diagnosis (207 pairs). MAIN OUTCOMES AND MEASURES Participants underwent screening using a population-based measure of autistic traits (CAST assessment), structured diagnostic assessments (DAWBA, ADI-R, and ADOS), and a best-estimate diagnosis. RESULTS On all ASD measures, correlations among monozygotic twins (range, 0.77-0.99) were significantly higher than those for dizygotic twins (range, 0.22-0.65), giving heritability estimates of 56% to 95%. The covariance of CAST and ASD diagnostic status (DAWBA, ADOS and best-estimate diagnosis) was largely explained by additive genetic factors (76%-95%). For the ADI-R only, shared environmental influences were significant (30% [95% CI, 8%-47%]) but smaller than genetic influences (56% [95% CI, 37%-82%]). CONCLUSIONS AND RELEVANCE The liability to ASD and a more broadly defined high-level autism trait phenotype in this large population-based twin sample derives primarily from additive genetic and, to a lesser extent, nonshared environmental effects. The largely consistent results across different diagnostic tools suggest that the results are generalizable across multiple measures and assessment methods. Genetic factors underpinning individual differences in autismlike traits show considerable overlap with genetic influences on diagnosed ASD. PMID:25738232
Differences in Movement Pattern and Detectability between Males and Females Influence How Common Sampling Methods Estimate Sex Ratio.

PubMed

Rodrigues, João Fabrício Mota; Coelho, Marco Túlio Pacheco

2016-01-01

Sampling the biodiversity is an essential step for conservation, and understanding the efficiency of sampling methods allows us to estimate the quality of our biodiversity data. Sex ratio is an important population characteristic, but until now, no study has evaluated how efficient are the sampling methods commonly used in biodiversity surveys in estimating the sex ratio of populations. We used a virtual ecologist approach to investigate whether active and passive capture methods are able to accurately sample a population's sex ratio and whether differences in movement pattern and detectability between males and females produce biased estimates of sex-ratios when using these methods. Our simulation allowed the recognition of individuals, similar to mark-recapture studies. We found that differences in both movement patterns and detectability between males and females produce biased estimates of sex ratios. However, increasing the sampling effort or the number of sampling days improves the ability of passive or active capture methods to properly sample sex ratio. Thus, prior knowledge regarding movement patterns and detectability for species is important information to guide field studies aiming to understand sex ratio related patterns.
Double sampling to estimate density and population trends in birds

USGS Publications Warehouse

Bart, Jonathan; Earnst, Susan L.

2002-01-01

We present a method for estimating density of nesting birds based on double sampling. The approach involves surveying a large sample of plots using a rapid method such as uncorrected point counts, variable circular plot counts, or the recently suggested double-observer method. A subsample of those plots is also surveyed using intensive methods to determine actual density. The ratio of the mean count on those plots (using the rapid method) to the mean actual density (as determined by the intensive searches) is used to adjust results from the rapid method. The approach works well when results from the rapid method are highly correlated with actual density. We illustrate the method with three years of shorebird surveys from the tundra in northern Alaska. In the rapid method, surveyors covered ~10 ha h-1 and surveyed each plot a single time. The intensive surveys involved three thorough searches, required ~3 h ha-1, and took 20% of the study effort. Surveyors using the rapid method detected an average of 79% of birds present. That detection ratio was used to convert the index obtained in the rapid method into an essentially unbiased estimate of density. Trends estimated from several years of data would also be essentially unbiased. Other advantages of double sampling are that (1) the rapid method can be changed as new methods become available, (2) domains can be compared even if detection rates differ, (3) total population size can be estimated, and (4) valuable ancillary information (e.g. nest success) can be obtained on intensive plots with little additional effort. We suggest that double sampling be used to test the assumption that rapid methods, such as variable circular plot and double-observer methods, yield density estimates that are essentially unbiased. The feasibility of implementing double sampling in a range of habitats needs to be evaluated.
Chamomile Consumption and Mortality: A Prospective Study of Mexican Origin Older Adults

PubMed Central

Howrey, Bret T.; Peek, M. Kristen; McKee, Juliet M.; Raji, Mukaila A.; Ottenbacher, Kenneth J.; Markides, Kyriakos S.

2016-01-01

Purpose: Approximately 20% of adults use some kind of herbal; however, little data exists from population-based study or clinical trials to support effectiveness of most herbal products. Chamomile is a commonly used herb among older adults of Mexican origin. We examined the effects of herbal chamomile consumption on mortality among older adults of Mexican origin. Methods and Design. A sample from the Hispanic Established Populations for Epidemiologic Study of the Elderly, a population-based study of noninstitutionalized Mexican Americans aged 65 and older from five Southwestern states (Texas, California, New Mexico, Colorado, and Arizona). We included all men and women from 2000 to 2007 (n = 1,677). Results. Chamomile was used by 14% of the sample. Cox proportional hazards regression analyses showed that chamomile was associated with a decreased risk of mortality in the total sample (hazard ratio [HR] 0.71, 95% confidence interval [CI] 0.55–0.92) and for women (HR 0.67, 95% CI 0.49–0.92) but not for men. In models adjusted for sociodemographic variables, health behaviors, and chronic conditions, chamomile remained significantly associated with reduced mortality in women (HR 0.72, 95% CI 0.53–0.98). Implications. The use of chamomile shows protective effects against mortality in this sample of older adults of Mexican origin for women. Further research is warranted in other populations to determine if these effects are consistent. PMID:26035879
Parkinson's disease and occupation: differences in associations by case identification method suggest referral bias.

PubMed

Teschke, Kay; Marion, Stephen A; Tsui, Joseph K C; Shen, Hui; Rugbjerg, Kathrine; Harris, M Anne

2014-02-01

We used a population-based sample of 403 Parkinson's disease cases and 405 controls to examine risks by occupation. Results were compared to a previous clinic-based analysis. With censoring of jobs held within 10 years of diagnosis, the following had significantly or strongly increased risks: social science, law and library jobs (OR = 1.8); farming and horticulture jobs (OR = 2.0); gas station jobs (OR = 2.6); and welders (OR = 3.0). The following had significantly decreased risks: management and administration jobs (OR = 0.70); and other health care jobs (OR = 0.44). These results were consistent with other findings for social science and farming occupations. Risks for teaching, medicine and health occupations were not elevated, unlike our previous clinic-based study. This underscores the value of population-based over clinic-based samples. Occupational studies may be particularly susceptible to referral bias because social networks may spread preferentially via jobs. © 2013 Wiley Periodicals, Inc.
[The research protocol III. Study population].

PubMed

Arias-Gómez, Jesús; Villasís-Keever, Miguel Ángel; Miranda-Novales, María Guadalupe

2016-01-01

The study population is defined as a set of cases, determined, limited, and accessible, that will constitute the subjects for the selection of the sample, and must fulfill several characteristics and distinct criteria. The objectives of this manuscript are focused on specifying each one of the elements required to make the selection of the participants of a research project, during the elaboration of the protocol, including the concepts of study population, sample, selection criteria and sampling methods. After delineating the study population, the researcher must specify the criteria that each participant has to comply. The criteria that include the specific characteristics are denominated selection or eligibility criteria. These criteria are inclusion, exclusion and elimination, and will delineate the eligible population. The sampling methods are divided in two large groups: 1) probabilistic or random sampling and 2) non-probabilistic sampling. The difference lies in the employment of statistical methods to select the subjects. In every research, it is necessary to establish at the beginning the specific number of participants to be included to achieve the objectives of the study. This number is the sample size, and can be calculated or estimated with mathematical formulas and statistic software.
Community duplicate diet methodology: A new tool for estimating dietary exposure to pesticides

EPA Science Inventory

An observational field study was conducted to assess the feasibility of a community duplicate diet collection method; a dietary monitoring procedure that is population-based. The purpose was to establish an alternative procedure to duplicate diet sampling that would be more effi...
Recruiting and retaining youth and young adults: challenges and opportunities in survey research for tobacco control.

PubMed

Cantrell, Jennifer; Hair, Elizabeth C; Smith, Alexandria; Bennett, Morgane; Rath, Jessica Miller; Thomas, Randall K; Fahimi, Mansour; Dennis, J Michael; Vallone, Donna

2018-03-01

Evaluation studies of population-based tobacco control interventions often rely on large-scale survey data from numerous respondents across many geographic areas to provide evidence of their effectiveness. Significant challenges for survey research have emerged with the evolving communications landscape, particularly for surveying hard-to-reach populations such as youth and young adults. This study combines the comprehensive coverage of an address-based sampling (ABS) frame with the timeliness of online data collection to develop a nationally representative longitudinal cohort of young people aged 15-21. We constructed an ABS frame, partially supplemented with auxiliary data, to recruit this hard-to-reach sample. Branded and tested mail-based recruitment materials were designed to bring respondents online for screening, consent and surveying. Once enrolled, respondents completed online surveys every 6 months via computer, tablet or smartphone. Numerous strategies were utilized to enhance retention and representativeness RESULTS: Results detail sample performance, representativeness and retention rates as well as device utilization trends for survey completion among youth and young adult respondents. Panel development efforts resulted in a large, nationally representative sample with high retention rates. This study is among the first to employ this hybrid ABS-to-online methodology to recruit and retain youth and young adults in a probability-based online cohort panel. The approach is particularly valuable for conducting research among younger populations as it capitalizes on their increasing access to and comfort with digital communication. We discuss challenges and opportunities of panel recruitment and retention methods in an effort to provide valuable information for tobacco control researchers seeking to obtain representative, population-based samples of youth and young adults in the U.S. as well as across the globe. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

Systematic review of proposed definitions of nocturnal polyuria and population-based evidence of their diagnostic accuracy.

PubMed

Olesen, Tine Kold; Denys, Marie-Astrid; Vande Walle, Johan; Everaert, Karel

2018-02-06

Background Evidence of diagnostic accuracy for proposed definitions of nocturnal polyuria is currently unclear. Purpose Systematic review to determine population-based evidence of the diagnostic accuracy of proposed definitions of nocturnal polyuria based on data from frequency-volume charts. Methods Seventeen pre-specified search terms identified 351 unique investigations published from 1990 to 2016 in BIOSIS, Embase, Embase Alerts, International Pharmaceutical Abstract, Medline, and Cochrane. Thirteen original communications were included in this review based on pre-specified exclusion criteria. Data were extracted from each paper regarding subject age, sex, ethnicity, health status, sample size, data collection methods, and diagnostic discrimination of proposed definitions including sensitivity, specificity, positive and negative predictive value. Results The sample size of study cohorts, participant age, sex, ethnicity, and health status varied considerably in 13 studies reporting on the diagnostic performance of seven different definitions of nocturnal polyuria using frequency-volume chart data from 4968 participants. Most study cohorts were small, mono-ethnic, including only Caucasian males aged 50 or higher with primary or secondary polyuria that were compared to a control group of healthy men without nocturia in prospective or retrospective settings. Proposed definitions had poor discriminatory accuracy in evaluations based on data from subjects independent from the original study cohorts with findings being similar regarding the most widely evaluated definition endorsed by ICS. Conclusions Diagnostic performance characteristics for proposed definitions of nocturnal polyuria show poor to modest discrimination and are not based on sufficient level of evidence from representative, multi-ethnic population-based data from both females and males of all adult ages.
cloncase: Estimation of sex frequency and effective population size by clonemate resampling in partially clonal organisms.

PubMed

Ali, Sajid; Soubeyrand, Samuel; Gladieux, Pierre; Giraud, Tatiana; Leconte, Marc; Gautier, Angélique; Mboup, Mamadou; Chen, Wanquan; de Vallavieille-Pope, Claude; Enjalbert, Jérôme

2016-07-01

Inferring reproductive and demographic parameters of populations is crucial to our understanding of species ecology and evolutionary potential but can be challenging, especially in partially clonal organisms. Here, we describe a new and accurate method, cloncase, for estimating both the rate of sexual vs. asexual reproduction and the effective population size, based on the frequency of clonemate resampling across generations. Simulations showed that our method provides reliable estimates of sex frequency and effective population size for a wide range of parameters. The cloncase method was applied to Puccinia striiformis f.sp. tritici, a fungal pathogen causing stripe/yellow rust, an important wheat disease. This fungus is highly clonal in Europe but has been suggested to recombine in Asia. Using two temporally spaced samples of P. striiformis f.sp. tritici in China, the estimated sex frequency was 75% (i.e. three-quarter of individuals being sexually derived during the yearly sexual cycle), indicating strong contribution of sexual reproduction to the life cycle of the pathogen in this area. The inferred effective population size of this partially clonal organism (Nc = 998) was in good agreement with estimates obtained using methods based on temporal variations in allelic frequencies. The cloncase estimator presented herein is the first method allowing accurate inference of both sex frequency and effective population size from population data without knowledge of recombination or mutation rates. cloncase can be applied to population genetic data from any organism with cyclical parthenogenesis and should in particular be very useful for improving our understanding of pest and microbial population biology. © 2016 John Wiley & Sons Ltd.
On the Simultaneous Identification and Quantification of Microalgae Populations Based on Fluorometric Techniques.

PubMed

Gsponer, Natalia S; Rodríguez, María Claudia; Palacios, Rodrigo E; Chesta, Carlos A

2018-05-16

In this study, the phytoplankton structure of a freshwater reservoir located in central Argentina (Embalse Río Tercero) was analyzed using Beutler's method (Photosynthesis Research 72: 39-53, 2002), aiming to provide water quality control agencies with a reliable tool for early detection of algae blooms, particularly cyanobacteria. The method estimated the concentration of chlorophyll a (Chl a) contributed by individual algal groups in a real sample by fitting its fluorescence excitation spectrum to a linear combination of norm spectra of relevant algae groups. To this purpose, norm spectra for five algae genera usually found in Embalse Río Tercero, Microcystis, Chlorella, Cyclotella, Ceratium and Porphyridium, were constructed and posteriorly used to analyze samples collected in the reservoir in years 2014-2016. Results showed that the method worked well for the quick identification of the algae present in the samples, but it tended to overestimate its Chl a contents. This error was attributed to the large heterogeneity of the algal populations due to the aging of cells grown in environmental conditions. © 2018 The American Society of Photobiology.
Expanding sexually transmitted infection screening among women and men engaging in transactional sex: the feasibility of field-based self-collection.

PubMed

Roth, A M; Rosenberger, J G; Reece, M; Van Der Pol, B

2013-04-01

Routine screening is a key component of sexually transmitted infection (STI) prevention and control; however, traditional programmes often fail to effectively reach men and women in hidden communities. To reduce prevalence, we must understand the programmatic features that would encourage utilization of services among asymptomatic individuals. Using incentivized snowball sampling, 44 women and men recently engaging in transactional sex were recruited (24 women, 20 men); median age 37 years. Respondents were offered the opportunity to collect genital, oropharyngeal and rectal samples for STI testing and completed a face-to-face interview about their experience with self-obtained sampling. Interviews were analysed using qualitative methods. Participants were unaware of potential risk for STI, but found self-sampling in non-clinical settings to be acceptable and preferable to clinic-based testing. All participants collected genital specimens; 96% and 4% collected oropharyngeal and rectal specimens, respectively. The burden of disease in this population was high: 38% tested positive for at least one STI. We detected multiple concomitant infections. Incorporating field collection of self-obtained samples into STI control programmes may increase utilization among high-risk populations unlikely to access clinic-based services. High infection rates indicate that individuals engaging in transactional sex would benefit from, and be responsive to, community-based self-sampling for STI screening.
Conduct of a personal radiofrequency electromagnetic field measurement study: proposed study protocol

PubMed Central

2010-01-01

Background The development of new wireless communication technologies that emit radio frequency electromagnetic fields (RF-EMF) is ongoing, but little is known about the RF-EMF exposure distribution in the general population. Previous attempts to measure personal exposure to RF-EMF have used different measurement protocols and analysis methods making comparisons between exposure situations across different study populations very difficult. As a result, observed differences in exposure levels between study populations may not reflect real exposure differences but may be in part, or wholly due to methodological differences. Methods The aim of this paper is to develop a study protocol for future personal RF-EMF exposure studies based on experience drawn from previous research. Using the current knowledge base, we propose procedures for the measurement of personal exposure to RF-EMF, data collection, data management and analysis, and methods for the selection and instruction of study participants. Results We have identified two basic types of personal RF-EMF measurement studies: population surveys and microenvironmental measurements. In the case of a population survey, the unit of observation is the individual and a randomly selected representative sample of the population is needed to obtain reliable results. For microenvironmental measurements, study participants are selected in order to represent typical behaviours in different microenvironments. These two study types require different methods and procedures. Conclusion Applying our proposed common core procedures in future personal measurement studies will allow direct comparisons of personal RF-EMF exposures in different populations and study areas. PMID:20487532
Diurnal activity of four species of thrips (Thysanoptera: Thripidae) and efficiencies of three nondestructive sampling techniques for thrips in mango inflorescences.

PubMed

Aliakbarpour, H; Rawi, Che Salmah Md

2010-06-01

Thrips cause considerable economic loss to mango, Mangifera indica L., in Penang, Malaysia. Three nondestructive sampling techniques--shaking mango panicles over a moist plastic tray, washing the panicles with ethanol, and immobilization of thrips by using CO2--were evaluated for their precision to determine the most effective technique to capture mango flower thrips (Thysanoptera: Thripidae) in an orchard located at Balik Pulau, Penang, Malaysia, during two flowering seasons from December 2008 to February 2009 and from August to September 2009. The efficiency of each of the three sampling techniques was compared with absolute population counts on whole panicles as a reference. Diurnal flight activity of thrips species was assessed using yellow sticky traps. All three sampling methods and sticky traps were used at two hourly intervals from 0800 to 1800 hours to get insight into diurnal periodicity of thrips abundance in the orchard. Based on pooled data for the two seasons, the CO2 method was the most efficient procedure extracting 80.7% adults and 74.5% larvae. The CO2 method had the lowest relative variation and was the most accurate procedure compared with the absolute method as shown by regression analysis. All collection techniques showed that the numbers of all thrips species in mango panicles increased after 0800 hours, reaching a peak between 1200 and 1400 hours. Adults thrips captured on the sticky traps were the most abundant between 0800-1000 and 1400-1600 hours. According to results of this study, the CO2 method is recommended for sampling of thrips in the field. It is a nondestructive sampling procedure that neither damages flowers nor diminishes fruit production. Management of thrips populations in mango orchards with insecticides would be more effectively carried out during their peak population abundance on the flower panicles at midday to 1400 hours.
Adaptive cluster sampling: An efficient method for assessing inconspicuous species

Treesearch

Andrea M. Silletti; Joan Walker

2003-01-01

Restorationistis typically evaluate the success of a project by estimating the population sizes of species that have been planted or seeded. Because total census is raely feasible, they must rely on sampling methods for population estimates. However, traditional random sampling designs may be inefficient for species that, for one reason or another, are challenging to...
kWIP: The k-mer weighted inner product, a de novo estimator of genetic similarity.

PubMed

Murray, Kevin D; Webers, Christfried; Ong, Cheng Soon; Borevitz, Justin; Warthmann, Norman

2017-09-01

Modern genomics techniques generate overwhelming quantities of data. Extracting population genetic variation demands computationally efficient methods to determine genetic relatedness between individuals (or "samples") in an unbiased manner, preferably de novo. Rapid estimation of genetic relatedness directly from sequencing data has the potential to overcome reference genome bias, and to verify that individuals belong to the correct genetic lineage before conclusions are drawn using mislabelled, or misidentified samples. We present the k-mer Weighted Inner Product (kWIP), an assembly-, and alignment-free estimator of genetic similarity. kWIP combines a probabilistic data structure with a novel metric, the weighted inner product (WIP), to efficiently calculate pairwise similarity between sequencing runs from their k-mer counts. It produces a distance matrix, which can then be further analysed and visualised. Our method does not require prior knowledge of the underlying genomes and applications include establishing sample identity and detecting mix-up, non-obvious genomic variation, and population structure. We show that kWIP can reconstruct the true relatedness between samples from simulated populations. By re-analysing several published datasets we show that our results are consistent with marker-based analyses. kWIP is written in C++, licensed under the GNU GPL, and is available from https://github.com/kdmurray91/kwip.
Automated quantitative cytological analysis using portable microfluidic microscopy.

PubMed

Jagannadh, Veerendra Kalyan; Murthy, Rashmi Sreeramachandra; Srinivasan, Rajesh; Gorthi, Sai Siva

2016-06-01

In this article, a portable microfluidic microscopy based approach for automated cytological investigations is presented. Inexpensive optical and electronic components have been used to construct a simple microfluidic microscopy system. In contrast to the conventional slide-based methods, the presented method employs microfluidics to enable automated sample handling and image acquisition. The approach involves the use of simple in-suspension staining and automated image acquisition to enable quantitative cytological analysis of samples. The applicability of the presented approach to research in cellular biology is shown by performing an automated cell viability assessment on a given population of yeast cells. Further, the relevance of the presented approach to clinical diagnosis and prognosis has been demonstrated by performing detection and differential assessment of malaria infection in a given sample. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Sodium and potassium content of 24 h urinary collections: a comparison between field- and laboratory-based analysers.

PubMed

Yin, Xuejun; Neal, Bruce; Tian, Maoyi; Li, Zhifang; Petersen, Kristina; Komatsu, Yuichiro; Feng, Xiangxian; Wu, Yangfeng

2018-04-01

Measurement of mean population Na and K intakes typically uses laboratory-based assays, which can add significant logistical burden and costs. A valid field-based measurement method would be a significant advance. In the current study, we used 24 h urine samples to compare estimates of Na, K and Na:K ratio based upon assays done using the field-based Horiba twin meter v. laboratory-based methods. The performance of the Horiba twin meter was determined by comparing field-based estimates of mean Na and K against those obtained using laboratory-based methods. The reported 95 % limits of agreement of Bland-Altman plots were calculated based on a regression approach for non-uniform differences. The 24 h urine samples were collected as part of an ongoing study being done in rural China. One hundred and sixty-six complete 24 h urine samples were qualified for estimating 24 h urinary Na and K excretion. Mean Na and K excretion were estimated as 170·4 and 37·4 mmol/d, respectively, using the meter-based assays; and 193·4 and 43·8 mmol/d, respectively, using the laboratory-based assays. There was excellent relative reliability (intraclass correlation coefficient) for both Na (0·986) and K (0·986). Bland-Altman plots showed moderate-to-good agreement between the two methods. Na and K intake estimations were moderately underestimated using assays based upon the Horiba twin meter. Compared with standard laboratory-based methods, the portable device was more practical and convenient.
A population-based evolutionary search approach to the multiple minima problem in de novo protein structure prediction

PubMed Central

2013-01-01

Background Elucidating the native structure of a protein molecule from its sequence of amino acids, a problem known as de novo structure prediction, is a long standing challenge in computational structural biology. Difficulties in silico arise due to the high dimensionality of the protein conformational space and the ruggedness of the associated energy surface. The issue of multiple minima is a particularly troublesome hallmark of energy surfaces probed with current energy functions. In contrast to the true energy surface, these surfaces are weakly-funneled and rich in comparably deep minima populated by non-native structures. For this reason, many algorithms seek to be inclusive and obtain a broad view of the low-energy regions through an ensemble of low-energy (decoy) conformations. Conformational diversity in this ensemble is key to increasing the likelihood that the native structure has been captured. Methods We propose an evolutionary search approach to address the multiple-minima problem in decoy sampling for de novo structure prediction. Two population-based evolutionary search algorithms are presented that follow the basic approach of treating conformations as individuals in an evolving population. Coarse graining and molecular fragment replacement are used to efficiently obtain protein-like child conformations from parents. Potential energy is used both to bias parent selection and determine which subset of parents and children will be retained in the evolving population. The effect on the decoy ensemble of sampling minima directly is measured by additionally mapping a conformation to its nearest local minimum before considering it for retainment. The resulting memetic algorithm thus evolves not just a population of conformations but a population of local minima. Results and conclusions Results show that both algorithms are effective in terms of sampling conformations in proximity of the known native structure. The additional minimization is shown to be key to enhancing sampling capability and obtaining a diverse ensemble of decoy conformations, circumventing premature convergence to sub-optimal regions in the conformational space, and approaching the native structure with proximity that is comparable to state-of-the-art decoy sampling methods. The results are shown to be robust and valid when using two representative state-of-the-art coarse-grained energy functions. PMID:24565020
A population-based nested case control study on recurrent pneumonias in children with severe generalized cerebral palsy: ethical considerations of the design and representativeness of the study sample.

PubMed

Veugelers, Rebekka; Calis, Elsbeth A C; Penning, Corine; Verhagen, Arianne; Bernsen, Roos; Bouquet, Jan; Benninga, Marc A; Merkus, Peter J F M; Arets, Hubertus G M; Tibboel, Dick; Evenhuis, Heleen M

2005-07-19

In children with severe generalized cerebral palsy, pneumonias are a major health issue. Malnutrition, dysphagia, gastro-oesophageal reflux, impaired respiratory function and constipation are hypothesized risk factors. Still, no data are available on the relative contribution of these possible risk factors in the described population. This paper describes the initiation of a study in 194 children with severe generalized cerebral palsy, on the prevalence and on the impact of these hypothesized risk factors of recurrent pneumonias. A nested case-control design with 18 months follow-up was chosen. Dysphagia, respiratory function and constipation will be assessed at baseline, malnutrition and gastro-oesophageal reflux at the end of the follow-up. The study population consists of a representative population sample of children with severe generalized cerebral palsy. Inclusion was done through care-centres in a predefined geographical area and not through hospitals. All measurements will be done on-site which sets high demands on all measurements. If these demands were not met in "gold standard" methods, other methods were chosen. Although the inclusion period was prolonged, the desired sample size of 300 children was not met. With a consent rate of 33%, nearly 10% of all eligible children in The Netherlands are included (n = 194). The study population is subtly different from the non-participants with regard to severity of dysphagia and prevalence rates of pneumonias and gastro-oesophageal reflux. Ethical issues complicated the study design. Assessment of malnutrition and gastro-oesophageal reflux at baseline was considered unethical, since these conditions can be easily treated. Therefore, we postponed these diagnostics until the end of the follow-up. In order to include a representative sample, all eligible children in a predefined geographical area had to be contacted. To increase the consent rate, on-site measurements are of first choice, but timely inclusion is jeopardized. The initiation of this first study among children with severe neurological impairment led to specific, unexpected problems. Despite small differences between participants and non-participating children, our sample is as representative as can be expected from any population-based study and will provide important, new information to bring us further towards effective interventions to prevent pneumonias in this population.
Single-virion sequencing of lamivudine-treated HBV populations reveal population evolution dynamics and demographic history.

PubMed

Zhu, Yuan O; Aw, Pauline P K; de Sessions, Paola Florez; Hong, Shuzhen; See, Lee Xian; Hong, Lewis Z; Wilm, Andreas; Li, Chen Hao; Hue, Stephane; Lim, Seng Gee; Nagarajan, Niranjan; Burkholder, William F; Hibberd, Martin

2017-10-27

Viral populations are complex, dynamic, and fast evolving. The evolution of groups of closely related viruses in a competitive environment is termed quasispecies. To fully understand the role that quasispecies play in viral evolution, characterizing the trajectories of viral genotypes in an evolving population is the key. In particular, long-range haplotype information for thousands of individual viruses is critical; yet generating this information is non-trivial. Popular deep sequencing methods generate relatively short reads that do not preserve linkage information, while third generation sequencing methods have higher error rates that make detection of low frequency mutations a bioinformatics challenge. Here we applied BAsE-Seq, an Illumina-based single-virion sequencing technology, to eight samples from four chronic hepatitis B (CHB) patients - once before antiviral treatment and once after viral rebound due to resistance. With single-virion sequencing, we obtained 248-8796 single-virion sequences per sample, which allowed us to find evidence for both hard and soft selective sweeps. We were able to reconstruct population demographic history that was independently verified by clinically collected data. We further verified four of the samples independently through PacBio SMRT and Illumina Pooled deep sequencing. Overall, we showed that single-virion sequencing yields insight into viral evolution and population dynamics in an efficient and high throughput manner. We believe that single-virion sequencing is widely applicable to the study of viral evolution in the context of drug resistance and host adaptation, allows differentiation between soft or hard selective sweeps, and may be useful in the reconstruction of intra-host viral population demographic history.
Inferring modes of colonization for pest species using heterozygosity comparisons and a shared-allele test.

PubMed

Sved, J A; Yu, H; Dominiak, B; Gilchrist, A S

2003-02-01

Long-range dispersal of a species may involve either a single long-distance movement from a core population or spreading via unobserved intermediate populations. Where the new populations originate as small propagules, genetic drift may be extreme and gene frequency or assignment methods may not prove useful in determining the relation between the core population and outbreak samples. We describe computationally simple resampling methods for use in this situation to distinguish between the different modes of dispersal. First, estimates of heterozygosity can be used to test for direct sampling from the core population and to estimate the effective size of intermediate populations. Second, a test of sharing of alleles, particularly rare alleles, can show whether outbreaks are related to each other rather than arriving as independent samples from the core population. The shared-allele statistic also serves as a genetic distance measure that is appropriate for small samples. These methods were applied to data on a fruit fly pest species, Bactrocera tryoni, which is quarantined from some horticultural areas in Australia. We concluded that the outbreaks in the quarantine zone came from a heterogeneous set of genetically differentiated populations, possibly ones that overwinter in the vicinity of the quarantine zone.
Reaching the Hard-to-Reach: A Probability Sampling Method for Assessing Prevalence of Driving under the Influence after Drinking in Alcohol Outlets

PubMed Central

De Boni, Raquel; do Nascimento Silva, Pedro Luis; Bastos, Francisco Inácio; Pechansky, Flavio; de Vasconcellos, Mauricio Teixeira Leite

2012-01-01

Drinking alcoholic beverages in places such as bars and clubs may be associated with harmful consequences such as violence and impaired driving. However, methods for obtaining probabilistic samples of drivers who drink at these places remain a challenge – since there is no a priori information on this mobile population – and must be continually improved. This paper describes the procedures adopted in the selection of a population-based sample of drivers who drank at alcohol selling outlets in Porto Alegre, Brazil, which we used to estimate the prevalence of intention to drive under the influence of alcohol. The sampling strategy comprises a stratified three-stage cluster sampling: 1) census enumeration areas (CEA) were stratified by alcohol outlets (AO) density and sampled with probability proportional to the number of AOs in each CEA; 2) combinations of outlets and shifts (COS) were stratified by prevalence of alcohol-related traffic crashes and sampled with probability proportional to their squared duration in hours; and, 3) drivers who drank at the selected COS were stratified by their intention to drive and sampled using inverse sampling. Sample weights were calibrated using a post-stratification estimator. 3,118 individuals were approached and 683 drivers interviewed, leading to an estimate that 56.3% (SE = 3,5%) of the drivers intended to drive after drinking in less than one hour after the interview. Prevalence was also estimated by sex and broad age groups. The combined use of stratification and inverse sampling enabled a good trade-off between resource and time allocation, while preserving the ability to generalize the findings. The current strategy can be viewed as a step forward in the efforts to improve surveys and estimation for hard-to-reach, mobile populations. PMID:22514620
Sampling in health geography: reconciling geographical objectives and probabilistic methods. An example of a health survey in Vientiane (Lao PDR)

PubMed Central

Vallée, Julie; Souris, Marc; Fournet, Florence; Bochaton, Audrey; Mobillion, Virginie; Peyronnie, Karine; Salem, Gérard

2007-01-01

Background Geographical objectives and probabilistic methods are difficult to reconcile in a unique health survey. Probabilistic methods focus on individuals to provide estimates of a variable's prevalence with a certain precision, while geographical approaches emphasise the selection of specific areas to study interactions between spatial characteristics and health outcomes. A sample selected from a small number of specific areas creates statistical challenges: the observations are not independent at the local level, and this results in poor statistical validity at the global level. Therefore, it is difficult to construct a sample that is appropriate for both geographical and probability methods. Methods We used a two-stage selection procedure with a first non-random stage of selection of clusters. Instead of randomly selecting clusters, we deliberately chose a group of clusters, which as a whole would contain all the variation in health measures in the population. As there was no health information available before the survey, we selected a priori determinants that can influence the spatial homogeneity of the health characteristics. This method yields a distribution of variables in the sample that closely resembles that in the overall population, something that cannot be guaranteed with randomly-selected clusters, especially if the number of selected clusters is small. In this way, we were able to survey specific areas while minimising design effects and maximising statistical precision. Application We applied this strategy in a health survey carried out in Vientiane, Lao People's Democratic Republic. We selected well-known health determinants with unequal spatial distribution within the city: nationality and literacy. We deliberately selected a combination of clusters whose distribution of nationality and literacy is similar to the distribution in the general population. Conclusion This paper describes the conceptual reasoning behind the construction of the survey sample and shows that it can be advantageous to choose clusters using reasoned hypotheses, based on both probability and geographical approaches, in contrast to a conventional, random cluster selection strategy. PMID:17543100
Assessing the Generalizability of Randomized Trial Results to Target Populations

PubMed Central

Stuart, Elizabeth A.; Bradshaw, Catherine P.; Leaf, Philip J.

2014-01-01

Recent years have seen increasing interest in and attention to evidence-based practices, where the “evidence” generally comes from well-conducted randomized trials. However, while those trials yield accurate estimates of the effect of the intervention for the participants in the trial (known as “internal validity”), they do not always yield relevant information about the effects in a particular target population (known as “external validity”). This may be due to a lack of specification of a target population when designing the trial, difficulties recruiting a sample that is representative of a pre-specified target population, or to interest in considering a target population somewhat different from the population directly targeted by the trial. This paper first provides an overview of existing design and analysis methods for assessing and enhancing the ability of a randomized trial to estimate treatment effects in a target population. It then provides a case study using one particular method, which weights the subjects in a randomized trial to match the population on a set of observed characteristics. The case study uses data from a randomized trial of School-wide Positive Behavioral Interventions and Supports (PBIS); our interest is in generalizing the results to the state of Maryland. In the case of PBIS, after weighting, estimated effects in the target population were similar to those observed in the randomized trial. The paper illustrates that statistical methods can be used to assess and enhance the external validity of randomized trials, making the results more applicable to policy and clinical questions. However, there are also many open research questions; future research should focus on questions of treatment effect heterogeneity and further developing these methods for enhancing external validity. Researchers should think carefully about the external validity of randomized trials and be cautious about extrapolating results to specific populations unless they are confident of the similarity between the trial sample and that target population. PMID:25307417
Assessing the generalizability of randomized trial results to target populations.

PubMed

Stuart, Elizabeth A; Bradshaw, Catherine P; Leaf, Philip J

2015-04-01

Recent years have seen increasing interest in and attention to evidence-based practices, where the "evidence" generally comes from well-conducted randomized trials. However, while those trials yield accurate estimates of the effect of the intervention for the participants in the trial (known as "internal validity"), they do not always yield relevant information about the effects in a particular target population (known as "external validity"). This may be due to a lack of specification of a target population when designing the trial, difficulties recruiting a sample that is representative of a prespecified target population, or to interest in considering a target population somewhat different from the population directly targeted by the trial. This paper first provides an overview of existing design and analysis methods for assessing and enhancing the ability of a randomized trial to estimate treatment effects in a target population. It then provides a case study using one particular method, which weights the subjects in a randomized trial to match the population on a set of observed characteristics. The case study uses data from a randomized trial of school-wide positive behavioral interventions and supports (PBIS); our interest is in generalizing the results to the state of Maryland. In the case of PBIS, after weighting, estimated effects in the target population were similar to those observed in the randomized trial. The paper illustrates that statistical methods can be used to assess and enhance the external validity of randomized trials, making the results more applicable to policy and clinical questions. However, there are also many open research questions; future research should focus on questions of treatment effect heterogeneity and further developing these methods for enhancing external validity. Researchers should think carefully about the external validity of randomized trials and be cautious about extrapolating results to specific populations unless they are confident of the similarity between the trial sample and that target population.
Prevalence and Predictors of Sexual Risks Among Homeless Youth

ERIC Educational Resources Information Center

Halcon, Linda L.; Lifson, Alan R.

2004-01-01

This study examined prevalence of sexual risks among homeless adolescents and described factors associated with those risks. Community-based outreach methods were used successfully to access this difficult-to-reach population. The sample included 203 homeless youth aged 15-22 recruited from community sites. Questionnaire items addressed…
Early Vocabulary Delay and Behavioral/Emotional Problems in Early Childhood: The Generation R Study

ERIC Educational Resources Information Center

Henrichs, Jens; Rescorla, Leslie; Donkersloot, Cootje; Schenk, Jacqueline J.; Raat, Hein; Jaddoe, Vincent W. V.; Hofman, Albert; Verhulst, Frank C.; Tiemeier, Henning

2013-01-01

Purpose: The authors tested associations between (a) parent-reported temporary vs. persistent vocabulary delay and (b) parent-reported behavioral/emotional problems in a sample of 5,497 young Dutch children participating in a prospective population-based study. Method: Mothers completed the MacArthur Communicative Development…

Marital Distress and Mental Health Care Service Utilization

ERIC Educational Resources Information Center

Schonbrun, Yael Chatav; Whisman, Mark A.

2010-01-01

Objective: This study was designed to evaluate the association between marital distress and mental health service utilization in a population-based sample of men and women (N = 1,601). Method: The association between marital distress and mental health care service utilization was evaluated for overall mental health service utilization and for…
Determining the linkage of disease-resistance genes to molecular markers: the LOD-SCORE method revisited with regard to necessary sample sizes.

PubMed

Hühn, M

1995-05-01

Some approaches to molecular marker-assisted linkage detection for a dominant disease-resistance trait based on a segregating F2 population are discussed. Analysis of two-point linkage is carried out by the traditional measure of maximum lod score. It depends on (1) the maximum-likelihood estimate of the recombination fraction between the marker and the disease-resistance gene locus, (2) the observed absolute frequencies, and (3) the unknown number of tested individuals. If one replaces the absolute frequencies by expressions depending on the unknown sample size and the maximum-likelihood estimate of recombination value, the conventional rule for significant linkage (maximum lod score exceeds a given linkage threshold) can be resolved for the sample size. For each sub-population used for linkage analysis [susceptible (= recessive) individuals, resistant (= dominant) individuals, complete F2] this approach gives a lower bound for the necessary number of individuals required for the detection of significant two-point linkage by the lod-score method.
Efficient evaluation of sampling quality of molecular dynamics simulations by clustering of dihedral torsion angles and Sammon mapping.

PubMed

Frickenhaus, Stephan; Kannan, Srinivasaraghavan; Zacharias, Martin

2009-02-01

A direct conformational clustering and mapping approach for peptide conformations based on backbone dihedral angles has been developed and applied to compare conformational sampling of Met-enkephalin using two molecular dynamics (MD) methods. Efficient clustering in dihedrals has been achieved by evaluating all combinations resulting from independent clustering of each dihedral angle distribution, thus resolving all conformational substates. In contrast, Cartesian clustering was unable to accurately distinguish between all substates. Projection of clusters on dihedral principal component (PCA) subspaces did not result in efficient separation of highly populated clusters. However, representation in a nonlinear metric by Sammon mapping was able to separate well the 48 highest populated clusters in just two dimensions. In addition, this approach also allowed us to visualize the transition frequencies between clusters efficiently. Significantly, higher transition frequencies between more distinct conformational substates were found for a recently developed biasing-potential replica exchange MD simulation method allowing faster sampling of possible substates compared to conventional MD simulations. Although the number of theoretically possible clusters grows exponentially with peptide length, in practice, the number of clusters is only limited by the sampling size (typically much smaller), and therefore the method is well suited also for large systems. The approach could be useful to rapidly and accurately evaluate conformational sampling during MD simulations, to compare different sampling strategies and eventually to detect kinetic bottlenecks in folding pathways.
Association between pregnancy complications and small-for-gestational-age birth weight defined by customized fetal growth standard versus a population-based standard.

PubMed

Odibo, Anthony O; Francis, Andre; Cahill, Alison G; Macones, George A; Crane, James P; Gardosi, Jason

2011-03-01

To derive coefficients for developing a customized growth chart for a Mid-Western US population, and to estimate the association between pregnancy outcomes and smallness for gestational age (SGA) defined by the customized growth chart compared with a population-based growth chart for the USA. A retrospective cohort study of an ultrasound database using 54,433 pregnancies meeting inclusion criteria was conducted. Coefficients for customized centiles were derived using 42,277 pregnancies and compared with those obtained from other populations. Two adverse outcome indicators were defined (greater than 7 day stay in the neonatal unit and stillbirth [SB]), and the risk for each outcome was calculated for the groups of pregnancies defined as SGA by the population standard and SGA by the customized standard using 12,456 pregnancies for the validation sample. The growth potential expressed as weight at 40 weeks in this population was 3524 g (standard error: 402 g). In the validation population, 4055 cases of SGA were identified using both population and customized standards. The cases additionally identified as SGA by the customized method had a significantly increased risk of each of the adverse outcome categories. The sensitivity and specificity of those identified as SGA by customized method only for detecting pregnancies at risk for SB was 32.7% (95% confidence interval [CI] 27.0-38.8%) and 95.1% (95% CI: 94.7-95.0%) versus 0.8% (95% CI 0.1-2.7%) and 98.0% (95% CI 97.8-98.2%)for those identified by only the population-based method, respectively. SGA defined by customized growth potential is able to identify substantially more pregnancies at a risk for adverse outcome than the currently used national standard for fetal growth.
Harnessing Data to Assess Equity of Care by Race, Ethnicity and Language

PubMed Central

Gracia, Amber; Cheirif, Jorge; Veliz, Juana; Reyna, Melissa; Vecchio, Mara; Aryal, Subhash

2015-01-01

Objective: Determine any disparities in care based on race, ethnicity and language (REaL) by utilizing inpatient (IP) core measures at Texas Health Resources, a large, faith-based, non-profit health care delivery system located in a large, ethnically diverse metropolitan area in Texas. These measures, which were established by the U.S. Centers for Medicare and Medicaid Services (CMS) and The Joint Commission (TJC), help to ensure better accountability for patient outcomes throughout the U.S. health care system. Methods: Sample analysis to understand the architecture of race, ethnicity and language (REaL) variables within the Texas Health clinical database, followed by development of the logic, method and framework for isolating populations and evaluating disparities by race (non-Hispanic White, non-Hispanic Black, Native American/Native Hawaiian/Pacific Islander, Asian and Other); ethnicity (Hispanic and non-Hispanic); and preferred language (English and Spanish). The study is based on use of existing clinical data for four inpatient (IP) core measures: Acute Myocardial Infarction (AMI), Congestive Heart Failure (CHF), Pneumonia (PN) and Surgical Care (SCIP), representing 100% of the sample population. These comprise a high number of cases presenting in our acute care facilities. Findings are based on a sample of clinical data (N = 19,873 cases) for the four inpatient (IP) core measures derived from 13 of Texas Health’s wholly-owned facilities, formulating a set of baseline data. Results: Based on applied method, Texas Health facilities consistently scored high with no discernable race, ethnicity and language (REaL) disparities as evidenced by a low percentage difference to the reference point (non-Hispanic White) on IP core measures, including: AMI (0.3%–1.2%), CHF (0.7%–3.0%), PN (0.5%–3.7%), and SCIP (0–0.7%). PMID:26703665
The demand control model and circadian saliva cortisol variations in a Swedish population based sample (The PART study)

PubMed Central

Alderling, Magnus; Theorell, Töres; de la Torre, Bartolomé; Lundberg, Ingvar

2006-01-01

Background Previous studies of the relationship between job strain and blood or saliva cortisol levels have been small and based on selected occupational groups. Our aim was to examine the association between job strain and saliva cortisol levels in a population-based study in which a number of potential confounders could be adjusted for. Methods The material derives from a population-based study in Stockholm on mental health and its potential determinants. Two data collections were performed three years apart with more than 8500 subjects responding to a questionnaire in both waves. In this paper our analyses are based on 529 individuals who held a job, participated in both waves as well as in an interview linked to the second wave. They gave saliva samples at awakening, half an hour later, at lunchtime and before going to bed on a weekday in close connection with the interview. Job control and job demands were assessed from the questionnaire in the second wave. Mixed models were used to analyse the association between the demand control model and saliva cortisol. Results Women in low strain jobs (high control and low demands) had significantly lower cortisol levels half an hour after awakening than women in high strain (low control and high demands), active (high control and high demands) or passive jobs (low control and low demands). There were no significant differences between the groups during other parts of the day and furthermore there was no difference between the job strain, active and passive groups. For men, no differences were found between demand control groups. Conclusion This population-based study, on a relatively large sample, weakly support the hypothesis that the demand control model is associated with saliva cortisol concentrations. PMID:17129377
DOE Office of Scientific and Technical Information (OSTI.GOV)

Belinsky, Steven A; Palmisano, William A

A molecular marker-based method for monitoring and detecting cancer in humans. Aberrant methylation of gene promoters is a marker for cancer risk in humans. A two-stage, or "nested" polymerase chain reaction method is disclosed for detecting methylated DNA sequences at sufficiently high levels of sensitivity to permit cancer screening in biological fluid samples, such as sputum, obtained non-invasively. The method is for detecting the aberrant methylation of the p16 gene, O 6-methylguanine-DNA methyltransferase gene, Death-associated protein kinase gene, RAS-associated family 1 gene, or other gene promoters. The method offers a potentially powerful approach to population-based screening for the detection ofmore » lung and other cancers.« less
Using Panel Vendors for Recruitment Into a Web-Based Family Prevention Program: Methodological Considerations.

PubMed

Wang-Schweig, Meme; Miller, Brenda A; Buller, David B; Byrnes, Hilary F; Bourdeau, Beth; Rogers, Veronica

2017-01-01

Use of online panel vendors in research has grown over the past decade. Panel vendors are organizations that recruit participants into a panel to take part in web-based surveys and match panelists to a target audience for data collection. We used two panel vendors to recruit families ( N = 411) with a 16- to 17-year-old teen to participate in a randomized control trial (RCT) of an online family-based program to prevent underage drinking and risky sexual behaviors. Our article addresses the following research questions: (1) How well do panel vendors provide a sample of families who meet our inclusion criteria to participate in a RCT? (2) How well do panel vendors provide a sample of families who reflect the characteristics of the general population? and (3) Does the choice of vendor influence the characteristics of families that we engage in research? Despite the screening techniques used by the panel vendors to identify families who met our inclusion criteria, 23.8% were found ineligible when research staff verified their eligibility by direct telephone contact. Compared to the general U.S. population, our sample had more Whites and more families with higher education levels. Finally, across the two panel vendors, there were no significant differences in the characteristics of families, except for mean age. The online environment provides opportunities for new methods to recruit participants in research studies. However, innovative recruitment methods need careful study to ensure the quality of their samples.
Regression trees for predicting mortality in patients with cardiovascular disease: What improvement is achieved by using ensemble-based methods?

PubMed Central

Austin, Peter C; Lee, Douglas S; Steyerberg, Ewout W; Tu, Jack V

2012-01-01

In biomedical research, the logistic regression model is the most commonly used method for predicting the probability of a binary outcome. While many clinical researchers have expressed an enthusiasm for regression trees, this method may have limited accuracy for predicting health outcomes. We aimed to evaluate the improvement that is achieved by using ensemble-based methods, including bootstrap aggregation (bagging) of regression trees, random forests, and boosted regression trees. We analyzed 30-day mortality in two large cohorts of patients hospitalized with either acute myocardial infarction (N = 16,230) or congestive heart failure (N = 15,848) in two distinct eras (1999–2001 and 2004–2005). We found that both the in-sample and out-of-sample prediction of ensemble methods offered substantial improvement in predicting cardiovascular mortality compared to conventional regression trees. However, conventional logistic regression models that incorporated restricted cubic smoothing splines had even better performance. We conclude that ensemble methods from the data mining and machine learning literature increase the predictive performance of regression trees, but may not lead to clear advantages over conventional logistic regression models for predicting short-term mortality in population-based samples of subjects with cardiovascular disease. PMID:22777999
Examining biological continuity across the late holocene occupation of the Aleutian Islands using cranial morphometrics and quantitative genetic permutation.

PubMed

Maley, Blaine

2016-05-01

The number of distinct human migrations into the Aleutian Islands during the Holocene has been a recurrent debate in the anthropological literature. Stemming from Hrdlička's sorting of the prehistoric remains into two distinct populations based on archaeological context and cranial measurements, the human occupation of the Aleutian Islands has long been thought to be the consequence of two distinct human migrations, a Paleo-Aleut migration that provided the initial settlement of the islands, and a Neo-Aleut migration that replaced the original settlers around 1000 BP. This study examines the relationship of the Aleut cranial assemblages in the context of greater Alaskan population variability to assess the evidence for a substantial migration into the Aleutian Islands during the late Holocene. A battery of 29 cranial measurements that quantify global cranial shape were analyzed using Euclidean morphometric methods and quantitative genetic permutation methods to examine the plausibility for two distinct Aleut occupations ("Paleo-Aleut" and "Neo-Aleut"), the latter of which is held to share closer phenetic affinities to mainland Alaskan populations than the former. The Aleut skeletal assemblages were arranged according to temporal association, geographic location, and cranial typology, and analyzed within a comparative framework of mainland Alaskan samples using principal coordinates, biological distance and random skewers permutation methods. Regardless of how the Aleut assemblages are divided, they show greater similarity to each other than to any of the mainland Alaskan assemblages. These findings are consistent across the methodological approaches. The results obtained in this study provide no support for a cranial morphology-based subdivision of the Aleuts into two distinct samples, Hence, there is no evidence for a substantial population migration of so-called Neo-Aleuts, nor for a population replacement event of an extant Paleo-Aleut population by a mainland-affiliated Neo-Aleuts population at or after 1000 BP. © 2016 Wiley Periodicals, Inc.
Design and implementation of estimation-based monitoring programs for flora and fauna: A case study on the Cherokee National Forest

USGS Publications Warehouse

Klimstra, J.D.; O'Connell, A.F.; Pistrang, M.J.; Lewis, L.M.; Herrig, J.A.; Sauer, J.R.

2007-01-01

Science-based monitoring of biological resources is important for a greater understanding of ecological systems and for assessment of the target population using theoretic-based management approaches. When selecting variables to monitor, managers first need to carefully consider their objectives, the geographic and temporal scale at which they will operate, and the effort needed to implement the program. Generally, monitoring can be divided into two categories: index and inferential. Although index monitoring is usually easier to implement, analysis of index data requires strong assumptions about consistency in detection rates over time and space, and parameters are often biasednot accounting for detectability and spatial variation. In most cases, individuals are not always available for detection during sampling periods, and the entire area of interest cannot be sampled. Conversely, inferential monitoring is more rigorous because it is based on nearly unbiased estimators of spatial distribution. Thus, we recommend that detectability and spatial variation be considered for all monitoring programs that intend to make inferences about the target population or the area of interest. Application of these techniques is especially important for the monitoring of Threatened and Endangered (T&E) species because it is critical to determine if population size is increasing or decreasing with some level of certainty. Use of estimation-based methods and probability sampling will reduce many of the biases inherently associated with index data and provide meaningful information with respect to changes that occur in target populations. We incorporated inferential monitoring into protocols for T&E species spanning a wide range of taxa on the Cherokee National Forest in the Southern Appalachian Mountains. We review the various approaches employed for different taxa and discuss design issues, sampling strategies, data analysis, and the details of estimating detectability using site occupancy. These techniques provide a science-based approach for monitoring and can be of value to all resource managers responsible for management of T&E species.
Sex differences in fingerprint ridge density in the Mataco-Mataguayo population.

PubMed

Gutiérrez-Redomero, E; Alonso, M C; Dipierri, J E

2011-12-01

Ridge density (RD), the number of digital ridges per unit area, varies according to sex, age, and population origin. The main objective of this study was to determine the extent of sexual dimorphism in RD and to set the age at which it appears, in an Amerindian sample from the Mataco-Mataguayo population. The sample studied for this research consisted of 99 males and 110 females, between 6 and 25 years old, which amounts to a total of 2090 fingerprints. Ridge count was carried out on distal radial and distal ulnar and on proximal regions of each finger to explore the RD patterns in order to identify similarities and differences among samples, areas, age groups, and sexes. RD decreased with age and, at all ages, RD was higher on the distal (radial and ulnar) areas, followed by the proximal sides. Females were found to have higher RD than males when older than 12 years, but not when younger. In the radial area, the Mataco-Mataguayo population, in both sexes, presented the RD similar to Spanish samples, but higher than all other populations analysed to date using this method. Variations in RD in the Amerindian population based on sex, age, and topology were confirmed in this work, and it is postulated that these variations are due to developmental differences among individuals and populations. A comparison between the Mataco-Mataguayo and Spanish populations is presented. Copyright © 2011 Elsevier GmbH. All rights reserved.
'Aussie normals': an a priori study to develop clinical chemistry reference intervals in a healthy Australian population.

PubMed

Koerbin, G; Cavanaugh, J A; Potter, J M; Abhayaratna, W P; West, N P; Glasgow, N; Hawkins, C; Armbruster, D; Oakman, C; Hickman, P E

2015-02-01

Development of reference intervals is difficult, time consuming, expensive and beyond the scope of most laboratories. The Aussie Normals study is a direct a priori study to determine reference intervals in healthy Australian adults. All volunteers completed a health and lifestyle questionnaire and exclusion was based on conditions such as pregnancy, diabetes, renal or cardiovascular disease. Up to 91 biochemical analyses were undertaken on a variety of analytical platforms using serum samples collected from 1856 volunteers. We report on our findings for 40 of these analytes and two calculated parameters performed on the Abbott ARCHITECTci8200/ci16200 analysers. Not all samples were analysed for all assays due to volume requirements or assay/instrument availability. Results with elevated interference indices and those deemed unsuitable after clinical evaluation were removed from the database. Reference intervals were partitioned based on the method of Harris and Boyd into three scenarios, combined gender, males and females and age and gender. We have performed a detailed reference interval study on a healthy Australian population considering the effects of sex, age and body mass. These reference intervals may be adapted to other manufacturer's analytical methods using method transference.
Non-parametric cell-based photometric proxies for galaxy morphology: methodology and application to the morphologically defined star formation-stellar mass relation of spiral galaxies in the local universe

NASA Astrophysics Data System (ADS)

Grootes, M. W.; Tuffs, R. J.; Popescu, C. C.; Robotham, A. S. G.; Seibert, M.; Kelvin, L. S.

2014-02-01

We present a non-parametric cell-based method of selecting highly pure and largely complete samples of spiral galaxies using photometric and structural parameters as provided by standard photometric pipelines and simple shape fitting algorithms. The performance of the method is quantified for different parameter combinations, using purely human-based classifications as a benchmark. The discretization of the parameter space allows a markedly superior selection than commonly used proxies relying on a fixed curve or surface of separation. Moreover, we find structural parameters derived using passbands longwards of the g band and linked to older stellar populations, especially the stellar mass surface density μ* and the r-band effective radius re, to perform at least equally well as parameters more traditionally linked to the identification of spirals by means of their young stellar populations, e.g. UV/optical colours. In particular, the distinct bimodality in the parameter μ*, consistent with expectations of different evolutionary paths for spirals and ellipticals, represents an often overlooked yet powerful parameter in differentiating between spiral and non-spiral/elliptical galaxies. We use the cell-based method for the optical parameter set including re in combination with the Sérsic index n and the i-band magnitude to investigate the intrinsic specific star formation rate-stellar mass relation (ψ*-M*) for a morphologically defined volume-limited sample of local Universe spiral galaxies. The relation is found to be well described by ψ _* ∝ M_*^{-0.5} over the range of 109.5 ≤ M* ≤ 1011 M⊙ with a mean interquartile range of 0.4 dex. This is somewhat steeper than previous determinations based on colour-selected samples of star-forming galaxies, primarily due to the inclusion in the sample of red quiescent discs.
Sampling methods to detect and estimate populations of Tyrophagus putrescentiae (Schrank) (Sarcoptiformes: Acaridae) infesting dry-cured hams

USDA-ARS?s Scientific Manuscript database

Spatial and temporal dynamics of pest populations is an important aspect of effective pest management. However, absolute sampling of some pest populations such as the ham mite, Tyrophagus putrescentiae (Schrank) (Sarcoptiformes: Acaridae), a serious pest of dry-cured ham, can be difficult. Sampling ...
Evaluation of nitrite contamination in baby foods and infant formulas marketed in Turkey.

PubMed

Erkekoglu, Pinar; Baydar, Terken

2009-05-01

Nitrites are responsible for methemoglobinemia, to which infants younger than 6 months are thought to be the most susceptible population. This study aimed to detect whether there was any nitrite contamination in infant formulas and baby foods marketed in Turkey and to estimate possible toxicological risks in this sensitive physiological period. For this purpose, the samples were randomly collected and divided into four groups: milk-based, cereal-based, vegetable-based, and fruit-based. An easy and reliable spectrophotometric method was used by modifying the Griess method. The average nitrite contamination was found to be 204.07+/-65.80 microg/g in 42 samples, with 1,073 microg/g maximum. According to the results, baby and infant formulas include various nitrite levels; nitrite contamination might come from several sources during manufacturing, and so extreme attention must be given throughout the manufacturing process of food for infants.
Inference for multivariate regression model based on multiply imputed synthetic data generated via posterior predictive sampling

NASA Astrophysics Data System (ADS)

Moura, Ricardo; Sinha, Bimal; Coelho, Carlos A.

2017-06-01

The recent popularity of the use of synthetic data as a Statistical Disclosure Control technique has enabled the development of several methods of generating and analyzing such data, but almost always relying in asymptotic distributions and in consequence being not adequate for small sample datasets. Thus, a likelihood-based exact inference procedure is derived for the matrix of regression coefficients of the multivariate regression model, for multiply imputed synthetic data generated via Posterior Predictive Sampling. Since it is based in exact distributions this procedure may even be used in small sample datasets. Simulation studies compare the results obtained from the proposed exact inferential procedure with the results obtained from an adaptation of Reiters combination rule to multiply imputed synthetic datasets and an application to the 2000 Current Population Survey is discussed.
Day-to-day associations between subjective sleep and affect in regard to future depression in a female population-based sample.

PubMed

de Wild-Hartmann, Jessica A; Wichers, Marieke; van Bemmel, Alex L; Derom, Catherine; Thiery, Evert; Jacobs, Nele; van Os, Jim; Simons, Claudia J P

2013-06-01

Poor sleep is a risk factor for depression, but little is known about the underlying mechanisms. Disentangling potential mechanisms by which sleep may be related to depression by zooming down to the 'micro-level' of within-person daily life patterns of subjective sleep and affect using the experience sampling method (ESM). A population-based twin sample consisting of 553 women underwent a 5-day baseline ESM protocol assessing subjective sleep and affect together with four follow-up assessments of depression. Sleep was associated with affect during the next day, especially positive affect. Daytime negative affect was not associated with subsequent night-time sleep. Baseline sleep predicted depressive symptoms across the follow-up period. The subtle, repetitive impact of sleep on affect on a daily basis, rather than the subtle repetitive impact of affect on sleep, may be one of the factors on the pathway to depression in women.
Shell productivity of the large benthic foraminifer Baculogypsina sphaerulata, based on the population dynamics in a tropical reef environment

NASA Astrophysics Data System (ADS)

Fujita, Kazuhiko; Otomaru, Maki; Lopati, Paeniu; Hosono, Takashi; Kayanne, Hajime

2016-03-01

Carbonate production by large benthic foraminifers is sometimes comparable to that of corals and coralline algae, and contributes to sedimentation on reef islands and beaches in the tropical Pacific. Population dynamic data, such as population density and size structure (size-frequency distribution), are vital for an accurate estimation of shell production of foraminifers. However, previous production estimates in tropical environments were based on a limited sampling period with no consideration of seasonality. In addition, no comparisons were made of various estimation methods to determine more accurate estimates. Here we present the annual gross shell production rate of Baculogypsina sphaerulata, estimated based on population dynamics studied over a 2-yr period on an ocean reef flat of Funafuti Atoll (Tuvalu, tropical South Pacific). The population density of B. sphaerulata increased from January to March, when northwest winds predominated and the study site was on the leeward side of reef islands, compared to other seasons when southeast trade winds predominated and the study site was on the windward side. This result suggested that wind-driven flows controlled the population density at the study site. The B. sphaerulata population had a relatively stationary size-frequency distribution throughout the study period, indicating no definite intensive reproductive period in the tropical population. Four methods were applied to estimate the annual gross shell production rates of B. sphaerulata. The production rates estimated by three of the four methods (using monthly biomass, life tables and growth increment rates) were in the order of hundreds of g CaCO3 m-2 yr-1 or cm-3 m-2 yr-1, and the simple method using turnover rates overestimated the values. This study suggests that seasonal surveys should be undertaken of population density and size structure as these can produce more accurate estimates of shell productivity of large benthic foraminifers.
Adélie Penguin Population Diet Monitoring by Analysis of Food DNA in Scats

PubMed Central

Jarman, Simon N.; McInnes, Julie C.; Faux, Cassandra; Polanowski, Andrea M.; Marthick, James; Deagle, Bruce E.; Southwell, Colin; Emmerson, Louise

2013-01-01

The Adélie penguin is the most important animal currently used for ecosystem monitoring in the Southern Ocean. The diet of this species is generally studied by visual analysis of stomach contents; or ratios of isotopes of carbon and nitrogen incorporated into the penguin from its food. There are significant limitations to the information that can be gained from these methods. We evaluated population diet assessment by analysis of food DNA in scats as an alternative method for ecosystem monitoring with Adélie penguins as an indicator species. Scats were collected at four locations, three phases of the breeding cycle, and in four different years. A novel molecular diet assay and bioinformatics pipeline based on nuclear small subunit ribosomal RNA gene (SSU rDNA) sequencing was used to identify prey DNA in 389 scats. Analysis of the twelve population sample sets identified spatial and temporal dietary change in Adélie penguin population diet. Prey diversity was found to be greater than previously thought. Krill, fish, copepods and amphipods were the most important food groups, in general agreement with other Adélie penguin dietary studies based on hard part or stable isotope analysis. However, our DNA analysis estimated that a substantial portion of the diet was gelatinous groups such as jellyfish and comb jellies. A range of other prey not previously identified in the diet of this species were also discovered. The diverse prey identified by this DNA-based scat analysis confirms that the generalist feeding of Adélie penguins makes them a useful indicator species for prey community composition in the coastal zone of the Southern Ocean. Scat collection is a simple and non-invasive field sampling method that allows DNA-based estimation of prey community differences at many temporal and spatial scales and provides significant advantages over alternative diet analysis approaches. PMID:24358158

Adélie penguin population diet monitoring by analysis of food DNA in scats.

PubMed

Jarman, Simon N; McInnes, Julie C; Faux, Cassandra; Polanowski, Andrea M; Marthick, James; Deagle, Bruce E; Southwell, Colin; Emmerson, Louise

2013-01-01

The Adélie penguin is the most important animal currently used for ecosystem monitoring in the Southern Ocean. The diet of this species is generally studied by visual analysis of stomach contents; or ratios of isotopes of carbon and nitrogen incorporated into the penguin from its food. There are significant limitations to the information that can be gained from these methods. We evaluated population diet assessment by analysis of food DNA in scats as an alternative method for ecosystem monitoring with Adélie penguins as an indicator species. Scats were collected at four locations, three phases of the breeding cycle, and in four different years. A novel molecular diet assay and bioinformatics pipeline based on nuclear small subunit ribosomal RNA gene (SSU rDNA) sequencing was used to identify prey DNA in 389 scats. Analysis of the twelve population sample sets identified spatial and temporal dietary change in Adélie penguin population diet. Prey diversity was found to be greater than previously thought. Krill, fish, copepods and amphipods were the most important food groups, in general agreement with other Adélie penguin dietary studies based on hard part or stable isotope analysis. However, our DNA analysis estimated that a substantial portion of the diet was gelatinous groups such as jellyfish and comb jellies. A range of other prey not previously identified in the diet of this species were also discovered. The diverse prey identified by this DNA-based scat analysis confirms that the generalist feeding of Adélie penguins makes them a useful indicator species for prey community composition in the coastal zone of the Southern Ocean. Scat collection is a simple and non-invasive field sampling method that allows DNA-based estimation of prey community differences at many temporal and spatial scales and provides significant advantages over alternative diet analysis approaches.
Nested methylation-specific polymerase chain reaction cancer detection method

DOEpatents

Belinsky, Steven A [Albuquerque, NM; Palmisano, William A [Edgewood, NM

2007-05-08

A molecular marker-based method for monitoring and detecting cancer in humans. Aberrant methylation of gene promoters is a marker for cancer risk in humans. A two-stage, or "nested" polymerase chain reaction method is disclosed for detecting methylated DNA sequences at sufficiently high levels of sensitivity to permit cancer screening in biological fluid samples, such as sputum, obtained non-invasively. The method is for detecting the aberrant methylation of the p16 gene, O 6-methylguanine-DNA methyltransferase gene, Death-associated protein kinase gene, RAS-associated family 1 gene, or other gene promoters. The method offers a potentially powerful approach to population-based screening for the detection of lung and other cancers.
What is a species? A new universal method to measure differentiation and assess the taxonomic rank of allopatric populations, using continuous variables

PubMed Central

Donegan, Thomas M.

2018-01-01

Abstract Existing models for assigning species, subspecies, or no taxonomic rank to populations which are geographically separated from one another were analyzed. This was done by subjecting over 3,000 pairwise comparisons of vocal or biometric data based on birds to a variety of statistical tests that have been proposed as measures of differentiation. One current model which aims to test diagnosability (Isler et al. 1998) is highly conservative, applying a hard cut-off, which excludes from consideration differentiation below diagnosis. It also includes non-overlap as a requirement, a measure which penalizes increases to sample size. The “species scoring” model of Tobias et al. (2010) involves less drastic cut-offs, but unlike Isler et al. (1998), does not control adequately for sample size and attributes scores in many cases to differentiation which is not statistically significant. Four different models of assessing effect sizes were analyzed: using both pooled and unpooled standard deviations and controlling for sample size using t-distributions or omitting to do so. Pooled standard deviations produced more conservative effect sizes when uncontrolled for sample size but less conservative effect sizes when so controlled. Pooled models require assumptions to be made that are typically elusive or unsupported for taxonomic studies. Modifications to improving these frameworks are proposed, including: (i) introducing statistical significance as a gateway to attributing any weighting to findings of differentiation; (ii) abandoning non-overlap as a test; (iii) recalibrating Tobias et al. (2010) scores based on effect sizes controlled for sample size using t-distributions. A new universal method is proposed for measuring differentiation in taxonomy using continuous variables and a formula is proposed for ranking allopatric populations. This is based first on calculating effect sizes using unpooled standard deviations, controlled for sample size using t-distributions, for a series of different variables. All non-significant results are excluded by scoring them as zero. Distance between any two populations is calculated using Euclidian summation of non-zeroed effect size scores. If the score of an allopatric pair exceeds that of a related sympatric pair, then the allopatric population can be ranked as species and, if not, then at most subspecies rank should be assigned. A spreadsheet has been programmed and is being made available which allows this and other tests of differentiation and rank studied in this paper to be rapidly analyzed. PMID:29780266
Authentication of Closely Related Fish and Derived Fish Products Using Tandem Mass Spectrometry and Spectral Library Matching.

PubMed

Nessen, Merel A; van der Zwaan, Dennis J; Grevers, Sander; Dalebout, Hans; Staats, Martijn; Kok, Esther; Palmblad, Magnus

2016-05-11

Proteomics methodology has seen increased application in food authentication, including tandem mass spectrometry of targeted species-specific peptides in raw, processed, or mixed food products. We have previously described an alternative principle that uses untargeted data acquisition and spectral library matching, essentially spectral counting, to compare and identify samples without the need for genomic sequence information in food species populations. Here, we present an interlaboratory comparison demonstrating how a method based on this principle performs in a realistic context. We also increasingly challenge the method by using data from different types of mass spectrometers, by trying to distinguish closely related and commercially important flatfish, and by analyzing heavily contaminated samples. The method was found to be robust in different laboratories, and 94-97% of the analyzed samples were correctly identified, including all processed and contaminated samples.
A Simple Sampling Method for Estimating the Accuracy of Large Scale Record Linkage Projects.

PubMed

Boyd, James H; Guiver, Tenniel; Randall, Sean M; Ferrante, Anna M; Semmens, James B; Anderson, Phil; Dickinson, Teresa

2016-05-17

Record linkage techniques allow different data collections to be brought together to provide a wider picture of the health status of individuals. Ensuring high linkage quality is important to guarantee the quality and integrity of research. Current methods for measuring linkage quality typically focus on precision (the proportion of incorrect links), given the difficulty of measuring the proportion of false negatives. The aim of this work is to introduce and evaluate a sampling based method to estimate both precision and recall following record linkage. In the sampling based method, record-pairs from each threshold (including those below the identified cut-off for acceptance) are sampled and clerically reviewed. These results are then applied to the entire set of record-pairs, providing estimates of false positives and false negatives. This method was evaluated on a synthetically generated dataset, where the true match status (which records belonged to the same person) was known. The sampled estimates of linkage quality were relatively close to actual linkage quality metrics calculated for the whole synthetic dataset. The precision and recall measures for seven reviewers were very consistent with little variation in the clerical assessment results (overall agreement using the Fleiss Kappa statistics was 0.601). This method presents as a possible means of accurately estimating matching quality and refining linkages in population level linkage studies. The sampling approach is especially important for large project linkages where the number of record pairs produced may be very large often running into millions.
Modeling misidentification errors that result from use of genetic tags in capture-recapture studies

USGS Publications Warehouse

Yoshizaki, J.; Brownie, C.; Pollock, K.H.; Link, W.A.

2011-01-01

Misidentification of animals is potentially important when naturally existing features (natural tags) such as DNA fingerprints (genetic tags) are used to identify individual animals. For example, when misidentification leads to multiple identities being assigned to an animal, traditional estimators tend to overestimate population size. Accounting for misidentification in capture-recapture models requires detailed understanding of the mechanism. Using genetic tags as an example, we outline a framework for modeling the effect of misidentification in closed population studies when individual identification is based on natural tags that are consistent over time (non-evolving natural tags). We first assume a single sample is obtained per animal for each capture event, and then generalize to the case where multiple samples (such as hair or scat samples) are collected per animal per capture occasion. We introduce methods for estimating population size and, using a simulation study, we show that our new estimators perform well for cases with moderately high capture probabilities or high misidentification rates. In contrast, conventional estimators can seriously overestimate population size when errors due to misidentification are ignored. ?? 2009 Springer Science+Business Media, LLC.
Lipid Vesicle Shape Analysis from Populations Using Light Video Microscopy and Computer Vision

PubMed Central

Zupanc, Jernej; Drašler, Barbara; Boljte, Sabina; Kralj-Iglič, Veronika; Iglič, Aleš; Erdogmus, Deniz; Drobne, Damjana

2014-01-01

We present a method for giant lipid vesicle shape analysis that combines manually guided large-scale video microscopy and computer vision algorithms to enable analyzing vesicle populations. The method retains the benefits of light microscopy and enables non-destructive analysis of vesicles from suspensions containing up to several thousands of lipid vesicles (1–50 µm in diameter). For each sample, image analysis was employed to extract data on vesicle quantity and size distributions of their projected diameters and isoperimetric quotients (measure of contour roundness). This process enables a comparison of samples from the same population over time, or the comparison of a treated population to a control. Although vesicles in suspensions are heterogeneous in sizes and shapes and have distinctively non-homogeneous distribution throughout the suspension, this method allows for the capture and analysis of repeatable vesicle samples that are representative of the population inspected. PMID:25426933
Detecting concerted demographic response across community assemblages using hierarchical approximate Bayesian computation.

PubMed

Chan, Yvonne L; Schanzenbach, David; Hickerson, Michael J

2014-09-01

Methods that integrate population-level sampling from multiple taxa into a single community-level analysis are an essential addition to the comparative phylogeographic toolkit. Detecting how species within communities have demographically tracked each other in space and time is important for understanding the effects of future climate and landscape changes and the resulting acceleration of extinctions, biological invasions, and potential surges in adaptive evolution. Here, we present a statistical framework for such an analysis based on hierarchical approximate Bayesian computation (hABC) with the goal of detecting concerted demographic histories across an ecological assemblage. Our method combines population genetic data sets from multiple taxa into a single analysis to estimate: 1) the proportion of a community sample that demographically expanded in a temporally clustered pulse and 2) when the pulse occurred. To validate the accuracy and utility of this new approach, we use simulation cross-validation experiments and subsequently analyze an empirical data set of 32 avian populations from Australia that are hypothesized to have expanded from smaller refugia populations in the late Pleistocene. The method can accommodate data set heterogeneity such as variability in effective population size, mutation rates, and sample sizes across species and exploits the statistical strength from the simultaneous analysis of multiple species. This hABC framework used in a multitaxa demographic context can increase our understanding of the impact of historical climate change by determining what proportion of the community responded in concert or independently and can be used with a wide variety of comparative phylogeographic data sets as biota-wide DNA barcoding data sets accumulate. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Surveying immigrants without sampling frames - evaluating the success of alternative field methods.

PubMed

Reichel, David; Morales, Laura

2017-01-01

This paper evaluates the sampling methods of an international survey, the Immigrant Citizens Survey, which aimed at surveying immigrants from outside the European Union (EU) in 15 cities in seven EU countries. In five countries, no sample frame was available for the target population. Consequently, alternative ways to obtain a representative sample had to be found. In three countries 'location sampling' was employed, while in two countries traditional methods were used with adaptations to reach the target population. The paper assesses the main methodological challenges of carrying out a survey among a group of immigrants for whom no sampling frame exists. The samples of the survey in these five countries are compared to results of official statistics in order to assess the accuracy of the samples obtained through the different sampling methods. It can be shown that alternative sampling methods can provide meaningful results in terms of core demographic characteristics although some estimates differ to some extent from the census results.
IMa2p - Parallel MCMC and inference of ancient demography under the Isolation with Migration (IM) model

PubMed Central

Sethuraman, Arun; Hey, Jody

2015-01-01

IMa2 and related programs are used to study the divergence of closely related species and of populations within species. These methods are based on the sampling of genealogies using MCMC, and they can proceed quite slowly for larger data sets. We describe a parallel implementation, called IMa2p, that provides a nearly linear increase in genealogy sampling rate with the number of processors in use. IMa2p is written in OpenMPI and C++, and scales well for demographic analyses of a large number of loci and populations, which are difficult to study using the serial version of the program. PMID:26059786
Sampling in health geography: reconciling geographical objectives and probabilistic methods. An example of a health survey in Vientiane (Lao PDR).

PubMed

Vallée, Julie; Souris, Marc; Fournet, Florence; Bochaton, Audrey; Mobillion, Virginie; Peyronnie, Karine; Salem, Gérard

2007-06-01

Geographical objectives and probabilistic methods are difficult to reconcile in a unique health survey. Probabilistic methods focus on individuals to provide estimates of a variable's prevalence with a certain precision, while geographical approaches emphasise the selection of specific areas to study interactions between spatial characteristics and health outcomes. A sample selected from a small number of specific areas creates statistical challenges: the observations are not independent at the local level, and this results in poor statistical validity at the global level. Therefore, it is difficult to construct a sample that is appropriate for both geographical and probability methods. We used a two-stage selection procedure with a first non-random stage of selection of clusters. Instead of randomly selecting clusters, we deliberately chose a group of clusters, which as a whole would contain all the variation in health measures in the population. As there was no health information available before the survey, we selected a priori determinants that can influence the spatial homogeneity of the health characteristics. This method yields a distribution of variables in the sample that closely resembles that in the overall population, something that cannot be guaranteed with randomly-selected clusters, especially if the number of selected clusters is small. In this way, we were able to survey specific areas while minimising design effects and maximising statistical precision. We applied this strategy in a health survey carried out in Vientiane, Lao People's Democratic Republic. We selected well-known health determinants with unequal spatial distribution within the city: nationality and literacy. We deliberately selected a combination of clusters whose distribution of nationality and literacy is similar to the distribution in the general population. This paper describes the conceptual reasoning behind the construction of the survey sample and shows that it can be advantageous to choose clusters using reasoned hypotheses, based on both probability and geographical approaches, in contrast to a conventional, random cluster selection strategy.
Rapid diagnosis of common deletional α-thalassemia in the Chinese population by qPCR based on identical primer homologous fragments.

PubMed

Long, Ju

2016-05-01

In China, -(SEA), -α(3.7) and -α(4.2) are common deletional α-thalassemia alleles. Gap-PCR is the currently used detection method for these alleles, whose disadvantages include time-consuming procedure and increased potential for PCR product contamination. Therefore, this detection method needs to be improved. Based on identical-primer homologous fragments, a qPCR system was developed for deletional α-thalassemia genotyping, which was composed of a group of quantitatively-related primers and their corresponding probes plus two groups of qualitatively-related primers and their corresponding probes. In order to verify the accuracy of the qPCR system, known genotype samples and random samples are employed. The standard curve result demonstrated that designed primers and probes all yielded good amplification efficiency. In the tests of known genotype samples and random samples, sample detection results were consistent with verification results. In detecting αα, -(SEA), -α(3.7) and -α(4.2) alleles, deletional α-thalassemia alleles are accurately detected by this method. In addition, this method is provided with a wider detection range, greater speed and reduced PCR product contamination risk when compared with current common gap-PCR detection reagents. Copyright © 2016 Elsevier B.V. All rights reserved.
Prevalence and distribution of glucose-6-phosphate dehydrogenase (G6PD) variants in Thai and Burmese populations in malaria endemic areas of Thailand

PubMed Central

2011-01-01

Background G6PD deficiency is common in malaria endemic regions and is estimated to affect more than 400 million people worldwide. Treatment of malaria patients with the anti-malarial drug primaquine or other 8-aminoquinolines may be associated with potential haemolytic anaemia. The aim of the present study was to investigate the prevalence of G6PD variants in Thai population who resided in malaria endemic areas (western, northern, north-eastern, southern, eastern and central regions) of Thailand, as well as the Burmese population who resided in areas along the Thai-Myanmar border. Methods The ten common G6PD variants were investigated in dried blood spot samples collected from 317 Thai (84 males, 233 females) and 183 Burmese (11 males, 172 females) populations residing in malaria endemic areas of Thailand using PCR-RFLP method. Results Four and seven G6PD variants were observed in samples collected from Burmese and Thai population, with prevalence of 6.6% (21/317) and 14.2% (26/183), respectively. Almost all (96.2%) of G6PD mutation samples collected from Burmese population carried G6PD Mahidol variant; only one sample (3.8%) carried G6PD Kaiping variant. For the Thai population, G6PD Mahidol (8/21: 38.1%) was the most common variant detected, followed by G6PD Viangchan (4/21: 19.0%), G6PD Chinese 4 (3/21: 14.3%), G6PD Canton (2/21: 9.5%), G6PD Union (2/21: 9.5%), G6PD Kaiping (1/21: 4.8%), and G6PD Gaohe (1/21: 4.8%). No G6PD Chinese 3, Chinese 5 and Coimbra variants were found. With this limited sample size, there appeared to be variation in G6PD mutation variants in samples obtained from Thai population in different regions particularly in the western region. Conclusions Results indicate difference in the prevalence and distribution of G6PD gene variants among the Thai and Burmese populations in different malaria endemic areas. Dosage regimen of primaquine for treatment of both Plasmodium falciparum and Plasmodium vivax malaria may need to be optimized, based on endemic areas with supporting data on G6PD variants. Larger sample size from different malaria endemic is required to obtain accurate genetic mapping of G6PD variants in Burmese and Thai population residing in malaria endemic areas of Thailand. PMID:22171972
Inferring modes of colonization for pest species using heterozygosity comparisons and a shared-allele test.

PubMed Central

Sved, J A; Yu, H; Dominiak, B; Gilchrist, A S

2003-01-01

Long-range dispersal of a species may involve either a single long-distance movement from a core population or spreading via unobserved intermediate populations. Where the new populations originate as small propagules, genetic drift may be extreme and gene frequency or assignment methods may not prove useful in determining the relation between the core population and outbreak samples. We describe computationally simple resampling methods for use in this situation to distinguish between the different modes of dispersal. First, estimates of heterozygosity can be used to test for direct sampling from the core population and to estimate the effective size of intermediate populations. Second, a test of sharing of alleles, particularly rare alleles, can show whether outbreaks are related to each other rather than arriving as independent samples from the core population. The shared-allele statistic also serves as a genetic distance measure that is appropriate for small samples. These methods were applied to data on a fruit fly pest species, Bactrocera tryoni, which is quarantined from some horticultural areas in Australia. We concluded that the outbreaks in the quarantine zone came from a heterogeneous set of genetically differentiated populations, possibly ones that overwinter in the vicinity of the quarantine zone. PMID:12618417
Possible overestimation of surface disinfection efficiency by assessment methods based on liquid sampling procedures as demonstrated by in situ quantification of spore viability.

PubMed

Grand, I; Bellon-Fontaine, M-N; Herry, J-M; Hilaire, D; Moriconi, F-X; Naïtali, M

2011-09-01

The standard test methods used to assess the efficiency of a disinfectant applied to surfaces are often based on counting the microbial survivors sampled in a liquid, but total cell removal from surfaces is seldom achieved. One might therefore wonder whether evaluations of microbial survivors in liquid-sampled cells are representative of the levels of survivors in whole populations. The present study was thus designed to determine the "damaged/undamaged" status induced by a peracetic acid disinfection for Bacillus atrophaeus spores deposited on glass coupons directly on this substrate and to compare it to the status of spores collected in liquid by a sampling procedure. The method utilized to assess the viability of both surface-associated and liquid-sampled spores included fluorescence labeling with a combination of Syto 61 and Chemchrome V6 dyes and quantifications by analyzing the images acquired by confocal laser scanning microscopy. The principal result of the study was that the viability of spores sampled in the liquid was found to be poorer than that of surface-associated spores. For example, after 2 min of peracetic acid disinfection, less than 17% ± 5% of viable cells were detected among liquid-sampled cells compared to 79% ± 5% or 47% ± 4%, respectively, when the viability was evaluated on the surface after or without the sampling procedure. Moreover, assessments of the survivors collected in the liquid phase, evaluated using the microscopic method and standard plate counts, were well correlated. Evaluations based on the determination of survivors among the liquid-sampled cells can thus overestimate the efficiency of surface disinfection procedures.
Possible Overestimation of Surface Disinfection Efficiency by Assessment Methods Based on Liquid Sampling Procedures as Demonstrated by In Situ Quantification of Spore Viability ▿

PubMed Central

Grand, I.; Bellon-Fontaine, M.-N.; Herry, J.-M.; Hilaire, D.; Moriconi, F.-X.; Naïtali, M.

2011-01-01

The standard test methods used to assess the efficiency of a disinfectant applied to surfaces are often based on counting the microbial survivors sampled in a liquid, but total cell removal from surfaces is seldom achieved. One might therefore wonder whether evaluations of microbial survivors in liquid-sampled cells are representative of the levels of survivors in whole populations. The present study was thus designed to determine the “damaged/undamaged” status induced by a peracetic acid disinfection for Bacillus atrophaeus spores deposited on glass coupons directly on this substrate and to compare it to the status of spores collected in liquid by a sampling procedure. The method utilized to assess the viability of both surface-associated and liquid-sampled spores included fluorescence labeling with a combination of Syto 61 and Chemchrome V6 dyes and quantifications by analyzing the images acquired by confocal laser scanning microscopy. The principal result of the study was that the viability of spores sampled in the liquid was found to be poorer than that of surface-associated spores. For example, after 2 min of peracetic acid disinfection, less than 17% ± 5% of viable cells were detected among liquid-sampled cells compared to 79% ± 5% or 47% ± 4%, respectively, when the viability was evaluated on the surface after or without the sampling procedure. Moreover, assessments of the survivors collected in the liquid phase, evaluated using the microscopic method and standard plate counts, were well correlated. Evaluations based on the determination of survivors among the liquid-sampled cells can thus overestimate the efficiency of surface disinfection procedures. PMID:21742922
Comparing sexual minority cancer survivors recruited through a cancer registry to convenience methods of recruitment.

PubMed

Boehmer, Ulrike; Clark, Melissa A; Timm, Alison; Glickman, Mark; Sullivan, Mairead

2011-01-01

Sexual minority women, defined as having a lesbian or bisexual identity or reporting a preference for a female partner, are not considered by cancer surveillance. This study assesses the representativeness of sexual minority breast cancer survivors, defined as having a lesbian or bisexual identity or reporting a preference for a female partner, who were recruited into a convenience sample compared with a population-based registry sample of sexual minority breast cancer survivors. Long-term survivors of non-metastatic breast cancer who self-reported as sexual minority were recruited from a cancer registry and subsequently from the community using convenience recruitment methods. Sexual minority breast cancer survivors who screened eligible participated in a telephone survey about their quality of life and factors associated therewith. Participants in the convenience sample were similar to the registry-based sample with respect to adjustment to cancer, physical health, trust in physician, coping, social support, and sexual minority experiences. Compared with the convenience sample, breast cancer survivors in the registry sample were more likely married, more educated, diagnosed more recently, at an earlier stage of cancer, and more likely treated with breast-conserving surgery; they differed on adjuvant therapies. Because sexual minority breast cancer survivors who volunteered for the community-based sample shared most characteristics of the sample recruited from the cancer registry, we concluded that the community sample had comparable representational quality. In the absence of cancer surveillance of sexual minorities, thoughtful convenience recruitment methods provide good representational quality convenience samples. Copyright © 2011 Jacobs Institute of Women's Health. Published by Elsevier Inc. All rights reserved.
The German version of the Material Values Scale

PubMed Central

Müller, Astrid; Smits, Dirk J. M.; Claes, Laurence; Gefeller, Olaf; Hinz, Andreas; de Zwaan, Martina

2013-01-01

Aim: The Material Values Scale is an instrument to assess beliefs about the importance to own material things. This instrument originally consists of the three subscales: ‘centrality’, ‘success’, and ‘happiness’. The present study investigated the psychometric properties of the German version of the MVS (G-MVS). Method: A population-based sample of 2,295 adult Germans completed the questionnaire in order to investigate the factorial structure. To test construct validity, additional samples were gathered among patients with compulsive buying (N=52) and medical students (N=347) who also answered the Compulsive Buying Scale (CBS) and the Patient Health Questionnaire depression scale (PHQ-8). Results: In the German population-based sample we could not confirm the 3-factor model but rather suggest a 2-factor solution with a first collapsed factor ‘centrality/success’, and the second factor ’happiness’. Patients with compulsive buying showed the highest scores on the G-MVS. While G-MVS scores among compulsive buyers and medical students were significantly related to compulsive buying scores, the correlation between the G-MVS and the depression measure appeared substantially lower. We did not find any gender differences regarding materialism, neither in the population-based sample nor in the students’ or compulsive buyers’ samples. However, age was negatively related to G-MVS scores. Conclusion: Confirmatory factor analyses suggest a 2-factor model of the G-MVS. Overall, the results indicate the use of the G-MVS as a brief, psychometrically sound, and potentially valid measure for the assessment of material values. PMID:23802017
Handling nonresponse in surveys: analytic corrections compared with converting nonresponders.

PubMed

Jenkins, Paul; Earle-Richardson, Giulia; Burdick, Patrick; May, John

2008-02-01

A large health survey was combined with a simulation study to contrast the reduction in bias achieved by double sampling versus two weighting methods based on propensity scores. The survey used a census of one New York county and double sampling in six others. Propensity scores were modeled as a logistic function of demographic variables and were used in conjunction with a random uniform variate to simulate response in the census. These data were used to estimate the prevalence of chronic disease in a population whose parameters were defined as values from the census. Significant (p < 0.0001) predictors in the logistic function included multiple (vs. single) occupancy (odds ratio (OR) = 1.3), bank card ownership (OR = 2.1), gender (OR = 1.5), home ownership (OR = 1.3), head of household's age (OR = 1.4), and income >$18,000 (OR = 0.8). The model likelihood ratio chi-square was significant (p < 0.0001), with the area under the receiver operating characteristic curve = 0.59. Double-sampling estimates were marginally closer to population values than those from either weighting method. However, the variance was also greater (p < 0.01). The reduction in bias for point estimation from double sampling may be more than offset by the increased variance associated with this method.
Shared Genetic Influences on Negative Emotionality and Major Depression/Conduct Disorder Comorbidity

ERIC Educational Resources Information Center

Tackett, Jennifer L.; Waldman, Irwin D.; Van Hulle, Carol A.; Lahey, Benjamin B.

2011-01-01

Objective: To investigate whether genetic contributions to major depressive disorder and conduct disorder comorbidity are shared with genetic influences on negative emotionality. Method: Primary caregivers of 2,022 same- and opposite-sex twin pairs 6 to 18 years of age comprised a population-based sample. Participants were randomly selected across…

Investigating the Role of Salivary Cortisol on Vocal Symptoms

ERIC Educational Resources Information Center

Holmqvist-Jämsén, Sofia; Johansson, Ada; Santtila, Pekka; Westberg, Lars; von der Pahlen, Bettina; Simberg, Susanna

2017-01-01

Purpose: We investigated whether participants who reported more often occurring vocal symptoms showed higher salivary cortisol levels and if such possible associations were different for men and women. Method: The participants (N = 170; men n = 49, women n = 121) consisted of a population-based sample of Finnish twins born between 1961 and 1989.…
Nonsuicidal Self-Injury in a College Population: General Trends and Sex Differences

ERIC Educational Resources Information Center

Whitlock, Janis; Muehlenkamp, Jennifer; Purington, Amanda; Eckenrode, John; Barreira, Paul; Abrams, Gina Baral; Marchell, Tim; Kress, Victoria; Girard, Kristine; Chin, Calvin; Knox, Kerry

2011-01-01

Objective: To describe basic nonsuicidal self-injury (NSSI) characteristics and to explore sex differences. Methods: A random sample from 8 universities were invited to participate in a Web-based survey in 2006-2007; 38.9% (n = 14,372) participated. Analysis assessed sex differences in NSSI prevalence, practices, severity, perceived dependency,…
New parsimonious simulation methods and tools to assess future food and environmental security of farm populations

PubMed Central

Antle, John M.; Stoorvogel, Jetse J.; Valdivia, Roberto O.

2014-01-01

This article presents conceptual and empirical foundations for new parsimonious simulation models that are being used to assess future food and environmental security of farm populations. The conceptual framework integrates key features of the biophysical and economic processes on which the farming systems are based. The approach represents a methodological advance by coupling important behavioural processes, for example, self-selection in adaptive responses to technological and environmental change, with aggregate processes, such as changes in market supply and demand conditions or environmental conditions as climate. Suitable biophysical and economic data are a critical limiting factor in modelling these complex systems, particularly for the characterization of out-of-sample counterfactuals in ex ante analyses. Parsimonious, population-based simulation methods are described that exploit available observational, experimental, modelled and expert data. The analysis makes use of a new scenario design concept called representative agricultural pathways. A case study illustrates how these methods can be used to assess food and environmental security. The concluding section addresses generalizations of parametric forms and linkages of regional models to global models. PMID:24535388
New parsimonious simulation methods and tools to assess future food and environmental security of farm populations.

PubMed

Antle, John M; Stoorvogel, Jetse J; Valdivia, Roberto O

2014-04-05

This article presents conceptual and empirical foundations for new parsimonious simulation models that are being used to assess future food and environmental security of farm populations. The conceptual framework integrates key features of the biophysical and economic processes on which the farming systems are based. The approach represents a methodological advance by coupling important behavioural processes, for example, self-selection in adaptive responses to technological and environmental change, with aggregate processes, such as changes in market supply and demand conditions or environmental conditions as climate. Suitable biophysical and economic data are a critical limiting factor in modelling these complex systems, particularly for the characterization of out-of-sample counterfactuals in ex ante analyses. Parsimonious, population-based simulation methods are described that exploit available observational, experimental, modelled and expert data. The analysis makes use of a new scenario design concept called representative agricultural pathways. A case study illustrates how these methods can be used to assess food and environmental security. The concluding section addresses generalizations of parametric forms and linkages of regional models to global models.
Physiogenomic analysis of the Puerto Rican population

PubMed Central

Ruaño, Gualberto; Duconge, Jorge; Windemuth, Andreas; Cadilla, Carmen L; Kocherla, Mohan; Villagra, David; Renta, Jessica; Holford, Theodore; Santiago-Borrero, Pedro J

2009-01-01

Aims Admixture in the population of the island of Puerto Rico is of general interest with regards to pharmacogenetics to develop comprehensive strategies for personalized healthcare in Latin Americans. This research was aimed at determining the frequencies of SNPs in key physiological, pharmacological and biochemical genes to infer population structure and ancestry in the Puerto Rican population. Materials & methods A noninterventional, cross-sectional, retrospective study design was implemented following a controlled, stratified-by-region, random sampling protocol. The sample was based on birthrates in each region of the island of Puerto Rico, according to the 2004 National Birth Registry. Genomic DNA samples from 100 newborns were obtained from the Puerto Rico Newborn Screening Program in dried-blood spot cards. Genotyping using a physiogenomic array was performed for 332 SNPs from 196 cardiometabolic and neuroendocrine genes. Population structure was examined using a Bayesian clustering approach as well as by allelic dissimilarity as a measure of allele sharing. Results The Puerto Rican sample was found to be broadly heterogeneous. We observed three main clusters in the population, which we hypothesize to reflect the historical admixture in the Puerto Rican population from Amerindian, African and European ancestors. We present evidence for this interpretation by comparing allele frequencies for the three clusters with those for the same SNPs available from the International HapMap project for Asian, African and European populations. Conclusion Our results demonstrate that population analysis can be performed with a physiogenomic array of cardiometabolic and neuroendocrine genes to facilitate the translation of genome diversity into personalized medicine. PMID:19374515
Identification of forensic samples by using an infrared-based automatic DNA sequencer.

PubMed

Ricci, Ugo; Sani, Ilaria; Klintschar, Michael; Cerri, Nicoletta; De Ferrari, Francesco; Giovannucci Uzielli, Maria Luisa

2003-06-01

We have recently introduced a new protocol for analyzing all core loci of the Federal Bureau of Investigation's (FBI) Combined DNA Index System (CODIS) with an infrared (IR) automatic DNA sequencer (LI-COR 4200). The amplicons were labeled with forward oligonucleotide primers, covalently linked to a new infrared fluorescent molecule (IRDye 800). The alleles were displayed as familiar autoradiogram-like images with real-time detection. This protocol was employed for paternity testing, population studies, and identification of degraded forensic samples. We extensively analyzed some simulated forensic samples and mixed stains (blood, semen, saliva, bones, and fixed archival embedded tissues), comparing the results with donor samples. Sensitivity studies were also performed for the four multiplex systems. Our results show the efficiency, reliability, and accuracy of the IR system for the analysis of forensic samples. We also compared the efficiency of the multiplex protocol with ultraviolet (UV) technology. Paternity tests, undegraded DNA samples, and real forensic samples were analyzed with this approach based on IR technology and with UV-based automatic sequencers in combination with commercially-available kits. The comparability of the results with the widespread UV methods suggests that it is possible to exchange data between laboratories using the same core group of markers but different primer sets and detection methods.
Variation in the cranial base orientation and facial skeleton in dry skulls sampled from three major populations.

PubMed

Kuroe, Kazuto; Rosas, Antonio; Molleson, Theya

2004-04-01

The aim of this study was to analyse the effects of cranial base orientation on the morphology of the craniofacial system in human populations. Three geographically distant populations from Europe (72), Africa (48) and Asia (24) were chosen. Five angular and two linear variables from the cranial base component and six angular and six linear variables from the facial component based on two reference lines of the vertical posterior maxillary and Frankfort horizontal planes were measured. The European sample presented dolichofacial individuals with a larger face height and a smaller face depth derived from a raised cranial base and facial cranium orientation which tended to be similar to the Asian sample. The African sample presented brachyfacial individuals with a reduced face height and a larger face depth as a result of a lowered cranial base and facial cranium orientation. The Asian sample presented dolichofacial individuals with a larger face height and depth due to a raised cranial base and facial cranium orientation. The findings of this study suggest that cranial base orientation and posterior cranial base length appear to be valid discriminating factors between different human populations.
Optimized methods for total nucleic acid extraction and quantification of the bat white-nose syndrome fungus, Pseudogymnoascus destructans, from swab and environmental samples.

PubMed

Verant, Michelle L; Bohuski, Elizabeth A; Lorch, Jeffery M; Blehert, David S

2016-03-01

The continued spread of white-nose syndrome and its impacts on hibernating bat populations across North America has prompted nationwide surveillance efforts and the need for high-throughput, noninvasive diagnostic tools. Quantitative real-time polymerase chain reaction (qPCR) analysis has been increasingly used for detection of the causative fungus, Pseudogymnoascus destructans, in both bat- and environment-associated samples and provides a tool for quantification of fungal DNA useful for research and monitoring purposes. However, precise quantification of nucleic acid from P. destructans is dependent on effective and standardized methods for extracting nucleic acid from various relevant sample types. We describe optimized methodologies for extracting fungal nucleic acids from sediment, guano, and swab-based samples using commercial kits together with a combination of chemical, enzymatic, and mechanical modifications. Additionally, we define modifications to a previously published intergenic spacer-based qPCR test for P. destructans to refine quantification capabilities of this assay. © 2016 The Author(s).
Optimized methods for total nucleic acid extraction and quantification of the bat white-nose syndrome fungus, Pseudogymnoascus destructans, from swab and environmental samples

USGS Publications Warehouse

Verant, Michelle; Bohuski, Elizabeth A.; Lorch, Jeffrey M.; Blehert, David

2016-01-01

The continued spread of white-nose syndrome and its impacts on hibernating bat populations across North America has prompted nationwide surveillance efforts and the need for high-throughput, noninvasive diagnostic tools. Quantitative real-time polymerase chain reaction (qPCR) analysis has been increasingly used for detection of the causative fungus, Pseudogymnoascus destructans, in both bat- and environment-associated samples and provides a tool for quantification of fungal DNA useful for research and monitoring purposes. However, precise quantification of nucleic acid fromP. destructans is dependent on effective and standardized methods for extracting nucleic acid from various relevant sample types. We describe optimized methodologies for extracting fungal nucleic acids from sediment, guano, and swab-based samples using commercial kits together with a combination of chemical, enzymatic, and mechanical modifications. Additionally, we define modifications to a previously published intergenic spacer–based qPCR test for P. destructans to refine quantification capabilities of this assay.
Improved diagnosis of Trichuris trichiura by using a bead-beating procedure on ethanol preserved stool samples prior to DNA isolation and the performance of multiplex real-time PCR for intestinal parasites.

PubMed

Kaisar, Maria M M; Brienen, Eric A T; Djuardi, Yenny; Sartono, Erliyani; Yazdanbakhsh, Maria; Verweij, Jaco J; Supali, Taniawati; VAN Lieshout, Lisette

2017-06-01

For the majority of intestinal parasites, real-time PCR-based diagnosis outperforms microscopy. However, the data for Trichuris trichiura have been less convincing and most comparative studies have been performed in populations with low prevalence. This study aims to improve detection of T. trichuria DNA in human stool by evaluating four sample preparation methods. Faecal samples (n = 60) were collected at Flores island, Indonesia and examined by microscopy. Aliquots were taken and a bead-beating procedure was used both on directly frozen stool and on material preserved with 96% ethanol. PCR on frozen samples showed 40% to be positive for T. trichiura, compared with 45% positive by microscopy. The percentage positive increased when using ethanol preservation (45·0%), bead-beating (51·7%) and a combination (55·0%) and all three methods showed significantly higher DNA loads. The various procedures had a less pronounced effect on the PCR results of nine other parasite targets tested. Most prevalent were Ascaris lumbricoides (≈60%), Necator americanus (≈60%), Dientamoeba fragilis (≈50%) and Giardia lamblia (≈12%). To validate the practicality of the procedure, bead-beating was applied in a population-based survey testing 910 stool samples. Findings confirmed bead-beating before DNA extraction to be a highly efficient procedure for the detection of T. trichiura DNA in stool.
Effectiveness of Family, Child, and Family-Child Based Intervention on ADHD Symptoms of Students with Disabilities

ERIC Educational Resources Information Center

Malekpour, Mokhtar; Aghababaei, Sara; Hadi, Samira

2014-01-01

The aim of the present study was to investigate and compare the effectiveness of family, child, and family-child based intervention on the rate of ADHD symptoms in third grade students. The population for this study was all of students with ADHD diagnoses in the city of Isfahan, Iran. The multistage random sampling method was used to select the 60…
Genetic algorithms and MCML program for recovery of optical properties of homogeneous turbid media

PubMed Central

Morales Cruzado, Beatriz; y Montiel, Sergio Vázquez; Atencio, José Alberto Delgado

2013-01-01

In this paper, we present and validate a new method for optical properties recovery of turbid media with slab geometry. This method is an iterative method that compares diffuse reflectance and transmittance, measured using integrating spheres, with those obtained using the known algorithm MCML. The search procedure is based in the evolution of a population due to selection of the best individual, i.e., using a genetic algorithm. This new method includes several corrections such as non-linear effects in integrating spheres measurements and loss of light due to the finite size of the sample. As a potential application and proof-of-principle experiment of this new method, we use this new algorithm in the recovery of optical properties of blood samples at different degrees of coagulation. PMID:23504404
Comparing Study Populations of Men Who Have Sex with Men: Evaluating Consistency Within Repeat Studies and Across Studies in the Seattle Area Using Different Recruitment Methodologies

PubMed Central

Burt, Richard D.; Oster, Alexandra M.; Golden, Mathew R.; Thiede, Hanne

2013-01-01

There is no gold standard for recruiting unbiased samples of men who have sex with men (MSM). To assess differing recruitment methods, we compared Seattle-area MSM samples from: venue-day-time sampling-based National HIV Behavioral Surveillance (NHBS) surveys in 2008 and 2011, random-digit-dialed (RDD) surveys in 2003 and 2006, and STD clinic patient data 2001–2011. We compared sociodemographics, sexual and drug-associated behavior, and HIV status and testing. There was generally good consistency between the two NHBS surveys and within STD clinic data across time. NHBS participants reported higher levels of drug-associated and lower levels of sexual risk than STD clinic patients. RDD participants differed from the other study populations in sociodemographics and some risk behaviors. While neither NHBS nor the STD clinic study populations may be representative of all MSM, both appear to provide consistent samples of MSM subpopulations across time that can provide useful information to guide HIV prevention. PMID:23900958
Estimation of infection prevalence and sensitivity in a stratified two-stage sampling design employing highly specific diagnostic tests when there is no gold standard.

PubMed

Miller, Ezer; Huppert, Amit; Novikov, Ilya; Warburg, Alon; Hailu, Asrat; Abbasi, Ibrahim; Freedman, Laurence S

2015-11-10

In this work, we describe a two-stage sampling design to estimate the infection prevalence in a population. In the first stage, an imperfect diagnostic test was performed on a random sample of the population. In the second stage, a different imperfect test was performed in a stratified random sample of the first sample. To estimate infection prevalence, we assumed conditional independence between the diagnostic tests and develop method of moments estimators based on expectations of the proportions of people with positive and negative results on both tests that are functions of the tests' sensitivity, specificity, and the infection prevalence. A closed-form solution of the estimating equations was obtained assuming a specificity of 100% for both tests. We applied our method to estimate the infection prevalence of visceral leishmaniasis according to two quantitative polymerase chain reaction tests performed on blood samples taken from 4756 patients in northern Ethiopia. The sensitivities of the tests were also estimated, as well as the standard errors of all estimates, using a parametric bootstrap. We also examined the impact of departures from our assumptions of 100% specificity and conditional independence on the estimated prevalence. Copyright © 2015 John Wiley & Sons, Ltd.
Unified framework to evaluate panmixia and migration direction among multiple sampling locations.

PubMed

Beerli, Peter; Palczewski, Michal

2010-05-01

For many biological investigations, groups of individuals are genetically sampled from several geographic locations. These sampling locations often do not reflect the genetic population structure. We describe a framework using marginal likelihoods to compare and order structured population models, such as testing whether the sampling locations belong to the same randomly mating population or comparing unidirectional and multidirectional gene flow models. In the context of inferences employing Markov chain Monte Carlo methods, the accuracy of the marginal likelihoods depends heavily on the approximation method used to calculate the marginal likelihood. Two methods, modified thermodynamic integration and a stabilized harmonic mean estimator, are compared. With finite Markov chain Monte Carlo run lengths, the harmonic mean estimator may not be consistent. Thermodynamic integration, in contrast, delivers considerably better estimates of the marginal likelihood. The choice of prior distributions does not influence the order and choice of the better models when the marginal likelihood is estimated using thermodynamic integration, whereas with the harmonic mean estimator the influence of the prior is pronounced and the order of the models changes. The approximation of marginal likelihood using thermodynamic integration in MIGRATE allows the evaluation of complex population genetic models, not only of whether sampling locations belong to a single panmictic population, but also of competing complex structured population models.
Calculating p-values and their significances with the Energy Test for large datasets

NASA Astrophysics Data System (ADS)

Barter, W.; Burr, C.; Parkes, C.

2018-04-01

The energy test method is a multi-dimensional test of whether two samples are consistent with arising from the same underlying population, through the calculation of a single test statistic (called the T-value). The method has recently been used in particle physics to search for samples that differ due to CP violation. The generalised extreme value function has previously been used to describe the distribution of T-values under the null hypothesis that the two samples are drawn from the same underlying population. We show that, in a simple test case, the distribution is not sufficiently well described by the generalised extreme value function. We present a new method, where the distribution of T-values under the null hypothesis when comparing two large samples can be found by scaling the distribution found when comparing small samples drawn from the same population. This method can then be used to quickly calculate the p-values associated with the results of the test.
Characterization of Enterobius vermicularis in a human population, employing a molecular-based method from adhesive tape samples.

PubMed

Piperaki, Evangelia-Theophano; Spanakos, Gregory; Patsantara, Giannoula; Vassalou, Evdokia; Vakalis, Nikolaos; Tsakris, Athanassios

2011-01-01

Human infection with the parasitic nematode Enterobius vermicularis occurs worldwide, particularly in children. Although its prevalence may exceed 35% in some parts of the world, molecular studies of E. vermicularis in humans are limited. The aim of the present study was to investigate the genetic variation within E. vermicularis in a human population. For this purpose, 77 adhesive tape samples taken from Greek children infested with E. vermicularis were tested. New primers were designed to amplify a segment of the mitochondrial cytochrome c oxidase subunit 1 (cox1) gene of E. vermicularis from adhesive tape samples. Thirty-six amplicons were sequenced and eleven different haplotypes were identified. All sequences clustered within the type previously characterized (type B), only reported to date from captive chimpanzees. To the best of our knowledge, this is the first study of E. vermicularis genotypes from a human population. Copyright © 2011 Elsevier Ltd. All rights reserved.
Pharmacokinetic Studies in Neonates: The Utility of an Opportunistic Sampling Design.

PubMed

Leroux, Stéphanie; Turner, Mark A; Guellec, Chantal Barin-Le; Hill, Helen; van den Anker, Johannes N; Kearns, Gregory L; Jacqz-Aigrain, Evelyne; Zhao, Wei

2015-12-01

The use of an opportunistic (also called scavenged) sampling strategy in a prospective pharmacokinetic study combined with population pharmacokinetic modelling has been proposed as an alternative strategy to conventional methods for accomplishing pharmacokinetic studies in neonates. However, the reliability of this approach in this particular paediatric population has not been evaluated. The objective of the present study was to evaluate the performance of an opportunistic sampling strategy for a population pharmacokinetic estimation, as well as dose prediction, and compare this strategy with a predetermined pharmacokinetic sampling approach. Three population pharmacokinetic models were derived for ciprofloxacin from opportunistic blood samples (SC model), predetermined (i.e. scheduled) samples (TR model) and all samples (full model used to previously characterize ciprofloxacin pharmacokinetics), using NONMEM software. The predictive performance of developed models was evaluated in an independent group of patients. Pharmacokinetic data from 60 newborns were obtained with a total of 430 samples available for analysis; 265 collected at predetermined times and 165 that were scavenged from those obtained as part of clinical care. All datasets were fit using a two-compartment model with first-order elimination. The SC model could identify the most significant covariates and provided reasonable estimates of population pharmacokinetic parameters (clearance and steady-state volume of distribution) compared with the TR and full models. Their predictive performances were further confirmed in an external validation by Bayesian estimation, and showed similar results. Monte Carlo simulation based on area under the concentration-time curve from zero to 24 h (AUC24)/minimum inhibitory concentration (MIC) using either the SC or the TR model gave similar dose prediction for ciprofloxacin. Blood samples scavenged in the course of caring for neonates can be used to estimate ciprofloxacin pharmacokinetic parameters and therapeutic dose requirements.
An adaptive two-stage sequential design for sampling rare and clustered populations

USGS Publications Warehouse

Brown, J.A.; Salehi, M.M.; Moradi, M.; Bell, G.; Smith, D.R.

2008-01-01

How to design an efficient large-area survey continues to be an interesting question for ecologists. In sampling large areas, as is common in environmental studies, adaptive sampling can be efficient because it ensures survey effort is targeted to subareas of high interest. In two-stage sampling, higher density primary sample units are usually of more interest than lower density primary units when populations are rare and clustered. Two-stage sequential sampling has been suggested as a method for allocating second stage sample effort among primary units. Here, we suggest a modification: adaptive two-stage sequential sampling. In this method, the adaptive part of the allocation process means the design is more flexible in how much extra effort can be directed to higher-abundance primary units. We discuss how best to design an adaptive two-stage sequential sample. ?? 2008 The Society of Population Ecology and Springer.
Classification of stellar populations in globular clusters

NASA Astrophysics Data System (ADS)

Wang, Yue; Zhao, Gang; Li, Hai-Ning

2017-04-01

Possessing multiple stellar populations has been accepted as a common feature of globular clusters (GCs). Different stellar populations manifest themselves with different chemical features, e.g. the well-known O-Na anti-correlation. Generally, the first (primordial) population has O and Na abundances consistent with those of field stars with similar metallicity; while the second (polluted) population is identified by their Na overabundance and O deficiency. The fraction of the populations is an important constraint on the GC formation scenario. Several methods have been proposed for the classification of GC populations. Here we examine a criterion derived based on the distribution of Galactic field stars, which relies on Na abundance as a function of [Fe/H], to distinguish first and second stellar populations in GCs. By comparing the first population fractions of 17 GCs estimated by the field star criterion with those in the literature derived by methods related to individual GCs, we find that the field star criterion tends to overestimate the first population fractions. The population separation methods, which are related to an individual GC sample, are recommended because the diversity of GCs can be taken into consideration. Currently, more caution should be exercised if one wants to regard field stars as a reference for the identification of a GC population. However, further study on the connection between field stars and GCs populations is still needed.

MaCH-Admix: Genotype Imputation for Admixed Populations

PubMed Central

Liu, Eric Yi; Li, Mingyao; Wang, Wei; Li, Yun

2012-01-01

Imputation in admixed populations is an important problem but challenging due to the complex linkage disequilibrium (LD) pattern. The emergence of large reference panels such as that from the 1,000 Genomes Project enables more accurate imputation in general, and in particular for admixed populations and for uncommon variants. To efficiently benefit from these large reference panels, one key issue to consider in modern genotype imputation framework is the selection of effective reference panels. In this work, we consider a number of methods for effective reference panel construction inside a hidden Markov model and specific to each target individual. These methods fall into two categories: identity-by-state (IBS) based and ancestry-weighted approach. We evaluated the performance on individuals from recently admixed populations. Our target samples include 8,421 African Americans and 3,587 Hispanic Americans from the Women’s Health Initiative, which allow assessment of imputation quality for uncommon variants. Our experiments include both large and small reference panels; large, medium, and small target samples; and in genome regions of varying levels of LD. We also include BEAGLE and IMPUTE2 for comparison. Experiment results with large reference panel suggest that our novel piecewise IBS method yields consistently higher imputation quality than other methods/software. The advantage is particularly noteworthy among uncommon variants where we observe up to 5.1% information gain with the difference being highly significant (Wilcoxon signed rank test P-value < 0.0001). Our work is the first that considers various sensible approaches for imputation in admixed populations and presents a comprehensive comparison. PMID:23074066
Stratified Sampling Design Based on Data Mining

PubMed Central

Kim, Yeonkook J.; Oh, Yoonhwan; Park, Sunghoon; Cho, Sungzoon

2013-01-01

Objectives To explore classification rules based on data mining methodologies which are to be used in defining strata in stratified sampling of healthcare providers with improved sampling efficiency. Methods We performed k-means clustering to group providers with similar characteristics, then, constructed decision trees on cluster labels to generate stratification rules. We assessed the variance explained by the stratification proposed in this study and by conventional stratification to evaluate the performance of the sampling design. We constructed a study database from health insurance claims data and providers' profile data made available to this study by the Health Insurance Review and Assessment Service of South Korea, and population data from Statistics Korea. From our database, we used the data for single specialty clinics or hospitals in two specialties, general surgery and ophthalmology, for the year 2011 in this study. Results Data mining resulted in five strata in general surgery with two stratification variables, the number of inpatients per specialist and population density of provider location, and five strata in ophthalmology with two stratification variables, the number of inpatients per specialist and number of beds. The percentages of variance in annual changes in the productivity of specialists explained by the stratification in general surgery and ophthalmology were 22% and 8%, respectively, whereas conventional stratification by the type of provider location and number of beds explained 2% and 0.2% of variance, respectively. Conclusions This study demonstrated that data mining methods can be used in designing efficient stratified sampling with variables readily available to the insurer and government; it offers an alternative to the existing stratification method that is widely used in healthcare provider surveys in South Korea. PMID:24175117
Ultrasensitive Genotypic Detection of Antiviral Resistance in Hepatitis B Virus Clinical Isolates▿ †

PubMed Central

Fang, Jie; Wichroski, Michael J.; Levine, Steven M.; Baldick, Carl J.; Mazzucco, Charles E.; Walsh, Ann W.; Kienzle, Bernadette K.; Rose, Ronald E.; Pokornowski, Kevin A.; Colonno, Richard J.; Tenney, Daniel J.

2009-01-01

Amino acid substitutions that confer reduced susceptibility to antivirals arise spontaneously through error-prone viral polymerases and are selected as a result of antiviral therapy. Resistance substitutions first emerge in a fraction of the circulating virus population, below the limit of detection by nucleotide sequencing of either the population or limited sets of cloned isolates. These variants can expand under drug pressure to dominate the circulating virus population. To enhance detection of these viruses in clinical samples, we established a highly sensitive quantitative, real-time allele-specific PCR assay for hepatitis B virus (HBV) DNA. Sensitivity was accomplished using a high-fidelity DNA polymerase and oligonucleotide primers containing locked nucleic acid bases. Quantitative measurement of resistant and wild-type variants was accomplished using sequence-matched standards. Detection methodology that was not reliant on hybridization probes, and assay modifications, minimized the effect of patient-specific sequence polymorphisms. The method was validated using samples from patients chronically infected with HBV through parallel sequencing of large numbers of cloned isolates. Viruses with resistance to lamivudine and other l-nucleoside analogs and entecavir, involving 17 different nucleotide substitutions, were reliably detected at levels at or below 0.1% of the total population. The method worked across HBV genotypes. Longitudinal analysis of patient samples showed earlier emergence of resistance on therapy than was seen with sequencing methodologies, including some cases of resistance that existed prior to treatment. In summary, we established and validated an ultrasensitive method for measuring resistant HBV variants in clinical specimens, which enabled earlier, quantitative measurement of resistance to therapy. PMID:19433559
Value of information methods to design a clinical trial in a small population to optimise a health economic utility function.

PubMed

Pearce, Michael; Hee, Siew Wan; Madan, Jason; Posch, Martin; Day, Simon; Miller, Frank; Zohar, Sarah; Stallard, Nigel

2018-02-08

Most confirmatory randomised controlled clinical trials (RCTs) are designed with specified power, usually 80% or 90%, for a hypothesis test conducted at a given significance level, usually 2.5% for a one-sided test. Approval of the experimental treatment by regulatory agencies is then based on the result of such a significance test with other information to balance the risk of adverse events against the benefit of the treatment to future patients. In the setting of a rare disease, recruiting sufficient patients to achieve conventional error rates for clinically reasonable effect sizes may be infeasible, suggesting that the decision-making process should reflect the size of the target population. We considered the use of a decision-theoretic value of information (VOI) method to obtain the optimal sample size and significance level for confirmatory RCTs in a range of settings. We assume the decision maker represents society. For simplicity we assume the primary endpoint to be normally distributed with unknown mean following some normal prior distribution representing information on the anticipated effectiveness of the therapy available before the trial. The method is illustrated by an application in an RCT in haemophilia A. We explicitly specify the utility in terms of improvement in primary outcome and compare this with the costs of treating patients, both financial and in terms of potential harm, during the trial and in the future. The optimal sample size for the clinical trial decreases as the size of the population decreases. For non-zero cost of treating future patients, either monetary or in terms of potential harmful effects, stronger evidence is required for approval as the population size increases, though this is not the case if the costs of treating future patients are ignored. Decision-theoretic VOI methods offer a flexible approach with both type I error rate and power (or equivalently trial sample size) depending on the size of the future population for whom the treatment under investigation is intended. This might be particularly suitable for small populations when there is considerable information about the patient population.
A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa

PubMed Central

Petegrosso, Raphael; Tolar, Jakub

2018-01-01

Single-cell RNA sequencing (scRNA-seq) has been widely applied to discover new cell types by detecting sub-populations in a heterogeneous group of cells. Since scRNA-seq experiments have lower read coverage/tag counts and introduce more technical biases compared to bulk RNA-seq experiments, the limited number of sampled cells combined with the experimental biases and other dataset specific variations presents a challenge to cross-dataset analysis and discovery of relevant biological variations across multiple cell populations. In this paper, we introduce a method of variance-driven multitask clustering of single-cell RNA-seq data (scVDMC) that utilizes multiple single-cell populations from biological replicates or different samples. scVDMC clusters single cells in multiple scRNA-seq experiments of similar cell types and markers but varying expression patterns such that the scRNA-seq data are better integrated than typical pooled analyses which only increase the sample size. By controlling the variance among the cell clusters within each dataset and across all the datasets, scVDMC detects cell sub-populations in each individual experiment with shared cell-type markers but varying cluster centers among all the experiments. Applied to two real scRNA-seq datasets with several replicates and one large-scale droplet-based dataset on three patient samples, scVDMC more accurately detected cell populations and known cell markers than pooled clustering and other recently proposed scRNA-seq clustering methods. In the case study applied to in-house Recessive Dystrophic Epidermolysis Bullosa (RDEB) scRNA-seq data, scVDMC revealed several new cell types and unknown markers validated by flow cytometry. MATLAB/Octave code available at https://github.com/kuanglab/scVDMC. PMID:29630593
Comparison of Relative Bias, Precision, and Efficiency of Sampling Methods for Natural Enemies of Soybean Aphid (Hemiptera: Aphididae).

PubMed

Bannerman, J A; Costamagna, A C; McCornack, B P; Ragsdale, D W

2015-06-01

Generalist natural enemies play an important role in controlling soybean aphid, Aphis glycines (Hemiptera: Aphididae), in North America. Several sampling methods are used to monitor natural enemy populations in soybean, but there has been little work investigating their relative bias, precision, and efficiency. We compare five sampling methods: quadrats, whole-plant counts, sweep-netting, walking transects, and yellow sticky cards to determine the most practical methods for sampling the three most prominent species, which included Harmonia axyridis (Pallas), Coccinella septempunctata L. (Coleoptera: Coccinellidae), and Orius insidiosus (Say) (Hemiptera: Anthocoridae). We show an important time by sampling method interaction indicated by diverging community similarities within and between sampling methods as the growing season progressed. Similarly, correlations between sampling methods for the three most abundant species over multiple time periods indicated differences in relative bias between sampling methods and suggests that bias is not consistent throughout the growing season, particularly for sticky cards and whole-plant samples. Furthermore, we show that sticky cards produce strongly biased capture rates relative to the other four sampling methods. Precision and efficiency differed between sampling methods and sticky cards produced the most precise (but highly biased) results for adult natural enemies, while walking transects and whole-plant counts were the most efficient methods for detecting coccinellids and O. insidiosus, respectively. Based on bias, precision, and efficiency considerations, the most practical sampling methods for monitoring in soybean include walking transects for coccinellid detection and whole-plant counts for detection of small predators like O. insidiosus. Sweep-netting and quadrat samples are also useful for some applications, when efficiency is not paramount. © The Authors 2015. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
ROLE OF LABORATORY SAMPLING DEVICES AND LABORATORY SUBSAMPLING METHODS IN OPTIMIZING REPRESENTATIVENESS STRATEGIES

EPA Science Inventory

Sampling is the act of selecting items from a specified population in order to estimate the parameters of that population (e.g., selecting soil samples to characterize the properties at an environmental site). Sampling occurs at various levels and times throughout an environmenta...
Conduct of a personal radiofrequency electromagnetic field measurement study: proposed study protocol.

PubMed

Röösli, Martin; Frei, Patrizia; Bolte, John; Neubauer, Georg; Cardis, Elisabeth; Feychting, Maria; Gajsek, Peter; Heinrich, Sabine; Joseph, Wout; Mann, Simon; Martens, Luc; Mohler, Evelyn; Parslow, Roger C; Poulsen, Aslak Harbo; Radon, Katja; Schüz, Joachim; Thuroczy, György; Viel, Jean-François; Vrijheid, Martine

2010-05-20

The development of new wireless communication technologies that emit radio frequency electromagnetic fields (RF-EMF) is ongoing, but little is known about the RF-EMF exposure distribution in the general population. Previous attempts to measure personal exposure to RF-EMF have used different measurement protocols and analysis methods making comparisons between exposure situations across different study populations very difficult. As a result, observed differences in exposure levels between study populations may not reflect real exposure differences but may be in part, or wholly due to methodological differences. The aim of this paper is to develop a study protocol for future personal RF-EMF exposure studies based on experience drawn from previous research. Using the current knowledge base, we propose procedures for the measurement of personal exposure to RF-EMF, data collection, data management and analysis, and methods for the selection and instruction of study participants. We have identified two basic types of personal RF-EMF measurement studies: population surveys and microenvironmental measurements. In the case of a population survey, the unit of observation is the individual and a randomly selected representative sample of the population is needed to obtain reliable results. For microenvironmental measurements, study participants are selected in order to represent typical behaviours in different microenvironments. These two study types require different methods and procedures. Applying our proposed common core procedures in future personal measurement studies will allow direct comparisons of personal RF-EMF exposures in different populations and study areas.
Aetiology for the covariation between combined type ADHD and reading difficulties in a family study: the role of IQ

PubMed Central

Cheung, Celeste H.M.; Wood, Alexis C.; Paloyelis, Yannis; Arias-Vasquez, Alejandro; Buitelaar, Jan K.; Franke, Barbara; Miranda, Ana; Mulas, Fernando; Rommelse, Nanda; Sergeant, Joseph A.; Sonuga-Barke, Edmund J.; Faraone, Stephen V.; Asherson, Philip; Kuntsi, Jonna

2012-01-01

Background Twin studies using both clinical and population-based samples suggest that the frequent co-occurrence of attention deficit hyperactivity disorder (ADHD) and reading ability/disability (RD) is largely driven by shared genetic influences. While both disorders are associated with lower IQ, recent twin data suggest that the shared genetic variability between reading difficulties and ADHD inattention symptoms is largely independent from genetic influences contributing to general cognitive ability. The current study aimed to extend the previous findings that were based on rating scale measures in a population sample by examining the generalizability of the findings to a clinical population, and by measuring reading difficulties both with a rating scale and with an objective task. We therefore investigated the familial relationships between ADHD, reading difficulties and IQ in a sample of individuals diagnosed with ADHD combined type, their siblings and control sibling pairs. Methods We ran multivariate familial models on data from 1789 individuals at ages 6 to 19. Reading difficulties were measured with both rating scale and an objective task. IQ was obtained using the Wechsler Intelligence Scales (WISC-III / WAIS-III). Results Significant phenotypic (0.2–0.4) and familial (0.3–0.5) correlations were observed among ADHD, reading difficulties and IQ. Yet 53% to 72% of the overlapping familial influences between ADHD and reading difficulties were not shared with IQ. Conclusions Our finding that familial influences shared with general cognitive ability, though present, do not account for the majority of the overlapping familial influences on ADHD and reading difficulties extends previous findings from a population-based study to a clinically-ascertained sample with combined type ADHD. PMID:22324316
An unusual haplotype structure on human chromosome 8p23 derived from the inversion polymorphism.

PubMed

Deng, Libin; Zhang, Yuezheng; Kang, Jian; Liu, Tao; Zhao, Hongbin; Gao, Yang; Li, Chaohua; Pan, Hao; Tang, Xiaoli; Wang, Dunmei; Niu, Tianhua; Yang, Huanming; Zeng, Changqing

2008-10-01

Chromosomal inversion is an important type of genomic variations involved in both evolution and disease pathogenesis. Here, we describe the refined genetic structure of a 3.8-Mb inversion polymorphism at chromosome 8p23. Using HapMap data of 1,073 SNPs generated from 209 unrelated samples from CEPH-Utah residents with ancestry from northern and western Europe (CEU); Yoruba in Ibadan, Nigeria (YRI); and Asian (ASN) samples, which were comprised of Han Chinese from Beijing, China (CHB) and Japanese from Tokyo, Japan (JPT)-we successfully deduced the inversion orientations of all their 418 haplotypes. In particular, distinct haplotype subgroups were identified based on principal component analysis (PCA). Such genetic substructures were consistent with clustering patterns based on neighbor-joining tree reconstruction, which revealed a total of four haplotype clades across all samples. Metaphase fluorescence in situ hybridization (FISH) in a subset of 10 HapMap samples verified their inversion orientations predicted by PCA or phylogenetic tree reconstruction. Positioning of the outgroup haplotype within one of YRI clades suggested that Human NCBI Build 36-inverted order is most likely the ancestral orientation. Furthermore, the population differentiation test and the relative extended haplotype homozygosity (REHH) analysis in this region discovered multiple selection signals, also in a population-specific manner. A positive selection signal was detected at XKR6 in the ASN population. These results revealed the correlation of inversion polymorphisms to population-specific genetic structures, and various selection patterns as possible mechanisms for the maintenance of a large chromosomal rearrangement at 8p23 region during evolution. In addition, our study also showed that haplotype-based clustering methods, such as PCA, can be applied in scanning for cryptic inversion polymorphisms at a genome-wide scale.
Demonstration Report for Visual Sample Plan (VSP) Verification Sampling Methods at the Navy/DRI Site

DTIC Science & Technology

2011-08-01

population of 537,197 with an overall population density of 608 people per square mile (people/ mi2 ). However, the population density in the vicinity...Preliminary Assessment Findings approximately 12 people/ mi2 . Population density is expected to greatly increase following development of the site
Simultaneous determination of seven informative Y chromosome SNPs to differentiate East Asian, European, and African populations.

PubMed

Muro, Tomonori; Iida, Reiko; Fujihara, Junko; Yasuda, Toshihiro; Watanabe, Yukina; Imamura, Shinji; Nakamura, Hiroaki; Kimura-Kataoka, Kaori; Yuasa, Isao; Toga, Tomoko; Takeshita, Haruo

2011-05-01

Identification of the population origin of an individual is very useful for crime investigators who need to narrow down a suspect based on specimens left at a crime scene. Single nucleotide polymorphisms of the Y chromosome (Y-SNPs) are a class of markers of interest to forensic investigators because many of the markers indicate regional specificity, thus providing useful information about the geographic origin of a subject. We selected seven informative Y-SNPs (M168, M130, JST021355, M96, P126, P196, and P234) to differentiate the three major population groups (East Asian, European, and African) and used them to develop forensic application. SNP genotyping was carried out by multiplex PCR reaction and multiplex single base extension (MSBE) reaction followed by capillary electrophoresis of extension products. This method can be used to assign a haplogroup from both degraded male DNA samples and DNA samples containing a mixture of female and male DNA through PCR primers that generate small amplicons (less than about 150 bp) and are highly specific for targets on the Y chromosome. The allelic state of each marker was definitively determined from a total of 791 males from the three major population groups. As expected, samples from the three major population groups showed Y-haplogroups common in the region of provenance: Y haplogroups C, D, and O for East Asians; IJ and R1 for Europeans; and AB and E for Africans. Published by Elsevier Ireland Ltd.
Prevalence of Diabetes and Intermediate Hyperglycemia Among Adults From the First Multinational Study of Noncommunicable Diseases in Six Central American Countries

PubMed Central

Barcelo, Alberto; Gregg, Edward W.; Gerzoff, Robert B.; Wong, Roy; Perez Flores, Enrique; Ramirez-Zea, Manuel; Cafiero, Elizabeth; Altamirano, Lesbia; Ascencio Rivera, Melanie; de Cosio, Gerardo; de Maza, Martha Dinorah; del Aguila, Roberto; Emanuel, Englebert; Gil, Enrique; Gough, Ethan; Jenkins, Valerie; Orellana, Patrícia; Palma, Ruben; Palomo, Ruben; Pastora, Martha; Peña, Rodolfo; Pineda, Elia; Rodriguez, Bismark; Tacsan, Luis; Thompson, Loraine; Villagra, Lucy

2012-01-01

OBJECTIVE The increasing burdens of obesity and diabetes are two of the most prominent threats to the health of populations of developed and developing countries alike. The Central America Diabetes Initiative (CAMDI) is the first study to examine the prevalence of diabetes in Central America. RESEARCH DESIGN AND METHODS The CAMDI survey was a cross-sectional survey based on a probabilistic sample of the noninstitutionalized population of five Central American populations conducted between 2003 and 2006. The total sample population was 10,822, of whom 7,234 (67%) underwent anthropometry measurement and a fasting blood glucose or 2-h oral glucose tolerance test. RESULTS The total prevalence of diabetes was 8.5%, but was higher in Belize (12.9%) and lower in Honduras (5.4%). Of the screened population, 18.6% had impaired glucose tolerance/impaired fasting glucose. CONCLUSIONS As this population ages, the prevalence of diabetes is likely to continue to rise in a dramatic and devastating manner. Preventive strategies must be quickly introduced. PMID:22323417
Microbial populations in contaminant plumes

USGS Publications Warehouse

Haack, S.K.; Bekins, B.A.

2000-01-01

Efficient biodegradation of subsurface contaminants requires two elements: (1) microbial populations with the necessary degradative capabilities, and (2) favorable subsurface geochemical and hydrological conditions. Practical constraints on experimental design and interpretation in both the hydrogeological and microbiological sciences have resulted in limited knowledge of the interaction between hydrogeological and microbiological features of subsurface environments. These practical constraints include: (1) inconsistencies between the scales of investigation in the hydrogeological and microbiological sciences, and (2) practical limitations on the ability to accurately define microbial populations in environmental samples. However, advances in application of small-scale sampling methods and interdisciplinary approaches to site investigations are beginning to significantly improve understanding of hydrogeological and microbiological interactions. Likewise, culture-based and molecular analyses of microbial populations in subsurface contaminant plumes have revealed significant adaptation of microbial populations to plume environmental conditions. Results of recent studies suggest that variability in subsurface geochemical and hydrological conditions significantly influences subsurface microbial-community structure. Combined investigations of site conditions and microbial-community structure provide the knowledge needed to understand interactions between subsurface microbial populations, plume geochemistry, and contaminant biodegradation.
Predictors of Disordered Eating in Adolescence and Young Adulthood: A Population-Based, Longitudinal Study of Females and Males in Norway

ERIC Educational Resources Information Center

Abebe, Dawit Shawel; Torgersen, Leila; Lien, Lars; Hafstad, Gertrud S.; von Soest, Tilmann

2014-01-01

We investigated longitudinal predictors for disordered eating from early adolescence to young adulthood (12-34 years) across gender and different developmental phases among Norwegian young people. Survey data from a population-based sample were collected at four time points (T) over a 13-year time span. A population-based sample of 5,679 females…
Hierarchical models of animal abundance and occurrence

USGS Publications Warehouse

Royle, J. Andrew; Dorazio, R.M.

2006-01-01

Much of animal ecology is devoted to studies of abundance and occurrence of species, based on surveys of spatially referenced sample units. These surveys frequently yield sparse counts that are contaminated by imperfect detection, making direct inference about abundance or occurrence based on observational data infeasible. This article describes a flexible hierarchical modeling framework for estimation and inference about animal abundance and occurrence from survey data that are subject to imperfect detection. Within this framework, we specify models of abundance and detectability of animals at the level of the local populations defined by the sample units. Information at the level of the local population is aggregated by specifying models that describe variation in abundance and detection among sites. We describe likelihood-based and Bayesian methods for estimation and inference under the resulting hierarchical model. We provide two examples of the application of hierarchical models to animal survey data, the first based on removal counts of stream fish and the second based on avian quadrat counts. For both examples, we provide a Bayesian analysis of the models using the software WinBUGS.
Y-chromosomal diversity of the Valachs from the Czech Republic: model for isolated population in Central Europe

PubMed Central

Ehler, Edvard; Vaněk, Daniel; Stenzl, Vlastimil; Vančata, Václav

2011-01-01

Aim To evaluate Y-chromosomal diversity of the Moravian Valachs of the Czech Republic and compare them with a Czech population sample and other samples from Central and South-Eastern Europe, and to evaluate the effects of genetic isolation and sampling. Methods The first sample set of the Valachs consisted of 94 unrelated male donors from the Valach region in northeastern Czech Republic border-area. The second sample set of the Valachs consisted of 79 men who originated from 7 paternal lineages defined by surname. No close relatives were sampled. The third sample set consisted of 273 unrelated men from the whole of the Czech Republic and was used for comparison, as well as published data for other 27 populations. The total number of samples was 3244. Y-short tandem repeat (STR) markers were typed by standard methods using PowerPlex® Y System (Promega) and Yfiler® Amplification Kit (Applied Biosystems) kits. Y-chromosomal haplogroups were estimated from the haplotype information. Haplotype diversity and other intra- and inter-population statistics were computed. Results The Moravian Valachs showed a lower genetic variability of Y-STR markers than other Central European populations, resembling more to the isolated Balkan populations (Aromuns, Csango, Bulgarian, and Macedonian Roma) than the surrounding populations (Czechs, Slovaks, Poles, Saxons). We illustrated the effect of sampling on Valach paternal lineages, which includes reduction of discrimination capacity and variability inside Y-chromosomal haplogroups. Valach modal haplotype belongs to R1a haplogroup and it was not detected in the Czech population. Conclusion The Moravian Valachs display strong substructure and isolation in their Y chromosomal markers. They represent a unique Central European population model for population genetics. PMID:21674832
Worldwide F(ST) estimates relative to five continental-scale populations.

PubMed

Steele, Christopher D; Court, Denise Syndercombe; Balding, David J

2014-11-01

We estimate the population genetics parameter FST (also referred to as the fixation index) from short tandem repeat (STR) allele frequencies, comparing many worldwide human subpopulations at approximately the national level with continental-scale populations. FST is commonly used to measure population differentiation, and is important in forensic DNA analysis to account for remote shared ancestry between a suspect and an alternative source of the DNA. We estimate FST comparing subpopulations with a hypothetical ancestral population, which is the approach most widely used in population genetics, and also compare a subpopulation with a sampled reference population, which is more appropriate for forensic applications. Both estimation methods are likelihood-based, in which FST is related to the variance of the multinomial-Dirichlet distribution for allele counts. Overall, we find low FST values, with posterior 97.5 percentiles < 3% when comparing a subpopulation with the most appropriate population, and even for inter-population comparisons we find FST < 5%. These are much smaller than single nucleotide polymorphism-based inter-continental FST estimates, and are also about half the magnitude of STR-based estimates from population genetics surveys that focus on distinct ethnic groups rather than a general population. Our findings support the use of FST up to 3% in forensic calculations, which corresponds to some current practice.
DNA motif alignment by evolving a population of Markov chains.

PubMed

Bi, Chengpeng

2009-01-30

Deciphering cis-regulatory elements or de novo motif-finding in genomes still remains elusive although much algorithmic effort has been expended. The Markov chain Monte Carlo (MCMC) method such as Gibbs motif samplers has been widely employed to solve the de novo motif-finding problem through sequence local alignment. Nonetheless, the MCMC-based motif samplers still suffer from local maxima like EM. Therefore, as a prerequisite for finding good local alignments, these motif algorithms are often independently run a multitude of times, but without information exchange between different chains. Hence it would be worth a new algorithm design enabling such information exchange. This paper presents a novel motif-finding algorithm by evolving a population of Markov chains with information exchange (PMC), each of which is initialized as a random alignment and run by the Metropolis-Hastings sampler (MHS). It is progressively updated through a series of local alignments stochastically sampled. Explicitly, the PMC motif algorithm performs stochastic sampling as specified by a population-based proposal distribution rather than individual ones, and adaptively evolves the population as a whole towards a global maximum. The alignment information exchange is accomplished by taking advantage of the pooled motif site distributions. A distinct method for running multiple independent Markov chains (IMC) without information exchange, or dubbed as the IMC motif algorithm, is also devised to compare with its PMC counterpart. Experimental studies demonstrate that the performance could be improved if pooled information were used to run a population of motif samplers. The new PMC algorithm was able to improve the convergence and outperformed other popular algorithms tested using simulated and biological motif sequences.
Recalibration of the Klales et al. (2012) method of sexing the human innominate for Mexican populations.

PubMed

Gómez-Valdés, Jorge A; Menéndez Garmendia, Antinea; García-Barzola, Lizbeth; Sánchez-Mejorada, Gabriela; Karam, Carlos; Baraybar, José Pablo; Klales, Alexandra

2017-03-01

The aim of this study was to test the accuracy of the Klales et al. (2012) equation for sex estimation in contemporary Mexican population. Our investigation was carried out on a sample of 203 left innominates of identified adult skeletons from the UNAM-Collection and the Santa María Xigui Cemetery, in Central Mexico. The Klales' original equation produces a sex bias in sex estimation against males (86-92% accuracy versus 100% accuracy in females). Based on these results, the Klales et al. (2012) method was recalibrated for a new cutt-of-point for sex estimation in contemporary Mexican populations. The results show cross-validated classification accuracy rates as high as 100% after recalibrating the original logistic regression equation. Recalibration improved classification accuracy and eliminated sex bias. This new formula will improve sex estimation for Mexican contemporary populations. © 2017 Wiley Periodicals, Inc.

Clinical evaluation of high-risk HPV detection on self-samples using the indicating FTA-elute solid-carrier cartridge.

PubMed

Geraets, D T; van Baars, R; Alonso, I; Ordi, J; Torné, A; Melchers, W J G; Meijer, C J L M; Quint, W G V

2013-06-01

High-risk human papillomavirus (hrHPV) testing in cervical screening is usually performed on physician-taken cervical smears in liquid-based medium. However, solid-state specimen carriers allow easy, non-hazardous storage and transportation and might be suitable for self-collection by non-responders in screening and in low-resource settings. We evaluated the adequacy of self-collected cervicovaginal (c/v) samples using a Viba-brush stored on an Indicating FTA-elute cartridge (FTA-based self-sampling) for hrHPV testing in women referred to a gynecology clinic due to an abnormal smear. 182 women accepted to self-collect a c/v sample. After self-sampling, a physician obtained a conventional liquid-based cervical smear. Finally, women were examined by colposcopy and a biopsy was taken when clinically indicated. Self-samples required only simple DNA elution, and DNA was extracted from physician-obtained samples. Both samples were tested for 14 hrHPVs by GP5+/6+-EIA-LQ Test and SPF(10)-DEIA-LiPA(25). Both assays detected significantly more hrHPV in physician-collected specimens than in self-collected samples (75.3% and 67.6% by SPF(10); 63.3% and 53.3% by GP5+/6+, respectively). The combination of physician-collected specimen and GP5+/6+ testing demonstrated the optimal balance in sensitivity (98.0%) and specificity (48.1%) for CIN2+ detection in this referral population. A test system of FTA-based self-collection and SPF(10) hrHPV detection approached this sensitivity (95.9%) and specificity (42.9%). These results show that the clinical performance of hrHPV detection is determined by both the sample collection system and the test method. FTA-based self-collection with SPF(10) testing might be valuable when a liquid-based medium cannot be used, but requires further investigation in screening populations. Copyright © 2013 Elsevier B.V. All rights reserved.
International Space Station environmental microbiome - microbial inventories of ISS filter debris.

PubMed

Venkateswaran, Kasthuri; Vaishampayan, Parag; Cisneros, Jessica; Pierson, Duane L; Rogers, Scott O; Perry, Jay

2014-01-01

Despite an expanding array of molecular approaches for detecting microorganisms in a given sample, rapid and robust means of assessing the differential viability of the microbial cells, as a function of phylogenetic lineage, remain elusive. A propidium monoazide (PMA) treatment coupled with downstream quantitative polymerase chain reaction (qPCR) and pyrosequencing analyses was carried out to better understand the frequency, diversity, and distribution of viable microorganisms associated with debris collected from the crew quarters of the International Space Station (ISS). The cultured bacterial counts were more in the ISS samples than cultured fungal population. The rapid molecular analyses targeted to estimate viable population exhibited 5-fold increase in bacterial (qPCR-PMA assay) and 25-fold increase in microbial (adenosine triphosphate assay) burden than the cultured bacterial population. The ribosomal nucleic acid-based identification of cultivated strains revealed the presence of only four to eight bacterial species in the ISS samples, however, the viable bacterial diversity detected by the PMA-pyrosequencing method was far more diverse (12 to 23 bacterial taxa) with the majority consisting of members of actinobacterial genera (Propionibacterium, Corynebacterium) and Staphylococcus. Sample fractions not treated with PMA (inclusive of both live and dead cells) yielded a great abundance of highly diverse bacterial (94 to 118 taxa) and fungal lineages (41 taxa). Even though deep sequencing capability of the molecular analysis widened the understanding about the microbial diversity, the cultivation assay also proved to be essential since some of the spore-forming microorganisms were detected only by the culture-based method. Presented here are the findings of the first comprehensive effort to assess the viability of microbial cells associated with ISS surfaces, and correlate differential viability with phylogenetic affiliation.
A pilot study of a computer-assisted cell-phone interview (CACI) methodology to survey respondents in households without telephones about alcohol use.

PubMed

Wilkins, Chris; Casswell, Sally; Barnes, Helen Moewaka; Pledger, Megan

2003-06-01

An intrinsic drawback with the use of a computer-assisted telephone interview (CATI) survey methodology is that people who live in households without a connected landline telephone are excluded from the survey sample. This paper presents a pilot of the feasibility of a computer-assisted cell-phone interview (CACI) methodology designed to survey people living in households without a telephone about alcohol use and be compatible with a larger telephone based alcohol sample. The CACI method was found to be an efficient and cost competitive method to reach non-telephone households. Telephone ownership was found to make a difference to the typical occasion amount of alcohol consumed, with respondents from households without telephones drinking significantly more than those with telephones even when consumption levels were controlled for socio-economic status. Although high levels of telephone ownership in the general population mean these differences may not have any impact on population alcohol measures they may be important in sub-populations where telephone ownership is lower.
Change-in-ratio methods for estimating population size

USGS Publications Warehouse

Udevitz, Mark S.; Pollock, Kenneth H.; McCullough, Dale R.; Barrett, Reginald H.

2002-01-01

Change-in-ratio (CIR) methods can provide an effective, low cost approach for estimating the size of wildlife populations. They rely on being able to observe changes in proportions of population subclasses that result from the removal of a known number of individuals from the population. These methods were first introduced in the 1940’s to estimate the size of populations with 2 subclasses under the assumption of equal subclass encounter probabilities. Over the next 40 years, closed population CIR models were developed to consider additional subclasses and use additional sampling periods. Models with assumptions about how encounter probabilities vary over time, rather than between subclasses, also received some attention. Recently, all of these CIR models have been shown to be special cases of a more general model. Under the general model, information from additional samples can be used to test assumptions about the encounter probabilities and to provide estimates of subclass sizes under relaxations of these assumptions. These developments have greatly extended the applicability of the methods. CIR methods are attractive because they do not require the marking of individuals, and subclass proportions often can be estimated with relatively simple sampling procedures. However, CIR methods require a carefully monitored removal of individuals from the population, and the estimates will be of poor quality unless the removals induce substantial changes in subclass proportions. In this paper, we review the state of the art for closed population estimation with CIR methods. Our emphasis is on the assumptions of CIR methods and on identifying situations where these methods are likely to be effective. We also identify some important areas for future CIR research.
Estimating population abundance and mapping distribution of wintering sea ducks in coastal waters of the mid-Atlantic

USGS Publications Warehouse

Koneff, M.D.; Royle, J. Andrew; Forsell, D.J.; Wortham, J.S.; Boomer, G.S.; Perry, M.C.

2005-01-01

Survey design for wintering scoters (Melanitta sp.) and other sea ducks that occur in offshore waters is challenging because these species have large ranges, are subject to distributional shifts among years and within a season, and can occur in aggregations. Interest in winter sea duck population abundance surveys has grown in recent years. This interest stems from concern over the population status of some sea ducks, limitations of extant breeding waterfowl survey programs in North America and logistical challenges and costs of conducting surveys in northern breeding regions, high winter area philopatry in some species and potential conservation implications, and increasing concern over offshore development and other threats to sea duck wintering habitats. The efficiency and practicality of statistically-rigorous monitoring strategies for mobile, aggregated wintering sea duck populations have not been sufficiently investigated. This study evaluated a 2-phase adaptive stratified strip transect sampling plan to estimate wintering population size of scoters, long-tailed ducks (Clangua hyemalis), and other sea ducks and provide information on distribution. The sampling plan results in an optimal allocation of a fixed sampling effort among offshore strata in the U.S. mid-Atlantic coast region. Phase I transect selection probabilities were based on historic distribution and abundance data, while Phase 2 selection probabilities were based on observations made during Phase 1 flights. Distance sampling methods were used to estimate detection rates. Environmental variables thought to affect detection rates were recorded during the survey and post-stratification and covariate modeling were investigated to reduce the effect of heterogeneity on detection estimation. We assessed cost-precision tradeoffs under a number of fixed-cost sampling scenarios using Monte Carlo simulation. We discuss advantages and limitations of this sampling design for estimating wintering sea duck abundance and mapping distribution and suggest improvements for future surveys.
Kinetic modeling of PET-FDG in the brain without blood sampling.

PubMed

Bentourkia, M'hamed

2006-12-01

The aim in this work is to report a new method to calculate parametric images from a single scan acquisition with positron emission tomography (PET) and fluorodeoxyglucose (FDG) in the human brain without blood sampling. It is usually practical for research or clinical purposes to inject the patient in an isolated room and to start the PET acquisition only for some 10-20 min, about 30 min after FDG injection. In order to calculate the cerebral metabolic rates for glucose (CMRG), usually several blood samples are required. The proposed method considers the relation between the uptake of the tracer in the cerebellum as a reference tissue and the population based input curve. Similar results were obtained for CMRG values with the present method in comparison to the usual autoradiographic and the non-linear least squares fitting of regions of interest.
A microplate reader-based method to quantify NADH-cytochrome b5 reductase activity for diagnosis of recessive congenital methaemoglobinemia.

PubMed

Kedar, Prabhakar; Desai, Anand; Warang, Prashant; Colah, Roshan

2017-05-01

Congenital methemoglobinemia due to NADH-cytochrome b5 reductase 3 (CYB5R3) deficiencies is an autosomal recessive disorder that occurs sporadically worldwide, A sensitive, accurate, and rapid analysis of NADH-CYB5R enzyme concentrations is necessary for the diagnosis of RCM. Here we present an alternative microplate method that is based on a standard 96-well microplate format and microplate reader that simplify the quantification of NADH-CYB5R activity. TECAN (Infinite 200 PRO series) microplate reader with Tecan's proven Magellan™ software measured the NADH-CYB5R enzyme activity in 250 normal controls and previously diagnosed 25 cases of RCM due to NADH-CYB5R deficiency in the Indian population using 96-well microplates using 200 μl of total reaction mixture and also compared with standard spectrophotometric assay. We have also studied stability of the hemolysate stored at 4 and -20°C temperature. Enzyme activity in all 25 samples ranged from 6.09 to 10.07 IU/g Hb (mean ± SD: 8.08 ± 1.99 IU/g Hb) where as normal control ranged (n = 250) between 13.42 and 21.58 IU/g Hb) (mean ± SD: 17.5 ± 4.08 IU/g of Hb). Data obtained from the microplate reader were compared with standard spectrophotometer method and found 100% concordance using both methods. Microplate method allows differentiating between normal, deficient and intermediate enzyme activity. It was observed that samples had significant loss of activity when stored at 4°C and retained stable activity at -20°C for 1 week time. Our new method, incorporating a whole process of enzyme assay into a microplate format is readily applicable and allows rapid monitoring of enzyme assay. It is readily applicable to quantitative assay on pediatric sample as well as large number of samples for population screening.
Diagnosis of glutaric aciduria type 1 by measuring 3-hydroxyglutaric acid in dried urine spots by liquid chromatography tandem mass spectrometry.

PubMed

Al-Dirbashi, Osama Y; Kölker, Stefan; Ng, Dione; Fisher, Lawrence; Rupar, Tony; Lepage, Nathalie; Rashed, Mohamed S; Santa, Tomofumi; Goodman, Stephen I; Geraghty, Michael T; Zschocke, Johannes; Christensen, Ernst; Hoffmann, Georg F; Chakraborty, Pranesh

2011-02-01

Accumulation of glutaric acid (GA) and 3-hydroxyglutaric acid (3HGA) in body fluids is the biochemical hallmark of type 1 glutaric aciduria (GA1), a disorder characterized by acute striatal degeneration and a subsequent dystonia. To date, methods for quantification of 3HGA are mainly based on stable isotope dilution gas chromatography mass spectrometry (GC-MS) and require extensive sample preparation. Here we describe a simple liquid chromatography tandem MS (LC-MS/MS) method to quantify this important metabolite in dried urine spots (DUS). This method is based on derivatization with 4-[2-(N,N-dimethylamino)ethylaminosulfonyl]-7-(2-aminoethylamino)-2,1,3-benzoxadiazole (DAABD-AE). Derivatization was adopted to improve the chromatographic and mass spectrometric properties of the studied analytes. Derivatization was performed directly on a 3.2-mm disc of DUS as a sample without extraction. Sample mixture was heated at 60°C for 45 min, and 5 μl of the reaction solution was analyzed by LC-MS/MS. Reference ranges obtained were in excellent agreement with the literature. The method was applied retrospectively for the analysis of DUS samples from established low- and high-excreter GA1 patients as well as controls (n = 100). Comparison of results obtained versus those obtained by GC-MS was satisfactory (n = 14). In populations with a high risk of GA1, this approach will be useful as a primary screening method for high- or low-excreter variants. In these populations, however, DUS analysis should not be implemented before completing a parallel comparative study with the standard screening method (i.e., molecular testing). In addition, follow-up DUS GA and 3HGA testing of babies with elevated dried blood spot C5DC acylcarnitines will be useful as a first-tier diagnostic test, thus reducing the number of cases requiring enzymatic and molecular analyses to establish or refute the diagnosis of GA1.
Prevalence of Dementia and Cognitive Complaints in the Context of High Cognitive Reserve: A Population-Based Study

PubMed Central

Perquin, Magali; Diederich, Nico; Pastore, Jessica; Lair, Marie-Lise; Stranges, Saverio; Vaillant, Michel

2015-01-01

Objectives This study aimed to assess the prevalence of dementia and cognitive complaints in a cross-sectional sample of Luxembourg seniors, and to discuss the results in the societal context of high cognitive reserve resulting from multilingualism. Methods A population sample of 1,377 people representative of Luxembourg residents aged over 64 years was initially identified via the national social insurance register. There were three different levels of contribution: full participation in the study, partial participation, and non-participation. We examined the profiles of these three different samples so that we could infer the prevalence estimates in the Luxembourgish senior population as a whole using the prevalence estimates obtained in this study. Results After careful attention to the potential bias and of the possibility of underestimation, we considered the obtained prevalence estimates of 3.8% for dementia (with corresponding 95% confidence limits (CL) of 2.8% and 4.8%) and 26.1% for cognitive complaints (CL = [17.8–34.3]) as trustworthy. Conclusion Based on these findings, we postulate that high cognitive reserve may result in surprisingly low prevalence estimates of cognitive complaints and dementia in adults over the age of 64 years, which thereby corroborates the longer disability-free life expectancy observed in the Luxembourg population. To the best of our knowledge, this study is the first to report such Luxembourgish public health data. PMID:26390288
Point-Sampling and Line-Sampling Probability Theory, Geometric Implications, Synthesis

Treesearch

L.R. Grosenbaugh

1958-01-01

Foresters concerned with measuring tree populations on definite areas have long employed two well-known methods of representative sampling. In list or enumerative sampling the entire tree population is tallied with a known proportion being randomly selected and measured for volume or other variables. In area sampling all trees on randomly located plots or strips...
Recruiting hard-to-reach United States population sub-groups via adaptations of snowball sampling strategy

PubMed Central

Sadler, Georgia Robins; Lee, Hau-Chen; Seung-Hwan Lim, Rod; Fullerton, Judith

2011-01-01

Nurse researchers and educators often engage in outreach to narrowly defined populations. This article offers examples of how variations on the snowball sampling recruitment strategy can be applied in the creation of culturally appropriate, community-based information dissemination efforts related to recruitment to health education programs and research studies. Examples from the primary author’s program of research are provided to demonstrate how adaptations of snowball sampling can be effectively used in the recruitment of members of traditionally underserved or vulnerable populations. The adaptation of snowball sampling techniques, as described in this article, helped the authors to gain access to each of the more vulnerable population groups of interest. The use of culturally sensitive recruitment strategies is both appropriate and effective in enlisting the involvement of members of vulnerable populations. Adaptations of snowball sampling strategies should be considered when recruiting participants for education programs or subjects for research studies when recruitment of a population based sample is not essential. PMID:20727089
Recruitment of hard-to-reach population subgroups via adaptations of the snowball sampling strategy.

PubMed

Sadler, Georgia Robins; Lee, Hau-Chen; Lim, Rod Seung-Hwan; Fullerton, Judith

2010-09-01

Nurse researchers and educators often engage in outreach to narrowly defined populations. This article offers examples of how variations on the snowball sampling recruitment strategy can be applied in the creation of culturally appropriate, community-based information dissemination efforts related to recruitment to health education programs and research studies. Examples from the primary author's program of research are provided to demonstrate how adaptations of snowball sampling can be used effectively in the recruitment of members of traditionally underserved or vulnerable populations. The adaptation of snowball sampling techniques, as described in this article, helped the authors to gain access to each of the more-vulnerable population groups of interest. The use of culturally sensitive recruitment strategies is both appropriate and effective in enlisting the involvement of members of vulnerable populations. Adaptations of snowball sampling strategies should be considered when recruiting participants for education programs or for research studies when the recruitment of a population-based sample is not essential.
Population Structure, Diversity and Reproductive Mode of the Grape Phylloxera (Daktulosphaira vitifoliae) across Its Native Range

PubMed Central

Walker, M. Andrew

2017-01-01

Grape Phylloxera, Daktulosphaira vitifoliae, is a gall-forming insect that feeds on the leaves and roots of many Vitis species. The roots of the cultivated V. vinifera cultivars and hybrids are highly susceptible to grape phylloxera feeding damage. The native range of this insect covers most of North America, and it is particularly abundant in the eastern and central United States. Phylloxera was introduced from North America to almost all grape-growing regions across five of the temperate zone continents. It devastated vineyards in each of these regions causing large-scale disruptions to grape growers, wine makers and national economies. In order to understand the population diversity of grape phylloxera in its native range, more than 500 samples from 19 States and 34 samples from the introduced range (northern California, Europe and South America) were genotyped with 32 simple sequence repeat markers. STRUCTURE, a model based clustering method identified five populations within these samples. The five populations were confirmed by a neighbor-joining tree and principal coordinate analysis (PCoA). These populations were distinguished by their Vitis species hosts and their geographic locations. Samples collected from California, Europe and South America traced back to phylloxera sampled in the northeastern United States on V. riparia, with some influence from phylloxera collected along the Atlantic Coast and Central Plains on V. vulpina. Reproductive statistics conclusively confirmed that sexual reproduction is common in the native range and is combined with cyclical parthenogenesis. Native grape phylloxera populations were identified to be under Hardy-Weinberg equilibrium. The identification of admixed samples between many of these populations indicates that shared environments facilitate sexual reproduction between different host associated populations to create new genotypes of phylloxera. This study also found that assortative mating might occur across the sympatric range of the V. vulpina west and V. cinerea populations. PMID:28125736
Label-Free, Flow-Imaging Methods for Determination of Cell Concentration and Viability.

PubMed

Sediq, A S; Klem, R; Nejadnik, M R; Meij, P; Jiskoot, Wim

2018-05-30

To investigate the potential of two flow imaging microscopy (FIM) techniques (Micro-Flow Imaging (MFI) and FlowCAM) to determine total cell concentration and cell viability. B-lineage acute lymphoblastic leukemia (B-ALL) cells of 2 different donors were exposed to ambient conditions. Samples were taken at different days and measured with MFI, FlowCAM, hemocytometry and automated cell counting. Dead and live cells from a fresh B-ALL cell suspension were fractionated by flow cytometry in order to derive software filters based on morphological parameters of separate cell populations with MFI and FlowCAM. The filter sets were used to assess cell viability in the measured samples. All techniques gave fairly similar cell concentration values over the whole incubation period. MFI showed to be superior with respect to precision, whereas FlowCAM provided particle images with a higher resolution. Moreover, both FIM methods were able to provide similar results for cell viability as the conventional methods (hemocytometry and automated cell counting). FIM-based methods may be advantageous over conventional cell methods for determining total cell concentration and cell viability, as FIM measures much larger sample volumes, does not require labeling, is less laborious and provides images of individual cells.
Determining the Population Size of Pond Phytoplankton.

ERIC Educational Resources Information Center

Hummer, Paul J.

1980-01-01

Discusses methods for determining the population size of pond phytoplankton, including water sampling techniques, laboratory analysis of samples, and additional studies worthy of investigation in class or as individual projects. (CS)
Colonoscopy screening for colorectal cancer: the outcomes of two recruitment methods.

PubMed

Corbett, Mike; Chambers, Sharon L; Shadbolt, Bruce; Hillman, Lybus C; Taupin, Doug

2004-10-18

To determine the response to colorectal cancer (CRC) screening by colonoscopy, through direct invitation or through invitation by general practitioners. Two-way comparison of randomised population sampling versus cluster sampling of a representative general practice population in the Australian Capital Territory, May 2002 to January 2004. Invitation to screen, assessment for eligibility, interview, and colonoscopy. 881 subjects aged 55-74 years were invited to screen: 520 from the electoral roll (ER) sample and 361 from the general practice (GP) cluster sample. Response rate, participation rate, and rate of adenomatous polyps in the screened group. Participation was similar in the ER arm (35.1%; 95% CI, 30.2%-40.3%) and the GP arm (40.1%; 95% CI, 29.2%-51.0%) after correcting for ineligibility, which was higher in the ER arm. Superior eligibility in the GP arm was offset by the labour of manual record review. Response rates after two invitations were similar for the two groups (ER arm: 78.8%; 95% CI, 75.1%-82.1%; GP arm: 81.7%; 95% CI, 73.8%-89.6%). Overall, 53.4% ineligibility arose from having a colonoscopy in the past 10 years (ER arm, 98/178; GP arm, 42/84). Of 231 colonoscopies performed, 229 were complete, with 32% of subjects screened having adenomatous polyps. Colonoscopy-based CRC screening yields similar response and participation rates with either random population sampling or general practice cluster sampling, with population sampling through the electoral roll providing greater ease of recruitment.
Optimizing the creation of base populations for aquaculture breeding programs using phenotypic and genomic data and its consequences on genetic progress.

PubMed

Fernández, Jesús; Toro, Miguel Á; Sonesson, Anna K; Villanueva, Beatriz

2014-01-01

The success of an aquaculture breeding program critically depends on the way in which the base population of breeders is constructed since all the genetic variability for the traits included originally in the breeding goal as well as those to be included in the future is contained in the initial founders. Traditionally, base populations were created from a number of wild strains by sampling equal numbers from each strain. However, for some aquaculture species improved strains are already available and, therefore, mean phenotypic values for economically important traits can be used as a criterion to optimize the sampling when creating base populations. Also, the increasing availability of genome-wide genotype information in aquaculture species could help to refine the estimation of relationships within and between candidate strains and, thus, to optimize the percentage of individuals to be sampled from each strain. This study explores the advantages of using phenotypic and genome-wide information when constructing base populations for aquaculture breeding programs in terms of initial and subsequent trait performance and genetic diversity level. Results show that a compromise solution between diversity and performance can be found when creating base populations. Up to 6% higher levels of phenotypic performance can be achieved at the same level of global diversity in the base population by optimizing the selection of breeders instead of sampling equal numbers from each strain. The higher performance observed in the base population persisted during 10 generations of phenotypic selection applied in the subsequent breeding program.
Systematic review of the use of online questionnaires of older adults.

PubMed

Remillard, Meegan L; Mazor, Kathleen M; Cutrona, Sarah L; Gurwitz, Jerry H; Tjia, Jennifer

2014-04-01

To describe methodological approaches to population targeting and sampling and to summarize limitations of Internet-based questionnaires in older adults. Systematic literature review. Studies using online questionnaires in older adult populations. English-language articles using search terms for geriatric, age 65 and over, Internet survey, online survey, Internet questionnaire, and online questionnaire in PubMed and EBSCO host between 1984 and July 2012. Inclusion criteria were study population mean age 65 and older and use of an online questionnaire for research. Review of 336 abstracts yielded 14 articles for full review by two investigators; 11 articles met inclusion criteria. Articles were extracted for study design and setting, participant characteristics, recruitment strategy, country, and study limitations. Eleven articles were published after 2001. Studies had populations with a mean age of 65 to 78, included descriptive and analytical designs, and were conducted in the United States, Australia, and Japan. Recruiting methods varied widely from paper fliers and personal e-mails to use of consumer marketing panels. Investigator-reported study limitations included the use of small convenience samples and limited generalizability. Online questionnaires are a feasible method of surveying older adults in some geographic regions and for some subsets of older adults, but limited Internet access constrains recruiting methods and often limits study generalizability. © 2014, Copyright the Authors Journal compilation © 2014, The American Geriatrics Society.
[Use of psychoactive substances and contraceptive methods by the Brazilian urban population, 2005].

PubMed

Bastos, Francisco I; Cunha, Cynthia B; Bertoni, Neilane

2008-06-01

To analyze the relationship between utilization patterns for condoms and other contraceptive methods and the consumption of alcohol and drugs. Exploratory study based on data from a probabilistic sample of 5,040 interviewees aged 16 to 65 years living in large urban regions of Brazil in 2005. The data were collected by means of questionnaires. The chi-square automatic interaction classification tree technique was used to study the use of condoms among interviewees of both sexes and other contraceptive methods among women, at the time of the last vaginal sexual intercourse. Among young and middle-aged adults of both sexes and young men in stable relationships, condom use was less frequent among those who said they used psychoactive substances (alcohol and/or illegal drugs). The possible modulating effect of psychoactive substances on contraceptive practices among mature women seems to be more straightforward, compared to the more subtle effects observed among younger women, for whom the different social classes they belonged to seemed to play a more important role. Despite the limitations resulting from an exploratory study, the fact that this was a representative sample of the urban population of Brazil and not from vulnerable populations, reinforces the need to implement integrated public policies directed towards the general population, with regard to preventing drug consumption, alcohol abuse, sexually transmitted infections, HIV/AIDS and unwanted pregnancy and promoting sexual and reproductive health.
Lincoln estimates of mallard (Anas platyrhynchos) abundance in North America.

PubMed

Alisauskas, Ray T; Arnold, Todd W; Leafloor, James O; Otis, David L; Sedinger, James S

2014-01-01

Estimates of range-wide abundance, harvest, and harvest rate are fundamental for sound inferences about the role of exploitation in the dynamics of free-ranging wildlife populations, but reliability of existing survey methods for abundance estimation is rarely assessed using alternative approaches. North American mallard populations have been surveyed each spring since 1955 using internationally coordinated aerial surveys, but population size can also be estimated with Lincoln's method using banding and harvest data. We estimated late summer population size of adult and juvenile male and female mallards in western, midcontinent, and eastern North America using Lincoln's method of dividing (i) total estimated harvest, [Formula: see text], by estimated harvest rate, [Formula: see text], calculated as (ii) direct band recovery rate, [Formula: see text], divided by the (iii) band reporting rate, [Formula: see text]. Our goal was to compare estimates based on Lincoln's method with traditional estimates based on aerial surveys. Lincoln estimates of adult males and females alive in the period June-September were 4.0 (range: 2.5-5.9), 1.8 (range: 0.6-3.0), and 1.8 (range: 1.3-2.7) times larger than respective aerial survey estimates for the western, midcontinent, and eastern mallard populations, and the two population estimates were only modestly correlated with each other (western: r = 0.70, 1993-2011; midcontinent: r = 0.54, 1961-2011; eastern: r = 0.50, 1993-2011). Higher Lincoln estimates are predictable given that the geographic scope of inference from Lincoln estimates is the entire population range, whereas sampling frames for aerial surveys are incomplete. Although each estimation method has a number of important potential biases, our review suggests that underestimation of total population size by aerial surveys is the most likely explanation. In addition to providing measures of total abundance, Lincoln's method provides estimates of fecundity and population sex ratio and could be used in integrated population models to provide greater insights about population dynamics and management of North American mallards and most other harvested species.

USING A COMMERCIAL TELEPHONE DIRECTORY TO IDENTIFY A POPULATION-BASED SAMPLE OF WOMEN OF REPRODUCTIVE AGE

EPA Science Inventory

Using a commercial telephone directory to identify a population-based sample of women of reproductive age
*DT Lobdell, GM Buck, JM Weiner, P Mendola (United States Environmental Protection Agency, Research Triangle Park, NC 27711)

In the United States, sampling women o...
Brief communication: the relation between standard error of the estimate and sample size of histomorphometric aging methods.

PubMed

Hennig, Cheryl; Cooper, David

2011-08-01

Histomorphometric aging methods report varying degrees of precision, measured through Standard Error of the Estimate (SEE). These techniques have been developed from variable samples sizes (n) and the impact of n on reported aging precision has not been rigorously examined in the anthropological literature. This brief communication explores the relation between n and SEE through a review of the literature (abstracts, articles, book chapters, theses, and dissertations), predictions based upon sampling theory and a simulation. Published SEE values for age prediction, derived from 40 studies, range from 1.51 to 16.48 years (mean 8.63; sd: 3.81 years). In general, these values are widely distributed for smaller samples and the distribution narrows as n increases--a pattern expected from sampling theory. For the two studies that have samples in excess of 200 individuals, the SEE values are very similar (10.08 and 11.10 years) with a mean of 10.59 years. Assuming this mean value is a 'true' characterization of the error at the population level, the 95% confidence intervals for SEE values from samples of 10, 50, and 150 individuals are on the order of ± 4.2, 1.7, and 1.0 years, respectively. While numerous sources of variation potentially affect the precision of different methods, the impact of sample size cannot be overlooked. The uncertainty associated with SEE values derived from smaller samples complicates the comparison of approaches based upon different methodology and/or skeletal elements. Meaningful comparisons require larger samples than have frequently been used and should ideally be based upon standardized samples. Copyright © 2011 Wiley-Liss, Inc.
Detecting microbial dysbiosis associated with Pediatric Crohn’s disease despite the high variability of the gut microbiota

PubMed Central

Wang, Feng; Kaplan, Jess L.; Gold, Benjamin D.; Bhasin, Manoj K.; Ward, Naomi L.; Kellermayer, Richard; Kirschner, Barbara S.; Heyman, Melvin B.; Dowd, Scot E.; Cox, Stephen B.; Dogan, Haluk; Steven, Blaire; Ferry, George D.; Cohen, Stanley A.; Baldassano, Robert N.; Moran, Christopher J.; Garnett, Elizabeth A.; Drake, Lauren; Otu, Hasan H.; Mirny, Leonid A.; Libermann, Towia A.; Winter, Harland S.; Korolev, Kirill

2016-01-01

SUMMARY The relationship between the host and its microbiota is challenging to understand because both microbial communities and their environment are highly variable. We developed a set of techniques to address this challenge based on population dynamics and information theory. These methods identified additional bacterial taxa associated with pediatric Crohn's disease and could detect significant changes in microbial communities with fewer samples than previous statistical approaches. We also substantially improved the accuracy of the diagnosis based on the microbiota from stool samples and found that the ecological niche of a microbe predicts its role in Crohn’s disease. Bacteria typically residing in the lumen of healthy patients decrease in disease while bacteria typically residing on the mucosa of healthy patients increase in disease. Our results also show that the associations with Crohn’s disease are evolutionarily conserved and provide a mutual-information-based method to visualize dysbiosis. PMID:26804920
Monitoring population exposure to pesticides based on liquid chromatography-tandem mass spectrometry measurement of their urinary metabolites in urban wastewater: A novel biomonitoring approach.

PubMed

Rousis, Nikolaos I; Zuccato, Ettore; Castiglioni, Sara

2016-11-15

Biomonitoring studies have documented the high exposure of the population to pesticides which are widely used for crop protection, industrial and household purposes. This is the first study which has measured human urinary metabolites of pesticides in urban wastewater as biomarkers of population exposure. A liquid chromatography-tandem mass spectrometry (LC-MS/MS) method was developed to measure fifteen urinary metabolites selected from the major classes of pesticides. Raw wastewater samples were processed by solid phase extraction (SPE) or direct injection into the LC-MS/MS system. Recoveries ranged from 75 to 115% and the limits of quantification were 1-15ng/L for the SPE method and 40-800ng/L for direct injection. The method was employed for the analysis of 44 composite 24-h wastewater samples collected in seven Italian cities. Most of the target substances were detected at concentrations ranging from 1.1ng/L to 1.6μg/L. The highest concentrations were for some common metabolites of alkyl phosphates and pyrethroids and the specific metabolite of chlorpyrifos and chlorpyrifos-methyl (3,5,6-trichloro-2-pyridinol). The frequency of detection and abundance of most of the measured metabolites were in line with the profiles reported in urine biomonitoring studies. This method is therefore proposed as a novel "biomonitoring approach" for obtaining objective, direct information on the levels of exposure of a specific population to pesticides, and current research is addressed to validate the method identifying the most reliable biomarkers. Copyright © 2016 Elsevier B.V. All rights reserved.
Relationship between blood manganese and blood pressure in the Korean general population according to KNHANES 2008

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lee, Byung-Kook; Kim, Yangho, E-mail: yanghokm@nuri.net

Introduction: We present data on the association of manganese (Mn) level with hypertension in a representative sample of the adult Korean population who participated in the Korean National Health and Nutrition Examination Survey (KNHANES) 2008. Methods: This study was based on the data obtained by KNHANES 2008, which was conducted for three years (2007-2009) using a rolling sampling design involving a complex, stratified, multistage, probability-cluster survey of a representative sample of the noninstitutionalized civilian population of South Korea. Results: Multiple regression analysis after controlling for covariates, including gender, age, regional area, education level, smoking, drinking status, hemoglobin, and serum creatinine,more » showed that the beta coefficients of log blood Mn were 3.514, 1.878, and 2.517 for diastolic blood pressure, and 3.593, 2.449, and 2.440 for systolic blood pressure in female, male, and all participants, respectively. Multiple regression analysis including three other blood metals, lead, mercury, and cadmium, revealed no significant effects of the three metals on blood pressure and showed no effect on the association between blood Mn and blood pressure. In addition, doubling the blood Mn increased the risk of hypertension 1.828, 1.573, and 1.567 fold in women, men, and all participants, respectively, after adjustment for covariates. The addition of blood lead, mercury, and cadmium as covariates did not affect the association between blood Mn and the prevalence of hypertension. Conclusion: Blood Mn level was associated with an increased risk of hypertension in a representative sample of the Korean adult population. - Highlights: {yields} We showed the association of manganese with hypertension in Korean population. {yields} This study was based on the data obtained by KNHANES 2008. {yields} Blood manganese level was associated with an increased risk of hypertension.« less
Sleep Complaints in the Adult Brazilian Population: A National Survey Based on Screening Questions

PubMed Central

Bittencourt, Lia Rita A.; Santos-Silva, Rogerio; Taddei, Jose A.; Andersen, Monica L.; de Mello, Marco T.; Tufik, Sergio

2009-01-01

Study Objectives: The aim of the current survey was to investigate the prevalence of sleep complaints in a randomized cluster sample of the Brazilian population. Methods: A 3-stage cluster sampling technique was utilized to randomly select Brazilian subjects older than 16 years, of both genders and all socioeconomic classes. The final sample of 2,110 subjects from 150 different cities was enough to estimate prevalence in the Brazilian population with a sampling error of ± 2%. Questions about sleep complaints were administered face-to-face by Instituto Datafolha interviewers on March 26 and 27, 2008. Data were expanded using a weighted variable. Results: Of all interviewed subjects, 63% reported at least one sleep related complaint. Sleep complaint prevalence increased with age and was similar among inhabitants of different Brazilian regions, as well as between metropolitan areas and smaller cities. Insomnia and nightmares were significantly more prevalent in women (40% and 25%, respectively), and snoring was more prevalent in men (35%). For sleep complaints with frequencies greater than 3 times per week, we found the following prevalence: 61% for snoring, 35% for insomnia, 17% for nightmares, 53% for leg kicking, and 37% for breathing pauses. Conclusions: Because sleep disorders are affect a high proportion of the population and are known to be correlated with decreased well-being and productivity, more detailed national surveys are necessary to provide relevant information to develop approaches to prevention and treatment. Citation: Bittencourt LRA; Santos-Silva R; Taddei JA; Andersen ML; de Mello MT; Tufik S. Sleep complaints in the adult brazilian population: a national survey based on screening questions. J Clin Sleep Med 2009;5(5):459-463. PMID:19961032
Estimating parasitic sea lamprey abundance in Lake Huron from heterogenous data sources

USGS Publications Warehouse

Young, Robert J.; Jones, Michael L.; Bence, James R.; McDonald, Rodney B.; Mullett, Katherine M.; Bergstedt, Roger A.

2003-01-01

The Great Lakes Fishery Commission uses time series of transformer, parasitic, and spawning population estimates to evaluate the effectiveness of its sea lamprey (Petromyzon marinus) control program. This study used an inverse variance weighting method to integrate Lake Huron sea lamprey population estimates derived from two estimation procedures: 1) prediction of the lake-wide spawning population from a regression model based on stream size and, 2) whole-lake mark and recapture estimates. In addition, we used a re-sampling procedure to evaluate the effect of trading off sampling effort between the regression and mark-recapture models. Population estimates derived from the regression model ranged from 132,000 to 377,000 while mark-recapture estimates of marked recently metamorphosed juveniles and parasitic sea lampreys ranged from 536,000 to 634,000 and 484,000 to 1,608,000, respectively. The precision of the estimates varied greatly among estimation procedures and years. The integrated estimate of the mark-recapture and spawner regression procedures ranged from 252,000 to 702,000 transformers. The re-sampling procedure indicated that the regression model is more sensitive to reduction in sampling effort than the mark-recapture model. Reliance on either the regression or mark-recapture model alone could produce misleading estimates of abundance of sea lampreys and the effect of the control program on sea lamprey abundance. These analyses indicate that the precision of the lakewide population estimate can be maximized by re-allocating sampling effort from marking sea lampreys to trapping additional streams.
Horvitz-Thompson survey sample methods for estimating large-scale animal abundance

USGS Publications Warehouse

Samuel, M.D.; Garton, E.O.

1994-01-01

Large-scale surveys to estimate animal abundance can be useful for monitoring population status and trends, for measuring responses to management or environmental alterations, and for testing ecological hypotheses about abundance. However, large-scale surveys may be expensive and logistically complex. To ensure resources are not wasted on unattainable targets, the goals and uses of each survey should be specified carefully and alternative methods for addressing these objectives always should be considered. During survey design, the impoflance of each survey error component (spatial design, propofiion of detected animals, precision in detection) should be considered carefully to produce a complete statistically based survey. Failure to address these three survey components may produce population estimates that are inaccurate (biased low), have unrealistic precision (too precise) and do not satisfactorily meet the survey objectives. Optimum survey design requires trade-offs in these sources of error relative to the costs of sampling plots and detecting animals on plots, considerations that are specific to the spatial logistics and survey methods. The Horvitz-Thompson estimators provide a comprehensive framework for considering all three survey components during the design and analysis of large-scale wildlife surveys. Problems of spatial and temporal (especially survey to survey) heterogeneity in detection probabilities have received little consideration, but failure to account for heterogeneity produces biased population estimates. The goal of producing unbiased population estimates is in conflict with the increased variation from heterogeneous detection in the population estimate. One solution to this conflict is to use an MSE-based approach to achieve a balance between bias reduction and increased variation. Further research is needed to develop methods that address spatial heterogeneity in detection, evaluate the effects of temporal heterogeneity on survey objectives and optimize decisions related to survey bias and variance. Finally, managers and researchers involved in the survey design process must realize that obtaining the best survey results requires an interactive and recursive process of survey design, execution, analysis and redesign. Survey refinements will be possible as further knowledge is gained on the actual abundance and distribution of the population and on the most efficient techniques for detection animals.
Chemiluminescent microparticle immunoassay based detection and prevalence of HCV infection in district Peshawar Pakistan

PubMed Central

2014-01-01

Background Due to the high rate of asymptomatic infections an advanced screening assay is of prompt importance to be used for the clinical diagnosis of HCV. Early detection of anti HCV is the first step in the management of chronic hepatitis and in the selection of patients needing treatments. In the current study we have first time used the advanced serological diagnostic technique i.e. Chemiluminescent Microparticle Immuno Assay (CMIA) for the detection of HCV infection in Peshawar Pakistan. Methods A total number of 982 samples were collected among the general public belongs to the different areas of district Peshawar. The samples were centrifuged at high speed to obtain a clear supernatant serum. All the samples were run on Architect system a fully automated immuno analyzer CMIA base technology. Results Out of 982 blood samples analyzed in this study, 160 (15.9%) were confirmed to be positive for active HCV infection. The overall prevalence was found to be 13.4%. Gender wise prevalence was recorded to be higher in male (19.1%) than female (12.7%). The age group 21-30 years was identified as the highest risk group among the studied population. Conclusion Among the tested samples, overall prevalence of active HCV infection was found to be 13.4% in the general population of Peshawar Pakistan. The young middle aged population of this region was at higher risk of HCV ailments compared to the other age groups. PMID:25016473
Exploring cardiovascular health: the Healthy Life in Suriname (HELISUR) study. A protocol of a cross-sectional study

PubMed Central

Diemer, Frederieke S; Aartman, Jet Q; Karamat, Fares A; Baldew, Sergio M; Jarbandhan, Ameerani V; van Montfrans, Gert A; Oehlers, Glenn P; Brewster, Lizzy M

2014-01-01

Introduction Obesity, hypertension and diabetes are on a dramatic rise in low-income and middle-income countries, and this foretells an overwhelming increase in chronic disease burden from cardiovascular disease. Therefore, rapid action should be taken through preventive population-based programmes. However, in these regions, data on the population distribution of cardiovascular risk factors, and of intermediate and final end points for cardiovascular disease are scarce. The Healthy Life in Suriname (HELISUR) study is a cardiovascular population study in Suriname, which is part of the Caribbean Community. The HELISUR study is dedicated to provide data on risk factors and prevalent cardiovascular disease in the multiethnic population, which is mainly of African and Asian descent. Methods and analysis In a cross-sectional, observational population-based setting, a random representative sample of 1800 citizens aged between 18 and 70 years will be selected using a cluster household sampling method. Self-reported demographic, socioeconomic and (cardiovascular) health-related data will be collected. Physical examination will include the assessment of cardiovascular risk factors and prevalent cardiovascular disease. In addition, we will study cardiovascular haemodynamics non-invasively, as a novel intermediate outcome. Finally, fasting blood and overnight urine samples will be collected to monitor cardiometabolic risk factors. The main outcome will be descriptive in reporting the prevalence of risk factors and measures of (sub) clinical end organ damage, stratified for ethnicity and sex-age groups. Ethics and dissemination Ethical approval has been obtained from the State Secretary of Health. Data analysis and manuscript submission are scheduled for 2016. Findings will be disseminated in peer-reviewed journals, and at national, regional and international scientific meetings. Importantly, data will be presented to Surinamese policymakers and healthcare workers, to develop preventive strategies to combat the rapid rise of cardiovascular disease. PMID:25537786
Unlocking Diversity in Germplasm Collections via Genomic Selection: A Case Study Based on Quantitative Adult Plant Resistance to Stripe Rust in Spring Wheat.

PubMed

Muleta, Kebede T; Bulli, Peter; Zhang, Zhiwu; Chen, Xianming; Pumphrey, Michael

2017-11-01

Harnessing diversity from germplasm collections is more feasible today because of the development of lower-cost and higher-throughput genotyping methods. However, the cost of phenotyping is still generally high, so efficient methods of sampling and exploiting useful diversity are needed. Genomic selection (GS) has the potential to enhance the use of desirable genetic variation in germplasm collections through predicting the genomic estimated breeding values (GEBVs) for all traits that have been measured. Here, we evaluated the effects of various scenarios of population genetic properties and marker density on the accuracy of GEBVs in the context of applying GS for wheat ( L.) germplasm use. Empirical data for adult plant resistance to stripe rust ( f. sp. ) collected on 1163 spring wheat accessions and genotypic data based on the wheat 9K single nucleotide polymorphism (SNP) iSelect assay were used for various genomic prediction tests. Unsurprisingly, the results of the cross-validation tests demonstrated that prediction accuracy increased with an increase in training population size and marker density. It was evident that using all the available markers (5619) was unnecessary for capturing the trait variation in the germplasm collection, with no further gain in prediction accuracy beyond 1 SNP per 3.2 cM (∼1850 markers), which is close to the linkage disequilibrium decay rate in this population. Collectively, our results suggest that larger germplasm collections may be efficiently sampled via lower-density genotyping methods, whereas genetic relationships between the training and validation populations remain critical when exploiting GS to select from germplasm collections. Copyright © 2017 Crop Science Society of America.
Walking and the Preservation of Cognitive Function in Older Populations

ERIC Educational Resources Information Center

Prohaska, Thomas R.; Eisenstein, Amy R.; Satariano, William A.; Hunter, Rebecca; Bayles, Constance M.; Kurtovich, Elaine; Kealey, Melissa; Ivey, Susan L.

2009-01-01

Purpose: This cross-sectional study takes a unique look at the association between patterns of walking and cognitive functioning by examining whether older adults with mild cognitive impairment differ in terms of the community settings where they walk and the frequency, intensity, or duration of walking. Design and Methods: The sample was based on…
Late Language Emergence in 24-Month-Old Twins: Heritable and Increased Risk for Late Language Emergence in Twins

ERIC Educational Resources Information Center

Rice, Mabel L.; Zubrick, Stephen R.; Taylor, Catherine L.; Gayán, Javier; Bontempo, Daniel E.

2014-01-01

Purpose: This study investigated the etiology of late language emergence (LLE) in 24-month-old twins, considering possible twinning, zygosity, gender, and heritability effects for vocabulary and grammar phenotypes. Method: A population-based sample of 473 twin pairs participated. Multilevel modeling estimated means and variances of vocabulary and…
Analysis of Sampling Methodologies for Noise Pollution Assessment and the Impact on the Population.

PubMed

Rey Gozalo, Guillermo; Barrigón Morillas, Juan Miguel

2016-05-11

Today, noise pollution is an increasing environmental stressor. Noise maps are recognised as the main tool for assessing and managing environmental noise, but their accuracy largely depends on the sampling method used. The sampling methods most commonly used by different researchers (grid, legislative road types and categorisation methods) were analysed and compared using the city of Talca (Chile) as a test case. The results show that the stratification of sound values in road categories has a significantly lower prediction error and a higher capacity for discrimination and prediction than in the legislative road types used by the Ministry of Transport and Telecommunications in Chile. Also, the use of one or another method implies significant differences in the assessment of population exposure to noise pollution. Thus, the selection of a suitable method for performing noise maps through measurements is essential to achieve an accurate assessment of the impact of noise pollution on the population.
Sequence-Based Genotyping for Marker Discovery and Co-Dominant Scoring in Germplasm and Populations

PubMed Central

Truong, Hoa T.; Ramos, A. Marcos; Yalcin, Feyruz; de Ruiter, Marjo; van der Poel, Hein J. A.; Huvenaars, Koen H. J.; Hogers, René C. J.; van Enckevort, Leonora. J. G.; Janssen, Antoine; van Orsouw, Nathalie J.; van Eijk, Michiel J. T.

2012-01-01

Conventional marker-based genotyping platforms are widely available, but not without their limitations. In this context, we developed Sequence-Based Genotyping (SBG), a technology for simultaneous marker discovery and co-dominant scoring, using next-generation sequencing. SBG offers users several advantages including a generic sample preparation method, a highly robust genome complexity reduction strategy to facilitate de novo marker discovery across entire genomes, and a uniform bioinformatics workflow strategy to achieve genotyping goals tailored to individual species, regardless of the availability of a reference sequence. The most distinguishing features of this technology are the ability to genotype any population structure, regardless whether parental data is included, and the ability to co-dominantly score SNP markers segregating in populations. To demonstrate the capabilities of SBG, we performed marker discovery and genotyping in Arabidopsis thaliana and lettuce, two plant species of diverse genetic complexity and backgrounds. Initially we obtained 1,409 SNPs for arabidopsis, and 5,583 SNPs for lettuce. Further filtering of the SNP dataset produced over 1,000 high quality SNP markers for each species. We obtained a genotyping rate of 201.2 genotypes/SNP and 58.3 genotypes/SNP for arabidopsis (n = 222 samples) and lettuce (n = 87 samples), respectively. Linkage mapping using these SNPs resulted in stable map configurations. We have therefore shown that the SBG approach presented provides users with the utmost flexibility in garnering high quality markers that can be directly used for genotyping and downstream applications. Until advances and costs will allow for routine whole-genome sequencing of populations, we expect that sequence-based genotyping technologies such as SBG will be essential for genotyping of model and non-model genomes alike. PMID:22662172
[Study on ITS sequences of Aconitum vilmorinianum and its medicinal adulterant].

PubMed

Zhang, Xiao-nan; Du, Chun-hua; Fu, De-huan; Gao, Li; Zhou, Pei-jun; Wang, Li

2012-09-01

To analyze and compare the ITS sequences of Aconitum vilmorinianum and its medicinal adulterant Aconitum austroyunnanense. Total genomic DNA were extracted from sample materials by improved CTAB method, ITS sequences of samples were amplified using PCR systems, directly sequenced and analyzed using software DNAStar, ClustalX1.81 and MEGA 4.0. 299 consistent sites, 19 variable sites and 13 informative sites were found in ITS1 sequences, 162 consistent sites, 2 variable sites and 1 informative sites were found in 5.8S sequences, 217 consistent sites, 3 variable sites and 1 informative site were found in ITS2 sequences. Base transition and transversion was not found only in 5.8S sequences, 2 sites transition and 1 site transversion were found in ITS1 sequences, only 1 site transversion was found in ITS2 sequences comparting the ITS sequences data matrix. By analyzing the ITS sequences data matrix from 2 population of Aconitum vilmorinianum and 3 population of Aconitum austroyunnanense, we found a stable informative site at the 596th base in ITS2 sequences, in all the samples of Aconitum vilmorinianum the base was C, and in all the samples of Aconitum austroyunnanense the base was A. Aconitum vilmorinianum and Aconitum austroyunnanense can be identified by their characters of ITS sequences, and the variable sites in ITS1 sequences are more than in ITS2 sequences.
Genetic analysis of floating Enteromorpha prolifera in the Yellow Sea with AFLP marker

NASA Astrophysics Data System (ADS)

Liu, Cui; Zhang, Jing; Sun, Xiaoyu; Li, Jian; Zhang, Xi; Liu, Tao

2011-09-01

Extremely large accumulation of green algae Enteromorpha prolifera floated along China' coastal region of the Yellow Sea ever since the summer of 2008. Amplified Fragment Length Polymorphism (AFLP) analysis was applied to assess the genetic diversity and relationships among E. prolifera samples collected from 9 affected areas of the Yellow Sea. Two hundred reproducible fragments were generated with 8 AFLP primer combinations, of which 194 (97%) were polymorphic. The average Nei's genetic diversity, the coefficiency of genetic differentiation (Gst), and the average gene flow estimated from Gst in the 9 populations were 0.4018, 0.6404 and 0.2807 respectively. Cluster analysis based on the unweighed pair group method with arithmetic averages (UPGMA) showed that the genetic relationships within one population or among different populations were all related to their collecting locations and sampling time. Large genetic differentiation was detected among the populations. The E. prolifera originated from different areas and were undergoing a course of mixing.
Estimating abundance

USGS Publications Warehouse

Sutherland, Chris; Royle, Andy

2016-01-01

This chapter provides a non-technical overview of ‘closed population capture–recapture’ models, a class of well-established models that are widely applied in ecology, such as removal sampling, covariate models, and distance sampling. These methods are regularly adopted for studies of reptiles, in order to estimate abundance from counts of marked individuals while accounting for imperfect detection. Thus, the chapter describes some classic closed population models for estimating abundance, with considerations for some recent extensions that provide a spatial context for the estimation of abundance, and therefore density. Finally, the chapter suggests some software for use in data analysis, such as the Windows-based program MARK, and provides an example of estimating abundance and density of reptiles using an artificial cover object survey of Slow Worms (Anguis fragilis).
Estimating abundance: Chapter 27

USGS Publications Warehouse

Royle, J. Andrew

2016-01-01

This chapter provides a non-technical overview of ‘closed population capture–recapture’ models, a class of well-established models that are widely applied in ecology, such as removal sampling, covariate models, and distance sampling. These methods are regularly adopted for studies of reptiles, in order to estimate abundance from counts of marked individuals while accounting for imperfect detection. Thus, the chapter describes some classic closed population models for estimating abundance, with considerations for some recent extensions that provide a spatial context for the estimation of abundance, and therefore density. Finally, the chapter suggests some software for use in data analysis, such as the Windows-based program MARK, and provides an example of estimating abundance and density of reptiles using an artificial cover object survey of Slow Worms (Anguis fragilis).
3-dimensional examination of the adult mouse subventricular zone reveals lineage-specific microdomains.

PubMed

Azim, Kasum; Fiorelli, Roberto; Zweifel, Stefan; Hurtado-Chong, Anahi; Yoshikawa, Kazuaki; Slomianka, Lutz; Raineteau, Olivier

2012-01-01

Recent studies suggest that the subventricular zone (SVZ) of the lateral ventricle is populated by heterogeneous populations of stem and progenitor cells that, depending on their exact location, are biased to acquire specific neuronal fates. This newly described heterogeneity of SVZ stem and progenitor cells underlines the necessity to develop methods for the accurate quantification of SVZ stem and progenitor subpopulations. In this study, we provide 3-dimensional topographical maps of slow cycling "stem" cells and progenitors based on their unique cell cycle properties. These maps revealed that both cell populations are present throughout the lateral ventricle wall as well as in discrete regions of the dorsal wall. Immunodetection of transcription factors expressed in defined progenitor populations further reveals that divergent lineages have clear regional enrichments in the rostro-caudal as well as in the dorso-ventral span of the lateral ventricle. Thus, progenitors expressing Tbr2 and Dlx2 were confined to dorsal and dorso-lateral regions of the lateral ventricle, respectively, while Mash1+ progenitors were more homogeneously distributed. All cell populations were enriched in the rostral-most region of the lateral ventricle. This diversity and uneven distribution greatly impede the accurate quantification of SVZ progenitor populations. This is illustrated by measuring the coefficient of error of estimates obtained by using increasing section sampling interval. Based on our empirical data, we provide such estimates for all progenitor populations investigated in this study. These can be used in future studies as guidelines to judge if the precision obtained with a sampling scheme is sufficient to detect statistically significant differences between experimental groups if a biological effect is present. Altogether, our study underlines the need to consider the SVZ of the lateral ventricle as a complex 3D structure and define methods to accurately assess neural stem cells or progenitor diversity and population sizes in physiological or experimental paradigms.

Long-term frozen storage of urine samples: a trouble to get PCR results in Schistosoma spp. DNA detection?

PubMed

Fernández-Soto, Pedro; Velasco Tirado, Virginia; Carranza Rodríguez, Cristina; Pérez-Arellano, José Luis; Muro, Antonio

2013-01-01

Human schistosomiasis remains a serious worldwide public health problem. At present, a sensitive and specific assay for routine diagnosis of schistosome infection is not yet available. The potential for detecting schistosome-derived DNA by PCR-based methods in human clinical samples is currently being investigated as a diagnostic tool with potential application in routine schistosomiasis diagnosis. Collection of diagnostic samples such as stool or blood is usually difficult in some populations. However, urine is a biological sample that can be collected in a non-invasive method, easy to get from people of all ages and easy in management, but as a sample for PCR diagnosis is still not widely used. This could be due to the high variability in the reported efficiency of detection as a result of the high variation in urine samples' storage or conditions for handling and DNA preservation and extraction methods. We evaluate different commercial DNA extraction methods from a series of long-term frozen storage human urine samples from patients with parasitological confirmed schistosomiasis in order to assess the PCR effectiveness for Schistosoma spp. detection. Patients urine samples were frozen for 18 months up to 7 years until use. Results were compared with those obtained in PCR assays using fresh healthy human urine artificially contaminated with Schistosoma mansoni DNA and urine samples from mice experimentally infected with S. mansoni cercariae stored frozen for at least 12 months before use. PCR results in fresh human artificial urine samples using different DNA based extraction methods were much more effective than those obtained when long-term frozen human urine samples were used as the source of DNA template. Long-term frozen human urine samples are probably not a good source for DNA extraction for use as a template in PCR detection of Schistosoma spp., regardless of the DNA method of extraction used.
Spatially explicit inference for open populations: estimating demographic parameters from camera-trap studies

USGS Publications Warehouse

Gardner, Beth; Reppucci, Juan; Lucherini, Mauro; Royle, J. Andrew

2010-01-01

We develop a hierarchical capture–recapture model for demographically open populations when auxiliary spatial information about location of capture is obtained. Such spatial capture–recapture data arise from studies based on camera trapping, DNA sampling, and other situations in which a spatial array of devices records encounters of unique individuals. We integrate an individual-based formulation of a Jolly-Seber type model with recently developed spatially explicit capture–recapture models to estimate density and demographic parameters for survival and recruitment. We adopt a Bayesian framework for inference under this model using the method of data augmentation which is implemented in the software program WinBUGS. The model was motivated by a camera trapping study of Pampas cats Leopardus colocolo from Argentina, which we present as an illustration of the model in this paper. We provide estimates of density and the first quantitative assessment of vital rates for the Pampas cat in the High Andes. The precision of these estimates is poor due likely to the sparse data set. Unlike conventional inference methods which usually rely on asymptotic arguments, Bayesian inferences are valid in arbitrary sample sizes, and thus the method is ideal for the study of rare or endangered species for which small data sets are typical.
Spatially explicit inference for open populations: estimating demographic parameters from camera-trap studies.

PubMed

Gardner, Beth; Reppucci, Juan; Lucherini, Mauro; Royle, J Andrew

2010-11-01

We develop a hierarchical capture-recapture model for demographically open populations when auxiliary spatial information about location of capture is obtained. Such spatial capture-recapture data arise from studies based on camera trapping, DNA sampling, and other situations in which a spatial array of devices records encounters of unique individuals. We integrate an individual-based formulation of a Jolly-Seber type model with recently developed spatially explicit capture-recapture models to estimate density and demographic parameters for survival and recruitment. We adopt a Bayesian framework for inference under this model using the method of data augmentation which is implemented in the software program WinBUGS. The model was motivated by a camera trapping study of Pampas cats Leopardus colocolo from Argentina, which we present as an illustration of the model in this paper. We provide estimates of density and the first quantitative assessment of vital rates for the Pampas cat in the High Andes. The precision of these estimates is poor due likely to the sparse data set. Unlike conventional inference methods which usually rely on asymptotic arguments, Bayesian inferences are valid in arbitrary sample sizes, and thus the method is ideal for the study of rare or endangered species for which small data sets are typical.
Probing messenger RNA conformational heterogeneity using single-molecule fluorescence anisotropy

NASA Astrophysics Data System (ADS)

Sinha, Deepak; Sastry, Srikanth; Shivashankar, G. V.

2006-03-01

In this letter we describe a method to probe biomolecular conformations and their dynamics at the single molecule level. We show, using fluorescence anisotropy based methods, that the hydrodynamic volume of biomolecules captures the intrinsic heterogeneity within a population. Population distributions of conformations and their dynamics are studied by making anisotropy measurements on one molecule at a time within a confocal volume. The mean anisotropy of mRNA is lowered on addition of salt while the spread remains the same. The intrinsic heterogeneity is revealed when conformational transitions are frozen, resulting in a drastic increase in the spread of the anisotropy. These studies reveal that mRNA samples a broad range of conformations.
Geospatial techniques for developing a sampling frame of watersheds across a region

USGS Publications Warehouse

Gresswell, Robert E.; Bateman, Douglas S.; Lienkaemper, George; Guy, T.J.

2004-01-01

Current land-management decisions that affect the persistence of native salmonids are often influenced by studies of individual sites that are selected based on judgment and convenience. Although this approach is useful for some purposes, extrapolating results to areas that were not sampled is statistically inappropriate because the sampling design is usually biased. Therefore, in recent investigations of coastal cutthroat trout (Oncorhynchus clarki clarki) located above natural barriers to anadromous salmonids, we used a methodology for extending the statistical scope of inference. The purpose of this paper is to apply geospatial tools to identify a population of watersheds and develop a probability-based sampling design for coastal cutthroat trout in western Oregon, USA. The population of mid-size watersheds (500-5800 ha) west of the Cascade Range divide was derived from watershed delineations based on digital elevation models. Because a database with locations of isolated populations of coastal cutthroat trout did not exist, a sampling frame of isolated watersheds containing cutthroat trout had to be developed. After the sampling frame of watersheds was established, isolated watersheds with coastal cutthroat trout were stratified by ecoregion and erosion potential based on dominant bedrock lithology (i.e., sedimentary and igneous). A stratified random sample of 60 watersheds was selected with proportional allocation in each stratum. By comparing watershed drainage areas of streams in the general population to those in the sampling frame and the resulting sample (n = 60), we were able to evaluate the how representative the subset of watersheds was in relation to the population of watersheds. Geospatial tools provided a relatively inexpensive means to generate the information necessary to develop a statistically robust, probability-based sampling design.
Baseline assessment of prevalence and geographical distribution of HPV types in Chile using self-collected vaginal samples

PubMed Central

Ferreccio, Catterina; Corvalán, Alejandro; Margozzini, Paula; Viviani, Paola; González, Claudia; Aguilera, Ximena; Gravitt, Patti E

2008-01-01

Background Chile has broad variations in weather, economics and population from the far desert north (Region 1) to the cold, icy south (Region 12). A home-based self-collected vaginal sampling was nested in the 2003 Chilean population-based health survey in order to explore the possibility of a type-specific geographical variation for human papillomavirus Methods The population was a national probability sample of people 17 years of age and over. Consenting women provided self-collected cervicovaginal swabs in universal collection media (UCM). DNA was extracted and typed to 37 HPV genotypes using PGMY consensus PCR and line blot assay. Weighted prevalence rates and adjusted OR were calculated. Results Of the 1,883 women participating in the health survey, 1,219 (64.7%) provided a cervicovaginal sample and in 1,110 (56.2% of participants and 66.5% of those eligible) the samples were adequate for analysis. Refusal rate was 16.9%. HPV prevalence was 29.2% (15.1% high-risk HPV and 14.1% low-risk HPV). Predominant high-risk types were HPV 16, 52, 51, 56 and 58. Predominant low-risk HPVs were HPV 84, CP6108, 62, 53 and 61. High-risk and low-risk HPV rates were inversely correlated between the regions. High-risk HPV prevalence was highest among the youngest women, whereas low-risk HPV increased slightly with age. Conclusion Self-obtained vaginal sampling is adequate for monitoring HPV in the community, for identifying high-risk areas, and for surveying the long term impact of interventions. PMID:18304362
Respondent-driven sampling and the recruitment of people with small injecting networks.

PubMed

Paquette, Dana; Bryant, Joanne; de Wit, John

2012-05-01

Respondent-driven sampling (RDS) is a form of chain-referral sampling, similar to snowball sampling, which was developed to reach hidden populations such as people who inject drugs (PWID). RDS is said to reach members of a hidden population that may not be accessible through other sampling methods. However, less attention has been paid as to whether there are segments of the population that are more likely to be missed by RDS. This study examined the ability of RDS to capture people with small injecting networks. A study of PWID, using RDS, was conducted in 2009 in Sydney, Australia. The size of participants' injecting networks was examined by recruitment chain and wave. Participants' injecting network characteristics were compared to those of participants from a separate pharmacy-based study. A logistic regression analysis was conducted to examine the characteristics independently associated with having small injecting networks, using the combined RDS and pharmacy-based samples. In comparison with the pharmacy-recruited participants, RDS participants were almost 80% less likely to have small injecting networks, after adjusting for other variables. RDS participants were also more likely to have their injecting networks form a larger proportion of those in their social networks, and to have acquaintances as part of their injecting networks. Compared to those with larger injecting networks, individuals with small injecting networks were equally likely to engage in receptive sharing of injecting equipment, but less likely to have had contact with prevention services. These findings suggest that those with small injecting networks are an important group to recruit, and that RDS is less likely to capture these individuals.
A comparative evaluation between real time Roche COBas TAQMAN 48 HCV and bDNA Bayer Versant HCV 3.0.

PubMed

Giraldi, Cristina; Noto, Alessandra; Tenuta, Robert; Greco, Francesca; Perugini, Daniela; Spadafora, Mario; Bianco, Anna Maria Lo; Savino, Olga; Natale, Alfonso

2006-10-01

The HCV virus is a common human pathogen made of a single stranded RNA genome with 9600nt. This work compared two different commercial methods used for HCV viral load, the bDNA Bayer Versant HCV 3.0 and the RealTime Roche COBAS TaqMan 48 HCV. We compared the reproducibility and linearity of the two methods. Seventy-five plasma samples with genotypes 1 to 4, which represent the population (45% genotype 1; 24% genotype 2; 13% genotype 3; 18% genotype 4) were directly processed with the Versanto method based upon signal amplification; the same samples were first extracted (COBAS Ampliprep - TNAI) and then amplified using RealTime PCR (COBAS TaqMan 48). The results obtained indicate the same performance for both methods if they have genotype 1, but in samples with genotypes 2, 3 and 4 the RealTime PCR Roche method gave an underestimation in respect to the Bayer bDNA assay.
The Impact of Using Different Methods to Assess Completeness of 24-Hour Urine Collection on Estimating Dietary Sodium.

PubMed

Wielgosz, Andreas; Robinson, Christopher; Mao, Yang; Jiang, Ying; Campbell, Norm R C; Muthuri, Stella; Morrison, Howard

2016-06-01

The standard for population-based surveillance of dietary sodium intake is 24-hour urine testing; however, this may be affected by incomplete urine collection. The impact of different indirect methods of assessing completeness of collection on estimated sodium ingestion has not been established. The authors enlisted 507 participants from an existing community study in 2009 to collect 24-hour urine samples. Several methods of assessing completeness of urine collection were tested. Mean sodium intake varied between 3648 mg/24 h and 7210 mg/24 h depending on the method used. Excluding urine samples collected for longer or shorter than 24 hours increased the estimated urine sodium excretion, even when corrections for the variation in timed collections were applied. Until an accurate method of indirectly assessing completeness of urine collection is identified, the gold standard of administering para-aminobenzoic acid is recommended. Efforts to ensure participants collect complete urine samples are also warranted. ©2015 Wiley Periodicals, Inc.
The role of stressful life events preceding death by suicide: Evidence from two samples of suicide decedents.

PubMed

Buchman-Schmitt, Jennifer M; Chu, Carol; Michaels, Matthew S; Hames, Jennifer L; Silva, Caroline; Hagan, Christopher R; Ribeiro, Jessica D; Selby, Edward A; Joiner, Thomas E

2017-10-01

Stressful life events (SLEs) are associated with increased risk for suicidal behavior. Less is known regarding the intensity of SLEs and how this may vary as a function of suicide attempt history. As a large percentage of suicide decedents do not have a history of suicidal behavior, SLEs precipitating suicide may help characterize suicidality in this understudied population. This paper examines the intensity, number, and accumulation of SLEs preceding death by suicide among decedents with varying suicide attempt histories. Suicide attempts, SLEs, and suicide methods were examined in two samples: 62 prison-based and 117 community-based suicide decedents. Regression was used to compare the level of stressor precipitating death by suicide in decedents who died on a first attempt versus multiple previous attempts. A non-significant trend was observed in the prison population which was supported by significant findings in the community-based sample. Decedents who died on a first attempt experienced a stressor of a lower magnitude when compared to decedents with multiple previous suicide attempts. We discuss the implications of these findings in relation to the stress-diathesis model for suicide. Copyright © 2017 Elsevier B.V. All rights reserved.
Temporal and social contexts of heroin-using populations. An illustration of the snowball sampling technique.

PubMed

Kaplan, C D; Korf, D; Sterk, C

1987-09-01

Snowball sampling is a method that has been used in the social sciences to study sensitive topics, rare traits, personal networks, and social relationships. The method involves the selection of samples utilizing "insider" knowledge and referral chains among subjects who possess common traits that are of research interest. It is especially useful in generating samples for which clinical sampling frames may be difficult to obtain or are biased in some way. In this paper, snowball samples of heroin users in two Dutch cities have been analyzed for the purpose of providing descriptions and limited inferences about the temporal and social contexts of their lifestyles. Two distinct heroin-using populations have been discovered who are distinguished by their life cycle stage. Significant contextual explanations have been found involving the passage from adolescent peer group to criminal occupation, the functioning of network "knots" and "outcroppings," and the frequency of social contact. It is suggested that the snowball sampling method may have utility in studying the temporal and social contexts of other populations of clinical interest.
Monitoring for airborne allergens

DOE Office of Scientific and Technical Information (OSTI.GOV)

Burge, H.A.

1992-07-01

Monitoring for allergens can provide some information on the kinds and levels of exposure experienced by local patient populations, providing volumetric methods are used for sample collection and analysis is accurate and consistent. Such data can also be used to develop standards for the specific environment and to begin to develop predictive models. Comparing outdoor allergen aerosols between different monitoring sites requires identical collection and analysis methods and some kind of rational standard, whether arbitrary, or based on recognized health effects.32 references.
COMPARISON OF SAMPLING TECHNIQUES USED IN STUDYING LEPIDOPTERA POPULATION DYNAMICS

EPA Science Inventory

Four methods (light traps, foliage samples, canvas bands, and gypsy moth egg mass surveys) that are used to study the population dynamics of foliage-feeding Lepidoptera were compared for 10 species, including gypsy moth, Lymantria dispar L. Samples were collected weekly at 12 sit...
Characterization of Aspergillus section Nigri species populations in vineyard soil using droplet digital PCR

USDA-ARS?s Scientific Manuscript database

Identification of populations of Aspergillus section Nigri species in environmental samples using traditional methods is laborious and impractical for large numbers of samples. We developed species-specific primers and probes for quantitative droplet digital PCR (ddPCR) to improve sample throughput ...
Population-Based Preference Weights for the EQ-5D Health States Using the Visual Analogue Scale (VAS) in Iran.

PubMed

Goudarzi, Reza; Zeraati, Hojjat; Akbari Sari, Ali; Rashidian, Arash; Mohammad, Kazem

2016-02-01

Health-related quality of life (HRQoL) is used as a measure to valuate healthcare interventions and guide policy making. The EuroQol EQ-5D is a widely used generic preference-based instrument to measure Health-related quality of life. The objective of this study was to develop a value set of the EQ-5D health states for an Iranian population. This study is a cross-sectional study of Iranian populations. Our sample from Iranian populations consists out of 869 participants, who were selected for this study using a stratified probability sampling method. The sample was taken from individuals living in the city of Tehran and was stratified by age and gender from July to November 2013. Respondents valued 13 health states using the visual analogue scale (VAS) of the EQ-5D. Several fixed effects regression models were tested to predict the full set of health states. We selected the final model based on the logical consistency of the estimates, the sign and magnitude of the regression coefficients, goodness of fit, and parsimony. We also compared predicted values with a value set from similar studies in the UK and other countries. Our results show that the HRQoL does not vary among socioeconomic groups. Models at the individual level resulted in an additive model with all coefficients being statistically significant, R(2) = 0.55, a value of 0.75 for the best health state (11112), and a value of -0.074 for the worst health state (33333). The value set obtained for the study sample remarkably differs from those elicited in developed countries. This study is the first estimate for the EQ-5D value set based on the VAS in Iran. Given the importance of locally adapted value set the use of this value set can be recommended for future studies in Iran and In the EMRO regions.
Pharmacokinetic Modeling and Limited Sampling Strategies Based on Healthy Volunteers for Monitoring of Ertapenem in Patients with Multidrug-Resistant Tuberculosis.

PubMed

van Rijn, S P; Zuur, M A; van Altena, R; Akkerman, O W; Proost, J H; de Lange, W C M; Kerstjens, H A M; Touw, D J; van der Werf, T S; Kosterink, J G W; Alffenaar, J W C

2017-04-01

Ertapenem is a broad-spectrum carbapenem antibiotic whose activity against Mycobacterium tuberculosis is being explored. Carbapenems have antibacterial activity when the plasma concentration exceeds the MIC at least 40% of the time (40% T MIC ). To assess the 40% T MIC in multidrug-resistant tuberculosis (MDR-TB) patients, a limited sampling strategy was developed using a population pharmacokinetic model based on data for healthy volunteers. A two-compartment population pharmacokinetic model was developed with data for 42 healthy volunteers using an iterative two-stage Bayesian method. External validation was performed by Bayesian fitting of the model developed with data for volunteers to the data for individual MDR-TB patients (in which the fitted values of the area under the concentration-time curve from 0 to 24 h [AUC 0-24, fit values] were used) using the population model developed for volunteers as a prior. A Monte Carlo simulation ( n = 1,000) was used to evaluate limited sampling strategies. Additionally, the 40% T MIC with the free fraction ( f 40% T MIC ) of ertapenem in MDR-TB patients was estimated with the population pharmacokinetic model. The population pharmacokinetic model that was developed was shown to overestimate the area under the concentration-time curve from 0 to 24 h (AUC 0-24 ) in MDR-TB patients by 6.8% (range, -17.2 to 30.7%). The best-performing limited sampling strategy, which had a time restriction of 0 to 6 h, was found to be sampling at 1 and 5 h ( r 2 = 0.78, mean prediction error = -0.33%, root mean square error = 5.5%). Drug exposure was overestimated by a mean percentage of 4.2% (range, -15.2 to 23.6%). When a free fraction of 5% was considered and the MIC was set at 0.5 mg/liter, the minimum f 40% T MIC would have been exceeded in 9 out of 12 patients. A population pharmacokinetic model and limited sampling strategy, developed using data from healthy volunteers, were shown to be adequate to predict ertapenem exposure in MDR-TB patients. Copyright © 2017 American Society for Microbiology.
Pharmacokinetic Modeling and Limited Sampling Strategies Based on Healthy Volunteers for Monitoring of Ertapenem in Patients with Multidrug-Resistant Tuberculosis

PubMed Central

van Rijn, S. P.; Zuur, M. A.; van Altena, R.; Akkerman, O. W.; Proost, J. H.; de Lange, W. C. M.; Kerstjens, H. A. M.; Touw, D. J.; van der Werf, T. S.; Kosterink, J. G. W.

2017-01-01

ABSTRACT Ertapenem is a broad-spectrum carbapenem antibiotic whose activity against Mycobacterium tuberculosis is being explored. Carbapenems have antibacterial activity when the plasma concentration exceeds the MIC at least 40% of the time (40% TMIC). To assess the 40% TMIC in multidrug-resistant tuberculosis (MDR-TB) patients, a limited sampling strategy was developed using a population pharmacokinetic model based on data for healthy volunteers. A two-compartment population pharmacokinetic model was developed with data for 42 healthy volunteers using an iterative two-stage Bayesian method. External validation was performed by Bayesian fitting of the model developed with data for volunteers to the data for individual MDR-TB patients (in which the fitted values of the area under the concentration-time curve from 0 to 24 h [AUC0–24, fit values] were used) using the population model developed for volunteers as a prior. A Monte Carlo simulation (n = 1,000) was used to evaluate limited sampling strategies. Additionally, the 40% TMIC with the free fraction (f 40% TMIC) of ertapenem in MDR-TB patients was estimated with the population pharmacokinetic model. The population pharmacokinetic model that was developed was shown to overestimate the area under the concentration-time curve from 0 to 24 h (AUC0–24) in MDR-TB patients by 6.8% (range, −17.2 to 30.7%). The best-performing limited sampling strategy, which had a time restriction of 0 to 6 h, was found to be sampling at 1 and 5 h (r2 = 0.78, mean prediction error = −0.33%, root mean square error = 5.5%). Drug exposure was overestimated by a mean percentage of 4.2% (range, −15.2 to 23.6%). When a free fraction of 5% was considered and the MIC was set at 0.5 mg/liter, the minimum f 40% TMIC would have been exceeded in 9 out of 12 patients. A population pharmacokinetic model and limited sampling strategy, developed using data from healthy volunteers, were shown to be adequate to predict ertapenem exposure in MDR-TB patients. PMID:28137814
Drivers of Disparity: Differences in Socially-Based Risk Factors of Self-injurious and Suicidal Behaviors Among Sexual Minority College Students

PubMed Central

Blosnich, John; Bossarte, Robert

2012-01-01

Lesbian, gay, and bisexual (i.e., sexual minority) populations have increased prevalence of both self-injurious and suicidal behaviors, but reasons for these disparities are poorly understood. Objective To test the association between socially-based stressors (e.g., victimization, discrimination) and self-injurious behavior, suicide ideation, and suicide attempt. Participants A national sample of college-attending 18- to 24-year-olds. Methods Random or census samples from post-secondary educational institutions that administered the National College Health Assessment during the Fall 2008 and Spring 2009 semesters. Results Sexual minorities reported more socially-based stressors than heterosexuals. Bisexuals exhibited greatest prevalence of self-injurious and suicidal behaviors. In adjusted models, intimate partner violence was most consistently associated with self-injurious behaviros. Conclusions Sexual minorities' elevated risks of self-injurious and suicidal behaviors may stem from higher exposure to socially-based stressors. Within-group differences among sexual minorities offer insight to specific risk factors that may contribute to elevated self-injurious and suicidal behaviors in sexual minority populations. PMID:22316411
PopAffiliator: online calculator for individual affiliation to a major population group based on 17 autosomal short tandem repeat genotype profile.

PubMed

Pereira, Luísa; Alshamali, Farida; Andreassen, Rune; Ballard, Ruth; Chantratita, Wasun; Cho, Nam Soo; Coudray, Clotilde; Dugoujon, Jean-Michel; Espinoza, Marta; González-Andrade, Fabricio; Hadi, Sibte; Immel, Uta-Dorothee; Marian, Catalin; Gonzalez-Martin, Antonio; Mertens, Gerhard; Parson, Walther; Perone, Carlos; Prieto, Lourdes; Takeshita, Haruo; Rangel Villalobos, Héctor; Zeng, Zhaoshu; Zhivotovsky, Lev; Camacho, Rui; Fonseca, Nuno A

2011-09-01

Because of their sensitivity and high level of discrimination, short tandem repeat (STR) maker systems are currently the method of choice in routine forensic casework and data banking, usually in multiplexes up to 15-17 loci. Constraints related to sample amount and quality, frequently encountered in forensic casework, will not allow to change this picture in the near future, notwithstanding the technological developments. In this study, we present a free online calculator named PopAffiliator ( http://cracs.fc.up.pt/popaffiliator ) for individual population affiliation in the three main population groups, Eurasian, East Asian and sub-Saharan African, based on genotype profiles for the common set of STRs used in forensics. This calculator performs affiliation based on a model constructed using machine learning techniques. The model was constructed using a data set of approximately fifteen thousand individuals collected for this work. The accuracy of individual population affiliation is approximately 86%, showing that the common set of STRs routinely used in forensics provide a considerable amount of information for population assignment, in addition to being excellent for individual identification.
Prevalence of migraine in a diverse community—electronic methods for migraine ascertainment in a large integrated health plan

PubMed Central

Pressman, Alice; Jacobson, Alice; Eguilos, Roderick; Gelfand, Amy; Huynh, Cynthia; Hamilton, Luisa; Avins, Andrew; Bakshi, Nandini; Merikangas, Kathleen

2016-01-01

Introduction The growing availability of electronic health data provides an opportunity to ascertain diagnosis-specific cases via systematic methods for sample recruitment for clinical research and health services evaluation. We developed and implemented a migraine probability algorithm (MPA) to identify migraine from electronic health records (EHR) in an integrated health plan. Methods We identified all migraine outpatient diagnoses and all migraine-specific prescriptions for a five-year period (April 2008–March 2013) from the Kaiser Permanente, Northern California (KPNC) EHR. We developed and evaluated the MPA in two independent samples, and derived prevalence estimates of medically-ascertained migraine in KPNC by age, sex, and race. Results The period prevalence of medically-ascertained migraine among KPNC adults during April 2008–March 2013 was 10.3% (women: 15.5%, men: 4.5%). Estimates peaked with age in women but remained flat for men. Prevalence among Asians was half that of whites. Conclusions We demonstrate the feasibility of an EHR-based algorithm to identify cases of diagnosed migraine and determine that prevalence patterns by our methods yield results comparable to aggregate estimates of treated migraine based on direct interviews in population-based samples. This inexpensive, easily applied EHR-based algorithm provides a new opportunity for monitoring changes in migraine prevalence and identifying potential participants for research studies. PMID:26069243

From blackbirds to black holes: Investigating capture-recapture methods for time domain astronomy

NASA Astrophysics Data System (ADS)

Laycock, Silas G. T.

2017-07-01

In time domain astronomy, recurrent transients present a special problem: how to infer total populations from limited observations. Monitoring observations may give a biassed view of the underlying population due to limitations on observing time, visibility and instrumental sensitivity. A similar problem exists in the life sciences, where animal populations (such as migratory birds) or disease prevalence, must be estimated from sparse and incomplete data. The class of methods termed Capture-Recapture is used to reconstruct population estimates from time-series records of encounters with the study population. This paper investigates the performance of Capture-Recapture methods in astronomy via a series of numerical simulations. The Blackbirds code simulates monitoring of populations of transients, in this case accreting binary stars (neutron star or black hole accreting from a stellar companion) under a range of observing strategies. We first generate realistic light-curves for populations of binaries with contrasting orbital period distributions. These models are then randomly sampled at observing cadences typical of existing and planned monitoring surveys. The classical capture-recapture methods, Lincoln-Peterson, Schnabel estimators, related techniques, and newer methods implemented in the Rcapture package are compared. A general exponential model based on the radioactive decay law is introduced which is demonstrated to recover (at 95% confidence) the underlying population abundance and duty cycle, in a fraction of the observing visits (10-50%) required to discover all the sources in the simulation. Capture-Recapture is a promising addition to the toolbox of time domain astronomy, and methods implemented in R by the biostats community can be readily called from within python.
Ambulatory cancer and US general population reference values and cutoff scores for the functional assessment of cancer therapy.

PubMed

Pearman, Timothy; Yanez, Betina; Peipert, John; Wortman, Katy; Beaumont, Jennifer; Cella, David

2014-09-15

Health-related quality of life (HRQOL) measures are commonly used in oncology research. Interest in their use for monitoring or screening is increasing. The Functional Assessment of Cancer Therapy (FACT) is one of the most widely used HRQOL instruments. Consequently, oncology researchers and practitioners have an increasing need for reference values for the Functional Assessment of Cancer Therapy-General (FACT-G) and its 7-item rapid version, the Functional Assessment of Cancer Therapy-General 7 (FACT-G7), to compare FACT scores across specific subgroups of patients in research trials and practice. The objectives of this study are to provide 1) reference values from a sample of the general US adult population and a sample of adults diagnosed with cancer and 2) cutoff scores for quality of life. A sample of the general US population (N = 1075) and a sample of patients with cancer from 12 studies (N = 5065) were analyzed. Cutoff scores were established using distribution- and anchor-based methods. Mean values for the cancer sample were analyzed by performance status, cancer type, and disease status. Also, t tests and established criteria for meaningful differences were used to compare values. FACT-G and FACT-G7 scores in the general US population sample and cancer sample were generally comparable. Among the sample of patients with cancer, FACT-G and FACT-G7 scores worsened with declining performance status and increasing disease status. These data will aid interpretation of the magnitude and meaning of FACT scores, and allow for comparisons of scores across studies. © 2014 American Cancer Society.
Methods and background characteristics of the TOHNN study: a population-based study of oral health conditions in northern Norway

PubMed Central

Holde, Gro Eirin; Oscarson, Nils; Tillberg, Anders; Marstrander, Peter; Jönsson, Birgitta

2016-01-01

Objectives The aim of the Tromstannen – Oral Health in Northern Norway (TOHNN) study was to investigate oral health and dental-related diseases in an adult population. This article provides an overview of the background of the study and a description of the sample characteristics and methods employed in data collection. Study design Cross-sectional population-based study including a questionnaire and clinical dental examination. Methods A randomly selected sample of 2,909 individuals (20–79 years old) drawn from the population register was invited to participate in the study. The data were collected between October 2013 and November 2014 in Troms County in northern Norway. The questionnaire focused on oral health-related behaviours and attitudes, oral health-related quality of life, sense of coherence, dental anxiety and symptoms from the temporomandibular joint. The dental examinations, including radiographs, were conducted by 11 dental teams in 5 dental offices. The examination comprised of registration of dental caries, full mouth periodontal status, temporomandibular disorders, mucosal lesions and height and weight. The participants were grouped by age (20–34, 35–49, 50–64 and 65–79) and ethnicity (Norwegian, Sámi, other European and other world). Results From the original sample of 2,909 individuals, 1,986 (68.3%) people participated, of whom 1,019 (51.3%) were women. The highest attendance rate was among women 20–34 years old (80.3%) and the lowest in the oldest age group of women (55.4%). There was no difference in response rate between rural and urban areas. There was a positive correlation between population size and household gross income (p < 0.001) and education level (p < 0.001). The majority of Sámi resided in smaller municipalities. In larger cities, most participants used private dental health care services, whereas, in rural areas, most participants used the public dental health care service. Conclusion The TOHNN study has the potential to generate new knowledge on a wide range of oral health conditions beneficial to the population in Troms County. Due to the high participation rate, generalization both nationally and to the circumpolar area ought to be possible. PMID:26900910
National Survey on Access, Use and Promotion of Rational Use of Medicines (PNAUM): household survey component methods

PubMed Central

Mengue, Sotero Serrate; Bertoldi, Andréa Dâmaso; Boing, Alexandra Crispim; Tavares, Noemia Urruth Leão; Pizzol, Tatiane da Silva Dal; Oliveira, Maria Auxiliadora; Arrais, Paulo Sérgio Dourado; Ramos, Luiz Roberto; Farias, Mareni Rocha; Luiza, Vera Lucia; Bernal, Regina Tomie Ivata; de Barros, Aluísio Jardim Dornellas

2016-01-01

ABSTRACT OBJECTIVE To describe methodological aspects of the household survey National Survey on Access, Use and Promotion of Rational Use of Medicines (PNAUM) related to sampling design and implementation, the actual obtained sample, instruments and fieldwork. METHODS A cross-sectional, population-based study with probability sampling in three stages of the population living in households located in Brazilian urban areas. Fieldwork was carried out between September 2013 and February 2014. The data collection instrument included questions related to: information about households, residents and respondents; chronic diseases and medicines used; use of health services; acute diseases and events treated with drugs; use of contraceptives; use of pharmacy services; behaviors that may affect drug use; package inserts and packaging; lifestyle and health insurance. RESULTS In total, 41,433 interviews were carried out in 20,404 households and 576 urban clusters corresponding to 586 census tracts distributed in the five Brazilian regions, according to eight domains defined by age and gender. CONCLUSIONS The results of the survey may be used as a baseline for future studies aiming to assess the impact of government action on drug access and use. For local studies using a compatible method, PNAUM may serve as a reference point to evaluate variations in space and population. With a comprehensive evaluation of drug-related aspects, PNAUM is a major source of data for a variety of analyses to be carried out both at academic and government level. PMID:27982381
Group-regularized individual prediction: theory and application to pain.

PubMed

Lindquist, Martin A; Krishnan, Anjali; López-Solà, Marina; Jepma, Marieke; Woo, Choong-Wan; Koban, Leonie; Roy, Mathieu; Atlas, Lauren Y; Schmidt, Liane; Chang, Luke J; Reynolds Losin, Elizabeth A; Eisenbarth, Hedwig; Ashar, Yoni K; Delk, Elizabeth; Wager, Tor D

2017-01-15

Multivariate pattern analysis (MVPA) has become an important tool for identifying brain representations of psychological processes and clinical outcomes using fMRI and related methods. Such methods can be used to predict or 'decode' psychological states in individual subjects. Single-subject MVPA approaches, however, are limited by the amount and quality of individual-subject data. In spite of higher spatial resolution, predictive accuracy from single-subject data often does not exceed what can be accomplished using coarser, group-level maps, because single-subject patterns are trained on limited amounts of often-noisy data. Here, we present a method that combines population-level priors, in the form of biomarker patterns developed on prior samples, with single-subject MVPA maps to improve single-subject prediction. Theoretical results and simulations motivate a weighting based on the relative variances of biomarker-based prediction-based on population-level predictive maps from prior groups-and individual-subject, cross-validated prediction. Empirical results predicting pain using brain activity on a trial-by-trial basis (single-trial prediction) across 6 studies (N=180 participants) confirm the theoretical predictions. Regularization based on a population-level biomarker-in this case, the Neurologic Pain Signature (NPS)-improved single-subject prediction accuracy compared with idiographic maps based on the individuals' data alone. The regularization scheme that we propose, which we term group-regularized individual prediction (GRIP), can be applied broadly to within-person MVPA-based prediction. We also show how GRIP can be used to evaluate data quality and provide benchmarks for the appropriateness of population-level maps like the NPS for a given individual or study. Copyright © 2015 Elsevier Inc. All rights reserved.
Comparison of Online Survey Recruitment Platforms for Hard-to-Reach Pregnant Smoking Populations: Feasibility Study

PubMed Central

Agas, Jessica Marie; Lee, Melissa; Pan, Julia Lily; Buttenheim, Alison Meredith

2018-01-01

Background Recruiting hard-to-reach populations for health research is challenging. Web-based platforms offer one way to recruit specific samples for research purposes, but little is known about the feasibility of online recruitment and the representativeness and comparability of samples recruited through different Web-based platforms. Objective The objectives of this study were to determine the feasibility of recruiting a hard-to-reach population (pregnant smokers) using 4 different Web-based platforms and to compare participants recruited through each platform. Methods A screener and survey were distributed online through Qualtrics Panel, Soapbox Sample, Reddit, and Amazon Mechanical Turk (mTurk). Descriptive statistics were used to summarize results of each recruitment platform, including eligibility yield, quality yield, income, race, age, and gestational age. Results Of the 3847 participants screened for eligibility across all 4 Web-based platforms, 535 were eligible and 308 completed the survey. Amazon mTurk yielded the fewest completed responses (n=9), 100% (9/9) of which passed several quality metrics verifying pregnancy and smoking status. Qualtrics Panel yielded 14 completed responses, 86% (12/14) of which passed the quality screening. Soapbox Sample produced 107 completed surveys, 67% (72/107) of which were found to be quality responses. Advertising through Reddit produced the highest completion rate (n=178), but only 29.2% (52/178) of those surveys passed the quality metrics. We found significant differences in eligibility yield, quality yield, age, number of previous pregnancies, age of smoking initiation, current smokers, race, education, and income (P<.001). Conclusions Although each platform successfully recruited pregnant smokers, results varied in quality, cost, and percentage of complete responses. Moving forward, investigators should pay careful attention to the percentage yield and cost of online recruitment platforms to maximize internal and external validity. PMID:29661751
Near-Native Protein Loop Sampling Using Nonparametric Density Estimation Accommodating Sparcity

PubMed Central

Day, Ryan; Lennox, Kristin P.; Sukhanov, Paul; Dahl, David B.; Vannucci, Marina; Tsai, Jerry

2011-01-01

Unlike the core structural elements of a protein like regular secondary structure, template based modeling (TBM) has difficulty with loop regions due to their variability in sequence and structure as well as the sparse sampling from a limited number of homologous templates. We present a novel, knowledge-based method for loop sampling that leverages homologous torsion angle information to estimate a continuous joint backbone dihedral angle density at each loop position. The φ,ψ distributions are estimated via a Dirichlet process mixture of hidden Markov models (DPM-HMM). Models are quickly generated based on samples from these distributions and were enriched using an end-to-end distance filter. The performance of the DPM-HMM method was evaluated against a diverse test set in a leave-one-out approach. Candidates as low as 0.45 Å RMSD and with a worst case of 3.66 Å were produced. For the canonical loops like the immunoglobulin complementarity-determining regions (mean RMSD <2.0 Å), the DPM-HMM method performs as well or better than the best templates, demonstrating that our automated method recaptures these canonical loops without inclusion of any IgG specific terms or manual intervention. In cases with poor or few good templates (mean RMSD >7.0 Å), this sampling method produces a population of loop structures to around 3.66 Å for loops up to 17 residues. In a direct test of sampling to the Loopy algorithm, our method demonstrates the ability to sample nearer native structures for both the canonical CDRH1 and non-canonical CDRH3 loops. Lastly, in the realistic test conditions of the CASP9 experiment, successful application of DPM-HMM for 90 loops from 45 TBM targets shows the general applicability of our sampling method in loop modeling problem. These results demonstrate that our DPM-HMM produces an advantage by consistently sampling near native loop structure. The software used in this analysis is available for download at http://www.stat.tamu.edu/~dahl/software/cortorgles/. PMID:22028638
Sampling Methods in Cardiovascular Nursing Research: An Overview.

PubMed

Kandola, Damanpreet; Banner, Davina; O'Keefe-McCarthy, Sheila; Jassal, Debbie

2014-01-01

Cardiovascular nursing research covers a wide array of topics from health services to psychosocial patient experiences. The selection of specific participant samples is an important part of the research design and process. The sampling strategy employed is of utmost importance to ensure that a representative sample of participants is chosen. There are two main categories of sampling methods: probability and non-probability. Probability sampling is the random selection of elements from the population, where each element of the population has an equal and independent chance of being included in the sample. There are five main types of probability sampling including simple random sampling, systematic sampling, stratified sampling, cluster sampling, and multi-stage sampling. Non-probability sampling methods are those in which elements are chosen through non-random methods for inclusion into the research study and include convenience sampling, purposive sampling, and snowball sampling. Each approach offers distinct advantages and disadvantages and must be considered critically. In this research column, we provide an introduction to these key sampling techniques and draw on examples from the cardiovascular research. Understanding the differences in sampling techniques may aid nurses in effective appraisal of research literature and provide a reference pointfor nurses who engage in cardiovascular research.
Sampling bee communities using pan traps: alternative methods increase sample size

USDA-ARS?s Scientific Manuscript database

Monitoring of the status of bee populations and inventories of bee faunas require systematic sampling. Efficiency and ease of implementation has encouraged the use of pan traps to sample bees. Efforts to find an optimal standardized sampling method for pan traps have focused on pan trap color. Th...
The association between Internet addiction and personality disorders in a general population-based sample.

PubMed

Zadra, Sina; Bischof, Gallus; Besser, Bettina; Bischof, Anja; Meyer, Christian; John, Ulrich; Rumpf, Hans-Jürgen

2016-12-01

Background and aims Data on Internet addiction (IA) and its association with personality disorder are rare. Previous studies are largely restricted to clinical samples and insufficient measurement of IA. Methods Cross-sectional analysis data are based on a German sub-sample (n = 168; 86 males; 71 meeting criteria for IA) with increased levels of excessive Internet use derived from a general population sample (n = 15,023). IA was assessed with a comprehensive standardized interview using the structure of the Composite International Diagnostic Interview and the criteria of Internet Gaming Disorder as suggested in DSM-5. Impulsivity, attention deficit hyperactivity disorder, and self-esteem were assessed with the widely used questionnaires. Results Participants with IA showed higher frequencies of personality disorders (29.6%) compared to those without IA (9.3%; p < .001). In males with IA, Cluster C personality disorders were more prevalent than among non-addicted males. Compared to participants who had IA only, lower rates of remission of IA were found among participants with IA and additional cluster B personality disorder. Personality disorders were significantly associated with IA in multivariate analysis. Comorbidity of IA and personality disorders must be considered in prevention and treatment.
Facebook Recruitment of Vaccine-Hesitant Canadian Parents: Cross-Sectional Study

PubMed Central

2017-01-01

Background There is concern over the increase in the number of “vaccine-hesitant” parents, which contributes to under-vaccinated populations and reduced herd immunity. Traditional studies investigating parental immunization beliefs and practices have relied on random digit dialing (RDD); however, this method presents increasing limitations. Facebook is the most used social media platform in Canada and presents an opportunity to recruit vaccine-hesitant parents in a novel manner. Objective The study aimed to explore the use of Facebook as a tool to reach vaccine-hesitant parents, as compared with RDD methods. Methods We recruited Canadian parents over 4 weeks in 2013-14 via targeted Facebook advertisements linked to a Web-based survey. We compared methodological parameters, key parental demographics, and three vaccine hesitancy indicators to an RDD sample of Canadian parents. Two raters categorized respondent reasons for difficulties in deciding to vaccinate, according to the model of determinants of vaccine hesitancy developed by the World Health Organization’s Strategic Advisory Group of Experts on Immunization. Results The Facebook campaign received a total of 4792 clicks from unique users, of whom 1696 started the Web-based survey. The total response rate of fully completed unique Web-based surveys was 22.89% (1097/4792) and the survey completion rate was 64.68% (1097/1696). The total cost including incentives was reasonable (Can $4861.19). The Web-based sample yielded younger parents, with 85.69% (940/1097) under the age of 40 years as compared with 23.38% (408/1745) in the RDD sample; 91.43% (1003/1097) of the Facebook respondents were female as compared with 59.26% (1034/1745) in the RDD sample. Facebook respondents had a lower median age of their youngest child (1 year vs 8 years for RDD). When compared with the RDD sample, the Web-based sample yielded a significantly higher proportion of respondents reporting vaccines as moderately safe to not safe (26.62% [292/1097] vs 18.57% [324/1745]), partially or not at all up-to-date vaccination status of youngest child (22.06% [242/1097] vs 9.57% [167/1745]), and difficulty in making the decision to vaccinate their youngest child (21.06% [231/1097] vs 10.09% [176/1745]). Out of the Web-based respondents who reported reasons for the difficulties in deciding to vaccinate, 37.2% (83/223) reported lack of knowledge or trust due to conflicting information and 23.8% (53/223) reported the perception of the risk of the adverse effects of vaccines being higher than the risk of disease acquisition. Conclusions We successfully recruited a large sample of our target population at low cost and achieved a high survey completion rate using Facebook. When compared with the RDD sampling strategy, we reached more vaccine-hesitant parents and younger parents with younger children—a population more likely to be making decisions on childhood immunizations. Facebook is a promising economical modality for reaching vaccine-hesitant parents for studies on the determinants of vaccine uptake. PMID:28739557
Promethazine Misuse among Methadone Maintenance Patients and Community-Based Injection Drug Users

PubMed Central

Shapiro, Brad J.; Lynch, Kara L.; Toochinda, Tab; Lutnick, Alexandra; Cheng, Helen Y.; Kral, Alex H.

2013-01-01

Objective Promethazine has been reported to be misused in conjunction with opioids in several settings. Promethazine misuse by itself or in conjunction with opioids may have serious adverse health effects. To date, no prevalence data for the nonmedical use of promethazine has been reported. This study examines the prevalence and correlates of promethazine use in two different populations in San Francisco, California, USA: methadone maintenance clinic patients and community-based injection drug users (IDUs). Methods We analyzed urine samples for the presence of promethazine and reviewed the clinical records for 334 methadone maintenance patients at the county methadone clinic. Separately, we used targeted sampling methods to recruit and survey 139 community-based opioid IDUs about their use of promethazine. We assessed prevalence and factors associated with promethazine use with bivariate and multivariate statistics. Results The prevalence of promethazine positive urine samples among the methadone maintenance patients was 26 percent. Only 15 percent of promethazine positive patients had an active prescription for promethazine. Among IDUs reporting injection of opiates in the community-based survey, 17 percent reported having used promethazine in the past month; 24 percent of the IDUs who reported being enrolled in methadone treatment reported using promethazine in the past month. Conclusions The finding that one quarter of methadone maintenance patients in a clinic or recruited in community settings have recently used promethazine provides compelling evidence of significant nonmedical use of promethazine in this patient population. Further research is needed to establish the extent and nature of nonmedical use of promethazine. PMID:23385449
A Spatial Statistical Model for Landscape Genetics

PubMed Central

Guillot, Gilles; Estoup, Arnaud; Mortier, Frédéric; Cosson, Jean François

2005-01-01

Landscape genetics is a new discipline that aims to provide information on how landscape and environmental features influence population genetic structure. The first key step of landscape genetics is the spatial detection and location of genetic discontinuities between populations. However, efficient methods for achieving this task are lacking. In this article, we first clarify what is conceptually involved in the spatial modeling of genetic data. Then we describe a Bayesian model implemented in a Markov chain Monte Carlo scheme that allows inference of the location of such genetic discontinuities from individual geo-referenced multilocus genotypes, without a priori knowledge on populational units and limits. In this method, the global set of sampled individuals is modeled as a spatial mixture of panmictic populations, and the spatial organization of populations is modeled through the colored Voronoi tessellation. In addition to spatially locating genetic discontinuities, the method quantifies the amount of spatial dependence in the data set, estimates the number of populations in the studied area, assigns individuals to their population of origin, and detects individual migrants between populations, while taking into account uncertainty on the location of sampled individuals. The performance of the method is evaluated through the analysis of simulated data sets. Results show good performances for standard data sets (e.g., 100 individuals genotyped at 10 loci with 10 alleles per locus), with high but also low levels of population differentiation (e.g., FST < 0.05). The method is then applied to a set of 88 individuals of wolverines (Gulo gulo) sampled in the northwestern United States and genotyped at 10 microsatellites. PMID:15520263
Temperament, Parenting, and Depressive Symptoms in a Population Sample of Preadolescents

ERIC Educational Resources Information Center

Oldehinkel, Albertine J.; Veenstra, Rene; Ormel, Johan; De Winter, Andrea F.; Verhulst, Frank C.

2006-01-01

Background: Depressive symptoms can be triggered by negative social experiences and individuals' processing of these experiences. This study focuses on the interaction between temperament, perceived parenting, and gender in relation to depressive problems in a Dutch population sample of preadolescents. Methods: The sample consisted of 2230…
Long-Term Frozen Storage of Urine Samples: A Trouble to Get PCR Results in Schistosoma spp. DNA Detection?

PubMed Central

Fernández-Soto, Pedro; Velasco Tirado, Virginia; Carranza Rodríguez, Cristina; Pérez-Arellano, José Luis; Muro, Antonio

2013-01-01

Background Human schistosomiasis remains a serious worldwide public health problem. At present, a sensitive and specific assay for routine diagnosis of schistosome infection is not yet available. The potential for detecting schistosome-derived DNA by PCR-based methods in human clinical samples is currently being investigated as a diagnostic tool with potential application in routine schistosomiasis diagnosis. Collection of diagnostic samples such as stool or blood is usually difficult in some populations. However, urine is a biological sample that can be collected in a non-invasive method, easy to get from people of all ages and easy in management, but as a sample for PCR diagnosis is still not widely used. This could be due to the high variability in the reported efficiency of detection as a result of the high variation in urine samples’ storage or conditions for handling and DNA preservation and extraction methods. Methodology/Principal Findings We evaluate different commercial DNA extraction methods from a series of long-term frozen storage human urine samples from patients with parasitological confirmed schistosomiasis in order to assess the PCR effectiveness for Schistosoma spp. detection. Patientś urine samples were frozen for 18 months up to 7 years until use. Results were compared with those obtained in PCR assays using fresh healthy human urine artificially contaminated with Schistosoma mansoni DNA and urine samples from mice experimentally infected with S. mansoni cercariae stored frozen for at least 12 months before use. PCR results in fresh human artificial urine samples using different DNA based extraction methods were much more effective than those obtained when long-term frozen human urine samples were used as the source of DNA template. Conclusions/Significance Long-term frozen human urine samples are probably not a good source for DNA extraction for use as a template in PCR detection of Schistosoma spp., regardless of the DNA method of extraction used. PMID:23613907
Stratified sampling design based on data mining.

PubMed

Kim, Yeonkook J; Oh, Yoonhwan; Park, Sunghoon; Cho, Sungzoon; Park, Hayoung

2013-09-01

To explore classification rules based on data mining methodologies which are to be used in defining strata in stratified sampling of healthcare providers with improved sampling efficiency. We performed k-means clustering to group providers with similar characteristics, then, constructed decision trees on cluster labels to generate stratification rules. We assessed the variance explained by the stratification proposed in this study and by conventional stratification to evaluate the performance of the sampling design. We constructed a study database from health insurance claims data and providers' profile data made available to this study by the Health Insurance Review and Assessment Service of South Korea, and population data from Statistics Korea. From our database, we used the data for single specialty clinics or hospitals in two specialties, general surgery and ophthalmology, for the year 2011 in this study. Data mining resulted in five strata in general surgery with two stratification variables, the number of inpatients per specialist and population density of provider location, and five strata in ophthalmology with two stratification variables, the number of inpatients per specialist and number of beds. The percentages of variance in annual changes in the productivity of specialists explained by the stratification in general surgery and ophthalmology were 22% and 8%, respectively, whereas conventional stratification by the type of provider location and number of beds explained 2% and 0.2% of variance, respectively. This study demonstrated that data mining methods can be used in designing efficient stratified sampling with variables readily available to the insurer and government; it offers an alternative to the existing stratification method that is widely used in healthcare provider surveys in South Korea.
Cell purification: a new challenge for biobanks.

PubMed

Almeida, Maria; García-Montero, Andres C; Orfao, Alberto

2014-01-01

Performing '-omics' analyses on heterogeneous biological tissue samples, such as blood or bone marrow, can lead to biased or even erroneous results, particularly when the targeted cells and/or molecules are present at relatively low percentages/amounts. In such cases, whole sample analysis will most probably dilute and mask the features of the cell and/or molecules of interest, and this will negatively impact the results and their interpretation. Therefore, frequently it is critically important to have well-characterized and high-quality purified cell populations for the reliable detection of subtle variations in their specific features, such as gene expression profile, protein expression pattern and metabolic status. Biobanks are technological platforms which aim to provide researchers access to a large number of high-quality biological samples and their associated data, particularly to support high-quality scientific and clinical research projects, and such projects will benefit enormously by having access to high-quality purified cell populations or their biological components (e.g. DNA, RNA, proteins). Therefore, a clear opportunity exists for preparative cell sorting techniques in biobanks. Although multiple different cell purification approaches exist or are under development (e.g. cell purification techniques based on cell adherence, density and/or cell size properties, methods based on antibody binding as well as new lab-on-a-chip purification techniques), the choice for a specific technology depends on multiple variables, including cell recovery, purity and yield, among others. In addition, most cell purification approaches are not well suited for high-throughput (HT) purification of multiple cell populations coexisting in a sample. Here we review the most (currently) used cell sorting methods that may be applied for sample preparation in biobanks. For the different approaches, technical considerations about their advantages and limitations are highlighted, and the requirements to be met by a HT cell sorting technology to be used in biobanks are also discussed.
Improved detection of CXCR4-using HIV by V3 genotyping: application of population-based and "deep" sequencing to plasma RNA and proviral DNA.

PubMed

Swenson, Luke C; Moores, Andrew; Low, Andrew J; Thielen, Alexander; Dong, Winnie; Woods, Conan; Jensen, Mark A; Wynhoven, Brian; Chan, Dennison; Glascock, Christopher; Harrigan, P Richard

2010-08-01

Tropism testing should rule out CXCR4-using HIV before treatment with CCR5 antagonists. Currently, the recombinant phenotypic Trofile assay (Monogram) is most widely utilized; however, genotypic tests may represent alternative methods. Independent triplicate amplifications of the HIV gp120 V3 region were made from either plasma HIV RNA or proviral DNA. These underwent standard, population-based sequencing with an ABI3730 (RNA n = 63; DNA n = 40), or "deep" sequencing with a Roche/454 Genome Sequencer-FLX (RNA n = 12; DNA n = 12). Position-specific scoring matrices (PSSMX4/R5) (-6.96 cutoff) and geno2pheno[coreceptor] (5% false-positive rate) inferred tropism from V3 sequence. These methods were then independently validated with a separate, blinded dataset (n = 278) of screening samples from the maraviroc MOTIVATE trials. Standard sequencing of HIV RNA with PSSM yielded 69% sensitivity and 91% specificity, relative to Trofile. The validation dataset gave 75% sensitivity and 83% specificity. Proviral DNA plus PSSM gave 77% sensitivity and 71% specificity. "Deep" sequencing of HIV RNA detected >2% inferred-CXCR4-using virus in 8/8 samples called non-R5 by Trofile, and <2% in 4/4 samples called R5. Triplicate analyses of V3 standard sequence data detect greater proportions of CXCR4-using samples than previously achieved. Sequencing proviral DNA and "deep" V3 sequencing may also be useful tools for assessing tropism.
The late Neandertal supraorbital fossils from Vindija Cave, Croatia: a biased sample?

PubMed

Ahern, James C M; Lee, Sang-Hee; Hawks, John D

2002-09-01

The late Neandertal sample from Vindija (Croatia) has been described as transitional between the earlier Central European Neandertals from Krapina (Croatia) and modern humans. However, the morphological differences indicating this transition may rather be the result of different sex and/or age compositions between the samples. This study tests the hypothesis that the metric differences between the Krapina and Vindija supraorbital samples are due to sampling bias. We focus upon the supraorbital region because past studies have posited this region as particularly indicative of the Vindija sample's transitional nature. Furthermore, the supraorbital region varies significantly with both age and sex. We analyzed four chords and two derived indices of supraorbital torus form as defined by Smith & Ranyard (1980, Am. J. phys. Anthrop.93, pp. 589-610). For each variable, we analyzed relative sample bias of the Krapina and Vindija samples using three sampling methods. In order to test the hypothesis that the Vindija sample contains an over-representation of females and/or young while the Krapina sample is normal or also female/young biased, we determined the probability of drawing a sample of the same size as and with a mean equal to or less than Vindija's from a Krapina-based population. In order to test the hypothesis that the Vindija sample is female/young biased while the Krapina sample is male/old biased, we determined the probability of drawing a sample of the same size as and with a mean equal or less than Vindija's from a generated population whose mean is halfway between Krapina's and Vindija's. Finally, in order to test the hypothesis that the Vindija sample is normal while the Krapina sample contains an over-representation of males and/or old, we determined the probability of drawing a sample of the same size as and with a mean equal to or greater than Krapina's from a Vindija-based population. Unless we assume that the Vindija sample is female/young and the Krapina sample is male/old biased, our results falsify the hypothesis that the metric differences between the Krapina and Vindija samples are due to sample bias.
Density dependence and climate effects in Rocky Mountain elk: an application of regression with instrumental variables for population time series with sampling error.

PubMed

Creel, Scott; Creel, Michael

2009-11-01

1. Sampling error in annual estimates of population size creates two widely recognized problems for the analysis of population growth. First, if sampling error is mistakenly treated as process error, one obtains inflated estimates of the variation in true population trajectories (Staples, Taper & Dennis 2004). Second, treating sampling error as process error is thought to overestimate the importance of density dependence in population growth (Viljugrein et al. 2005; Dennis et al. 2006). 2. In ecology, state-space models are used to account for sampling error when estimating the effects of density and other variables on population growth (Staples et al. 2004; Dennis et al. 2006). In econometrics, regression with instrumental variables is a well-established method that addresses the problem of correlation between regressors and the error term, but requires fewer assumptions than state-space models (Davidson & MacKinnon 1993; Cameron & Trivedi 2005). 3. We used instrumental variables to account for sampling error and fit a generalized linear model to 472 annual observations of population size for 35 Elk Management Units in Montana, from 1928 to 2004. We compared this model with state-space models fit with the likelihood function of Dennis et al. (2006). We discuss the general advantages and disadvantages of each method. Briefly, regression with instrumental variables is valid with fewer distributional assumptions, but state-space models are more efficient when their distributional assumptions are met. 4. Both methods found that population growth was negatively related to population density and winter snow accumulation. Summer rainfall and wolf (Canis lupus) presence had much weaker effects on elk (Cervus elaphus) dynamics [though limitation by wolves is strong in some elk populations with well-established wolf populations (Creel et al. 2007; Creel & Christianson 2008)]. 5. Coupled with predictions for Montana from global and regional climate models, our results predict a substantial reduction in the limiting effect of snow accumulation on Montana elk populations in the coming decades. If other limiting factors do not operate with greater force, population growth rates would increase substantially.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.