Science.gov

Sample records for 250k snp array

  1. SNP Arrays

    PubMed Central

    Louhelainen, Jari

    2016-01-01

    The papers published in this Special Issue “SNP arrays” (Single Nucleotide Polymorphism Arrays) focus on several perspectives associated with arrays of this type. The range of papers vary from a case report to reviews, thereby targeting wider audiences working in this field. The research focus of SNP arrays is often human cancers but this Issue expands that focus to include areas such as rare conditions, animal breeding and bioinformatics tools. Given the limited scope, the spectrum of papers is nothing short of remarkable and even from a technical point of view these papers will contribute to the field at a general level. Three of the papers published in this Special Issue focus on the use of various SNP array approaches in the analysis of three different cancer types. Two of the papers concentrate on two very different rare conditions, applying the SNP arrays slightly differently. Finally, two other papers evaluate the use of the SNP arrays in the context of genetic analysis of livestock. The findings reported in these papers help to close gaps in the current literature and also to give guidelines for future applications of SNP arrays. PMID:27792140

  2. 250 kA compact linear transformer driver for wire array z-pinch loads

    NASA Astrophysics Data System (ADS)

    Bott, S. C.; Haas, D. M.; Madden, R. E.; Ueda, U.; Eshaq, Y.; Collins, G., IV; Gunasekera, K.; Mariscal, D.; Peebles, J.; Beg, F. N.; Mazarakis, M.; Struve, K.; Sharpe, R.

    2011-05-01

    We present the application of a short rise (˜150ns) 250 kA linear transformer driver (LTD) to wire array z-pinch loads for the first time. The generator is a modification of a previous driver in which a new conical power feed provides a low inductance coupling to wire loads. Performance of the new design using both short circuit and plasma loads is presented and discussed. The final design delivers ˜200kA to a wire array load which is in good agreement with SCREAMER calculations using a simplified representative circuit. Example results demonstrate successful experiments using cylindrical, conical, and inverse wire arrays as well as previously published work on x-pinch loads.

  3. Genome Wide Linkage Study, Using a 250K SNP Map, of Plasmodium falciparum Infection and Mild Malaria Attack in a Senegalese Population

    PubMed Central

    Milet, Jacqueline; Nuel, Gregory; Watier, Laurence; Courtin, David; Slaoui, Yousri; Senghor, Paul; Migot-Nabias, Florence; Gaye, Oumar; Garcia, André

    2010-01-01

    Multiple factors are involved in the variability of host's response to P. falciparum infection, like the intensity and seasonality of malaria transmission, the virulence of parasite and host characteristics like age or genetic make-up. Although admitted nowadays, the involvement of host genetic factors remains unclear. Discordant results exist, even concerning the best-known malaria resistance genes that determine the structure or function of red blood cells. Here we report on a genome-wide linkage and association study for P. falciparum infection intensity and mild malaria attack among a Senegalese population of children and young adults from 2 to 18 years old. A high density single nucleotide polymorphisms (SNP) genome scan (Affimetrix GeneChip Human Mapping 250K-nsp) was performed for 626 individuals: i.e. 249 parents and 377 children out of the 504 ones included in the follow-up. The population belongs to a unique ethnic group and was closely followed-up during 3 years. Genome-wide linkage analyses were performed on four clinical and parasitological phenotypes and association analyses using the family based association tests (FBAT) method were carried out in regions previously linked to malaria phenotypes in literature and in the regions for which we identified a linkage peak. Analyses revealed three strongly suggestive evidences for linkage: between mild malaria attack and both the 6p25.1 and the 12q22 regions (empirical p-value = 5×10−5 and 9×10−5 respectively), and between the 20p11q11 region and the prevalence of parasite density in asymptomatic children (empirical p-value = 1.5×10−4). Family based association analysis pointed out one significant association between the intensity of plasmodial infection and a polymorphism located in ARHGAP26 gene in the 5q31–q33 region (p-value = 3.7×10−5). This study identified three candidate regions, two of them containing genes that could point out new pathways implicated in the response to

  4. SNP Array in Hematopoietic Neoplasms: A Review

    PubMed Central

    Song, Jinming; Shao, Haipeng

    2015-01-01

    Cytogenetic analysis is essential for the diagnosis and prognosis of hematopoietic neoplasms in current clinical practice. Many hematopoietic malignancies are characterized by structural chromosomal abnormalities such as specific translocations, inversions, deletions and/or numerical abnormalities that can be identified by karyotype analysis or fluorescence in situ hybridization (FISH) studies. Single nucleotide polymorphism (SNP) arrays offer high-resolution identification of copy number variants (CNVs) and acquired copy-neutral loss of heterozygosity (LOH)/uniparental disomy (UPD) that are usually not identifiable by conventional cytogenetic analysis and FISH studies. As a result, SNP arrays have been increasingly applied to hematopoietic neoplasms to search for clinically-significant genetic abnormalities. A large numbers of CNVs and UPDs have been identified in a variety of hematopoietic neoplasms. CNVs detected by SNP array in some hematopoietic neoplasms are of prognostic significance. A few specific genes in the affected regions have been implicated in the pathogenesis and may be the targets for specific therapeutic agents in the future. In this review, we summarize the current findings of application of SNP arrays in a variety of hematopoietic malignancies with an emphasis on the clinically significant genetic variants. PMID:27600067

  5. Analyzing cancer samples with SNP arrays.

    PubMed

    Van Loo, Peter; Nilsen, Gro; Nordgard, Silje H; Vollan, Hans Kristian Moen; Børresen-Dale, Anne-Lise; Kristensen, Vessela N; Lingjærde, Ole Christian

    2012-01-01

    Single nucleotide polymorphism (SNP) arrays are powerful tools to delineate genomic aberrations in cancer genomes. However, the analysis of these SNP array data of cancer samples is complicated by three phenomena: (a) aneuploidy: due to massive aberrations, the total DNA content of a cancer cell can differ significantly from its normal two copies; (b) nonaberrant cell admixture: samples from solid tumors do not exclusively contain aberrant tumor cells, but always contain some portion of nonaberrant cells; (c) intratumor heterogeneity: different cells in the tumor sample may have different aberrations. We describe here how these phenomena impact the SNP array profile, and how these can be accounted for in the analysis. In an extended practical example, we apply our recently developed and further improved ASCAT (allele-specific copy number analysis of tumors) suite of tools to analyze SNP array data using data from a series of breast carcinomas as an example. We first describe the structure of the data, how it can be plotted and interpreted, and how it can be segmented. The core ASCAT algorithm next determines the fraction of nonaberrant cells and the tumor ploidy (the average number of DNA copies), and calculates an ASCAT profile. We describe how these ASCAT profiles visualize both copy number aberrations as well as copy-number-neutral events. Finally, we touch upon regions showing intratumor heterogeneity, and how they can be detected in ASCAT profiles. All source code and data described here can be found at our ASCAT Web site ( http://www.ifi.uio.no/forskning/grupper/bioinf/Projects/ASCAT/).

  6. A SNP genotyping array for hexaploid oat

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Recognizing a need in cultivated hexaploid oat (Avena sativa L.) for a reliable set of reference SNPs, we have developed a 6K BeadChip design containing 257 Infinium I and 5,486 Infinium II designs corresponding to 5,743 SNPs. Of those, 4,975 SNPs yielded successful assays after array manufacturing...

  7. Software solutions for the livestock genomics SNP array revolution.

    PubMed

    Nicolazzi, E L; Biffani, S; Biscarini, F; Orozco Ter Wengel, P; Caprera, A; Nazzicari, N; Stella, A

    2015-08-01

    Since the beginning of the genomic era, the number of available single nucleotide polymorphism (SNP) arrays has grown considerably. In the bovine species alone, 11 SNP chips not completely covered by intellectual property are currently available, and the number is growing. Genomic/genotype data are not standardized, and this hampers its exchange and integration. In addition, software used for the analyses of these data usually requires not standard (i.e. case specific) input files which, considering the large amount of data to be handled, require at least some programming skills in their production. In this work, we describe a software toolkit for SNP array data management, imputation, genome-wide association studies, population genetics and genomic selection. However, this toolkit does not solve the critical need for standardization of the genotypic data and software input files. It only highlights the chaotic situation each researcher has to face on a daily basis and gives some helpful advice on the currently available tools in order to navigate the SNP array data complexity.

  8. Development of SNP-genotyping arrays in two shellfish species.

    PubMed

    Lapègue, S; Harrang, E; Heurtebise, S; Flahauw, E; Donnadieu, C; Gayral, P; Ballenghien, M; Genestout, L; Barbotte, L; Mahla, R; Haffray, P; Klopp, C

    2014-07-01

    Use of SNPs has been favoured due to their abundance in plant and animal genomes, accompanied by the falling cost and rising throughput capacity for detection and genotyping. Here, we present in vitro (obtained from targeted sequencing) and in silico discovery of SNPs, and the design of medium-throughput genotyping arrays for two oyster species, the Pacific oyster, Crassostrea gigas, and European flat oyster, Ostrea edulis. Two sets of 384 SNP markers were designed for two Illumina GoldenGate arrays and genotyped on more than 1000 samples for each species. In each case, oyster samples were obtained from wild and selected populations and from three-generation families segregating for traits of interest in aquaculture. The rate of successfully genotyped polymorphic SNPs was about 60% for each species. Effects of SNP origin and quality on genotyping success (Illumina functionality Score) were analysed and compared with other model and nonmodel species. Furthermore, a simulation was made based on a subset of the C. gigas SNP array with a minor allele frequency of 0.3 and typical crosses used in shellfish hatcheries. This simulation indicated that at least 150 markers were needed to perform an accurate parental assignment. Such panels might provide valuable tools to improve our understanding of the connectivity between wild (and selected) populations and could contribute to future selective breeding programmes.

  9. Automated SNP genotype clustering algorithm to improve data completeness in high-throughput SNP genotyping datasets from custom arrays.

    PubMed

    Smith, Edward M; Littrell, Jack; Olivier, Michael

    2007-12-01

    High-throughput SNP genotyping platforms use automated genotype calling algorithms to assign genotypes. While these algorithms work efficiently for individual platforms, they are not compatible with other platforms, and have individual biases that result in missed genotype calls. Here we present data on the use of a second complementary SNP genotype clustering algorithm. The algorithm was originally designed for individual fluorescent SNP genotyping assays, and has been optimized to permit the clustering of large datasets generated from custom-designed Affymetrix SNP panels. In an analysis of data from a 3K array genotyped on 1,560 samples, the additional analysis increased the overall number of genotypes by over 45,000, significantly improving the completeness of the experimental data. This analysis suggests that the use of multiple genotype calling algorithms may be advisable in high-throughput SNP genotyping experiments. The software is written in Perl and is available from the corresponding author.

  10. Genome-wide SNP detection, validation, and development of an 8K SNP array for apple

    Technology Transfer Automated Retrieval System (TEKTRAN)

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide...

  11. A user guide to the Brassica 60K Illumina Infinium™ SNP genotyping array.

    PubMed

    Mason, Annaliese S; Higgins, Erin E; Snowdon, Rod J; Batley, Jacqueline; Stein, Anna; Werner, Christian; Parkin, Isobel A P

    2017-02-20

    The Brassica napus 60K Illumina Infinium™ SNP array has had huge international uptake in the rapeseed community due to the revolutionary speed of acquisition and ease of analysis of this high-throughput genotyping data, particularly when coupled with the newly available reference genome sequence. However, further utilization of this valuable resource can be optimized by better understanding the promises and pitfalls of SNP arrays. We outline how best to analyze Brassica SNP marker array data for diverse applications, including linkage and association mapping, genetic diversity and genomic introgression studies. We present data on which SNPs are locus-specific in winter, semi-winter and spring B. napus germplasm pools, rather than amplifying both an A-genome and a C-genome locus or multiple loci. Common issues that arise when analyzing array data will be discussed, particularly those unique to SNP markers and how to deal with these for practical applications in Brassica breeding applications.

  12. Construction of a versatile SNP array for pyramiding useful genes of rice.

    PubMed

    Kurokawa, Yusuke; Noda, Tomonori; Yamagata, Yoshiyuki; Angeles-Shim, Rosalyn; Sunohara, Hidehiko; Uehara, Kanako; Furuta, Tomoyuki; Nagai, Keisuke; Jena, Kshirod Kumar; Yasui, Hideshi; Yoshimura, Atsushi; Ashikari, Motoyuki; Doi, Kazuyuki

    2016-01-01

    DNA marker-assisted selection (MAS) has become an indispensable component of breeding. Single nucleotide polymorphisms (SNP) are the most frequent polymorphism in the rice genome. However, SNP markers are not readily employed in MAS because of limitations in genotyping platforms. Here the authors report a Golden Gate SNP array that targets specific genes controlling yield-related traits and biotic stress resistance in rice. As a first step, the SNP genotypes were surveyed in 31 parental varieties using the Affymetrix Rice 44K SNP microarray. The haplotype information for 16 target genes was then converted to the Golden Gate platform with 143-plex markers. Haplotypes for the 14 useful allele are unique and can discriminate among all other varieties. The genotyping consistency between the Affymetrix microarray and the Golden Gate array was 92.8%, and the accuracy of the Golden Gate array was confirmed in 3 F2 segregating populations. The concept of the haplotype-based selection by using the constructed SNP array was proofed.

  13. Optimal design of low-density SNP arrays for genomic prediction: algorithm and applications

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for their optimal design. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optim...

  14. Measuring diversity in Gossypium hirsutum using the CottonSNP63K Array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A CottonSNP63K array and accompanying cluster file has been developed and includes 45,104 intra-specific SNPs and 17,954 inter-specific SNPs for automated genotyping of cotton (Gossypium spp.) samples. Development of the cluster file included genotyping of 1,156 samples, a subset of which were iden...

  15. Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Btau_4.0 and UMD3.1 are two distinct cattle reference genome assemblies. In our previous study using the low density BovineSNP50 array, we reported a copy number variation (CNV) analysis on Btau_4.0 with 521 animals of 21 cattle breeds, yielding 682 CNV regions with a total length of 139.8 megabases...

  16. Development of maizeSNP3072, a high-throughput compatible SNP array, for DNA fingerprinting identification of Chinese maize varieties.

    PubMed

    Tian, Hong-Li; Wang, Feng-Ge; Zhao, Jiu-Ran; Yi, Hong-Mei; Wang, Lu; Wang, Rui; Yang, Yang; Song, Wei

    Single nucleotide polymorphisms (SNPs) are abundant and evenly distributed throughout the maize (Zea mays L.) genome. SNPs have several advantages over simple sequence repeats, such as ease of data comparison and integration, high-throughput processing of loci, and identification of associated phenotypes. SNPs are thus ideal for DNA fingerprinting, genetic diversity analysis, and marker-assisted breeding. Here, we developed a high-throughput and compatible SNP array, maizeSNP3072, containing 3072 SNPs developed from the maizeSNP50 array. To improve genotyping efficiency, a high-quality cluster file, maizeSNP3072_GT.egt, was constructed. All 3072 SNP loci were localized within different genes, where they were distributed in exons (43 %), promoters (21 %), 3' untranslated regions (UTRs; 22 %), 5' UTRs (9 %), and introns (5 %). The average genotyping failure rate using these SNPs was only 6 %, or 3 % using the cluster file to call genotypes. The genotype consistency of repeat sample analysis on Illumina GoldenGate versus Infinium platforms exceeded 96.4 %. The minor allele frequency (MAF) of the SNPs averaged 0.37 based on data from 309 inbred lines. The 3072 SNPs were highly effective for distinguishing among 276 examined hybrids. Comparative analysis using Chinese varieties revealed that the 3072SNP array showed a better marker success rate and higher average MAF values, evaluation scores, and variety-distinguishing efficiency than the maizeSNP50K array. The maizeSNP3072 array thus can be successfully used in DNA fingerprinting identification of Chinese maize varieties and shows potential as a useful tool for germplasm resource evaluation and molecular marker-assisted breeding.

  17. Japonica array: improved genotype imputation by designing a population-specific SNP array with 1070 Japanese individuals

    PubMed Central

    Kawai, Yosuke; Mimori, Takahiro; Kojima, Kaname; Nariai, Naoki; Danjoh, Inaho; Saito, Rumiko; Yasuda, Jun; Yamamoto, Masayuki; Nagasaki, Masao

    2015-01-01

    The Tohoku Medical Megabank Organization constructed the reference panel (referred to as the 1KJPN panel), which contains >20 million single nucleotide polymorphisms (SNPs), from whole-genome sequence data from 1070 Japanese individuals. The 1KJPN panel contains the largest number of haplotypes of Japanese ancestry to date. Here, from the 1KJPN panel, we designed a novel custom-made SNP array, named the Japonica array, which is suitable for whole-genome imputation of Japanese individuals. The array contains 659 253 SNPs, including tag SNPs for imputation, SNPs of Y chromosome and mitochondria, and SNPs related to previously reported genome-wide association studies and pharmacogenomics. The Japonica array provides better imputation performance for Japanese individuals than the existing commercially available SNP arrays with both the 1KJPN panel and the International 1000 genomes project panel. For common SNPs (minor allele frequency (MAF)>5%), the genomic coverage of the Japonica array (r2>0.8) was 96.9%, that is, almost all common SNPs were covered by this array. Nonetheless, the coverage of low-frequency SNPs (0.5%array reached 67.2%, which is higher than those of the existing arrays. In addition, we confirmed the high quality genotyping performance of the Japonica array using the 288 samples in 1KJPN; the average call rate 99.7% and the average concordance rate 99.7% to the genotypes obtained from high-throughput sequencer. As demonstrated in this study, the creation of custom-made SNP arrays based on a population-specific reference panel is a practical way to facilitate further association studies through genome-wide genotype imputations. PMID:26108142

  18. Microfluidic linear hydrogel array for multiplexed single nucleotide polymorphism (SNP) detection.

    PubMed

    Jung, Yun Kyung; Kim, Jungkyu; Mathies, Richard A

    2015-03-17

    A PDMS-based microfluidic linear hydrogel array is developed for multiplexed single nucleotide polymorphism (SNP) detection. A sequence of three-dimensional (3D) hydrogel plugs containing the desired DNA probes is prepared by UV polymerization within a PDMS microchannel system. The fluorescently labeled target DNA is then electrophoresed through the sequence of hydrogel plugs for hybridization. Continued electrophoresis provides an electrophoretic wash that removes nonspecific binders. The capture gel array is imaged after washing at various temperatures (temperature gradient electrophoresis) to further distinguish perfect matches from mismatches. The ability of this microdevice to perform multiplex SNP genotyping is demonstrated by analyzing a mixture of model E. coli bacterial targets. This microfluidic hydrogel array is ∼1000 times more sensitive than planar microarrays due to the 3D gel capture, the hybridization time is much shorter due to electrophoretic control of the transport properties, and the stringent wash with temperature gradient electrophoresis enables analysis of single nucleotide mismatches with high specificity.

  19. Single nucleotide polymorphism (SNP) detection using microelectrode biochip array

    NASA Astrophysics Data System (ADS)

    Choi, Yong-Sung; Lee, Kyung-Sup; Park, Dae-Hee

    2005-10-01

    In this paper, a microelectrode array DNA chip was fabricated on a glass slide using photolithography technology. Several probe DNAs with mercaptohexyl moiety at their 5' end were immobilized on the gold electrodes by a DNA arrayer utilizing the affinity between gold and sulfur. Then target DNAs were hybridized and reacted with Hoechst 33258, which is a DNA minor groove binder and electrochemically active dye. Cyclic voltammetry in a 5 mM ferricyanide/ferrocyanide solution at 100 mV s-1 confirmed the immobilization of probe DNA on the gold electrodes. Linear sweep voltammetry or cyclic voltammetry showed a difference between target DNA and control DNA in the anodic peak current values. This was derived from Hoechst 33258 concentrated at the electrode surface through association with formed hybrids. It is suggested that this DNA chip could recognize sequence specific genes. It is also suggested that a multichannel electrochemical DNA microarray is useful to develop a portable device for a clinical gene diagnostic system.

  20. A Customized Pigmentation SNP Array Identifies a Novel SNP Associated with Melanoma Predisposition in the SLC45A2 Gene

    PubMed Central

    Alonso, Santos; Boyano, M. Dolores; Peña-Chilet, Maria; Pita, Guillermo; Aviles, Jose A.; Mayor, Matias; Gomez-Fernandez, Cristina; Casado, Beatriz; Martin-Gonzalez, Manuel; Izagirre, Neskuts; De la Rua, Concepcion; Asumendi, Aintzane; Perez-Yarza, Gorka; Arroyo-Berdugo, Yoana; Boldo, Enrique; Lozoya, Rafael; Torrijos-Aguilar, Arantxa; Pitarch, Ana; Pitarch, Gerard; Sanchez-Motilla, Jose M.; Valcuende-Cavero, Francisca; Tomas-Cabedo, Gloria; Perez-Pastor, Gemma; Diaz-Perez, Jose L.; Gardeazabal, Jesus; de Lizarduy, Iñigo Martinez; Sanchez-Diez, Ana; Valdes, Carlos; Pizarro, Angel; Casado, Mariano; Carretero, Gregorio; Botella-Estrada, Rafael; Nagore, Eduardo; Lazaro, Pablo; Lluch, Ana; Benitez, Javier; Martinez-Cadenas, Conrado; Ribas, Gloria

    2011-01-01

    As the incidence of Malignant Melanoma (MM) reflects an interaction between skin colour and UV exposure, variations in genes implicated in pigmentation and tanning response to UV may be associated with susceptibility to MM. In this study, 363 SNPs in 65 gene regions belonging to the pigmentation pathway have been successfully genotyped using a SNP array. Five hundred and ninety MM cases and 507 controls were analyzed in a discovery phase I. Ten candidate SNPs based on a p-value threshold of 0.01 were identified. Two of them, rs35414 (SLC45A2) and rs2069398 (SILV/CKD2), were statistically significant after conservative Bonferroni correction. The best six SNPs were further tested in an independent Spanish series (624 MM cases and 789 controls). A novel SNP located on the SLC45A2 gene (rs35414) was found to be significantly associated with melanoma in both phase I and phase II (P<0.0001). None of the other five SNPs were replicated in this second phase of the study. However, three SNPs in TYR, SILV/CDK2 and ADAMTS20 genes (rs17793678, rs2069398 and rs1510521 respectively) had an overall p-value<0.05 when considering the whole DNA collection (1214 MM cases and 1296 controls). Both the SLC45A2 and the SILV/CDK2 variants behave as protective alleles, while the TYR and ADAMTS20 variants seem to function as risk alleles. Cumulative effects were detected when these four variants were considered together. Furthermore, individuals carrying two or more mutations in MC1R, a well-known low penetrance melanoma-predisposing gene, had a decreased MM risk if concurrently bearing the SLC45A2 protective variant. To our knowledge, this is the largest study on Spanish sporadic MM cases to date. PMID:21559390

  1. Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications.

    PubMed

    Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R; Taylor, Jeremy F; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart

    2016-01-01

    Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The

  2. Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications

    PubMed Central

    Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R.; Taylor, Jeremy F.; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart

    2016-01-01

    Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The

  3. Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao

    PubMed Central

    Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos

    2015-01-01

    Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity. PMID:26070980

  4. Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao.

    PubMed

    Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos

    2015-08-01

    Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity.

  5. Sensitive Quantification of Mosaicism Using High Density SNP Arrays and the Cumulative Distribution Function

    PubMed Central

    Markello, Thomas C.; Carlson-Donohoe, Hannah; Sincan, Murat; Adams, David; Bodine, David M.; Farrar, Jason E.; Vlachos, Adrianna; Lipton, Jeffrey M.; Auerbach, Arleen D.; Ostrander, Elaine A.; Chandrasekharappa, Settara C.; Boerkoel, Cornelius F.; Gahl, William A.

    2012-01-01

    Medicine is rapidly applying exome and genome sequencing to the diagnosis and management of human disease. Somatic mosaicism, however, is not readily detectable by these means, and yet it accounts for a significant portion of undiagnosed disease. We present a rapid and sensitive method, the Continuous Distribution Function as applied to single nucleotide polymorphism (SNP) array data, to quantify somatic mosaicism throughout the genome. We also demonstrate application of the method to novel diseases and mechanisms. PMID:22277120

  6. Evaluating the performance of Affymetrix SNP Array 6.0 platform with 400 Japanese individuals

    PubMed Central

    Nishida, Nao; Koike, Asako; Tajima, Atsushi; Ogasawara, Yuko; Ishibashi, Yoshimi; Uehara, Yasuka; Inoue, Ituro; Tokunaga, Katsushi

    2008-01-01

    Background With improvements in genotyping technologies, genome-wide association studies with hundreds of thousands of SNPs allow the identification of candidate genetic loci for multifactorial diseases in different populations. However, genotyping errors caused by genotyping platforms or genotype calling algorithms may lead to inflation of false associations between markers and phenotypes. In addition, the number of SNPs available for genome-wide association studies in the Japanese population has been investigated using only 45 samples in the HapMap project, which could lead to an inaccurate estimation of the number of SNPs with low minor allele frequencies. We genotyped 400 Japanese samples in order to estimate the number of SNPs available for genome-wide association studies in the Japanese population and to examine the performance of the current SNP Array 6.0 platform and the genotype calling algorithm "Birdseed". Results About 20% of the 909,622 SNP markers on the array were revealed to be monomorphic in the Japanese population. Consequently, 661,599 SNPs were available for genome-wide association studies in the Japanese population, after excluding the poorly behaving SNPs. The Birdseed algorithm accurately determined the genotype calls of each sample with a high overall call rate of over 99.5% and a high concordance rate of over 99.8% using more than 48 samples after removing low-quality samples by adjusting QC criteria. Conclusion Our results confirmed that the SNP Array 6.0 platform reached the level reported by the manufacturer, and thus genome-wide association studies using the SNP Array 6.0 platform have considerable potential to identify candidate susceptibility or resistance genetic factors for multifactorial diseases in the Japanese population, as well as in other populations. PMID:18803882

  7. Development and Evaluation of a 9K SNP Array for Peach by Internationally Coordinated SNP Detection and Validation in Breeding Germplasm

    PubMed Central

    Scalabrin, Simone; Gilmore, Barbara; Lawley, Cynthia T.; Gasic, Ksenija; Micheletti, Diego; Rosyara, Umesh R.; Cattonaro, Federica; Vendramin, Elisa; Main, Dorrie; Aramini, Valeria; Blas, Andrea L.; Mockler, Todd C.; Bryant, Douglas W.; Wilhelm, Larry; Troggio, Michela; Sosinski, Bryon; Aranzana, Maria José; Arús, Pere; Iezzoni, Amy; Morgante, Michele; Peace, Cameron

    2012-01-01

    Although a large number of single nucleotide polymorphism (SNP) markers covering the entire genome are needed to enable molecular breeding efforts such as genome wide association studies, fine mapping, genomic selection and marker-assisted selection in peach [Prunus persica (L.) Batsch] and related Prunus species, only a limited number of genetic markers, including simple sequence repeats (SSRs), have been available to date. To address this need, an international consortium (The International Peach SNP Consortium; IPSC) has pursued a coordinated effort to perform genome-scale SNP discovery in peach using next generation sequencing platforms to develop and characterize a high-throughput Illumina Infinium® SNP genotyping array platform. We performed whole genome re-sequencing of 56 peach breeding accessions using the Illumina and Roche/454 sequencing technologies. Polymorphism detection algorithms identified a total of 1,022,354 SNPs. Validation with the Illumina GoldenGate® assay was performed on a subset of the predicted SNPs, verifying ∼75% of genic (exonic and intronic) SNPs, whereas only about a third of intergenic SNPs were verified. Conservative filtering was applied to arrive at a set of 8,144 SNPs that were included on the IPSC peach SNP array v1, distributed over all eight peach chromosomes with an average spacing of 26.7 kb between SNPs. Use of this platform to screen a total of 709 accessions of peach in two separate evaluation panels identified a total of 6,869 (84.3%) polymorphic SNPs. The almost 7,000 SNPs verified as polymorphic through extensive empirical evaluation represent an excellent source of markers for future studies in genetic relatedness, genetic mapping, and dissecting the genetic architecture of complex agricultural traits. The IPSC peach SNP array v1 is commercially available and we expect that it will be used worldwide for genetic studies in peach and related stone fruit and nut species. PMID:22536421

  8. SNP Discovery and Development of a High-Density Genotyping Array for Sunflower

    PubMed Central

    Bachlava, Eleni; Taylor, Christopher A.; Tang, Shunxue; Bowers, John E.; Mandel, Jennifer R.; Burke, John M.; Knapp, Steven J.

    2012-01-01

    Recent advances in next-generation DNA sequencing technologies have made possible the development of high-throughput SNP genotyping platforms that allow for the simultaneous interrogation of thousands of single-nucleotide polymorphisms (SNPs). Such resources have the potential to facilitate the rapid development of high-density genetic maps, and to enable genome-wide association studies as well as molecular breeding approaches in a variety of taxa. Herein, we describe the development of a SNP genotyping resource for use in sunflower (Helianthus annuus L.). This work involved the development of a reference transcriptome assembly for sunflower, the discovery of thousands of high quality SNPs based on the generation and analysis of ca. 6 Gb of transcriptome re-sequencing data derived from multiple genotypes, the selection of 10,640 SNPs for inclusion in the genotyping array, and the use of the resulting array to screen a diverse panel of sunflower accessions as well as related wild species. The results of this work revealed a high frequency of polymorphic SNPs and relatively high level of cross-species transferability. Indeed, greater than 95% of successful SNP assays revealed polymorphism, and more than 90% of these assays could be successfully transferred to related wild species. Analysis of the polymorphism data revealed patterns of genetic differentiation that were largely congruent with the evolutionary history of sunflower, though the large number of markers allowed for finer resolution than has previously been possible. PMID:22238659

  9. SNP discovery and development of a high-density genotyping array for sunflower.

    PubMed

    Bachlava, Eleni; Taylor, Christopher A; Tang, Shunxue; Bowers, John E; Mandel, Jennifer R; Burke, John M; Knapp, Steven J

    2012-01-01

    Recent advances in next-generation DNA sequencing technologies have made possible the development of high-throughput SNP genotyping platforms that allow for the simultaneous interrogation of thousands of single-nucleotide polymorphisms (SNPs). Such resources have the potential to facilitate the rapid development of high-density genetic maps, and to enable genome-wide association studies as well as molecular breeding approaches in a variety of taxa. Herein, we describe the development of a SNP genotyping resource for use in sunflower (Helianthus annuus L.). This work involved the development of a reference transcriptome assembly for sunflower, the discovery of thousands of high quality SNPs based on the generation and analysis of ca. 6 Gb of transcriptome re-sequencing data derived from multiple genotypes, the selection of 10,640 SNPs for inclusion in the genotyping array, and the use of the resulting array to screen a diverse panel of sunflower accessions as well as related wild species. The results of this work revealed a high frequency of polymorphic SNPs and relatively high level of cross-species transferability. Indeed, greater than 95% of successful SNP assays revealed polymorphism, and more than 90% of these assays could be successfully transferred to related wild species. Analysis of the polymorphism data revealed patterns of genetic differentiation that were largely congruent with the evolutionary history of sunflower, though the large number of markers allowed for finer resolution than has previously been possible.

  10. Construction and evaluation of a high-density SNP array for the Pacific oyster (Crassostrea gigas)

    PubMed Central

    Li, Chunyan; Wang, Wei; Li, Busu; Li, Li

    2017-01-01

    Single nucleotide polymorphisms (SNPs) are widely used in genetics and genomics research. The Pacific oyster (Crassostrea gigas) is an economically and ecologically important marine bivalve, and it possesses one of the highest levels of genomic DNA variation among animal species. Pacific oyster SNPs have been extensively investigated; however, the mechanisms by which these SNPs may be used in a high-throughput, transferable, and economical manner remain to be elucidated. Here, we constructed an oyster 190K SNP array using Affymetrix Axiom genotyping technology. We designed 190,420 SNPs on the chip; these SNPs were selected from 54 million SNPs identified through re-sequencing of 472 Pacific oysters collected in China, Japan, Korea, and Canada. Our genotyping results indicated that 133,984 (70.4%) SNPs were polymorphic and successfully converted on the chip. The SNPs were distributed evenly throughout the oyster genome, located in 3,595 scaffolds with a length of ~509.4 million; the average interval spacing was 4,210 bp. In addition, 111,158 SNPs were distributed in 21,050 coding genes, with an average of 5.3 SNPs per gene. In comparison with genotypes obtained through re-sequencing, ~69% of the converted SNPs had a concordance rate of >0.971; the mean concordance rate was 0.966. Evaluation based on genotypes of full-sib family individuals revealed that the average genotyping accuracy rate was 0.975. Carrying 133 K polymorphic SNPs, our oyster 190K SNP array is the first commercially available high-density SNP chip for mollusks, with the highest throughput. It represents a valuable tool for oyster genome-wide association studies, fine linkage mapping, and population genetics. PMID:28328985

  11. Tomato breeding in the genomics era: insights from a SNP array

    PubMed Central

    2013-01-01

    Background The major bottle neck in genetic and linkage studies in tomato has been the lack of a sufficient number of molecular markers. This has radically changed with the application of next generation sequencing and high throughput genotyping. A set of 6000 SNPs was identified and 5528 of them were used to evaluate tomato germplasm at the level of species, varieties and segregating populations. Results From the 5528 SNPs, 1980 originated from 454-sequencing, 3495 from Illumina Solexa sequencing and 53 were additional known markers. Genotyping different tomato samples allowed the evaluation of the level of heterozygosity and introgressions among commercial varieties. Cherry tomatoes were especially different from round/beefs in chromosomes 4, 5 and 12. We were able to identify a set of 750 unique markers distinguishing S. lycopersicum ‘Moneymaker’ from all its distantly related wild relatives. Clustering and neighbour joining analysis among varieties and species showed expected grouping patterns, with S. pimpinellifolium as the most closely related to commercial tomatoesearlier results. Conclusions Our results show that a SNP search in only a few breeding lines already provides generally applicable markers in tomato and its wild relatives. It also shows that the Illumina bead array generated data are highly reproducible. Our SNPs can roughly be divided in two categories: SNPs of which both forms are present in the wild relatives and in domesticated tomatoes (originating from common ancestors) and SNPs unique for the domesticated tomato (originating from after the domestication event). The SNPs can be used for genotyping, identification of varieties, comparison of genetic and physical linkage maps and to confirm (phylogenetic) relations. In the SNPs used for the array there is hardly any overlap with the SolCAP array and it is strongly recommended to combine both SNP sets and to select a core collection of robust SNPs completely covering the entire tomato

  12. Syndromic ciliopathies: From single gene to multi gene analysis by SNP arrays and next generation sequencing.

    PubMed

    Knopp, C; Rudnik-Schöneborn, S; Eggermann, T; Bergmann, C; Begemann, M; Schoner, K; Zerres, K; Ortiz Brüchle, N

    2015-10-01

    Joubert syndrome (JS) and related disorders (JSRD), Meckel syndrome (MKS) and Bardet-Biedl syndrome (BBS) are autosomal recessive ciliopathies with a broad clinical and genetic overlap. In our multiethnic cohort of 88 MKS, 61 JS/JSRD and 66 BBS families we performed genetic analyses and were able to determine mutation frequencies and detection rates for the most frequently mutated MKS genes. On the basis of determined mutation frequencies, a next generation gene panel for JS/JSRD and MKS was established. Furthermore 35 patients from 26 unrelated consanguineous families were investigated by SNP array-based homozygosity mapping and subsequent DNA sequencing of known candidate genes according to runs of homozygosity size in descending order. This led to the identification of the causative homozygous mutation in 62% of unrelated index cases. Based on our data we discuss various strategies for diagnostic mutation detection in the syndromic ciliopathies JS/JSRD, MKS and BBS.

  13. Mapping of Genetic Abnormalities of Primary Tumours from Metastatic CRC by High-Resolution SNP Arrays

    PubMed Central

    Sayagués, José María; Fontanillo, Celia; Abad, María del Mar; González-González, María; Sarasquete, María Eugenia; del Carmen Chillon, Maria; Garcia, Eva; Bengoechea, Oscar; Fonseca, Emilio; Gonzalez-Diaz, Marcos; De Las Rivas, Javier

    2010-01-01

    Background For years, the genetics of metastatic colorectal cancer (CRC) have been studied using a variety of techniques. However, most of the approaches employed so far have a relatively limited resolution which hampers detailed characterization of the common recurrent chromosomal breakpoints as well as the identification of small regions carrying genetic changes and the genes involved in them. Methodology/Principal Findings Here we applied 500K SNP arrays to map the most common chromosomal lesions present at diagnosis in a series of 23 primary tumours from sporadic CRC patients who had developed liver metastasis. Overall our results confirm that the genetic profile of metastatic CRC is defined by imbalanced gains of chromosomes 7, 8q, 11q, 13q, 20q and X together with losses of the 1p, 8p, 17p and 18q chromosome regions. In addition, SNP-array studies allowed the identification of small (<1.3 Mb) and extensive/large (>1.5 Mb) altered DNA sequences, many of which contain cancer genes known to be involved in CRC and the metastatic process. Detailed characterization of the breakpoint regions for the altered chromosomes showed four recurrent breakpoints at chromosomes 1p12, 8p12, 17p11.2 and 20p12.1; interestingly, the most frequently observed recurrent chromosomal breakpoint was localized at 17p11.2 and systematically targeted the FAM27L gene, whose role in CRC deserves further investigations. Conclusions/Significance In summary, in the present study we provide a detailed map of the genetic abnormalities of primary tumours from metastatic CRC patients, which confirm and extend on previous observations as regards the identification of genes potentially involved in development of CRC and the metastatic process. PMID:21060790

  14. MA-SNP--A new genotype calling method for oligonucleotide SNP arrays modeling the batch effect with a normal mixture model.

    PubMed

    Wen, Yalu; Li, Ming; Fu, Wenjiang J

    2011-08-30

    Genome-wide association studies hold great promise in identifying disease-susceptibility variants and understanding the genetic etiology of complex diseases. Microarray technology enables the genotyping of millions of single nucleotide polymorphisms. Many factors in microarray studies, such as probe selection, sample quality, and experimental process and batch, have substantial effect on the genotype calling accuracy, which is crucial for downstream analyses. Failure to account for the variability of these sources may lead to inaccurate genotype calls and false positive and false negative findings. In this study, we develop a SNP-specific genotype calling algorithm based on the probe intensity composite representation (PICR) model, while using a normal mixture model to account for the variability of batch effect on the genotype calls. We demonstrate our method with SNP array data in a few studies, including the HapMap project, the coronary heart disease and the UK Blood Service Control studies by the Wellcome Trust Case-Control Consortium, and a methylation profiling study. Our single array based approach outperforms PICR and is comparable to the best multi-array genotype calling methods.

  15. A large maize (Zea Mays L.) SNP genotyping array: development and germplasm genotyping, and genetic mapping to compare with the B73 reference genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection for accelerated breeding. We report the establishment of a large SNP array for maize and i...

  16. Development and evaluation of a genome-wide 6K SNP array for diploid sweet cherry and tetraploid sour cherry

    Technology Transfer Automated Retrieval System (TEKTRAN)

    High-throughput genome scans are important tools for genetic studies and breeding applications. Here, a 6K SNP array for use with the Illumina Infinium® system was developed for diploid sweet cherry (Prunus avium) and allotetraploid sour cherry (P. cerasus). This effort was led by RosBREED, a commun...

  17. Development and application of a novel genome-wide SNP array reveals domestication history in soybean.

    PubMed

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-02-09

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean.

  18. 250-kW CW klystron amplifier for planetary radar

    NASA Technical Reports Server (NTRS)

    Cormier, Reginald A.; Mizuhara, Albert

    1992-01-01

    The design, construction, and performance testing is described of two Varian klystrons, model VKX-7864A, which replaced the aging and less efficient VA-949J klystrons in the X band planetary radar transmitter on the Goldstone, CA, 70 meter antenna. The project was carried out jointly by the JPL and Varian Assoc. Output power was increased from 200 to 250 kW continuous wave per klystron, and full dc beam power is dissipated in the collector (it was not possible to operate the VA-949J klystrons without RF drive because of limited collector dissipation capability). Replacement were made with a minimum of transmitter modifciations. The planetary radar transmitter is now operating successfully with these two klystrons.

  19. Re-evaluating data quality of dog mitochondrial, Y chromosomal, and autosomal SNPs genotyped by SNP array.

    PubMed

    O Otecko, Newton; Peng, Min-Sheng; Yang, He-Chuan; Zhang, Ya-Ping; Wang, Guo-Dong

    2016-11-18

    Quality deficiencies in single nucleotide polymorphism (SNP) analyses have important implications. We used missingness rates to investigate the quality of a recently published dataset containing 424 mitochondrial, 211 Y chromosomal, and 160 432 autosomal SNPs generated by a semicustom Illumina SNP array from 5 392 dogs and 14 grey wolves. Overall, the individual missingness rate for mitochondrial SNPs was ~43.8%, with 980 (18.1%) individuals completely missing mitochondrial SNP genotyping (missingness rate=1). In males, the genotype missingness rate was ~28.8% for Y chromosomal SNPs, with 374 males recording rates above 0.96. These 374 males also exhibited completely failed mitochondrial SNPs genotyping, indicative of a batch effect. Individual missingness rates for autosomal markers were greater than zero, but less than 0.5. Neither mitochondrial nor Y chromosomal SNPs achieved complete genotyping (locus missingness rate=0), whereas 5.9% of autosomal SNPs had a locus missingness rate=1. The high missingness rates and possible batch effect show that caution and rigorous measures are vital when genotyping and analyzing SNP array data for domestic animals. Further improvements of these arrays will be helpful to future studies.

  20. Re-evaluating data quality of dog mitochondrial, Y chromosomal, and autosomal SNPs genotyped by SNP array

    PubMed Central

    OTECKO, Newton O.; PENG, Min-Sheng; YANG, He-Chuan; ZHANG, Ya-Ping; WANG, Guo-Dong

    2016-01-01

    Quality deficiencies in single nucleotide polymorphism (SNP) analyses have important implications. We used missingness rates to investigate the quality of a recently published dataset containing 424 mitochondrial, 211 Y chromosomal, and 160 432 autosomal SNPs generated by a semicustom Illumina SNP array from 5 392 dogs and 14 grey wolves. Overall, the individual missingness rate for mitochondrial SNPs was ~43.8%, with 980 (18.1%) individuals completely missing mitochondrial SNP genotyping (missingness rate=1). In males, the genotype missingness rate was ~28.8% for Y chromosomal SNPs, with 374 males recording rates above 0.96. These 374 males also exhibited completely failed mitochondrial SNPs genotyping, indicative of a batch effect. Individual missingness rates for autosomal markers were greater than zero, but less than 0.5. Neither mitochondrial nor Y chromosomal SNPs achieved complete genotyping (locus missingness rate=0), whereas 5.9% of autosomal SNPs had a locus missingness rate=1. The high missingness rates and possible batch effect show that caution and rigorous measures are vital when genotyping and analyzing SNP array data for domestic animals. Further improvements of these arrays will be helpful to future studies. PMID:28105800

  1. Development of an Alfalfa SNP Array and Its Use to Evaluate Patterns of Population Structure and Linkage Disequilibrium

    PubMed Central

    Li, Xuehui; Han, Yuanhong; Wei, Yanling; Acharya, Ananta; Farmer, Andrew D.; Ho, Julie; Monteros, Maria J.; Brummer, E. Charles

    2014-01-01

    A large set of genome-wide markers and a high-throughput genotyping platform can facilitate the genetic dissection of complex traits and accelerate molecular breeding applications. Previously, we identified about 0.9 million SNP markers by sequencing transcriptomes of 27 diverse alfalfa genotypes. From this SNP set, we developed an Illumina Infinium array containing 9,277 SNPs. Using this array, we genotyped 280 diverse alfalfa genotypes and several genotypes from related species. About 81% (7,476) of the SNPs met the criteria for quality control and showed polymorphisms. The alfalfa SNP array also showed a high level of transferability for several closely related Medicago species. Principal component analysis and model-based clustering showed clear population structure corresponding to subspecies and ploidy levels. Within cultivated tetraploid alfalfa, genotypes from dormant and nondormant cultivars were largely assigned to different clusters; genotypes from semidormant cultivars were split between the groups. The extent of linkage disequilibrium (LD) across all genotypes rapidly decayed to 26 Kbp at r2 = 0.2, but the rate varied across ploidy levels and subspecies. A high level of consistency in LD was found between and within the two subpopulations of cultivated dormant and nondormant alfalfa suggesting that genome-wide association studies (GWAS) and genomic selection (GS) could be conducted using alfalfa genotypes from throughout the fall dormancy spectrum. However, the relatively low LD levels would require a large number of markers to fully saturate the genome. PMID:24416217

  2. A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer

    PubMed Central

    Li, Ming; Wen, Yalu; Fu, Wenjiang

    2014-01-01

    Cumulative evidence has shown that structural variations, due to insertions, deletions, and inversions of DNA, may contribute considerably to the development of complex human diseases, such as breast cancer. High-throughput genotyping technologies, such as Affymetrix high density single-nucleotide polymorphism (SNP) arrays, have produced large amounts of genetic data for genome-wide SNP genotype calling and copy number estimation. Meanwhile, there is a great need for accurate and efficient statistical methods to detect copy number variants. In this article, we introduce a hidden-Markov-model (HMM)-based method, referred to as the PICR-CNV, for copy number inference. The proposed method first estimates copy number abundance for each single SNP on a single array based on the raw fluorescence values, and then standardizes the estimated copy number abundance to achieve equal footing among multiple arrays. This method requires no between-array normalization, and thus, maintains data integrity and independence of samples among individual subjects. In addition to our efforts to apply new statistical technology to raw fluorescence values, the HMM has been applied to the standardized copy number abundance in order to reduce experimental noise. Through simulations, we show our refined method is able to infer copy number variants accurately. Application of the proposed method to a breast cancer dataset helps to identify genomic regions significantly associated with the disease. PMID:26279618

  3. Development of high-density SNP genotyping arrays for white spruce (Picea glauca) and transferability to subtropical and nordic congeners.

    PubMed

    Pavy, Nathalie; Gagnon, France; Rigault, Philippe; Blais, Sylvie; Deschênes, Astrid; Boyle, Brian; Pelgas, Betty; Deslauriers, Marie; Clément, Sébastien; Lavigne, Patricia; Lamothe, Manuel; Cooke, Janice E K; Jaramillo-Correa, Juan P; Beaulieu, Jean; Isabel, Nathalie; Mackay, John; Bousquet, Jean

    2013-03-01

    High-density SNP genotyping arrays can be designed for any species given sufficient sequence information of high quality. Two high-density SNP arrays relying on the Infinium iSelect technology (Illumina) were designed for use in the conifer white spruce (Picea glauca). One array contained 7338 segregating SNPs representative of 2814 genes of various molecular functional classes for main uses in genetic association and population genetics studies. The other one contained 9559 segregating SNPs representative of 9543 genes for main uses in population genetics, linkage mapping of the genome and genomic prediction. The SNPs assayed were discovered from various sources of gene resequencing data. SNPs predicted from high-quality sequences derived from genomic DNA reached a genotyping success rate of 64.7%. Nonsingleton in silico SNPs (i.e. a sequence polymorphism present in at least two reads) predicted from expressed sequenced tags obtained with the Roche 454 technology and Illumina GAII analyser resulted in a similar genotyping success rate of 71.6% when the deepest alignment was used and the most favourable SNP probe per gene was selected. A variable proportion of these SNPs was shared by other nordic and subtropical spruce species from North America and Europe. The number of shared SNPs was inversely proportional to phylogenetic divergence and standing genetic variation in the recipient species, but positively related to allele frequency in P. glauca natural populations. These validated SNP resources should open up new avenues for population genetics and comparative genetic mapping at a genomic scale in spruce species.

  4. Interim report on updated microarray probes for the LLNL Burkholderia pseudomallei SNP array

    SciTech Connect

    Gardner, S; Jaing, C

    2012-03-27

    The overall goal of this project is to forensically characterize 100 unknown Burkholderia isolates in the US-Australia collaboration. We will identify genome-wide single nucleotide polymorphisms (SNPs) from B. pseudomallei and near neighbor species including B. mallei, B. thailandensis and B. oklahomensis. We will design microarray probes to detect these SNP markers and analyze 100 Burkholderia genomic DNAs extracted from environmental, clinical and near neighbor isolates from Australian collaborators on the Burkholderia SNP microarray. We will analyze the microarray genotyping results to characterize the genetic diversity of these new isolates and triage the samples for whole genome sequencing. In this interim report, we described the SNP analysis and the microarray probe design for the Burkholderia SNP microarray.

  5. Bivariate segmentation of SNP-array data for allele-specific copy number analysis in tumour samples

    PubMed Central

    2013-01-01

    Background SNP arrays output two signals that reflect the total genomic copy number (LRR) and the allelic ratio (BAF), which in combination allow the characterisation of allele-specific copy numbers (ASCNs). While methods based on hidden Markov models (HMMs) have been extended from array comparative genomic hybridisation (aCGH) to jointly handle the two signals, only one method based on change-point detection, ASCAT, performs bivariate segmentation. Results In the present work, we introduce a generic framework for bivariate segmentation of SNP array data for ASCN analysis. For the matter, we discuss the characteristics of the typically applied BAF transformation and how they affect segmentation, introduce concepts of multivariate time series analysis that are of concern in this field and discuss the appropriate formulation of the problem. The framework is implemented in a method named CnaStruct, the bivariate form of the structural change model (SCM), which has been successfully applied to transcriptome mapping and aCGH. Conclusions On a comprehensive synthetic dataset, we show that CnaStruct outperforms the segmentation of existing ASCN analysis methods. Furthermore, CnaStruct can be integrated into the workflows of several ASCN analysis tools in order to improve their performance, specially on tumour samples highly contaminated by normal cells. PMID:23497144

  6. A HapMap leads to a Capsicum annuum SNP infinium array: a new tool for pepper breeding.

    PubMed

    Hulse-Kemp, Amanda M; Ashrafi, Hamid; Plieske, Joerg; Lemm, Jana; Stoffel, Kevin; Hill, Theresa; Luerssen, Hartmut; Pethiyagoda, Charit L; Lawley, Cindy T; Ganal, Martin W; Van Deynze, Allen

    2016-01-01

    The Capsicum genus (Pepper) is a part of the Solanacae family. It has been important in many cultures worldwide for its key nutritional components and uses as spices, medicines, ornamentals and vegetables. Worldwide population growth is associated with demand for more nutritionally valuable vegetables while contending with decreasing resources and available land. These conditions require increased efficiency in pepper breeding to deal with these imminent challenges. Through resequencing of inbred lines we have completed a valuable haplotype map (HapMap) for the pepper genome based on single-nucleotide polymorphisms (SNP). The identified SNPs were annotated and classified based on their gene annotation in the pepper draft genome sequence and phenotype of the sequenced inbred lines. A selection of one marker per gene model was utilized to create the PepperSNP16K array, which simultaneously genotyped 16 405 SNPs, of which 90.7% were found to be informative. A set of 84 inbred and hybrid lines and a mapping population of 90 interspecific F2 individuals were utilized to validate the array. Diversity analysis of the inbred lines shows a distinct separation of bell versus chile/hot pepper types and separates them into five distinct germplasm groups. The interspecific population created between Tabasco (C. frutescens chile type) and P4 (C. annuum blocky type) produced a linkage map with 5546 markers separated into 1361 bins on twelve 12 linkage groups representing 1392.3 cM. This publically available genotyping platform can be used to rapidly assess a large number of markers in a reproducible high-throughput manner for pepper. As a standardized tool for genetic analyses, the PepperSNP16K can be used worldwide to share findings and analyze QTLs for important traits leading to continued improvement of pepper for consumers. Data and information on the array are available through the Solanaceae Genomics Network.

  7. A HapMap leads to a Capsicum annuum SNP infinium array: a new tool for pepper breeding

    PubMed Central

    Hulse-Kemp, Amanda M; Ashrafi, Hamid; Plieske, Joerg; Lemm, Jana; Stoffel, Kevin; Hill, Theresa; Luerssen, Hartmut; Pethiyagoda, Charit L; Lawley, Cindy T; Ganal, Martin W; Van Deynze, Allen

    2016-01-01

    The Capsicum genus (Pepper) is a part of the Solanacae family. It has been important in many cultures worldwide for its key nutritional components and uses as spices, medicines, ornamentals and vegetables. Worldwide population growth is associated with demand for more nutritionally valuable vegetables while contending with decreasing resources and available land. These conditions require increased efficiency in pepper breeding to deal with these imminent challenges. Through resequencing of inbred lines we have completed a valuable haplotype map (HapMap) for the pepper genome based on single-nucleotide polymorphisms (SNP). The identified SNPs were annotated and classified based on their gene annotation in the pepper draft genome sequence and phenotype of the sequenced inbred lines. A selection of one marker per gene model was utilized to create the PepperSNP16K array, which simultaneously genotyped 16 405 SNPs, of which 90.7% were found to be informative. A set of 84 inbred and hybrid lines and a mapping population of 90 interspecific F2 individuals were utilized to validate the array. Diversity analysis of the inbred lines shows a distinct separation of bell versus chile/hot pepper types and separates them into five distinct germplasm groups. The interspecific population created between Tabasco (C. frutescens chile type) and P4 (C. annuum blocky type) produced a linkage map with 5546 markers separated into 1361 bins on twelve 12 linkage groups representing 1392.3 cM. This publically available genotyping platform can be used to rapidly assess a large number of markers in a reproducible high-throughput manner for pepper. As a standardized tool for genetic analyses, the PepperSNP16K can be used worldwide to share findings and analyze QTLs for important traits leading to continued improvement of pepper for consumers. Data and information on the array are available through the Solanaceae Genomics Network. PMID:27602231

  8. Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao.

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ~4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification pr...

  9. A high-throughput SNP array in the amphidiploid species Brassica napus shows diversity in resistance genes.

    PubMed

    Dalton-Morgan, Jessica; Hayward, Alice; Alamery, Salman; Tollenaere, Reece; Mason, Annaliese S; Campbell, Emma; Patel, Dhwani; Lorenc, Michał T; Yi, Bin; Long, Yan; Meng, Jinling; Raman, Rosy; Raman, Harsh; Lawley, Cindy; Edwards, David; Batley, Jacqueline

    2014-12-01

    Single-nucleotide polymorphisms (SNPs)are molecular markers based on nucleotide variation and can be used for genotyping assays across populations and to track genomic inheritance. SNPs offer a comprehensive genotyping alternative to whole-genome sequencing for both agricultural and research purposes including molecular breeding and diagnostics, genome evolution and genetic diversity analyses, genetic mapping, and trait association studies. Here genomic SNPs were discovered between four cultivars of the important amphidiploid oilseed species Brassica napus and used to develop a B. napus Infinium™ array containing 5,306 SNPs randomly dispersed across the genome. Assay success was high, with >94 % of these producing a reproducible, polymorphic genotype in the 1,070 samples screened. Although the assay was designed to B. napus, successful SNP amplification was achieved in the B. napus progenitor species, Brassica rapa and Brassica oleracea, and to a lesser extent in the related species Brassica nigra. Phylogenetic analysis was consistent with the expected relationships between B. napus individuals. This study presents an efficient custom SNP assay development pipeline in the complex polyploid Brassica genome and demonstrates the utility of the array for high-throughput genotyping in a number of related Brassica species. It also demonstrates the utility of this assay in genotyping resistance genes on chromosome A7, which segregate amongst the 1,070 samples.

  10. A 34K SNP genotyping array for Populus trichocarpa: design, application to the study of natural populations and transferability to other Populus species.

    PubMed

    Geraldes, A; Difazio, S P; Slavov, G T; Ranjan, P; Muchero, W; Hannemann, J; Gunter, L E; Wymore, A M; Grassa, C J; Farzaneh, N; Porth, I; McKown, A D; Skyba, O; Li, E; Fujita, M; Klápště, J; Martin, J; Schackwitz, W; Pennacchio, C; Rokhsar, D; Friedmann, M C; Wasteneys, G O; Guy, R D; El-Kassaby, Y A; Mansfield, S D; Cronk, Q C B; Ehlting, J; Douglas, C J; Tuskan, G A

    2013-03-01

    Genetic mapping of quantitative traits requires genotypic data for large numbers of markers in many individuals. For such studies, the use of large single nucleotide polymorphism (SNP) genotyping arrays still offers the most cost-effective solution. Herein we report on the design and performance of a SNP genotyping array for Populus trichocarpa (black cottonwood). This genotyping array was designed with SNPs pre-ascertained in 34 wild accessions covering most of the species latitudinal range. We adopted a candidate gene approach to the array design that resulted in the selection of 34 131 SNPs, the majority of which are located in, or within 2 kb of, 3543 candidate genes. A subset of the SNPs on the array (539) was selected based on patterns of variation among the SNP discovery accessions. We show that more than 95% of the loci produce high quality genotypes and that the genotyping error rate for these is likely below 2%. We demonstrate that even among small numbers of samples (n = 10) from local populations over 84% of loci are polymorphic. We also tested the applicability of the array to other species in the genus and found that the number of polymorphic loci decreases rapidly with genetic distance, with the largest numbers detected in other species in section Tacamahaca. Finally, we provide evidence for the utility of the array to address evolutionary questions such as intraspecific studies of genetic differentiation, species assignment and the detection of natural hybrids.

  11. Development of a 63K SNP Array for Cotton and High-Density Mapping of Intraspecific and Interspecific Populations of Gossypium spp.

    PubMed

    Hulse-Kemp, Amanda M; Lemm, Jana; Plieske, Joerg; Ashrafi, Hamid; Buyyarapu, Ramesh; Fang, David D; Frelichowski, James; Giband, Marc; Hague, Steve; Hinze, Lori L; Kochan, Kelli J; Riggs, Penny K; Scheffler, Jodi A; Udall, Joshua A; Ulloa, Mauricio; Wang, Shirley S; Zhu, Qian-Hao; Bag, Sumit K; Bhardwaj, Archana; Burke, John J; Byers, Robert L; Claverie, Michel; Gore, Michael A; Harker, David B; Islam, Md S; Jenkins, Johnie N; Jones, Don C; Lacape, Jean-Marc; Llewellyn, Danny J; Percy, Richard G; Pepper, Alan E; Poland, Jesse A; Mohan Rai, Krishan; Sawant, Samir V; Singh, Sunil Kumar; Spriggs, Andrew; Taylor, Jen M; Wang, Fei; Yourstone, Scott M; Zheng, Xiuting; Lawley, Cindy T; Ganal, Martin W; Van Deynze, Allen; Wilson, Iain W; Stelly, David M

    2015-04-22

    High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community.

  12. Development of a 63K SNP Array for Cotton and High-Density Mapping of Intraspecific and Interspecific Populations of Gossypium spp.

    PubMed Central

    Hulse-Kemp, Amanda M.; Lemm, Jana; Plieske, Joerg; Ashrafi, Hamid; Buyyarapu, Ramesh; Fang, David D.; Frelichowski, James; Giband, Marc; Hague, Steve; Hinze, Lori L.; Kochan, Kelli J.; Riggs, Penny K.; Scheffler, Jodi A.; Udall, Joshua A.; Ulloa, Mauricio; Wang, Shirley S.; Zhu, Qian-Hao; Bag, Sumit K.; Bhardwaj, Archana; Burke, John J.; Byers, Robert L.; Claverie, Michel; Gore, Michael A.; Harker, David B.; Islam, Md S.; Jenkins, Johnie N.; Jones, Don C.; Lacape, Jean-Marc; Llewellyn, Danny J.; Percy, Richard G.; Pepper, Alan E.; Poland, Jesse A.; Mohan Rai, Krishan; Sawant, Samir V.; Singh, Sunil Kumar; Spriggs, Andrew; Taylor, Jen M.; Wang, Fei; Yourstone, Scott M.; Zheng, Xiuting; Lawley, Cindy T.; Ganal, Martin W.; Van Deynze, Allen; Wilson, Iain W.; Stelly, David M.

    2015-01-01

    High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community. PMID:25908569

  13. Development of a SNP array and its application to genetic mapping and diversity assessment in pepper (Capsicum spp.)

    PubMed Central

    Cheng, Jiaowen; Qin, Cheng; Tang, Xin; Zhou, Huangkai; Hu, Yafei; Zhao, Zicheng; Cui, Junjie; Li, Bo; Wu, Zhiming; Yu, Jiping; Hu, Kailin

    2016-01-01

    The development and application of single nucleotide polymorphisms (SNPs) is in its infancy for pepper. Here, a set of 15,000 SNPs were chosen from the resequencing data to develop an array for pepper with 12,720 loci being ultimately synthesized. Of these, 8,199 (~64.46%) SNPs were found to be scorable and covered ~81.18% of the whole genome. With this array, a high-density interspecific genetic map with 5,569 SNPs was constructed using 297 F2 individuals, and genetic diversity of a panel of 399 pepper elite/landrace lines was successfully characterized. Based on the genetic map, one major QTL, named Up12.1, was detected for the fruit orientation trait. A total of 65 protein-coding genes were predicted within this QTL region based on the current annotation of the Zunla-1 genome. In summary, the thousands of well-validated SNP markers, high-density genetic map and genetic diversity information will be useful for molecular genetics and innovative breeding in pepper. Furthermore, the mapping results lay foundation for isolating the genes underlying variation in fruit orientation of Capsicum. PMID:27623541

  14. Significance of genome-wide analysis of copy number alterations and UPD in myelodysplastic syndromes using combined CGH - SNP arrays.

    PubMed

    Ahmad, Ausaf; Iqbal, M Anwar

    2012-01-01

    Genetic information is an extremely valuable data source in characterizing the personal nature of cancer. Chromosome instability is a hallmark of most cancer cells. Chromosomal abnormalities are correlated with poor prognosis, disease classification, risk stratification, and treatment selection. Copy number alterations (CNAs) are an important molecular signature in cancer initiation, development, and progression. Recent application of whole-genome tools to characterize normal and cancer genomes provides the powerful molecular cytogenetic means to enumerate the multiple somatic, genetic and epigenetic alterations that occur in cancer. Combined array comparative genomic hybridization (aCGH) with single nucleotide polymorphism (SNP) array is a useful technique allowing detection of CNAs and loss of heterozygosity (LOH) or uni-parental disomy (UPD) together in a single experiment. It also provides allelic information on deletions, duplications, and amplifications. UPD can result in an abnormal phenotype when the chromosomes involved are imprinted. Myelodysplastic syndromes (MDS) are the most common clonal stem cell hematologic malignancy characterized by ineffective hematopoiesis, which leads to rapid progression into acute myeloid leukemia. UPD that occurs without concurrent changes in the gene copy number is a common chromosomal defect in hematologic malignancies, especially in MDS. Approximately 40-50% of MDS patients do not have karyotypic abnormalities that are detectable using classical metaphase cytogenetic techniques (MC) because of inherent limitations of MC, low resolution and the requirement of having dividing cells. In this review, we highlight advances in the clinical application of microarray technology in MDS and discuss the clinical potential of microarray.

  15. Development of a dense SNP-based linkage map of an apple rootstock progeny using the Malus Infinium whole genome genotyping array

    PubMed Central

    2012-01-01

    Background A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Results Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the ‘Golden Delicious’ genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the ‘Golden Delicious’ pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. Conclusions We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and

  16. Identification of the mechanism underlying a human chimera by SNP array analysis.

    PubMed

    Shin, So Youn; Yoo, Han-Wook; Lee, Beom Hee; Kim, Kun Suk; Seo, Eul-Ju

    2012-09-01

    Human chimerism resulting from the fusion of two different zygotes is a rare phenomenon. Two mechanisms of chimerism have been hypothesized: dispermic fertilization of an oocyte and its second polar body and dispermic fertilization of two identical gametes from parthenogenetic activation, and these can be identified and discriminated using DNA polymorphism. In the present study we describe a patient with chimerism presenting as a true hermaphrodite and applied single nucleotide polymorphism array analysis to demonstrate dispermic fertilization of two identical gametes from parthenogenetic activation as the underlying mechanism at the whole chromosome level. We suggest that application of genotyping array analysis to the diagnostic process in patients with disorders of sex development will help identify more human chimera patients and increase our understanding of the underlying mechanisms.

  17. Regions of homozygosity identified by oligonucleotide SNP arrays: evaluating the incidence and clinical utility.

    PubMed

    Wang, Jia-Chi; Ross, Leslie; Mahon, Loretta W; Owen, Renius; Hemmat, Morteza; Wang, Boris T; El Naggar, Mohammed; Kopita, Kimberly A; Randolph, Linda M; Chase, John M; Matas Aguilera, Maria J; Siles, Juan López; Church, Joseph A; Hauser, Natalie; Shen, Joseph J; Jones, Marilyn C; Wierenga, Klaas J; Jiang, Zhijie; Haddadin, Mary; Boyar, Fatih Z; Anguiano, Arturo; Strom, Charles M; Sahoo, Trilochan

    2015-05-01

    Copy neutral segments with allelic homozygosity, also known as regions of homozygosity (ROHs), are frequently identified in cases interrogated by oligonucleotide single-nucleotide polymorphism (oligo-SNP) microarrays. Presence of ROHs may be because of parental relatedness, chromosomal recombination or rearrangements and provides important clues regarding ancestral homozygosity, consanguinity or uniparental disomy. In this study of 14 574 consecutive cases, 832 (6%) were found to harbor one or more ROHs over 10 Mb, of which 651 cases (78%) had multiple ROHs, likely because of identity by descent (IBD), and 181 cases (22%) with ROHs involving a single chromosome. Parental relatedness was predicted to be first degree or closer in 5%, second in 9% and third in 19%. Of the 181 cases, 19 had ROHs for a whole chromosome revealing uniparental isodisomy (isoUPD). In all, 25 cases had significant ROHs involving a single chromosome; 5 cases were molecularly confirmed to have a mixed iso- and heteroUPD15 and 1 case each with segmental UPD9pat and segmental UPD22mat; 17 cases were suspected to have a mixed iso- and heteroUPD including 2 cases with small supernumerary marker and 2 cases with mosaic trisomy. For chromosome 15, 12 (92%) of 13 molecularly studied cases had either Prader-Willi or Angelman syndrome. Autosomal recessive disorders were confirmed in seven of nine cases from eight families because of the finding of suspected gene within a ROH. This study demonstrates that ROHs are much more frequent than previously recognized and often reflect parental relatedness, ascertain autosomal recessive diseases or unravel UPD in many cases.

  18. Utility of SNP arrays in detecting, quantifying, and determining meiotic origin of tetrasomy 12p in blood from individuals with Pallister-Killian syndrome.

    PubMed

    Conlin, Laura K; Kaur, Maninder; Izumi, Kosuke; Campbell, Lindsey; Wilkens, Alisha; Clark, Dinah; Deardorff, Matthew A; Zackai, Elaine H; Pallister, Phillip; Hakonarson, Hakon; Spinner, Nancy B; Krantz, Ian D

    2012-12-01

    Identification of the isochromosome 12p (i(12p)) associated with Pallister-Killian syndrome is complicated by the low frequency of this supernumerary chromosome in PHA stimulated peripheral blood lymphocytes, and frequently requires cytogenetic analysis of fibroblast cells. Recently, it has been shown that array CGH techniques are able to detect tetrasomy 12p in peripheral blood, even when not identified by traditional cytogenetic techniques. We studied 15 patients with a previous cytogenetic and clinical diagnosis of Pallister-Killian syndrome using genome-wide SNP arrays to investigate the ability of this platform to identify the i(12p) in blood and tissue. Array analysis verified tetrasomy 12p in all samples from fibroblasts, but was only able to detect it in 46% of blood samples. The genotyping information available from the SNP arrays allowed for the detection of as low as 5% mosaicism, as well as suggesting a Meiosis II origin for the isochromosome in the majority of patients. Analysis of the percentage of abnormal cells with patient age at time of study suggests that the frequency of the i(12p) decreased with age in blood, but not in fibroblasts. These highlight the power of SNP arrays in detecting and characterizing the isochromosome 12p in Pallister-Killian syndrome as well as underscoring the important utility of traditional cytogenetic techniques.

  19. Design and coverage of high throughput genotyping arrays optimized for individuals of East Asian, African American, and Latino race/ethnicity using imputation and a novel hybrid SNP selection algorithm.

    PubMed

    Hoffmann, Thomas J; Zhan, Yiping; Kvale, Mark N; Hesselson, Stephanie E; Gollub, Jeremy; Iribarren, Carlos; Lu, Yontao; Mei, Gangwu; Purdy, Matthew M; Quesenberry, Charles; Rowell, Sarah; Shapero, Michael H; Smethurst, David; Somkin, Carol P; Van den Eeden, Stephen K; Walter, Larry; Webster, Teresa; Whitmer, Rachel A; Finn, Andrea; Schaefer, Catherine; Kwok, Pui-Yan; Risch, Neil

    2011-12-01

    Four custom Axiom genotyping arrays were designed for a genome-wide association (GWA) study of 100,000 participants from the Kaiser Permanente Research Program on Genes, Environment and Health. The array optimized for individuals of European race/ethnicity was previously described. Here we detail the development of three additional microarrays optimized for individuals of East Asian, African American, and Latino race/ethnicity. For these arrays, we decreased redundancy of high-performing SNPs to increase SNP capacity. The East Asian array was designed using greedy pairwise SNP selection. However, removing SNPs from the target set based on imputation coverage is more efficient than pairwise tagging. Therefore, we developed a novel hybrid SNP selection method for the African American and Latino arrays utilizing rounds of greedy pairwise SNP selection, followed by removal from the target set of SNPs covered by imputation. The arrays provide excellent genome-wide coverage and are valuable additions for large-scale GWA studies.

  20. High-density SNP-based genetic maps for the parents of an outcrossed and a selfed tetraploid garden rose cross, inferred from admixed progeny using the 68k rose SNP array

    PubMed Central

    Vukosavljev, Mirjana; Arens, Paul; Voorrips, Roeland E; van ‘t Westende, Wendy PC; Esselink, GD; Bourke, Peter M; Cox, Peter; van de Weg, W Eric; Visser, Richard GF; Maliepaard, Chris; Smulders, Marinus JM

    2016-01-01

    Dense genetic maps create a base for QTL analysis of important traits and future implementation of marker-assisted breeding. In tetraploid rose, the existing linkage maps include <300 markers to cover 28 linkage groups (4 homologous sets of 7 chromosomes). Here we used the 68k WagRhSNP Axiom single-nucleotide polymorphism (SNP) array for rose, in combination with SNP dosage calling at the tetraploid level, to genotype offspring from the garden rose cultivar ‘Red New Dawn’. The offspring proved to be not from a single bi-parental cross. In rose breeding, crosses with unintended parents occur regularly. We developed a strategy to separate progeny into putative populations, even while one of the parents was unknown, using principle component analysis on pairwise genetic distances based on sets of selected SNP markers that were homozygous, and therefore uninformative for one parent. One of the inferred populations was consistent with self-fertilization of ‘Red New Dawn’. Subsequently, linkage maps were generated for a bi-parental and a self-pollinated population with ‘Red New Dawn’ as the common maternal parent. The densest map, for the selfed parent, had 1929 SNP markers on 25 linkage groups, covering 1765.5 cM at an average marker distance of 0.9 cM. Synteny with the strawberry (Fragaria vesca) genome was extensive. Rose ICM1 corresponded to F. vesca pseudochromosome 7 (Fv7), ICM4 to Fv4, ICM5 to Fv3, ICM6 to Fv2 and ICM7 to Fv5. Rose ICM2 corresponded to parts of F. vesca pseudochromosomes 1 and 6, whereas ICM3 is syntenic to the remainder of Fv6. PMID:27818777

  1. The Psychological Challenges of Replacing Conventional Karyotyping with Genomic SNP Array Analysis in Prenatal Testing.

    PubMed

    Riedijk, Sam; Diderich, Karin E M; van der Steen, Sanne L; Govaerts, Lutgarde C P; Joosten, Marieke; Knapen, Maarten F C M; de Vries, Femke A T; van Opstal, Diane; Tibben, Aad; Galjaard, Robert-Jan H

    2014-07-03

    Pregnant couples tend to prefer a maximum of information about the health of their fetus. Therefore, we implemented whole genome microarray instead of conventional karyotyping (CK) for all indications for prenatal diagnosis (PND). The array detects more clinically relevant anomalies, including early onset disorders, not related to the indication and more genetic anomalies of yet unquantifiable risk, so-called susceptibility loci (SL) for mainly neurodevelopmental disorders. This manuscript highlights the psychological challenges in prenatal genetic counselling when using the array and provides counselling suggestions. First, we suggest that pre-test decision counselling should emphasize deliberation about what pregnant couples wish to learn about the future health of their fetus more than information about possible outcomes. Second, pregnant couples need support in dealing with SL. Therefore, in order to consider the SL in a proportionate perspective, the presence of phenotypes associated with SL in the family, the incidence of a particular SL in control populations and in postnatally ascertained patients needs highlighting during post-test genetic counselling. Finally, the decision that couples need to make about the course of their pregnancy is more complicated when the expected phenotype is variable and not quantifiable. Therefore, during post-test psychological counseling, couples should concretize the options of continuing and ending their pregnancy; all underlying feelings and thoughts should be made explicit, as well as the couple's resources, in order to attain adequate decision-making. As such, pre- and post-test counselling aids pregnant couples in handling the uncertainties that may accompany offering a broader scope of genetic PND using the array.

  2. The Psychological Challenges of Replacing Conventional Karyotyping with Genomic SNP Array Analysis in Prenatal Testing

    PubMed Central

    Riedijk, Sam; Diderich, Karin E. M.; van der Steen, Sanne L.; Govaerts, Lutgarde C. P.; Joosten, Marieke; Knapen, Maarten F. C. M.; de Vries, Femke A. T.; van Opstal, Diane; Tibben, Aad; Galjaard, Robert-Jan H.

    2014-01-01

    Pregnant couples tend to prefer a maximum of information about the health of their fetus. Therefore, we implemented whole genome microarray instead of conventional karyotyping (CK) for all indications for prenatal diagnosis (PND). The array detects more clinically relevant anomalies, including early onset disorders, not related to the indication and more genetic anomalies of yet unquantifiable risk, so-called susceptibility loci (SL) for mainly neurodevelopmental disorders. This manuscript highlights the psychological challenges in prenatal genetic counselling when using the array and provides counselling suggestions. First, we suggest that pre-test decision counselling should emphasize deliberation about what pregnant couples wish to learn about the future health of their fetus more than information about possible outcomes. Second, pregnant couples need support in dealing with SL. Therefore, in order to consider the SL in a proportionate perspective, the presence of phenotypes associated with SL in the family, the incidence of a particular SL in control populations and in postnatally ascertained patients needs highlighting during post-test genetic counselling. Finally, the decision that couples need to make about the course of their pregnancy is more complicated when the expected phenotype is variable and not quantifiable. Therefore, during post-test psychological counseling, couples should concretize the options of continuing and ending their pregnancy; all underlying feelings and thoughts should be made explicit, as well as the couple’s resources, in order to attain adequate decision-making. As such, pre- and post-test counselling aids pregnant couples in handling the uncertainties that may accompany offering a broader scope of genetic PND using the array. PMID:26237473

  3. Development and validation of a 20K single nucleotide polymorphism (SNP) whole genome genotyping array for apple (Malus × domestica Borkh).

    PubMed

    Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela

    2014-01-01

    High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.

  4. Development and Validation of a 20K Single Nucleotide Polymorphism (SNP) Whole Genome Genotyping Array for Apple (Malus × domestica Borkh)

    PubMed Central

    Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela

    2014-01-01

    High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs. PMID:25303088

  5. Homozygosity mapping with SNP arrays identifies TRIM32, an E3 ubiquitin ligase, as a Bardet–Biedl syndrome gene (BBS11)

    PubMed Central

    Chiang, Annie P.; Beck, John S.; Yen, Hsan-Jan; Tayeh, Marwan K.; Scheetz, Todd E.; Swiderski, Ruth E.; Nishimura, Darryl Y.; Braun, Terry A.; Kim, Kwang-Youn A.; Huang, Jian; Elbedour, Khalil; Carmi, Rivka; Slusarski, Diane C.; Casavant, Thomas L.; Stone, Edwin M.; Sheffield, Val C.

    2006-01-01

    The identification of mutations in genes that cause human diseases has largely been accomplished through the use of positional cloning, which relies on linkage mapping. In studies of rare diseases, the resolution of linkage mapping is limited by the number of available meioses and informative marker density. One recent advance is the development of high-density SNP microarrays for genotyping. The SNP arrays overcome low marker informativity by using a large number of markers to achieve greater coverage at finer resolution. We used SNP microarray genotyping for homozygosity mapping in a small consanguineous Israeli Bedouin family with autosomal recessive Bardet–Biedl syndrome (BBS; obesity, pigmentary retinopathy, polydactyly, hypogonadism, renal and cardiac abnormalities, and cognitive impairment) in which previous linkage studies using short tandem repeat polymorphisms failed to identify a disease locus. SNP genotyping revealed a homozygous candidate region. Mutation analysis in the region of homozygosity identified a conserved homozygous missense mutation in the TRIM32 gene, a gene coding for an E3 ubiquitin ligase. Functional analysis of this gene in zebrafish and expression correlation analyses among other BBS genes in an expression quantitative trait loci data set demonstrate that TRIM32 is a BBS gene. This study shows the value of high-density SNP genotyping for homozygosity mapping and the use of expression correlation data for evaluation of candidate genes and identifies the proteasome degradation pathway as a pathway involved in BBS. PMID:16606853

  6. Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.).

    PubMed

    Koning-Boucoiran, Carole F S; Esselink, G Danny; Vukosavljev, Mirjana; van 't Westende, Wendy P C; Gitonga, Virginia W; Krens, Frans A; Voorrips, Roeland E; van de Weg, W Eric; Schulz, Dietmar; Debener, Thomas; Maliepaard, Chris; Arens, Paul; Smulders, Marinus J M

    2015-01-01

    In order to develop a versatile and large SNP array for rose, we set out to mine ESTs from diverse sets of rose germplasm. For this RNA-Seq libraries containing about 700 million reads were generated from tetraploid cut and garden roses using Illumina paired-end sequencing, and from diploid Rosa multiflora using 454 sequencing. Separate de novo assemblies were performed in order to identify single nucleotide polymorphisms (SNPs) within and between rose varieties. SNPs among tetraploid roses were selected for constructing a genotyping array that can be employed for genetic mapping and marker-trait association discovery in breeding programs based on tetraploid germplasm, both from cut roses and from garden roses. In total 68,893 SNPs were included on the WagRhSNP Axiom array. Next, an orthology-guided assembly was performed for the construction of a non-redundant rose transcriptome database. A total of 21,740 transcripts had significant hits with orthologous genes in the strawberry (Fragaria vesca L.) genome. Of these 13,390 appeared to contain the full-length coding regions. This newly established transcriptome resource adds considerably to the currently available sequence resources for the Rosaceae family in general and the genus Rosa in particular.

  7. A 34K SNP genotyping array for Populus trichocarpa: design, application to the study of natural populations and transferability to other Populus species

    SciTech Connect

    Geraldes, Armando; Hannemann, Jan; Grassa, Chris; Farzaneh, Nima; Porth, Ilga; McKown, Athena; Skyba, Oleksandr; Li, Eryang; Mike, Fujita; Friedmann, Michael; Wasteneys, Geoffrey; Guy, Robert; El-Kassaby, Yousry; Mansfield, Shawn; Cronk, Quentin; Ehlting, Juergen; Douglas, Carl; DiFazio, Stephen P; Slavov, Gancho; Ranjan, Priya; Muchero, Wellington; Gunter, Lee E; Wymore, Ann; Tuskan, Gerald A; Martin, Joel; Schackwitz, Wendy; Pennacchio, Christa; Rokhsar, Daniel

    2013-01-01

    Genetic mapping of quantitative traits requires genotypic data for large numbers of markers in many individuals. Despite the declining costs of genotyping by sequencing, for most studies, the use of large SNP genotyping arrays still offers the most cost-effective solution for large-scale targeted genotyping. Here we report on the design and performance of a SNP genotyping array for Populus trichocarpa (black cottonwood). This genotyping array was designed with SNPs pre-ascertained in 34 wild accessions covering most of the species range. Due to the rapid decay of linkage disequilibrium in P. trichocarpa we adopted a candidate gene approach to the array design that resulted in the selection of 34,131 SNPs, the majority of which are located in, or within 2 kb, of 3,543 candidate genes. A subset of the SNPs (539) was selected based on patterns of variation among the SNP discovery accessions. We show that more than 95% of the loci produce high quality genotypes and that the genotyping error rate for these is likely below 2%, indicating that high-quality data are generated with this array. We demonstrate that even among small numbers of samples (n=10) from local populations over 84% of loci are polymorphic. We also tested the applicability of the array to other species in the genus and found that due to ascertainment bias the number of polymorphic loci decreases rapidly with genetic distance, with the largest numbers detected in other species in section Tacamahaca (P. balsamifera and P. angustifolia). Finally, we provide evidence for the utility of the array for intraspecific studies of genetic differentiation and for species assignment and the detection of natural hybrids.

  8. High-density SNP genotyping array for hexaploid wheat and its secondary and tertiary gene pool.

    PubMed

    Winfield, Mark O; Allen, Alexandra M; Burridge, Amanda J; Barker, Gary L A; Benbow, Harriet R; Wilkinson, Paul A; Coghill, Jane; Waterfall, Christy; Davassi, Alessandro; Scopes, Geoff; Pirani, Ali; Webster, Teresa; Brew, Fiona; Bloor, Claire; King, Julie; West, Claire; Griffiths, Simon; King, Ian; Bentley, Alison R; Edwards, Keith J

    2016-05-01

    In wheat, a lack of genetic diversity between breeding lines has been recognized as a significant block to future yield increases. Species belonging to bread wheat's secondary and tertiary gene pools harbour a much greater level of genetic variability, and are an important source of genes to broaden its genetic base. Introgression of novel genes from progenitors and related species has been widely employed to improve the agronomic characteristics of hexaploid wheat, but this approach has been hampered by a lack of markers that can be used to track introduced chromosome segments. Here, we describe the identification of a large number of single nucleotide polymorphisms that can be used to genotype hexaploid wheat and to identify and track introgressions from a variety of sources. We have validated these markers using an ultra-high-density Axiom(®) genotyping array to characterize a range of diploid, tetraploid and hexaploid wheat accessions and wheat relatives. To facilitate the use of these, both the markers and the associated sequence and genotype information have been made available through an interactive web site.

  9. Whole-exome SNP array identifies 15 new susceptibility loci for psoriasis.

    PubMed

    Zuo, Xianbo; Sun, Liangdan; Yin, Xianyong; Gao, Jinping; Sheng, Yujun; Xu, Jinhua; Zhang, Jianzhong; He, Chundi; Qiu, Ying; Wen, Guangdong; Tian, Hongqing; Zheng, Xiaodong; Liu, Shengxiu; Wang, Wenjun; Li, Weiran; Cheng, Yuyan; Liu, Longdan; Chang, Yan; Wang, Zaixing; Li, Zenggang; Li, Longnian; Wu, Jianping; Fang, Ling; Shen, Changbing; Zhou, Fusheng; Liang, Bo; Chen, Gang; Li, Hui; Cui, Yong; Xu, Aie; Yang, Xueqin; Hao, Fei; Xu, Limin; Fan, Xing; Li, Yuzhen; Wu, Rina; Wang, Xiuli; Liu, Xiaoming; Zheng, Min; Song, Shunpeng; Ji, Bihua; Fang, Hong; Yu, Jianbin; Sun, Yongxin; Hui, Yan; Zhang, Furen; Yang, Rongya; Yang, Sen; Zhang, Xuejun

    2015-04-09

    Genome-wide association studies (GWASs) have reproducibly associated ∼40 susceptibility loci with psoriasis. However, the missing heritability is evident and the contributions of coding variants have not yet been systematically evaluated. Here, we present a large-scale whole-exome array analysis for psoriasis consisting of 42,760 individuals. We discover 16 SNPs within 15 new genes/loci associated with psoriasis, including C1orf141, ZNF683, TMC6, AIM2, IL1RL1, CASR, SON, ZFYVE16, MTHFR, CCDC129, ZNF143, AP5B1, SYNE2, IFNGR2 and 3q26.2-q27 (P<5.00 × 10(-08)). In addition, we also replicate four known susceptibility loci TNIP1, NFKBIA, IL12B and LCE3D-LCE3E. These susceptibility variants identified in the current study collectively account for 1.9% of the psoriasis heritability. The variant within AIM2 is predicted to impact protein structure. Our findings increase the number of genetic risk factors for psoriasis and highlight new and plausible biological pathways in psoriasis.

  10. Fast detection of de novo copy number variants from SNP arrays for case-parent trios

    PubMed Central

    2012-01-01

    Background In studies of case-parent trios, we define copy number variants (CNVs) in the offspring that differ from the parental copy numbers as de novo and of interest for their potential functional role in disease. Among the leading array-based methods for discovery of de novo CNVs in case-parent trios is the joint hidden Markov model (HMM) implemented in the PennCNV software. However, the computational demands of the joint HMM are substantial and the extent to which false positive identifications occur in case-parent trios has not been well described. We evaluate these issues in a study of oral cleft case-parent trios. Results Our analysis of the oral cleft trios reveals that genomic waves represent a substantial source of false positive identifications in the joint HMM, despite a wave-correction implementation in PennCNV. In addition, the noise of low-level summaries of relative copy number (log R ratios) is strongly associated with batch and correlated with the frequency of de novo CNV calls. Exploiting the trio design, we propose a univariate statistic for relative copy number referred to as the minimum distance that can reduce technical variation from probe effects and genomic waves. We use circular binary segmentation to segment the minimum distance and maximum a posteriori estimation to infer de novo CNVs from the segmented genome. Compared to PennCNV on simulated data, MinimumDistance identifies fewer false positives on average and is comparable to PennCNV with respect to false negatives. Genomic waves contribute to discordance of PennCNV and MinimumDistance for high coverage de novo calls, while highly concordant calls on chromosome 22 were validated by quantitative PCR. Computationally, MinimumDistance provides a nearly 8-fold increase in speed relative to the joint HMM in a study of oral cleft trios. Conclusions Our results indicate that batch effects and genomic waves are important considerations for case-parent studies of de novo CNV, and that the

  11. A survey and new measurements of ice vapor pressure at temperatures between 170 and 250K

    NASA Technical Reports Server (NTRS)

    Marti, James; Mauersberger, Konrad

    1993-01-01

    New measurements of ice vapor pressures at temperatures between 170 and 250 K are presented and published vapor pressure data are summarized. An empirical vapor pressure equation was derived and allows prediction of vapor pressures between 170 k and the triple point of water with an accuracy of approximately 2 percent. Predictions obtained agree, within experimental uncertainty, with the most reliable equation derived from thermodynamic principles.

  12. The infrared optical constants of sulfuric acid at 250 K. [spectral reflectance measurement of aqueous solutions

    NASA Technical Reports Server (NTRS)

    Pinkley, L. W.; Williams, D.

    1976-01-01

    Results are presented for measurements of the IR spectral reflectance at near-normal incidence of aqueous solutions of sulfuric acid with acid concentrations of 75% and 95.6% by weight. Kramers-Kronig analyses of the reflectance data are employed to obtain values of the optical constants n(nu) and k(nu) in the spectral range from 400 to 6000 cm to the -1 power. The optical constants of these solutions at 250 K and 300 K are compared. It is found that in spectral regions remote from strong absorption bands, the values of the n(nu) indices obtained at 250 K agree with the values given by Lorentz-Lorenz correction of the same indices at 300 K. All absorption bands observed at 300 K are found to be present at 250 K with slight shifts in frequency and with significant differences in the k(nu) indices at the band maxima. Based on these results, it is concluded that the clouds of Venus probably consist of droplets of aqueous solutions of sulfuric acid with acid concentrations of about 75% by weight.

  13. Development of a 690 K SNP array in catfish and its application for genetic mapping and validation of the reference genome sequence

    PubMed Central

    Zeng, Qifan; Fu, Qiang; Li, Yun; Waldbieser, Geoff; Bosworth, Brian; Liu, Shikai; Yang, Yujia; Bao, Lisui; Yuan, Zihao; Li, Ning; Liu, Zhanjiang

    2017-01-01

    Single nucleotide polymorphisms (SNPs) are capable of providing the highest level of genome coverage for genomic and genetic analysis because of their abundance and relatively even distribution in the genome. Such a capacity, however, cannot be achieved without an efficient genotyping platform such as SNP arrays. In this work, we developed a high-density SNP array with 690,662 unique SNPs (herein 690 K array) that were relatively evenly distributed across the entire genome, and covered 98.6% of the reference genome sequence. Here we also report linkage mapping using the 690 K array, which allowed mapping of over 250,000 SNPs on the linkage map, the highest marker density among all the constructed linkage maps. These markers were mapped to 29 linkage groups (LGs) with 30,591 unique marker positions. This linkage map anchored 1,602 scaffolds of the reference genome sequence to LGs, accounting for over 97% of the total genome assembly. A total of 1,007 previously unmapped scaffolds were placed to LGs, allowing validation and in few instances correction of the reference genome sequence assembly. This linkage map should serve as a valuable resource for various genetic and genomic analyses, especially for GWAS and QTL mapping for genes associated with economically important traits. PMID:28079141

  14. Development of a 690 K SNP array in catfish and its application for genetic mapping and validation of the reference genome sequence.

    PubMed

    Zeng, Qifan; Fu, Qiang; Li, Yun; Waldbieser, Geoff; Bosworth, Brian; Liu, Shikai; Yang, Yujia; Bao, Lisui; Yuan, Zihao; Li, Ning; Liu, Zhanjiang

    2017-01-12

    Single nucleotide polymorphisms (SNPs) are capable of providing the highest level of genome coverage for genomic and genetic analysis because of their abundance and relatively even distribution in the genome. Such a capacity, however, cannot be achieved without an efficient genotyping platform such as SNP arrays. In this work, we developed a high-density SNP array with 690,662 unique SNPs (herein 690 K array) that were relatively evenly distributed across the entire genome, and covered 98.6% of the reference genome sequence. Here we also report linkage mapping using the 690 K array, which allowed mapping of over 250,000 SNPs on the linkage map, the highest marker density among all the constructed linkage maps. These markers were mapped to 29 linkage groups (LGs) with 30,591 unique marker positions. This linkage map anchored 1,602 scaffolds of the reference genome sequence to LGs, accounting for over 97% of the total genome assembly. A total of 1,007 previously unmapped scaffolds were placed to LGs, allowing validation and in few instances correction of the reference genome sequence assembly. This linkage map should serve as a valuable resource for various genetic and genomic analyses, especially for GWAS and QTL mapping for genes associated with economically important traits.

  15. Three gangliogliomas: results of GTG-banding, SKY, genome-wide high resolution SNP-array, gene expression and review of the literature.

    PubMed

    Xu, Li-Xin; Holland, Heidrun; Kirsten, Holger; Ahnert, Peter; Krupp, Wolfgang; Bauer, Manfred; Schober, Ralf; Mueller, Wolf; Fritzsch, Dominik; Meixensberger, Jürgen; Koschny, Ronald

    2015-04-01

    According to the World Health Organization gangliogliomas are classified as well-differentiated and slowly growing neuroepithelial tumors, composed of neoplastic mature ganglion and glial cells. It is the most frequent tumor entity observed in patients with long-term epilepsy. Comprehensive cytogenetic and molecular cytogenetic data including high-resolution genomic profiling (single nucleotide polymorphism (SNP)-array) of gangliogliomas are scarce but necessary for a better oncological understanding of this tumor entity. For a detailed characterization at the single cell and cell population levels, we analyzed genomic alterations of three gangliogliomas using trypsin-Giemsa banding (GTG-banding) and by spectral karyotyping (SKY) in combination with SNP-array and gene expression array experiments. By GTG and SKY, we could confirm frequently detected chromosomal aberrations (losses within chromosomes 10, 13 and 22; gains within chromosomes 5, 7, 8 and 12), and identify so far unknown genetic aberrations like the unbalanced non-reciprocal translocation t(1;18)(q21;q21). Interestingly, we report on the second so far detected ganglioglioma with ring chromosome 1. Analyses of SNP-array data from two of the tumors and respective germline DNA (peripheral blood) identified few small gains and losses and a number of copy-neutral regions with loss of heterozygosity (LOH) in germline and in tumor tissue. In comparison to germline DNA, tumor tissues did not show substantial regions with significant loss or gain or with newly developed LOH. Gene expression analyses of tumor-specific genes revealed similarities in the profile of the analyzed samples regarding different relevant pathways. Taken together, we describe overlapping but also distinct and novel genetic aberrations of three gangliogliomas.

  16. A High Density SNP Array for the Domestic Horse and Extant Perissodactyla: Utility for Association Mapping, Genetic Diversity, and Phylogeny Studies

    PubMed Central

    McCue, Molly E.; Bannasch, Danika L.; Petersen, Jessica L.; Gurr, Jessica; Bailey, Ernie; Binns, Matthew M.; Distl, Ottmar; Guérin, Gérard; Hasegawa, Telhisa; Hill, Emmeline W.; Leeb, Tosso; Lindgren, Gabriella; Penedo, M. Cecilia T.; Røed, Knut H.; Ryder, Oliver A.; Swinburne, June E.; Tozaki, Teruaki; Valberg, Stephanie J.; Vaudin, Mark; Lindblad-Toh, Kerstin

    2012-01-01

    An equine SNP genotyping array was developed and evaluated on a panel of samples representing 14 domestic horse breeds and 18 evolutionarily related species. More than 54,000 polymorphic SNPs provided an average inter-SNP spacing of ∼43 kb. The mean minor allele frequency across domestic horse breeds was 0.23, and the number of polymorphic SNPs within breeds ranged from 43,287 to 52,085. Genome-wide linkage disequilibrium (LD) in most breeds declined rapidly over the first 50–100 kb and reached background levels within 1–2 Mb. The extent of LD and the level of inbreeding were highest in the Thoroughbred and lowest in the Mongolian and Quarter Horse. Multidimensional scaling (MDS) analyses demonstrated the tight grouping of individuals within most breeds, close proximity of related breeds, and less tight grouping in admixed breeds. The close relationship between the Przewalski's Horse and the domestic horse was demonstrated by pair-wise genetic distance and MDS. Genotyping of other Perissodactyla (zebras, asses, tapirs, and rhinoceros) was variably successful, with call rates and the number of polymorphic loci varying across taxa. Parsimony analysis placed the modern horse as sister taxa to Equus przewalski. The utility of the SNP array in genome-wide association was confirmed by mapping the known recessive chestnut coat color locus (MC1R) and defining a conserved haplotype of ∼750 kb across all breeds. These results demonstrate the high quality of this SNP genotyping resource, its usefulness in diverse genome analyses of the horse, and potential use in related species. PMID:22253606

  17. SNP-array based whole genome homozygosity mapping: a quick and powerful tool to achieve an accurate diagnosis in LGMD2 patients.

    PubMed

    Papić, Lea; Fischer, Dirk; Trajanoski, Slave; Höftberger, Romana; Fischer, Carina; Ströbel, Thomas; Schmidt, Wolfgang M; Bittner, Reginald E; Schabhüttl, Maria; Gruber, Karin; Pieber, Thomas R; Janecke, Andreas R; Auer-Grumbach, Michaela

    2011-01-01

    A large number of novel disease genes have been identified by homozygosity mapping and the positional candidate approach. In this study we used single nucleotide polymorphism (SNP) array-based, whole genome homozygosity mapping as the first step to a molecular diagnosis in the highly heterogeneous muscle disease, limb girdle muscular dystrophy (LGMD). In a consanguineous family, both affected siblings showed homozygous blocks on chromosome 15 corresponding to the LGMD2A locus. Direct sequencing of CAPN3, encoding calpain-3, identified a homozygous deletion c.483delG (p.Ile162SerfsX17). In a sporadic LGMD patient complete absence of caveolin-3 on Western blot was observed. However, a mutation in CAV3 could not be detected. Homozygosity mapping revealed a large homozygous block at the LGMD2I locus, and direct sequencing of FKRP encoding fukutin-related-protein detected the common homozygous c.826 C>A (p.Leu276Ile) mutation. Subsequent re-examination of this patient's muscle biopsy showed aberrant α-dystroglycan glycosylation. In summary, we show that whole-genome homozygosity mapping using low cost SNP arrays provides a fast and non-invasive method to identify disease-causing mutations in sporadic patients or sibs from consanguineous families in LGMD2. Furthermore, this is the first study describing that in addition to PTRF, encoding polymerase I and transcript release factor, FKRP mutations may cause secondary caveolin-3 deficiency.

  18. Identification of genomic aberrations associated with disease transformation by means of high-resolution SNP array analysis in patients with myeloproliferative neoplasm.

    PubMed

    Rumi, Elisa; Harutyunyan, Ashot; Elena, Chiara; Pietra, Daniela; Klampfl, Thorsten; Bagienski, Klaudia; Berg, Tiina; Casetti, Ilaria; Pascutto, Cristiana; Passamonti, Francesco; Kralovics, Robert; Cazzola, Mario

    2011-12-01

    Myeloproliferative neoplasms (MPN) include polycythemia vera (PV), essential thrombocythemia (ET), and primary myelofibrosis (PMF). These disorders may undergo phenotypic shifts, and may specifically evolve into secondary myelofibrosis (MF) or acute myeloid leukemia (AML). We studied genomic changes associated with these transformations in 29 patients who had serial samples collected in different phases of disease. Genomic DNA from granulocytes, i.e., the myeloproliferative genome, was processed and hybridized to genome-wide human SNP 6.0 arrays. Most patients in chronic phase had chromosomal regions with uniparental disomy (UPD) and/or copy number changes. Disease progression to secondary MF or AML was associated with the acquisition of additional chromosomal aberrations in granulocytes (P = 0.002). A close relationship was observed between aberrations of chromosome 9p (UPD and/or gain) and progression from PV to post-PV MF (P = 0.002). The acquisition of one or more aberrations involving chromosome 5, 7, or 17p was specifically associated with progression to AML (OR 5.9, 95% CI 1.2-27.7, P = 0.006), and significantly affected overall survival (HR 18, 95% CI 1.9-164, P = 0.01). These observations indicate that disease progression from chronic-phase MPN to secondary MF or AML is associated with specific chromosomal aberrations that can be detected by means of high-resolution SNP array analysis of granulocyte DNA.

  19. Design and development of collector for C-band 250 kW CW Klystron

    NASA Astrophysics Data System (ADS)

    Baloda, Suman; Lamba, O. S.; Kaushik, Meenu; Richa; Bansal, Prachi; Kumud; Pradeep; Kant, D.; Joshi, L. M.

    2012-11-01

    The paper presents the design and development of collector for C-band 250 kW high power klystron. The design criteria for the collector assembly is selection of material, vacuum and high temperature compatibility, proper electron beam dispersion, minimum back scattering of electrons and thermal design for proper cooling at high power dissipation. All these aspects have been discussed for collector development in details. The collector has been designed in TRAK and then beam propagation has been analyzed in MAGIC 2D software. The thermal simulation has been done using ANSYS 11.0 (multi-physics). The outer surface of the collector has been grooved to facilitate its proper cooling. Design results are presented for water cooling with different flow rates and channel dimensions. OFHC copper material is chosen for collector which is suitable for vacuum and hydrogen brazing operations and good thermal properties for efficient cooling.

  20. Three-dimensional magnetic cloak working from d.c. to 250 kHz

    NASA Astrophysics Data System (ADS)

    Zhu, Jianfei; Jiang, Wei; Liu, Yichao; Yin, Ge; Yuan, Jun; He, Sailing; Ma, Yungui

    2015-11-01

    Invisible cloaking is one of the major outcomes of the metamaterial research, but the practical potential, in particular for high frequencies (for example, microwave to visible light), is fatally challenged by the complex material properties they usually demand. On the other hand, it will be advantageous and also technologically instrumental to design cloaking devices for applications at low frequencies where electromagnetic components are favourably uncoupled. In this work, we vastly develop the bilayer approach to create a three-dimensional magnetic cloak able to work in both static and dynamic fields. Under the quasi-static approximation, we demonstrate a perfect magnetic cloaking device with a large frequency band from 0 to 250 kHz. The practical potential of our device is experimentally verified by using a commercial metal detector, which may lead us to having a real cloaking application where the dynamic magnetic field can be manipulated in desired ways.

  1. The 250-kW CW klystron amplifier for planetary radar

    NASA Technical Reports Server (NTRS)

    Cormier, R.; Mizuhara, A.

    1992-01-01

    The design, construction, and performance testing is described of two Varian klystrons, model VKX-7864A, which replaced the aging and less efficient VA-949J klystrons in the X band planetary radar transmitter on the Goldstone, CA, 70 meter antenna. The project was carried out jointly by the JPL and Varian Assoc. Output power was increased from 200 to 250 kW continuous wave per klystron, and full dc beam power is dissipated in the collector (it was not possible to operate the VA-949J klystrons without RF drive because of limited collector dissipation capability). Replacements were made with a minimum of transmitter modifications. The planetary radar transmitter is now operating successfully with these two klystrons.

  2. 250 kW flywheel with HTS magnetic bearing for industrial use

    NASA Astrophysics Data System (ADS)

    Werfel, F. N.; Floegel-Delor, U.; Riedel, T.; Rothfeld, R.; Wippich, D.; Goebel, B.; Reiner, G.; Wehlau, N.

    2008-02-01

    A 250 kW / 5 kWh engineering prototype Flywheel Energy Storage System (FESS) was designed, fabricated and component tested by Adelwitz Technologiezentrum GmbH (ATZ) and L-3 Communications Magnet - Motor GmbH (MM). A heavy - load vertical 0.6 ton rotor is suspended totally magnetically by an HTS radial-passive bearing on the top together with a PM bearing at the bottom. Further features are the flywheel rotor body which is manufactured from carbon fibre reinforced plastics (CFRP) in a multi-rim version and combined with an integrated high-power motor/generator. A 35 W/77 K single- stage Gifford McMahon cryo-cooler is cooling the HTS bearing to a temperature of 45 - 60 K. Functionality and efficiency of the magnetic bearing configurations, rotor control concepts and motor / generator power electric system is considered and established. Bearing stiffness parameters, damping performance, and rotational friction are measured. Testing of further components under vacuum conditions confirmed that low bearing drag and wear- free operation can be attained. The motor-generator operates with a power in excess of 250 kW and an efficiency of > 92%, including the losses of the inverters. A redundant mechanical touchdown bearing system can be activated to restore the rotor position. The separately tested flywheel components are now in the assembling status expecting first machine tests in November 2007. After studying and measuring all FESS parameters in -house the dynamical storage device will be tested in a German E.ON power station under industrial conditions.

  3. High resolution SNP array genomic profiling of peripheral T cell lymphomas, not otherwise specified, identifies a subgroup with chromosomal aberrations affecting the REL locus.

    PubMed

    Hartmann, Sylvia; Gesk, Stefan; Scholtysik, René; Kreuz, Markus; Bug, Stefanie; Vater, Inga; Döring, Claudia; Cogliatti, Sergio; Parrens, Marie; Merlio, Jean-Philippe; Kwiecinska, Anna; Porwit, Anna; Piccaluga, Pier Paolo; Pileri, Stefano; Hoefler, Gerald; Küppers, Ralf; Siebert, Reiner; Hansmann, Martin-Leo

    2010-02-01

    Little is known about genomic aberrations in peripheral T cell lymphoma, not otherwise specified (PTCL NOS). We studied 47 PTCL NOS by 250k GeneChip single nucleotide polymorphism arrays and detected genomic imbalances in 22 of the cases. Recurrent gains and losses were identified, including gains of chromosome regions 1q32-43, 2p15-16, 7, 8q24, 11q14-25, 17q11-21 and 21q11-21 (> or = 5 cases each) as well as losses of chromosome regions 1p35-36, 5q33, 6p22, 6q16, 6q21-22, 8p21-23, 9p21, 10p11-12, 10q11-22, 10q25-26, 13q14, 15q24, 16q22, 16q24, 17p11, 17p13 and Xp22 (> or = 4 cases each). Genomic imbalances affected several regions containing members of nuclear factor-kappaB signalling and genes involved in cell cycle control. Gains of 2p15-16 were confirmed in each of three cases analysed by fluorescence in situ hybridization (FISH) and were associated with breakpoints at the REL locus in two of these cases. Three additional cases with gains of the REL locus were detected by FISH among 18 further PTCL NOS. Five of 27 PTCL NOS investigated showed nuclear expression of the REL protein by immunohistochemistry, partly associated with genomic gains of the REL locus. Therefore, in a subgroup of PTCL NOS gains/rearrangements of REL and expression of REL protein may be of pathogenetic relevance.

  4. Genomic Variation by Whole-Genome SNP Mapping Arrays Predicts Time-to-Event Outcome in Patients with Chronic Lymphocytic Leukemia

    PubMed Central

    Schweighofer, Carmen D.; Coombes, Kevin R.; Majewski, Tadeusz; Barron, Lynn L.; Lerner, Susan; Sargent, Rachel L.; O'Brien, Susan; Ferrajoli, Alessandra; Wierda, William G.; Czerniak, Bogdan A.; Medeiros, L. Jeffrey; Keating, Michael J.; Abruzzo, Lynne V.

    2013-01-01

    Genomic abnormalities, such as deletions in 11q22 or 17p13, are associated with poorer prognosis in patients with chronic lymphocytic leukemia (CLL). We hypothesized that unknown regions of copy number variation (CNV) affect clinical outcome and can be detected by array-based single-nucleotide polymorphism (SNP) genotyping. We compared SNP genotypes from 168 untreated patients with CLL with genotypes from 73 white HapMap controls. We identified 322 regions of recurrent CNV, 82 of which occurred significantly more often in CLL than in HapMap (CLL-specific CNV), including regions typically aberrant in CLL: deletions in 6q21, 11q22, 13q14, and 17p13 and trisomy 12. In univariate analyses, 35 of total and 11 of CLL-specific CNVs were associated with unfavorable time-to-event outcomes, including gains or losses in chromosomes 2p, 4p, 4q, 6p, 6q, 7q, 11p, 11q, and 17p. In multivariate analyses, six CNVs (ie, CLL-specific variations in 11p15.1-15.4 or 6q27) predicted time-to-treatment or overall survival independently of established markers of prognosis. Moreover, genotypic complexity (ie, the number of independent CNVs per patient) significantly predicted prognosis, with a median time-to-treatment of 64 months versus 23 months in patients with zero to one versus two or more CNVs, respectively (P = 3.3 × 10−8). In summary, a comparison of SNP genotypes from patients with CLL with HapMap controls allowed us to identify known and unknown recurrent CNVs and to determine regions and rates of CNV that predict poorer prognosis in patients with CLL. PMID:23273604

  5. Genetic differentiation of brackish water populations of cod Gadus morhua in the southern Baltic, inferred from genotyping using SNP-arrays.

    PubMed

    Poćwierz-Kotus, A; Kijewska, A; Petereit, C; Bernaś, R; Więcaszek, B; Arnyasi, M; Lien, S; Kent, M P; Wenne, R

    2015-02-01

    The Baltic is a semi-enclosed sea characterised by decreasing salinity in the eastern and northern direction with only the deeper parts of the southern Baltic suitable as spawning grounds for marine species like cod. Baltic cod exhibits various adaptations to brackish water conditions, yet the inflow of salty North Sea water near the bottom remains an influence on the spawning success of the Baltic cod. The eastern Baltic population has been very weakly studied in comparison with the western population. The aim of this study is to demonstrate for the first time genetic differentiation by the use of a large number of SNPs between eastern and western Baltic populations existing in differentiated salinity conditions. Two cod samples were collected from the Bay of Gdańsk, Poland and one from the Kiel Bight, Germany. Samples were genotyped using a cod derived SNP-array (Illumina) with 10 913 SNPs. A selection of diagnostic SNPs was performed. A set of 7944 validated SNPs were analysed to assess the differentiation of three samples of cod. Results indicated a clear distinctness of the Kiel Bight from the populations of the eastern Baltic. FST comparison between both eastern samples was non-significant. Clustering analysis, principal coordinates analysis and assignment test clearly indicated that the eastern samples should be considered as one subpopulation, well differentiated from the western subpopulation. With the SNP approach, no differentiation between groups containing 'healthy' and 'non-healthy' cod individuals was observed.

  6. A Large Maize (Zea mays L.) SNP Genotyping Array: Development and Germplasm Genotyping, and Genetic Mapping to Compare with the B73 Reference Genome

    PubMed Central

    Ganal, Martin W.; Durstewitz, Gregor; Polley, Andreas; Bérard, Aurélie; Buckler, Edward S.; Charcosset, Alain; Clarke, Joseph D.; Graner, Eva-Maria; Hansen, Mark; Joets, Johann; Le Paslier, Marie-Christine; McMullen, Michael D.; Montalent, Pierre; Rose, Mark; Schön, Chris-Carolin; Sun, Qi; Walter, Hildrun; Martin, Olivier C.; Falque, Matthieu

    2011-01-01

    SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection. We report the establishment of a large maize SNP array and its use for diversity analysis and high density linkage mapping. The markers, taken from more than 800,000 SNPs, were selected to be preferentially located in genes and evenly distributed across the genome. The array was tested with a set of maize germplasm including North American and European inbred lines, parent/F1 combinations, and distantly related teosinte material. A total of 49,585 markers, including 33,417 within 17,520 different genes and 16,168 outside genes, were of good quality for genotyping, with an average failure rate of 4% and rates up to 8% in specific germplasm. To demonstrate this array's use in genetic mapping and for the independent validation of the B73 sequence assembly, two intermated maize recombinant inbred line populations – IBM (B73×Mo17) and LHRF (F2×F252) – were genotyped to establish two high density linkage maps with 20,913 and 14,524 markers respectively. 172 mapped markers were absent in the current B73 assembly and their placement can be used for future improvements of the B73 reference sequence. Colinearity of the genetic and physical maps was mostly conserved with some exceptions that suggest errors in the B73 assembly. Five major regions containing non-colinearities were identified on chromosomes 2, 3, 6, 7 and 9, and are supported by both independent genetic maps. Four additional non-colinear regions were found on the LHRF map only; they may be due to a lower density of IBM markers in those regions or to true structural rearrangements between lines. Given the array's high quality, it will be a valuable resource for maize genetics and many aspects of maize breeding. PMID:22174790

  7. A large maize (Zea mays L.) SNP genotyping array: development and germplasm genotyping, and genetic mapping to compare with the B73 reference genome.

    PubMed

    Ganal, Martin W; Durstewitz, Gregor; Polley, Andreas; Bérard, Aurélie; Buckler, Edward S; Charcosset, Alain; Clarke, Joseph D; Graner, Eva-Maria; Hansen, Mark; Joets, Johann; Le Paslier, Marie-Christine; McMullen, Michael D; Montalent, Pierre; Rose, Mark; Schön, Chris-Carolin; Sun, Qi; Walter, Hildrun; Martin, Olivier C; Falque, Matthieu

    2011-01-01

    SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection. We report the establishment of a large maize SNP array and its use for diversity analysis and high density linkage mapping. The markers, taken from more than 800,000 SNPs, were selected to be preferentially located in genes and evenly distributed across the genome. The array was tested with a set of maize germplasm including North American and European inbred lines, parent/F1 combinations, and distantly related teosinte material. A total of 49,585 markers, including 33,417 within 17,520 different genes and 16,168 outside genes, were of good quality for genotyping, with an average failure rate of 4% and rates up to 8% in specific germplasm. To demonstrate this array's use in genetic mapping and for the independent validation of the B73 sequence assembly, two intermated maize recombinant inbred line populations - IBM (B73×Mo17) and LHRF (F2×F252) - were genotyped to establish two high density linkage maps with 20,913 and 14,524 markers respectively. 172 mapped markers were absent in the current B73 assembly and their placement can be used for future improvements of the B73 reference sequence. Colinearity of the genetic and physical maps was mostly conserved with some exceptions that suggest errors in the B73 assembly. Five major regions containing non-colinearities were identified on chromosomes 2, 3, 6, 7 and 9, and are supported by both independent genetic maps. Four additional non-colinear regions were found on the LHRF map only; they may be due to a lower density of IBM markers in those regions or to true structural rearrangements between lines. Given the array's high quality, it will be a valuable resource for maize genetics and many aspects of maize breeding.

  8. Conceptual study of a 250 kW planar SOFC system for CHP application

    NASA Astrophysics Data System (ADS)

    Fontell, E.; Kivisaari, T.; Christiansen, N.; Hansen, J.-B.; Pålsson, J.

    In August 2002, Wärtsilä Corporation and Haldor Topsøe A/S entered into a co-operation agreement to start joint development program within the planar SOFC technology. The development program aims to bring to the market highly efficient, clean and cost competitive fuel cell systems with power outputs above 200 kW for distributed power generation with CHP and for marine applications. In this study, the product concept for a 250 kW natural gas-fuelled atmospheric SOFC plant has been studied. The process has been calculated and optimised for high electrical efficiency. In the calculations, system efficiencies more than 55-85% (electrical co-generation) have been reached. The necessary balance of plant (BoP) components have been identified and the concept for grid connection has been defined. The BoP includes fuel and air supply, anode re-circulation, start-up steam, purge gas, exhaust gas heat recovery, back-up power, power electronics and control system. Based on the analysed system and component information, a conceptual design and cost break down structure for the product have been made. The cost breakdown shows that the stack, system control and power electronics are the major cost factors, while the remaining BoP equipment stands for a minor share of the manufacturing cost. Finally, the feasibility of the SOFC plants has been compared to gas engines.

  9. A High-Resolution SNP Array-Based Linkage Map Anchors a New Domestic Cat Draft Genome Assembly and Provides Detailed Patterns of Recombination

    PubMed Central

    Li, Gang; Hillier, LaDeana W.; Grahn, Robert A.; Zimin, Aleksey V.; David, Victor A.; Menotti-Raymond, Marilyn; Middleton, Rondo; Hannah, Steven; Hendrickson, Sher; Makunin, Alex; O’Brien, Stephen J.; Minx, Pat; Wilson, Richard K.; Lyons, Leslie A.; Warren, Wesley C.; Murphy, William J.

    2016-01-01

    High-resolution genetic and physical maps are invaluable tools for building accurate genome assemblies, and interpreting results of genome-wide association studies (GWAS). Previous genetic and physical maps anchored good quality draft assemblies of the domestic cat genome, enabling the discovery of numerous genes underlying hereditary disease and phenotypes of interest to the biomedical science and breeding communities. However, these maps lacked sufficient marker density to order thousands of shorter scaffolds in earlier assemblies, which instead relied heavily on comparative mapping with related species. A high-resolution map would aid in validating and ordering chromosome scaffolds from existing and new genome assemblies. Here, we describe a high-resolution genetic linkage map of the domestic cat genome based on genotyping 453 domestic cats from several multi-generational pedigrees on the Illumina 63K SNP array. The final maps include 58,055 SNP markers placed relative to 6637 markers with unique positions, distributed across all autosomes and the X chromosome. Our final sex-averaged maps span a total autosomal length of 4464 cM, the longest described linkage map for any mammal, confirming length estimates from a previous microsatellite-based map. The linkage map was used to order and orient the scaffolds from a substantially more contiguous domestic cat genome assembly (Felis catus v8.0), which incorporated ∼20 × coverage of Illumina fragment reads. The new genome assembly shows substantial improvements in contiguity, with a nearly fourfold increase in N50 scaffold size to 18 Mb. We use this map to report probable structural errors in previous maps and assemblies, and to describe features of the recombination landscape, including a massive (∼50 Mb) recombination desert (of virtually zero recombination) on the X chromosome that parallels a similar desert on the porcine X chromosome in both size and physical location. PMID:27172201

  10. A High-Resolution SNP Array-Based Linkage Map Anchors a New Domestic Cat Draft Genome Assembly and Provides Detailed Patterns of Recombination.

    PubMed

    Li, Gang; Hillier, LaDeana W; Grahn, Robert A; Zimin, Aleksey V; David, Victor A; Menotti-Raymond, Marilyn; Middleton, Rondo; Hannah, Steven; Hendrickson, Sher; Makunin, Alex; O'Brien, Stephen J; Minx, Pat; Wilson, Richard K; Lyons, Leslie A; Warren, Wesley C; Murphy, William J

    2016-06-01

    High-resolution genetic and physical maps are invaluable tools for building accurate genome assemblies, and interpreting results of genome-wide association studies (GWAS). Previous genetic and physical maps anchored good quality draft assemblies of the domestic cat genome, enabling the discovery of numerous genes underlying hereditary disease and phenotypes of interest to the biomedical science and breeding communities. However, these maps lacked sufficient marker density to order thousands of shorter scaffolds in earlier assemblies, which instead relied heavily on comparative mapping with related species. A high-resolution map would aid in validating and ordering chromosome scaffolds from existing and new genome assemblies. Here, we describe a high-resolution genetic linkage map of the domestic cat genome based on genotyping 453 domestic cats from several multi-generational pedigrees on the Illumina 63K SNP array. The final maps include 58,055 SNP markers placed relative to 6637 markers with unique positions, distributed across all autosomes and the X chromosome. Our final sex-averaged maps span a total autosomal length of 4464 cM, the longest described linkage map for any mammal, confirming length estimates from a previous microsatellite-based map. The linkage map was used to order and orient the scaffolds from a substantially more contiguous domestic cat genome assembly (Felis catus v8.0), which incorporated ∼20 × coverage of Illumina fragment reads. The new genome assembly shows substantial improvements in contiguity, with a nearly fourfold increase in N50 scaffold size to 18 Mb. We use this map to report probable structural errors in previous maps and assemblies, and to describe features of the recombination landscape, including a massive (∼50 Mb) recombination desert (of virtually zero recombination) on the X chromosome that parallels a similar desert on the porcine X chromosome in both size and physical location.

  11. Development and implementation of a highly-multiplexed SNP array for genetic mapping in maritime pine and comparative mapping with loblolly pine

    PubMed Central

    2011-01-01

    Background Single nucleotide polymorphisms (SNPs) are the most abundant source of genetic variation among individuals of a species. New genotyping technologies allow examining hundreds to thousands of SNPs in a single reaction for a wide range of applications such as genetic diversity analysis, linkage mapping, fine QTL mapping, association studies, marker-assisted or genome-wide selection. In this paper, we evaluated the potential of highly-multiplexed SNP genotyping for genetic mapping in maritime pine (Pinus pinaster Ait.), the main conifer used for commercial plantation in southwestern Europe. Results We designed a custom GoldenGate assay for 1,536 SNPs detected through the resequencing of gene fragments (707 in vitro SNPs/Indels) and from Sanger-derived Expressed Sequenced Tags assembled into a unigene set (829 in silico SNPs/Indels). Offspring from three-generation outbred (G2) and inbred (F2) pedigrees were genotyped. The success rate of the assay was 63.6% and 74.8% for in silico and in vitro SNPs, respectively. A genotyping error rate of 0.4% was further estimated from segregating data of SNPs belonging to the same gene. Overall, 394 SNPs were available for mapping. A total of 287 SNPs were integrated with previously mapped markers in the G2 parental maps, while 179 SNPs were localized on the map generated from the analysis of the F2 progeny. Based on 98 markers segregating in both pedigrees, we were able to generate a consensus map comprising 357 SNPs from 292 different loci. Finally, the analysis of sequence homology between mapped markers and their orthologs in a Pinus taeda linkage map, made it possible to align the 12 linkage groups of both species. Conclusions Our results show that the GoldenGate assay can be used successfully for high-throughput SNP genotyping in maritime pine, a conifer species that has a genome seven times the size of the human genome. This SNP-array will be extended thanks to recent sequencing effort using new generation

  12. Development of two major resources for pea genomics: the GenoPea 13.2K SNP Array and a high-density, high-resolution consensus genetic map.

    PubMed

    Tayeh, Nadim; Aluome, Christelle; Falque, Matthieu; Jacquin, Françoise; Klein, Anthony; Chauveau, Aurélie; Bérard, Aurélie; Houtin, Hervé; Rond, Céline; Kreplak, Jonathan; Boucherot, Karen; Martin, Chantal; Baranger, Alain; Pilet-Nayel, Marie-Laure; Warkentin, Thomas D; Brunel, Dominique; Marget, Pascal; Le Paslier, Marie-Christine; Aubert, Grégoire; Burstin, Judith

    2015-12-01

    Single nucleotide polymorphism (SNP) arrays represent important genotyping tools for innovative strategies in both basic research and applied breeding. Pea is an important food, feed and sustainable crop with a large (about 4.45 Gbp) but not yet available genome sequence. In the present study, 12 pea recombinant inbred line populations were genotyped using the newly developed GenoPea 13.2K SNP Array. Individual and consensus genetic maps were built providing insights into the structure and organization of the pea genome. Largely collinear genetic maps of 3918-8503 SNPs were obtained from all mapping populations, and only two of these exhibited putative chromosomal rearrangement signatures. Similar distortion patterns in different populations were noted. A total of 12 802 transcript-derived SNP markers placed on a 15 079-marker high-density, high-resolution consensus map allowed the identification of ohnologue-rich regions within the pea genome and the localization of local duplicates. Dense syntenic networks with sequenced legume genomes were further established, paving the way for the identification of the molecular bases of important agronomic traits segregating in the mapping populations. The information gained on the structure and organization of the genome from this research will undoubtedly contribute to the understanding of the evolution of the pea genome and to its assembly. The GenoPea 13.2K SNP Array and individual and consensus genetic maps are valuable genomic tools for plant scientists to strengthen pea as a model for genetics and physiology and enhance breeding.

  13. Mosaic maternal uniparental disomy of chromosome 15 in Prader-Willi syndrome: utility of genome-wide SNP array.

    PubMed

    Izumi, Kosuke; Santani, Avni B; Deardorff, Matthew A; Feret, Holly A; Tischler, Tanya; Thiel, Brian D; Mulchandani, Surabhi; Stolle, Catherine A; Spinner, Nancy B; Zackai, Elaine H; Conlin, Laura K

    2013-01-01

    Prader-Willi syndrome is caused by the loss of paternal gene expression on 15q11.2-q13.2, and one of the mechanisms resulting in Prader-Willi syndrome phenotype is maternal uniparental disomy of chromosome 15. Various mechanisms including trisomy rescue, monosomy rescue, and post fertilization errors can lead to uniparental disomy, and its mechanism can be inferred from the pattern of uniparental hetero and isodisomy. Detection of a mosaic cell line provides a unique opportunity to understand the mechanism of uniparental disomy; however, mosaic uniparental disomy is a rare finding in patients with Prader-Willi syndrome. We report on two infants with Prader-Willi syndrome caused by mosaic maternal uniparental disomy 15. Patient 1 has mosaic uniparental isodisomy of the entire chromosome 15, and Patient 2 has mosaic uniparental mixed iso/heterodisomy 15. Genome-wide single-nucleotide polymorphism array was able to demonstrate the presence of chromosomally normal cell line in the Patient 1 and trisomic cell line in Patient 2, and provide the evidence that post-fertilization error and trisomy rescue as a mechanism of uniparental disomy in each case, respectively. Given its ability of detecting small percent mosaicism as well as its capability of identifying the loss of heterozygosity of chromosomal regions, genome-wide single-nucleotide polymorphism array should be utilized as an adjunct to the standard methylation analysis in the evaluation of Prader-Willi syndrome.

  14. Genome-wide detection of CNVs in Chinese indigenous sheep with different types of tails using ovine high-density 600K SNP arrays

    PubMed Central

    Zhu, Caiye; Fan, Hongying; Yuan, Zehu; Hu, Shijin; Ma, Xiaomeng; Xuan, Junli; Wang, Hongwei; Zhang, Li; Wei, Caihong; Zhang, Qin; Zhao, Fuping; Du, Lixin

    2016-01-01

    Chinese indigenous sheep can be classified into three types based on tail morphology: fat-tailed, fat-rumped, and thin-tailed sheep, of which the typical breeds are large-tailed Han sheep, Altay sheep, and Tibetan sheep, respectively. To unravel the genetic mechanisms underlying the phenotypic differences among Chinese indigenous sheep with tails of three different types, we used ovine high-density 600K SNP arrays to detect genome-wide copy number variation (CNV). In large-tailed Han sheep, Altay sheep, and Tibetan sheep, 371, 301, and 66 CNV regions (CNVRs) with lengths of 71.35 Mb, 51.65 Mb, and 10.56 Mb, respectively, were identified on autosomal chromosomes. Ten CNVRs were randomly chosen for confirmation, of which eight were successfully validated. The detected CNVRs harboured 3130 genes, including genes associated with fat deposition, such as PPARA, RXRA, KLF11, ADD1, FASN, PPP1CA, PDGFA, and PEX6. Moreover, multilevel bioinformatics analyses of the detected candidate genes were significantly enriched for involvement in fat deposition, GTPase regulator, and peptide receptor activities. This is the first high-resolution sheep CNV map for Chinese indigenous sheep breeds with three types of tails. Our results provide valuable information that will support investigations of genomic structural variation underlying traits of interest in sheep. PMID:27282145

  15. SNP array and phenotype correlation shows that FLI1 deletion per se is not responsible for thrombocytopenia development in Jacobsen syndrome.

    PubMed

    Trkova, Marie; Becvarova, Vera; Hynek, Martin; Hnykova, Lenka; Hlavova, Eva; Kreckova, Gabriela; Kulovany, Eduard; Cutka, David; Zatloukalova, Jitka; Markova, Kristyna; Sukova, Martina; Horacek, Jiri; Stejskal, David

    2012-10-01

    Jacobsen syndrome (JBS) is a rare chromosomal disorder caused by terminal deletion of the long arm of chromosome 11. We report on four prenatally diagnosed patients with JBS with variable prenatal and postnatal phenotypes and 11q deletions of varying sizes. Precise characterization of the deleted region in three patients was performed by SNP arrays. The severity of both the prenatal and postnatal phenotypes did not correlate with the size of the haploinsufficient region. Despite the large difference in the deletion size (nearly 6 Mb), both of the live-born patients had similar phenotypes corresponding to JBS. However, one of the most prominent features of JBS, thrombocytopenia, was only present in the live-born boy. The girl, who had a significantly longer deletion spanning all four genes suspected of being causative of JBS-related thrombocytopenia (FLI1, ETS1, NFRKB, and JAM3), did not manifest a platelet phenotype. Therefore, our findings do not support the traditional view of deletion size correlation in JBS or the causative role of FLI1, ETS1, NFRKB, and JAM3 deletion per se for the development of disease-related thrombocytopenia.

  16. 250 kV 6 mA compact Cockcroft-Walton high-voltage power supply.

    PubMed

    Ma, Zhan-Wen; Su, Xiao-Dong; Lu, Xiao-Long; Wei, Zhen; Wang, Jun-Run; Huang, Zhi-Wu; Miao, Tian-You; Su, Tong-Ling; Yao, Ze-En

    2016-08-01

    A compact power supply system for a compact neutron generator has been developed. A 4-stage symmetrical Cockcroft-Walton circuit is adopted to produce 250 kV direct current high-voltage. A 2-stage 280 kV isolation transformer system is used to drive the ion source power supply. For a compact structure, safety, and reliability during the operation, the Cockcroft-Walton circuit and the isolation transformer system are enclosed in an epoxy vessel containing the transformer oil whose size is about ∅350 mm × 766 mm. Test results indicate that the maximum output voltage of the power supply is 282 kV, and the stability of the output voltage is better than 0.63% when the high voltage power supply is operated at 250 kV, 6.9 mA with the input voltage varying ±10%.

  17. 250 kV 6 mA compact Cockcroft-Walton high-voltage power supply

    NASA Astrophysics Data System (ADS)

    Ma, Zhan-Wen; Su, Xiao-Dong; Lu, Xiao-Long; Wei, Zhen; Wang, Jun-Run; Huang, Zhi-Wu; Miao, Tian-You; Su, Tong-Ling; Yao, Ze-En

    2016-08-01

    A compact power supply system for a compact neutron generator has been developed. A 4-stage symmetrical Cockcroft-Walton circuit is adopted to produce 250 kV direct current high-voltage. A 2-stage 280 kV isolation transformer system is used to drive the ion source power supply. For a compact structure, safety, and reliability during the operation, the Cockcroft-Walton circuit and the isolation transformer system are enclosed in an epoxy vessel containing the transformer oil whose size is about ∅350 mm × 766 mm. Test results indicate that the maximum output voltage of the power supply is 282 kV, and the stability of the output voltage is better than 0.63% when the high voltage power supply is operated at 250 kV, 6.9 mA with the input voltage varying ±10%.

  18. Performance and Design Analysis of a 250-kW, Grid-Connected Battery Energy Storage System

    SciTech Connect

    Ball, Greg J.; Norris, Benjamin L.

    1999-06-01

    This report documents the assessment of performance and design of a 250-kW prototype battery energy storage system developed by Omnion Power Engineering Company and tested by Pacific Gas and Electric Company, both in collaboration with Sandia National Laboratories. The assess- ment included system performance, operator interface, and reliability. The report also discusses how to detect failed battery strings with strategically located voltage measurements.

  19. Best diagnostic approach for the genetic evaluation of fetuses after intrauterine death in first, second or third trimester: QF-PCR, karyotyping and/or genome wide SNP array analysis

    PubMed Central

    2014-01-01

    Background The aim of this study was to evaluate the best diagnostic approach for the genetic analysis of samples from first, second and third trimester intrauterine fetal deaths (IUFDs). We examined a total of 417 IUFD samples from fetuses with and without congenital anomalies. On 414 samples, karyotyping (N = 46) and/or rapid aneuploidy testing by QF-PCR (N = 371) was performed). One hundred sixty eight samples with a normal test result were subsequently tested by genome wide Single Nucleotide Polymorphism (SNP) array analysis. Three samples were only analyzed by array. Results In 50 (12.0%) samples an aneuploidy was detected by QF-PCR and/or karyotyping, representing 47.1% of first, 13.2% of second and 3.4% of third trimester pregnancies. Karyotyping and QF-PCR failed in 4 (8.7%) and 7 (1.9%) samples, respectively, concerning mostly contaminated amniotic fluid samples from third trimester pregnancies. Clinically relevant aberrations were identified in 4.2% (all fetuses with malformations) of the 168 samples tested by SNP array. Inherited copy number variants (CNVs) were detected in 5.4% and 8.9% showed CNVs of unknown clinical relevance as parental inheritance could not be studied yet. In a sample from a fetus suspect for Meckel-Grüber syndrome, the genotype information from the SNP array revealed various stretches of homozygosity, including one stretch encompassing the CEP290 gene. Subsequent CEP290 mutation analysis revealed a homozygous, pathogenic mutation in this gene. Conclusions Based on our experience we recommend QF-PCR as the first-line test in IUFD samples of first and second trimester pregnancies to exclude aneuploidy before performing array analysis. The chance to detect aneuploidy in third trimester pregnancies is relatively low and therefore array analysis can be performed as a first-tier test. A tissue sample, instead of amniotic fluid, is preferred because of a higher success rate in testing. We emphasize the need for analysis of parental

  20. Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality.

    PubMed

    Ali, Shahin S; Shao, Jonathan; Strem, Mary D; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W; Bailey, Bryan A

    2015-01-01

    Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri.

  1. Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality

    PubMed Central

    Ali, Shahin S.; Shao, Jonathan; Strem, Mary D.; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W.; Bailey, Bryan A.

    2015-01-01

    Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri. PMID:26379633

  2. Construction and start-up of a 250 kW natural gas fueled MCFC demonstration power plant

    SciTech Connect

    Figueroa, R.A.; Carter, J.; Rivera, R.; Otahal, J.

    1996-12-31

    San Diego Gas & Electric (SDG&E) is participating with M-C Power in the development and commercialization program of their internally manifolded heat exchanger (IMHEX{reg_sign}) carbonate fuel cell technology. Development of the IMHEX technology base on the UNOCAL test facility resulted in the demonstration of a 250 kW thermally integrated power plant located at the Naval Air Station at Miramar, California. The members of the commercialization team lead by M-C Power (MCP) include Bechtel Corporation, Stewart & Stevenson Services, Inc., and Ishikawajima-Harima Heavy Industries (IHI). MCP produced the fuel cell stack, Bechtel was responsible for the process engineering including the control system, Stewart & Stevenson was responsible for packaging the process equipment in a skid (pumps, desulfurizer, gas heater, turbo, heat exchanger and stem generator), IHI produced a compact flat plate catalytic reformer operating on natural gas, and SDG&E assumed responsibility for plant construction, start-up and operation of the plant.

  3. Dilatometer setup for low coefficient of thermal expansion materials measurements in the 140 K-250 K temperature range.

    PubMed

    Spannagel, Ruven; Hamann, Ines; Sanjuan, Josep; Schuldt, Thilo; Gohlke, Martin; Johann, Ulrich; Weise, Dennis; Braxmaier, Claus

    2016-10-01

    Space applications demand light weight materials with excellent dimensional stability for telescopes, optical benches, optical resonators, etc. Glass-ceramics and composite materials can be tuned to reach very low coefficient of thermal expansion (CTE) at different temperatures. In order to determine such CTEs, very accurate setups are needed. Here we present a dilatometer that is able to measure the CTE of a large variety of materials in the temperature range of 140 K to 250 K. The dilatometer is based on a heterodyne interferometer with nanometer noise levels to measure the expansion of a sample when applying small amplitude controlled temperature signals. In this article, the CTE of a carbon fiber reinforced polymer sample has been determined with an accuracy in the 10(-8) K(-1) range.

  4. Dilatometer setup for low coefficient of thermal expansion materials measurements in the 140 K-250 K temperature range

    NASA Astrophysics Data System (ADS)

    Spannagel, Ruven; Hamann, Ines; Sanjuan, Josep; Schuldt, Thilo; Gohlke, Martin; Johann, Ulrich; Weise, Dennis; Braxmaier, Claus

    2016-10-01

    Space applications demand light weight materials with excellent dimensional stability for telescopes, optical benches, optical resonators, etc. Glass-ceramics and composite materials can be tuned to reach very low coefficient of thermal expansion (CTE) at different temperatures. In order to determine such CTEs, very accurate setups are needed. Here we present a dilatometer that is able to measure the CTE of a large variety of materials in the temperature range of 140 K to 250 K. The dilatometer is based on a heterodyne interferometer with nanometer noise levels to measure the expansion of a sample when applying small amplitude controlled temperature signals. In this article, the CTE of a carbon fiber reinforced polymer sample has been determined with an accuracy in the 10-8 K-1 range.

  5. SKM-SNP: SNP markers detection method.

    PubMed

    Liu, Yang; Li, Mark; Cheung, Yiu M; Sham, Pak C; Ng, Michael K

    2010-04-01

    SKM-SNP, SNP markers detection program, is proposed to identify a set of relevant SNPs for the association between a disease and multiple marker genotypes. We employ a subspace categorical clustering algorithm to compute a weight for each SNP in the group of patient samples and the group of normal samples, and use the weights to identify the subsets of relevant SNPs that categorize these two groups. The experiments on both Schizophrenia and Parkinson Disease data sets containing genome-wide SNPs are reported to demonstrate the program. Results indicate that our method can find some relevant SNPs that categorize the disease samples. The online SKM-SNP program is available at http://www.math.hkbu.edu.hk/~mng/SKM-SNP/SKM-SNP.html.

  6. Effect of Bubbles on Liquid Nitrogen Breakdown in Plane-Plane Electrode Geometry From 100-250 kPa

    SciTech Connect

    Sauers, Isidor; James, David Randy; Tuncer, Enis; Polyzos, Georgios; Pace, Marshall O

    2011-01-01

    Liquid nitrogen (LN(2)) is used as the cryogen and dielectric for many high temperature superconducting, high voltage applications. When a quench in the superconductor occurs, bubbles are generated which can affect the dielectric breakdown properties of the LN(2). Experiments were performed using plane-plane electrode geometry where bubbles were introduced into the gap through a pinhole in the ground electrode. Bubbles were generated using one or more kapton heaters producing heater powers up to 30 W. Pressure was varied from 100-250 kPa. Breakdown strength was found to be relatively constant up to a given heater power and pressure at which the breakdown strength drops to a low value depending on the pressure. After the drop the breakdown strength continues to drop gradually at higher heater power. This is particularly illustrated at 100 kPa. After the drop in breakdown strength the breakdown is believed to be due to the formation of a vapor bridge. Also the heater power at which the breakdown strength changes from that of LN(2) to that of gaseous nitrogen increases with increasing pressure. The data can provide design constraints for high temperature superconducting fault current limiters (FCLs) so that the formation of a vapor bridge can be suppressed or avoided.

  7. Genomic variation by whole-genome SNP mapping arrays predicts time-to-event outcome in patients with chronic lymphocytic leukemia: a comparison of CLL and HapMap genotypes.

    PubMed

    Schweighofer, Carmen D; Coombes, Kevin R; Majewski, Tadeusz; Barron, Lynn L; Lerner, Susan; Sargent, Rachel L; O'Brien, Susan; Ferrajoli, Alessandra; Wierda, William G; Czerniak, Bogdan A; Medeiros, L Jeffrey; Keating, Michael J; Abruzzo, Lynne V

    2013-03-01

    Genomic abnormalities, such as deletions in 11q22 or 17p13, are associated with poorer prognosis in patients with chronic lymphocytic leukemia (CLL). We hypothesized that unknown regions of copy number variation (CNV) affect clinical outcome and can be detected by array-based single-nucleotide polymorphism (SNP) genotyping. We compared SNP genotypes from 168 untreated patients with CLL with genotypes from 73 white HapMap controls. We identified 322 regions of recurrent CNV, 82 of which occurred significantly more often in CLL than in HapMap (CLL-specific CNV), including regions typically aberrant in CLL: deletions in 6q21, 11q22, 13q14, and 17p13 and trisomy 12. In univariate analyses, 35 of total and 11 of CLL-specific CNVs were associated with unfavorable time-to-event outcomes, including gains or losses in chromosomes 2p, 4p, 4q, 6p, 6q, 7q, 11p, 11q, and 17p. In multivariate analyses, six CNVs (ie, CLL-specific variations in 11p15.1-15.4 or 6q27) predicted time-to-treatment or overall survival independently of established markers of prognosis. Moreover, genotypic complexity (ie, the number of independent CNVs per patient) significantly predicted prognosis, with a median time-to-treatment of 64 months versus 23 months in patients with zero to one versus two or more CNVs, respectively (P = 3.3 × 10(-8)). In summary, a comparison of SNP genotypes from patients with CLL with HapMap controls allowed us to identify known and unknown recurrent CNVs and to determine regions and rates of CNV that predict poorer prognosis in patients with CLL.

  8. Characterization of a Wheat Breeders' Array suitable for high-throughput SNP genotyping of global accessions of hexaploid bread wheat (Triticum aestivum).

    PubMed

    Allen, Alexandra M; Winfield, Mark O; Burridge, Amanda J; Downie, Rowena C; Benbow, Harriet R; Barker, Gary L A; Wilkinson, Paul A; Coghill, Jane; Waterfall, Christy; Davassi, Alessandro; Scopes, Geoff; Pirani, Ali; Webster, Teresa; Brew, Fiona; Bloor, Claire; Griffiths, Simon; Bentley, Alison R; Alda, Mark; Jack, Peter; Phillips, Andrew L; Edwards, Keith J

    2017-03-01

    Targeted selection and inbreeding have resulted in a lack of genetic diversity in elite hexaploid bread wheat accessions. Reduced diversity can be a limiting factor in the breeding of high yielding varieties and crucially can mean reduced resilience in the face of changing climate and resource pressures. Recent technological advances have enabled the development of molecular markers for use in the assessment and utilization of genetic diversity in hexaploid wheat. Starting with a large collection of 819 571 previously characterized wheat markers, here we describe the identification of 35 143 single nucleotide polymorphism-based markers, which are highly suited to the genotyping of elite hexaploid wheat accessions. To assess their suitability, the markers have been validated using a commercial high-density Affymetrix Axiom(®) genotyping array (the Wheat Breeders' Array), in a high-throughput 384 microplate configuration, to characterize a diverse global collection of wheat accessions including landraces and elite lines derived from commercial breeding communities. We demonstrate that the Wheat Breeders' Array is also suitable for generating high-density genetic maps of previously uncharacterized populations and for characterizing novel genetic diversity produced by mutagenesis. To facilitate the use of the array by the wheat community, the markers, the associated sequence and the genotype information have been made available through the interactive web site 'CerealsDB'.

  9. Development of a 63K SNP array for Gossypium and high-density mapping of intra- and inter-specific populations of cotton (G. hirsutum L.)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    High-throughput genotyping arrays provide a standardized resource for crop research communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), candidate marker and quantitative trait loci (QTL) ide...

  10. SNP-VISTA

    SciTech Connect

    Shah, Nameeta; Teplitsky, Michael; Minovitsky, Simon; Dubchak, Inna

    2005-11-07

    SNP-VISTA aids in analyses of the following types of data: A. Large-scale re-sequence data of disease-related genes for discovery of associated and/or causative alleles (GeneSNP-VISTA). B. Massive amounts of ecogenomics data for studying homologous recombination in microbial populations (EcoSNP-VISTA). The main features and capabilities of SNP-VISTA are: 1) Mapping of SNPs to gene structure; 2) classification of SNPs, based on their location in the gene, frequency of occurrence in samples and allele composition; 3) clustering, based on user-defined subsets of SNPs, highlighting haplotypes as well as recombinant sequences; 4) integration of protein conservation visualization; and 5) display of automatically calculated recombination points that are user-editable. The main strength of SNP-VISTA is its graphical interface and use of visual representations, which support interactive exploration and hence better understanding of large-scale SNPs data.

  11. DASH-2: flexible, low-cost, and high-throughput SNP genotyping by dynamic allele-specific hybridization on membrane arrays.

    PubMed

    Jobs, Magnus; Howell, W Mathias; Stromqvist, Linda; Mayr, Torsten; Brookes, Anthony J

    2003-05-01

    Genotyping technologies need to be continually improved in terms of their flexibility, cost-efficiency, and throughput, to push forward genome variation analysis. To this end, we have leveraged the inherent simplicity of dynamic allele-specific hybridization (DASH) and coupled it to recent innovations of centrifugal arrays and iFRET. We have thereby created a new genotyping platform we term DASH-2, which we demonstrate and evaluate in this report. The system is highly flexible in many ways (any plate format, PCR multiplexing, serial and parallel array processing, spectral-multiplexing of hybridization probes), thus supporting a wide range of application scales and objectives. Precision is demonstrated to be in the range 99.8-100%, and assay costs are 0.05 USD or less per genotype assignment. DASH-2 thus provides a powerful new alternative for genotyping practice, which can be used without the need for expensive robotics support.

  12. Lipids, obesity and gallbladder disease in women: insights from genetic studies using the cardiovascular gene-centric 50K SNP array

    PubMed Central

    Rodriguez, Santiago; Gaunt, Tom R; Guo, Yiran; Zheng, Jie; Barnes, Michael R; Tang, Weihang; Danish, Fazal; Johnson, Andrew; Castillo, Berta A; Li, Yun R; Hakonarson, Hakon; Buxbaum, Sarah G; Palmer, Tom; Tsai, Michael Y; Lange, Leslie A; Ebrahim, Shah; Davey Smith, George; Lawlor, Debbie A; Folsom, Aaron R; Hoogeveen, Ron; Reiner, Alex; Keating, Brendan; Day, Ian NM

    2016-01-01

    Gallbladder disease (GBD) has an overall prevalence of 10–40% depending on factors such as age, gender, population, obesity and diabetes, and represents a major economic burden. Although gallstones are composed of cholesterol by-products and are associated with obesity, presumed causal pathways remain unproven, although BMI reduction is typically recommended. We performed genetic studies to discover candidate genes and define pathways involved in GBD. We genotyped 15 241 women of European ancestry from three cohorts, including 3216 with GBD, using the Human cardiovascular disease (HumanCVD) BeadChip containing up to ~53 000 single-nucleotide polymorphisms (SNPs). Effect sizes with P-values for development of GBD were generated. We identify two new loci associated with GBD, GCKR rs1260326:T>C (P=5.88 × 10−7, ß=−0.146) and TTC39B rs686030:C>A (P=6.95x10−7, ß=0.271) and detect four independent SNP effects in ABCG8 rs4953023:G>A (P=7.41 × 10−47, ß=0.734), ABCG8 rs4299376:G>T (P=2.40 × 10−18, ß=0.278), ABCG5 rs6544718:T>C (P=2.08 × 10−14, ß=0.044) and ABCG5 rs6720173:G>C (P=3.81 × 10−12, ß=0.262) in conditional analyses taking genotypes of rs4953023:G>A as a covariate. We also delineate the risk effects among many genotypes known to influence lipids. These data, from the largest GBD genetic study to date, show that specific, mainly hepatocyte-centred, components of lipid metabolism are important to GBD risk in women. We discuss the potential pharmaceutical implications of our findings. PMID:25920552

  13. Lipids, obesity and gallbladder disease in women: insights from genetic studies using the cardiovascular gene-centric 50K SNP array.

    PubMed

    Rodriguez, Santiago; Gaunt, Tom R; Guo, Yiran; Zheng, Jie; Barnes, Michael R; Tang, Weihang; Danish, Fazal; Johnson, Andrew; Castillo, Berta A; Li, Yun R; Hakonarson, Hakon; Buxbaum, Sarah G; Palmer, Tom; Tsai, Michael Y; Lange, Leslie A; Ebrahim, Shah; Davey Smith, George; Lawlor, Debbie A; Folsom, Aaron R; Hoogeveen, Ron; Reiner, Alex; Keating, Brendan; Day, Ian N M

    2016-01-01

    Gallbladder disease (GBD) has an overall prevalence of 10-40% depending on factors such as age, gender, population, obesity and diabetes, and represents a major economic burden. Although gallstones are composed of cholesterol by-products and are associated with obesity, presumed causal pathways remain unproven, although BMI reduction is typically recommended. We performed genetic studies to discover candidate genes and define pathways involved in GBD. We genotyped 15,241 women of European ancestry from three cohorts, including 3216 with GBD, using the Human cardiovascular disease (HumanCVD) BeadChip containing up to ~ 53,000 single-nucleotide polymorphisms (SNPs). Effect sizes with P-values for development of GBD were generated. We identify two new loci associated with GBD, GCKR rs1260326:T>C (P = 5.88 × 10(-7), ß = -0.146) and TTC39B rs686030:C>A (P = 6.95 x 10(-7), ß = 0.271) and detect four independent SNP effects in ABCG8 rs4953023:G>A (P=7.41 × 10(-47), ß = 0.734), ABCG8 rs4299376:G(>)T (P = 2.40 × 10(-18), ß = 0.278), ABCG5 rs6544718:T>C (P = 2.08 × 10(-14), ß = 0.044) and ABCG5 rs6720173:G>C (P = 3.81 × 10(-12), ß(=)0.262) in conditional analyses taking genotypes of rs4953023:G>A as a covariate. We also delineate the risk effects among many genotypes known to influence lipids. These data, from the largest GBD genetic study to date, show that specific, mainly hepatocyte-centred, components of lipid metabolism are important to GBD risk in women. We discuss the potential pharmaceutical implications of our findings.

  14. Test of a 250 kVA Battery-Inverter System Micro-Grid: Cooperative Research and Development Final Report, CRADA Number CRD-11-460

    SciTech Connect

    Kramer, William; Martin, Greg; Lundstrom, Blake

    2013-12-01

    Portland General Electric (PGE) is installing a 5-megawatt (MW) lithium-ion-based battery-inverter system (BIS) in Salem, Oregon, as part of the Pacific Northwest Smart Grid Demonstration Project. NREL will assist PGE in testing a 250-kilovolt-ampere (kVA) portion of the BIS in order to verify correct operation and minimize risk to subsequent demonstrations. In this project NREL will providetechnical support for the 250-kVA test and will work with PGE to write a test plan and evaluate the system in the lab before deployment in the field.

  15. A comparison of the reactivating and therapeutic efficacy of newly developed bispyridinium oximes (K250, K251) with commonly used oximes against tabun in rats and mice.

    PubMed

    Kassa, Jiri; Karasova, Jana; Bajgar, Jiri; Kuca, Kamil; Musilek, Kamil; Kopelikova, Irena

    2009-08-01

    The potency of newly developed bispyridinium compounds (K250, K251) in reactivating tabun-inhibited acetylcholinesterase and reducing tabun-induced lethal toxic effects was compared with currently available oximes (obidoxime, trimedoxime, the oxime HI-6) using in vivo methods. Studies determined percentage of reactivation of tabun-inhibited blood and tissue AChE in poisoned rats and showed that the reactivating efficacy of both newly developed oximes is comparable with the oxime HI-6 but it is significantly lower than the reactivating effects of obidoxime and trimedoxime, especially in diaphragm and brain. Both newly developed oximes were also found to be able to slightly reduce lethal toxic effects in tabun-poisoned mice. Their therapeutic efficacy is higher than the potency of the oxime HI-6 but it is lower than the therapeutic effects of trimedoxime and obidoxime. Thus, the reactivating and therapeutic potency of both newly developed oximes (K250, K251) does not prevail over the effectiveness of currently available oximes and, therefore, they are not suitable for their replacement for the treatment of acute tabun poisoning.

  16. Inference of kinship coefficients from Korean SNP genotyping data.

    PubMed

    Park, Seong-Jin; Yang, Jin Ok; Kim, Sang Cheol; Kwon, Jekeun; Lee, Sanghyuk; Lee, Byungwook

    2013-06-01

    The determination of relatedness between individuals in a family is crucial in analysis of common complex diseases. We present a method to infer close inter-familial relationships based on SNP genotyping data and provide the relationship coefficient of kinship in Korean families. We obtained blood samples from 43 Korean individuals in two families. SNP data was obtained using the Affymetrix Genome-wide Human SNP array 6.0 and the Illumina Human 1M-Duo chip. To measure the kinship coefficient with the SNP genotyping data, we considered all possible pairs of individuals in each family. The genetic distance between two individuals in a pair was determined using the allele sharing distance method. The results show that genetic distance is proportional to the kinship coefficient and that a close degree of kinship can be confirmed with SNP genotyping data. This study represents the first attempt to identify the genetic distance between very closely related individuals.

  17. Delineation of a de novo 7q21.3q31.1 Deletion by CGH-SNP Arrays in a Girl with Multiple Congenital Anomalies Including Severe Glaucoma

    PubMed Central

    Martínez-Jacobo, L.; Córdova-Fletes, C.; Ortiz-López, R.; Rivas, F.; Saucedo-Carrasco, C.; Rojas-Martínez, A.

    2013-01-01

    In this study, we present a female patient with a constitutional de novo deletion in 7q21.3q31.1 as determined by G-banding and CGH-SNP arrays. She exhibited, among other features, psychomotor retardation, congenital severe bilateral glaucoma, a cleft palate, and heart defect. Microarray assay disclosed a deleted 12.5-Mb region roughly 88 kb downstream the ectrodactyly critical region; thus, the patient's final karyotype was 46,XX.arr 7q21.3q31.1(96,742,140-109,246,085)×1 dn. This girl represents the fourth patient described so far with congenital glaucoma and a deletion encompassing or overlapping the 7q21.3q31.1 region, and confirms the presence of a locus or loci related to such a clinical feature. According to our results, the proneness to ocular defects secondary to 7q intermediate deletions could be caused by co-deletion of TAC1, HBP1, and a small cluster of cytochrome P450 genes (subfamily 3A). This conclusion is supported by their functional roles and expression locations as well as because TAC1 is related to the functional pathway of the MYOC gene whose mutations are linked to glaucoma. Moreover, given that this girl is clinically reminiscent of several phenotypes related to diverse deletions within 7q21q32, our results and observations offer a general overview of the gene content of deletions/phenotypes overlapping 7q21.3q31.1 and confirm that loci distal to DLX genes including the CUX1 gene and potential regulatory elements downstream from DLX5 are unrelated to ectrodactyly. PMID:24167464

  18. SNP panels/Imputation

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Participants from thirteen countries discussed services that Interbull can perform or recommendations that Interbull can make to promote harmonization and assist member countries in improving their genomic evaluations in regard to SNP panels and imputation. The panel recommended: A mechanism to shar...

  19. Comparison of a 250 kV single-stage accelerator mass spectrometer with a 5 MV tandem accelerator mass spectrometer--fitness for purpose in bioanalysis.

    PubMed

    Young, G C; Corless, S; Felgate, C C; Colthup, P V

    2008-12-01

    The introduction of 'compact' accelerator mass spectrometers into biomedical science, including use in drug metabolism and bioanalytical applications, is an exciting recent development. Comparisons are presented here between a more established and relatively large tandem accelerator which operates at up to 5 MV and a conventional laboratory-sized 250 kV single-stage accelerator mass spectrometer. Biological samples were enriched with low levels of radiocarbon, then converted into graphite prior to analysis on each of the two instruments. The data obtained showed the single-stage instrument to be capable of delivering comparable results, and thus able to provide similar study support, with that provided by the 5 MV instrument, without the significant overheads and complexities which are inherent to the operation of the larger instrument. We believe that the advent of these laboratory-sized accelerator mass spectrometry (AMS) instruments represents a real turning point in the potential for application of AMS by a wider user group.

  20. Analysis of population structure and genetic history of cattle breeds based on high-density SNP data

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Advances in single nucleotide polymorphism (SNP) genotyping microarrays have facilitated a new understanding of population structure and evolutionary history for several species. Most existing studies in livestock were based on low density SNP arrays. The first wave of low density SNP studies on cat...

  1. Fine-scaled human genetic structure revealed by SNP microarrays.

    PubMed

    Xing, Jinchuan; Watkins, W Scott; Witherspoon, David J; Zhang, Yuhua; Guthery, Stephen L; Thara, Rangaswamy; Mowry, Bryan J; Bulayeva, Kazima; Weiss, Robert B; Jorde, Lynn B

    2009-05-01

    We report an analysis of more than 240,000 loci genotyped using the Affymetrix SNP microarray in 554 individuals from 27 worldwide populations in Africa, Asia, and Europe. To provide a more extensive and complete sampling of human genetic variation, we have included caste and tribal samples from two states in South India, Daghestanis from eastern Europe, and the Iban from Malaysia. Consistent with observations made by Charles Darwin, our results highlight shared variation among human populations and demonstrate that much genetic variation is geographically continuous. At the same time, principal components analyses reveal discernible genetic differentiation among almost all identified populations in our sample, and in most cases, individuals can be clearly assigned to defined populations on the basis of SNP genotypes. All individuals are accurately classified into continental groups using a model-based clustering algorithm, but between closely related populations, genetic and self-classifications conflict for some individuals. The 250K data permitted high-level resolution of genetic variation among Indian caste and tribal populations and between highland and lowland Daghestani populations. In particular, upper-caste individuals from Tamil Nadu and Andhra Pradesh form one defined group, lower-caste individuals from these two states form another, and the tribal Irula samples form a third. Our results emphasize the correlation of genetic and geographic distances and highlight other elements, including social factors that have contributed to population structure.

  2. Pulsed Yb:fiber system capable of >250kW peak power with tunable pulses in the 50ps to 1.5ns range

    NASA Astrophysics Data System (ADS)

    McComb, Timothy S.; Lowder, Tyson L.; Leadbetter, Vickie; Reynolds, Mitch; Saracco, Matthieu J.; Hutchinson, Joel; Green, Jared; McCal, Dennis; Burkholder, Gary; Kutscha, Tim; Dittli, Adam; Hamilton, Chuck; Kliner, Dahv A. V.; Randall, Matthew; Fanning, Geoff; Bell, Jake

    2013-03-01

    We have demonstrated a pulsed 1064 nm PM Yb:fiber laser system incorporating a seed source with a tunable pulse repetition rate and pulse duration and a multistage fiber amplifier, ending in a large core (>650 μm2 mode field area), tapered fiber amplifier. The amplifier chain is all-fiber, with the exception of the final amplifier's pump combiner, allowing robust, compact packaging. The air-cooled laser system is rated for >60 W of average power and beam quality of M2 < 1.3 at repetition rates below 100 kHz to 10's of MHz, with pulses discretely tunable over a range spanning 50 ps to greater than 1.5 ns. Maximum pulse energies, limited by the onset of self phase modulation and stimulated Raman scattering, are greater than 12.5 μJ at 50 ps and 375 μJ at 1.5 ns , corresponding to >250 kW peak power across the pulse tuning range. We present frequency conversion to 532 nm with efficiency greater than 70% and conversion to UV via frequency tripling, with initial feasibility experiments showing >30% UV conversion efficiency. Application results of the laser in scribing, thin film removal and micro-machining will be discussed.

  3. High throughput SNP discovery and validation in the pig: towards the development of a high density swine SNP chip

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Recent developments in sequencing technology have allowed the generation of millions of short read sequences in a fast and inexpensive way. This enables the cost effective large scale identification of hundreds of thousands of SNPs needed for the development of high density SNP arrays. Currently, a ...

  4. Ultrahigh-density linkage map for cultivated cucumber (Cucumis sativus L.) using a single-nucleotide polymorphism genotyping array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    With the low cost of single nucleotide polymorphism (SNP) discovery, use of SNP markers for SNP array development is becoming more affordable. The SNP array is a very useful tool for high throughput genotyping and has a number of applications such as genome-wide association studies (GWAS). Since the...

  5. SNP genotyping by heteroduplex analysis.

    PubMed

    Paniego, Norma; Fusari, Corina; Lia, Verónica; Puebla, Andrea

    2015-01-01

    Heteroduplex-based genotyping methods have proven to be technologically effective and economically efficient for low- to medium-range throughput single-nucleotide polymorphism (SNP) determination. In this chapter we describe two protocols that were successfully applied for SNP detection and haplotype analysis of candidate genes in association studies. The protocols involve (1) enzymatic mismatch cleavage with endonuclease CEL1 from celery, associated with fragment separation using capillary electrophoresis (CEL1 cleavage), and (2) differential retention of the homo/heteroduplex DNA molecules under partial denaturing conditions on ion pair reversed-phase liquid chromatography (dHPLC). Both methods are complementary since dHPLC is more versatile than CEL1 cleavage for identifying multiple SNP per target region, and the latter is easily optimized for sequences with fewer SNPs or small insertion/deletion polymorphisms. Besides, CEL1 cleavage is a powerful method to localize the position of the mutation when fragment resolution is done using capillary electrophoresis.

  6. SNP uniqueness problem: a proof-of-principle in HapMap SNPs.

    PubMed

    Doron, Shany; Shweiki, Dorit

    2011-04-01

    SNP-based research strongly affects our biomedical and clinically associated knowledge. Nonunique and false-positive SNP existence in commonly used datasets may thus lead to biased, inaccurate clinically associated conclusions. We designed a computational study to reveal the degree of nonunique/false-positive SNPs in the HapMap dataset. Two sets of SNP flanking sequences were used as queries for BLAT analysis against the human genome. 4.2% and 11.9% of HapMap SNPs align to the genome nonuniquely (long and short, respectively). Furthermore, an average of 7.9% nonunique SNPs are included in common commercial genotyping arrays (according to our designed probes). Nonunique SNPs identified in this study are represented to various degrees in clinically associated databases, stressing the consequence of inaccurate SNP annotation and hence SNP utilization. Unfortunately, our results question some disease-related genotyping analyses, raising a worrisome concern on their validity.

  7. UASIS: Universal Automatic SNP Identification System

    PubMed Central

    2011-01-01

    Background SNP (Single Nucleotide Polymorphism), the most common genetic variations between human beings, is believed to be a promising way towards personalized medicine. As more and more research on SNPs are being conducted, non-standard nomenclatures may generate potential problems. The most serious issue is that researchers cannot perform cross referencing among different SNP databases. This will result in more resources and time required to track SNPs. It could be detrimental to the entire academic community. Results UASIS (Universal Automated SNP Identification System) is a web-based server for SNP nomenclature standardization and translation at DNA level. Three utilities are available. They are UASIS Aligner, Universal SNP Name Generator and SNP Name Mapper. UASIS maps SNPs from different databases, including dbSNP, GWAS, HapMap and JSNP etc., into an uniform view efficiently using a proposed universal nomenclature and state-of-art alignment algorithms. UASIS is freely available at http://www.uasis.tk with no requirement of log-in. Conclusions UASIS is a helpful platform for SNP cross referencing and tracking. By providing an informative, unique and unambiguous nomenclature, which utilizes unique position of a SNP, we aim to resolve the ambiguity of SNP nomenclatures currently practised. Our universal nomenclature is a good complement to mainstream SNP notations such as rs# and HGVS guidelines. UASIS acts as a bridge to connect heterogeneous representations of SNPs. PMID:22369494

  8. Linear reduction methods for tag SNP selection.

    PubMed

    He, Jingwu; Zelikovsky, Alex

    2004-01-01

    It is widely hoped that constructing a complete human haplotype map will help to associate complex diseases with certain SNP's. Unfortunately, the number of SNP's is huge and it is very costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNP's that should be sequenced to considerably small number of informative representatives, so called tag SNP's. In this paper, we propose a new linear algebra based method for selecting and using tag SNP's. Our method is purely combinatorial and can be combined with linkage disequilibrium (LD) and block based methods. We measure the quality of our tag SNP selection algorithm by comparing actual SNP's with SNP's linearly predicted from linearly chosen tag SNP's. We obtain an extremely good compression and prediction rates. For example, for long haplotypes (>25000 SNP's), knowing only 0.4% of all SNP's we predict the entire unknown haplotype with 2% accuracy while the prediction method is based on a 10% sample of the population.

  9. Wrinkled single-layer graphenes fabricated by silicon nanopillar arrays

    NASA Astrophysics Data System (ADS)

    Li, Zibo; Wu, Yutong; Nan, Jingjie; Tang, Xiaoduo; Zhang, Junhu; Yang, Bai

    2016-11-01

    The degree of crumpling affects the optoelectronic properties of graphene, which are very important for the performance of graphene-based devices and materials. In this article, we report an approach to tune the formation of wrinkles on single-layer graphene (SLG) by silicon nanopillar (SNP) arrays. By using gold nanoparticles as an etching mask, SNP arrays with different heights could be prepared by tuning the duration of etching. The formation of wrinkles on these SNP arrays was studied systematically. We found that thermal treatment could lead to a wrapping behavior of graphene around SNP arrays, which was accompanied by the emergence of many more wrinkles. Controllable wettability, conductivity and transmittance were demonstrated. This ability to tune wrinkles using SNP arrays can be employed to engineer the fabrication of graphene-related devices and other optoelectronic applications.

  10. LiF:Mg,Ti TLD response as a function of photon energy for moderately filtered x-ray spectra in the range of 20-250 kVp relative to {sup 60}Co

    SciTech Connect

    Nunn, A. A.; Davis, S. D.; Micka, J. A.; DeWerd, L. A.

    2008-05-15

    The response of LiF:Mg,Ti thermoluminescent dosimeters (TLDs) as a function of photon energy was determined using irradiations with moderately filtered x-ray beams in the energy range of 20-250 kVp relative to the response to irradiations with {sup 60}Co photons. To determine if the relative light output from LiF:Mg,Ti TLDs per unit air kerma as a function of photon energy can be predicted using calculations such as Monte Carlo (MC) simulations, measurements from the x-ray beam irradiations were compared with MC calculated results, similar to the methodology used by Davis et al. [Radiat. Prot. Dosim. 106, 33-43 (2003)]. TLDs were irradiated in photon beams with well-known air kerma rates using the National Institute of Standards and Technology traceable M-series x-ray beams in the range of 20-250 kVp. For each x-ray beam, several sets of TLDs were irradiated for times corresponding to different air kerma levels to take into account any dose nonlinearity. TLD light output was then compared to that from several sets of TLDs irradiated at similar corresponding air kerma levels using a {sup 60}Co irradiator. The MC code MCNP5 was used to account for photon scatter and attenuation in the holder and TLDs and was used to calculate the predicted relative TLD light output per unit air kerma for irradiations with each of the experimentally used photon beams. The measured relative TLD response as a function of photon energy differed by up to 13% from the MC calculations. We conclude that MC calculations do not accurately predict the relative response of TLDs as a function of photon energy, consistent with the conclusions of Davis et al. [Radiat. Prot. Dosim. 106, 33-43 (2003)]. This is likely due to complications in the solid state physics of the thermoluminescence process that are not incorporated into the simulation.

  11. A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation

    PubMed Central

    2013-01-01

    Background Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. Results We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Conclusions Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array—more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation

  12. Genome-wide SNP typing reveals signatures of population history.

    PubMed

    Hughes, Austin L; Welch, Robert; Puri, Vinita; Matthews, Casey; Haque, Kashif; Chanock, Stephen J; Yeager, Meredith

    2008-07-01

    Single-nucleotide polymorphism (SNP) arrays have become a popular technology for disease-association studies, but they also have potential for studying the genetic differentiation of human populations. Application of the Affymetrix GeneChip Human Mapping 500K Array Set to a population of 102 individuals representing the major ethnic groups in the United States (African, Asian, European, and Hispanic) revealed patterns of gene diversity and genetic distance that reflected population history. We analyzed allelic frequencies at 388,654 autosomal SNP sites that showed some variation in our study population and 10% or fewer missing values. Despite the small size (23-31 individuals) of each subpopulation, there were no fixed differences at any site between any two subpopulations. As expected from the African origin of modern humans, greater gene diversity was seen in Africans than in either Asians or Europeans, and the genetic distance between the Asian and the European populations was significantly lower than that between either of these two populations and Africans. Principal components analysis applied to a correlation matrix among individuals was able to separate completely the major continental groups of humans (Africans, Asians, and Europeans), while Hispanics overlapped all three of these groups. Genes containing two or more markers with extraordinarily high genetic distance between subpopulations were identified as candidate genes for health differences between subpopulations. The results show that, even with modest sample sizes, genome-wide SNP genotyping technologies have great promise for capturing signatures of gene frequency difference between human subpopulations, with applications in areas as diverse as forensics and the study of ethnic health disparities.

  13. Detecting Susceptibility to Breast Cancer with SNP-SNP Interaction Using BPSOHS and Emotional Neural Networks

    PubMed Central

    Wang, Xiao; Fan, Yue

    2016-01-01

    Studies for the association between diseases and informative single nucleotide polymorphisms (SNPs) have received great attention. However, most of them just use the whole set of useful SNPs and fail to consider the SNP-SNP interactions, while these interactions have already been proven in biology experiments. In this paper, we use a binary particle swarm optimization with hierarchical structure (BPSOHS) algorithm to improve the effective of PSO for the identification of the SNP-SNP interactions. Furthermore, in order to use these SNP interactions in the susceptibility analysis, we propose an emotional neural network (ENN) to treat SNP interactions as emotional tendency. Different from the normal architecture, just as the emotional brain, this architecture provides a specific path to treat the emotional value, by which the SNP interactions can be considered more quickly and directly. The ENN helps us use the prior knowledge about the SNP interactions and other influence factors together. Finally, the experimental results prove that the proposed BPSOHS_ENN algorithm can detect the informative SNP-SNP interaction and predict the breast cancer risk with a much higher accuracy than existing methods. PMID:27294121

  14. Crystallization of 21.25Gd 2O 3-63.75MoO 3-15B 2O 3 glass induced by femtosecond laser at the repetition rate of 250 kHz

    NASA Astrophysics Data System (ADS)

    Zhong, M. J.; Han, Y. M.; Liu, L. P.; Zhou, P.; Du, Y. Y.; Guo, Q. T.; Ma, H. L.; Dai, Y.

    2010-12-01

    We report the formation of β'-Gd 2(MoO 4) 3 (GMO) crystal on the surface of the 21.25Gd 2O 3-63.75MoO 3-15B 2O 3 glass, induced by 250 kHz, 800 nm femtosecond laser irradiation. The morphology of the modified region in the glass was clearly examined by scanning electron microscopy (SEM). By micro-Raman spectra, the laser-induced crystals were confirmed to be GMO phases and it is found that these crystals have a strong dependence on the number and power of the femtosecond laser pulses. When the irradiation laser power was 900 mW, not only the Raman peaks of GMO crystals but also some new peaks at 214 cm -1, 240 cm -1, 466 cm -1, 664 cm -1 and 994 cm -1which belong to the MoO 3 crystals were observed. The possible mechanisms are proposed to explain these phenomena.

  15. Determination of the Sensibility Factors for TLD-100 Powder on the Energy of X-Ray of 50, 250 kVp; 192Ir, 137Cs and 60Co

    SciTech Connect

    Loaiza, Sandra P.; Alvarez, Jose T.

    2006-09-08

    TLD-100 powder is calibrated in terms of absorbed dose to water Dw, using the protocols AAPM TG61, AAPM TG43 and IAEA-TRS 398, for the energy of RX 50, 250 kVp, 137Cs and 60Co respectively. The calibration curves, TLD Response R versus Dw, are fitted by weighted least square by a quadratic polynomials; which are validated with the lack of fit and the Anderson-Darling normality test. The slope of these curves corresponds to the sensibility factor: Fs R/DW, [Fs] = nC Gy-1. The expanded uncertainties U's for these factors are obtained from the ANOVA tables. Later, the Fs' values are interpolated using the effective energy hvefec for the 192Ir. The SSDL sent a set of capsules with powder TLD-100 for two Hospitals. These irradiated them a nominal dose of Dw = 2 Gy. The results determined at SSDL are: for the Hospital A the Dw is overestimated in order to 4.8% and the Hospital B underestimates it in the range from -1.4% to -17.5%.

  16. SNP Cutter: a comprehensive tool for SNP PCR–RFLP assay design

    PubMed Central

    Zhang, Ruifang; Zhu, Zanhua; Zhu, Hongming; Nguyen, Tu; Yao, Fengxia; Xia, Kun; Liang, Desheng; Liu, Chunyu

    2005-01-01

    The Polymerase chain reaction–restriction fragment length polymorphism (PCR–RFLP) is a relatively simple and inexpensive method for genotyping single nucleotide polymorphisms (SNPs). It requires minimal investment in instrumentation. Here, we describe a web application, ‘SNP Cutter,’ which designs PCR–RFLP assays on a batch of SNPs from the human genome. NCBI dbSNP rs IDs or formatted SNPs are submitted into the SNP Cutter which then uses restriction enzymes from a pre-selected list to perform enzyme selection. The program is capable of designing primers for either natural PCR–RFLP or mismatch PCR–RFLP, depending on the SNP sequence data. SNP Cutter generates the information needed to evaluate and perform genotyping experiments, including a PCR primers list, sizes of original amplicons and different allelic fragment after enzyme digestion. Some output data is tab-delimited, therefore suitable for database archiving. The SNP Cut-ter is available at . PMID:15980518

  17. SNP genotyping by DNA photoligation: application to SNP detection of genes from food crops

    NASA Astrophysics Data System (ADS)

    Yoshimura, Yoshinaga; Ohtake, Tomoko; Okada, Hajime; Ami, Takehiro; Tsukaguchi, Tadashi; Fujimoto, Kenzo

    2009-06-01

    We describe a simple and inexpensive single-nucleotide polymorphism (SNP) typing method, using DNA photoligation with 5-carboxyvinyl-2'-deoxyuridine and two fluorophores. This SNP-typing method facilitates qualitative determination of genes from indica and japonica rice, and showed a high degree of single nucleotide specificity up to 10 000. This method can be used in the SNP typing of actual genomic DNA samples from food crops.

  18. A SNP-Based Molecular Barcode for Characterization of Common Wheat

    PubMed Central

    Gao, LiFeng; Jia, JiZeng; Kong, XiuYing

    2016-01-01

    Wheat is grown as a staple crop worldwide. It is important to develop an effective genotyping tool for this cereal grain both to identify germplasm diversity and to protect the rights of breeders. Single-nucleotide polymorphism (SNP) genotyping provides a means for developing a practical, rapid, inexpensive and high-throughput assay. Here, we investigated SNPs as robust markers of genetic variation for typing wheat cultivars. We identified SNPs from an array of 9000 across a collection of 429 well-known wheat cultivars grown in China, of which 43 SNP markers with high minor allele frequency and variations discriminated the selected wheat varieties and their wild ancestors. This SNP-based barcode will allow for the rapid and precise identification of wheat germplasm resources and newly released varieties and will further assist in the wheat breeding program. PMID:26985664

  19. HapRice, an SNP haplotype database and a web tool for rice.

    PubMed

    Yonemaru, Jun-ichi; Ebana, Kaworu; Yano, Masahiro

    2014-01-01

    Genome-wide single nucleotide polymorphism (SNP) analysis is a promising tool to examine the genetic diversity of rice populations and genetic traits of scientific and economic importance. Next-generation sequencing technology has accelerated the re-sequencing of diverse rice varieties and the discovery of genome-wide SNPs. Notably, validation of these SNPs by a high-throughput genotyping system, such as an SNP array, could provide a manageable and highly accurate SNP set. To enhance the potential utility of genome-wide SNPs for geneticists and breeders, analysis tools need to be developed. Here, we constructed an SNP haplotype database, which allows visualization of the allele frequency of all SNPs in the genome browser. We calculated the allele frequencies of 3,334 SNPs in 76 accessions from the world rice collection and 3,252 SNPs in 177 Japanese rice accessions; all these SNPs have been validated in our previous studies. The SNP haplotypes were defined by the allele frequency in each cultivar group (aus, indica, tropical japonica and temperate japonica) for the world rice accessions, and in non-irrigated and three irrigated groups (three variety registration periods) for Japanese rice accessions. We also developed web tools for finding polymorphic SNPs between any two rice accessions and for the primer design to develop cleaved amplified polymorphic sequence markers at any SNP. The 'HapRice' database and the web tools can be accessed at http://qtaro.abr.affrc.go.jp/index.html. In addition, we established a core SNP set consisting of 768 SNPs uniformly distributed in the rice genome; this set is of a practically appropriate size for use in rice genetic analysis.

  20. SNPMeta: SNP annotation and SNP metadata collection without a reference genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The increase in availability of resequencing data is greatly accelerating SNP discovery and has facilitated the development of SNP genotyping assays. This, in turn, is increasing interest in annotation of individual SNPs. Currently, these data are only available through curation, or comparison to a ...

  1. Characterization of the Streptomyces sp. Strain C5 snp Locus and Development of snp-Derived Expression Vectors

    PubMed Central

    DeSanti, Charles L.; Strohl, William R.

    2003-01-01

    The Streptomyces sp. strain C5 snp locus is comprised of two divergently oriented genes: snpA, a metalloproteinase gene, and snpR, which encodes a LysR-like activator of snpA transcription. The transcriptional start point of snpR is immediately downstream of a strong T-N11-A inverted repeat motif likely to be the SnpR binding site, while the snpA transcriptional start site overlaps the ATG start codon, generating a leaderless snpA transcript. By using the aphII reporter gene of pIJ486 as a reporter, the plasmid-borne snpR-activated snpA promoter was ca. 60-fold more active than either the nonactivated snpA promoter or the melC1 promoter of pIJ702. The snpR-activated snpA promoter produced reporter protein levels comparable to those of the up-mutated ermE∗ promoter. The SnpR-activated snpA promoter was built into a set of transcriptional and translational fusion expression vectors which have been used for the intracellular expression of numerous daunomycin biosynthesis pathway genes from Streptomyces sp. strain C5 as well as the expression and secretion of soluble recombinant human endostatin. PMID:12620855

  2. SNP markers-based map construction and genome-wide linkage analysis in Brassica napus.

    PubMed

    Raman, Harsh; Dalton-Morgan, Jessica; Diffey, Simon; Raman, Rosy; Alamery, Salman; Edwards, David; Batley, Jacqueline

    2014-09-01

    An Illumina Infinium array comprising 5306 single nucleotide polymorphism (SNP) markers was used to genotype 175 individuals of a doubled haploid population derived from a cross between Skipton and Ag-Spectrum, two Australian cultivars of rapeseed (Brassica napus L.). A genetic linkage map based on 613 SNP and 228 non-SNP (DArT, SSR, SRAP and candidate gene markers) covering 2514.8 cM was constructed and further utilized to identify loci associated with flowering time and resistance to blackleg, a disease caused by the fungus Leptosphaeria maculans. Comparison between genetic map positions of SNP markers and the sequenced Brassica rapa (A) and Brassica oleracea (C) genome scaffolds showed several genomic rearrangements in the B. napus genome. A major locus controlling resistance to L. maculans was identified at both seedling and adult plant stages on chromosome A07. QTL analyses revealed that up to 40.2% of genetic variation for flowering time was accounted for by loci having quantitative effects. Comparative mapping showed Arabidopsis and Brassica flowering genes such as Phytochrome A/D, Flowering Locus C and agamous-Like MADS box gene AGL1 map within marker intervals associated with flowering time in a DH population from Skipton/Ag-Spectrum. Genomic regions associated with flowering time and resistance to L. maculans had several SNP markers mapped within 10 cM. Our results suggest that SNP markers will be suitable for various applications such as trait introgression, comparative mapping and high-resolution mapping of loci in B. napus.

  3. Elucidation of the ‘Honeycrisp’ pedigree through haplotype analysis with a multi-family integrated SNP linkage map and a large apple (Malus×domestica) pedigree-connected SNP data set

    PubMed Central

    Howard, Nicholas P; van de Weg, Eric; Bedford, David S; Peace, Cameron P; Vanderzande, Stijn; Clark, Matthew D; Teh, Soon Li; Cai, Lichun; Luby, James J

    2017-01-01

    The apple (Malus×domestica) cultivar Honeycrisp has become important economically and as a breeding parent. An earlier study with SSR markers indicated the original recorded pedigree of ‘Honeycrisp’ was incorrect and ‘Keepsake’ was identified as one putative parent, the other being unknown. The objective of this study was to verify ‘Keepsake’ as a parent and identify and genetically describe the unknown parent and its grandparents. A multi-family based dense and high-quality integrated SNP map was created using the apple 8 K Illumina Infinium SNP array. This map was used alongside a large pedigree-connected data set from the RosBREED project to build extended SNP haplotypes and to identify pedigree relationships. ‘Keepsake’ was verified as one parent of ‘Honeycrisp’ and ‘Duchess of Oldenburg’ and ‘Golden Delicious’ were identified as grandparents through the unknown parent. Following this finding, siblings of ‘Honeycrisp’ were identified using the SNP data. Breeding records from several of these siblings suggested that the previously unreported parent is a University of Minnesota selection, MN1627. This selection is no longer available, but now is genetically described through imputed SNP haplotypes. We also present the mosaic grandparental composition of ‘Honeycrisp’ for each of its 17 chromosome pairs. This new pedigree and genetic information will be useful in future pedigree-based genetic studies to connect ‘Honeycrisp’ with other cultivars used widely in apple breeding programs. The created SNP linkage map will benefit future research using the data from the Illumina apple 8 and 20 K and Affymetrix 480 K SNP arrays. PMID:28243452

  4. [Research progress on the phenotype informative SNP in forensic science].

    PubMed

    Liu, Yu-Xuan; Hu, Qing-Qing; Ma, Hong-Du; Huang, Dai-Xin

    2014-10-01

    Single nucleotide polymorphism (SNP) refers to the single base sequence variation in specific location of the human genome. Phenotype informative SNP has gradually become one of the research hot spots in forensic science. In this paper, the forensic research situation and application prospect of phenotype informative SNP in the characteristics of hair, eye and skin color, height, and facial feature are reviewed.

  5. A novel approach to analyzing fMRI and SNP data via parallel independent component analysis

    NASA Astrophysics Data System (ADS)

    Liu, Jingyu; Pearlson, Godfrey; Calhoun, Vince; Windemuth, Andreas

    2007-03-01

    There is current interest in understanding genetic influences on brain function in both the healthy and the disordered brain. Parallel independent component analysis, a new method for analyzing multimodal data, is proposed in this paper and applied to functional magnetic resonance imaging (fMRI) and a single nucleotide polymorphism (SNP) array. The method aims to identify the independent components of each modality and the relationship between the two modalities. We analyzed 92 participants, including 29 schizophrenia (SZ) patients, 13 unaffected SZ relatives, and 50 healthy controls. We found a correlation of 0.79 between one fMRI component and one SNP component. The fMRI component consists of activations in cingulate gyrus, multiple frontal gyri, and superior temporal gyrus. The related SNP component is contributed to significantly by 9 SNPs located in sets of genes, including those coding for apolipoprotein A-I, and C-III, malate dehydrogenase 1 and the gamma-aminobutyric acid alpha-2 receptor. A significant difference in the presences of this SNP component is found between the SZ group (SZ patients and their relatives) and the control group. In summary, we constructed a framework to identify the interactions between brain functional and genetic information; our findings provide new insight into understanding genetic influences on brain function in a common mental disorder.

  6. is-rSNP: a novel technique for in silico regulatory SNP detection

    PubMed Central

    Macintyre, Geoff; Bailey, James; Haviv, Izhak; Kowalczyk, Adam

    2010-01-01

    Motivation: Determining the functional impact of non-coding disease-associated single nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) is challenging. Many of these SNPs are likely to be regulatory SNPs (rSNPs): variations which affect the ability of a transcription factor (TF) to bind to DNA. However, experimental procedures for identifying rSNPs are expensive and labour intensive. Therefore, in silico methods are required for rSNP prediction. By scoring two alleles with a TF position weight matrix (PWM), it can be determined which SNPs are likely rSNPs. However, predictions in this manner are noisy and no method exists that determines the statistical significance of a nucleotide variation on a PWM score. Results: We have designed an algorithm for in silico rSNP detection called is-rSNP. We employ novel convolution methods to determine the complete distributions of PWM scores and ratios between allele scores, facilitating assignment of statistical significance to rSNP effects. We have tested our method on 41 experimentally verified rSNPs, correctly predicting the disrupted TF in 28 cases. We also analysed 146 disease-associated SNPs with no known functional impact in an attempt to identify candidate rSNPs. Of the 11 significantly predicted disrupted TFs, 9 had previous evidence of being associated with the disease in the literature. These results demonstrate that is-rSNP is suitable for high-throughput screening of SNPs for potential regulatory function. This is a useful and important tool in the interpretation of GWAS. Availability: is-rSNP software is available for use at: www.genomics.csse.unimelb.edu.au/is-rSNP Contact: gmaci@csse.unimelb.edu.au; adam.kowalczyk@nicta.com.au Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20823317

  7. A Bayesian Framework for SNP Identification

    SciTech Connect

    Webb-Robertson, Bobbie-Jo M.; Havre, Susan L.; Payne, Deborah A.

    2005-07-01

    Current proteomics techniques, such as mass spectrometry, focus on protein identification, usually ignoring most types of modifications beyond post-translational modifications, with the assumption that only a small number of peptides have to be matched to a protein for a positive identification. However, not all proteins are being identified with current techniques and improved methods to locate points of mutation are becoming a necessity. In the case when single-nucleotide polymorphisms (SNPs) are observed, brute force is the most common method to locate them, quickly becoming computationally unattractive as the size of the database associated with the model organism grows. We have developed a Bayesian model for SNPs, BSNP, incorporating evolutionary information at both the nucleotide and amino acid levels. Formulating SNPs as a Bayesian inference problem allows probabilities of interest to be easily obtained, for example the probability of a specific SNP or specific type of mutation over a gene or entire genome. Three SNP databases were observed in the evaluation of the BSNP model; the first SNP database is a disease specific gene in human, hemoglobin, the second is also a disease specific gene in human, p53, and the third is a more general SNP database for multiple genes in mouse. We validate that the BSNP model assigns higher posterior probabilities to the SNPs defined in all three separate databases than can be attributed to chance under specific evolutionary information, for example the amino acid model described by Majewski and Ott in conjunction with either the four-parameter nucleotide model by Bulmer or seven-parameter nucleotide model by Majewski and Ott.

  8. SNP genotyping in melons: genetic variation, population structure, and linkage disequilibrium.

    PubMed

    Esteras, Cristina; Formisano, Gelsomina; Roig, Cristina; Díaz, Aurora; Blanca, José; Garcia-Mas, Jordi; Gómez-Guillamón, María Luisa; López-Sesé, Ana Isabel; Lázaro, Almudena; Monforte, Antonio J; Picó, Belén

    2013-05-01

    Novel sequencing technologies were recently used to generate sequences from multiple melon (Cucumis melo L.) genotypes, enabling the in silico identification of large single nucleotide polymorphism (SNP) collections. In order to optimize the use of these markers, SNP validation and large-scale genotyping are necessary. In this paper, we present the first validated design for a genotyping array with 768 SNPs that are evenly distributed throughout the melon genome. This customized Illumina GoldenGate assay was used to genotype a collection of 74 accessions, representing most of the botanical groups of the species. Of the assayed loci, 91 % were successfully genotyped. The array provided a large number of polymorphic SNPs within and across accessions. This set of SNPs detected high levels of variation in accessions from this crop's center of origin as well as from several other areas of melon diversification. Allele distribution throughout the genome revealed regions that distinguished between the two main groups of cultivated accessions (inodorus and cantalupensis). Population structure analysis showed a subdivision into five subpopulations, reflecting the history of the crop. A considerably low level of LD was detected, which decayed rapidly within a few kilobases. Our results show that the GoldenGate assay can be used successfully for high-throughput SNP genotyping in melon. Since many of the genotyped accessions are currently being used as the parents of breeding populations in various programs, this set of mapped markers could be used for future mapping and breeding efforts.

  9. Variable Selection in Logistic Regression for Detecting SNP-SNP Interactions: the Rheumatoid Arthritis Example

    PubMed Central

    Lin, H. Y.; Desmond, R.; Liu, Y. H.; Bridges, S. L.; Soong, S. J.

    2013-01-01

    Summary Many complex disease traits are observed to be associated with single nucleotide polymorphism (SNP) interactions. In testing small-scale SNP-SNP interactions, variable selection procedures in logistic regressions are commonly used. The empirical evidence of variable selection for testing interactions in logistic regressions is limited. This simulation study was designed to compare nine variable selection procedures in logistic regressions for testing SNP-SNP interactions. Data on 10 SNPs were simulated for 400 and 1000 subjects (case/control ratio=1). The simulated model included one main effect and two 2-way interactions. The variable selection procedures included automatic selection (stepwise, forward and backward), common 2-step selection, AIC- and BIC-based selection. The hierarchical rule effect, in which all main effects and lower order terms of the highest-order interaction term are included in the model regardless of their statistical significance, was also examined. We found that the stepwise variable selection without the hierarchical rule which had reasonably high authentic (true positive) proportion and low noise (false positive) proportion, is a better method compared to other variable selection procedures. The procedure without the hierarchical rule requires fewer terms in testing interactions, so it can accommodate more SNPs than the procedure with the hierarchical rule. For testing interactions, the procedures without the hierarchical rule had higher authentic proportion and lower noise proportion compared with ones with the hierarchical rule. These variable selection procedures were also applied and compared in a rheumatoid arthritis study. PMID:18231122

  10. Genome-wide SNP association-based localization of a dwarfism gene in Friesian dwarf horses.

    PubMed

    Orr, N; Back, W; Gu, J; Leegwater, P; Govindarajan, P; Conroy, J; Ducro, B; Van Arendonk, J A M; MacHugh, D E; Ennis, S; Hill, E W; Brama, P A J

    2010-12-01

    The recent completion of the horse genome and commercial availability of an equine SNP genotyping array has facilitated the mapping of disease genes. We report putative localization of the gene responsible for dwarfism, a trait in Friesian horses that is thought to have a recessive mode of inheritance, to a 2-MB region of chromosome 14 using just 10 affected animals and 10 controls. We successfully genotyped 34,429 SNPs that were tested for association with dwarfism using chi-square tests. The most significant SNP in our study, BIEC2-239376 (P(2df)=4.54 × 10(-5), P(rec)=7.74 × 10(-6)), is located close to a gene implicated in human dwarfism. Fine-mapping and resequencing analyses did not aid in further localization of the causative variant, and replication of our findings in independent sample sets will be necessary to confirm these results.

  11. Genomic Array as Compared to Karyotyping in Myelodysplastic Syndromes in a Prospective Clinical trial.

    PubMed

    Stevens-Kroef, Marian J; Olde Weghuis, Daniel; ElIdrissi-Zaynoun, Najat; van der Reijden, Bert; Cremers, Eline M P; Alhan, Canan; Westers, Theresia M; Visser-Wisselaar, Heleen A; Chitu, Dana A; Cunha, Sonia M; Vellenga, Edo; Klein, Saskia K; Wijermans, Pierre; de Greef, Georgine E; Schaafsma, M R; Muus, Petra; Ossenkoppele, Gert J; van de Loosdrecht, Arjan A; Jansen, Joop H

    2017-02-25

    Karyotyping is considered as the gold standard in the genetic subclassification of myelodysplastic syndrome (MDS). Oligo/SNP-based genomic array profiling is a high-resolution tool that also enables genome wide analysis. We compared karyotyping with oligo/SNP-based array profiling in 104 MDS patients from the HOVON-89 study. Oligo/SNP-array identified all cytogenetically defined genomic lesions, except for subclones in two cases and balanced translocations in three cases. On the other hand oligo/SNP-based genomic array profiling had a higher success rate, showing 55 abnormal cases, while an abnormal karyotype was found in only 35 patients. In 9 patients whose karyotyping was unsuccessful because of insufficient metaphases or failure, oligo/SNP-based array analysis was successful. Based on cytogenetic visible abnormalities as identified by oligo/SNP-based genomic array prognostic scores based on IPSS/-R were assigned. These prognostic scores were identical to the IPSS/-R scores as obtained with karyotyping in 95-96% of the patients. In addition to the detection of cytogenetically defined lesions, oligo/SNP-based genomic profiling identified focal copy number abnormalities or regions of copy neutral loss of heterozygosity that were out of the scope of karyotyping and fluorescence in situ hybridization. Of interest, in 26 patients we demonstrated such cytogenetic invisible abnormalities. These abnormalities often involved regions that are recurrently affected in hematological malignancies, and may therefore be of clinical relevance. Our findings indicate that oligo/SNP-based genomic array can be used to identify the vast majority of recurrent cytogenetic abnormalities in MDS. Furthermore, oligo/SNP-based array profiling yields additional genetic abnormalities that may be of clinical importance. This article is protected by copyright. All rights reserved.

  12. Surface invasive cleavage assay on a maskless light-directed diamond DNA microarray for genome-wide human SNP mapping.

    PubMed

    Nie, Bei; Yang, Min; Fu, Weiling; Liang, Zhiqing

    2015-07-07

    The surface invasive cleavage assay, because of its innate accuracy and ability for self-signal amplification, provides a potential route for the mapping of hundreds of thousands of human SNP sites. However, its performance on a high density DNA array has not yet been established, due to the unusual "hairpin" probe design on the microarray and the lack of chemical stability of commercially available substrates. Here we present an applicable method to implement a nanocrystalline diamond thin film as an alternative substrate for fabricating an addressable DNA array using maskless light-directed photochemistry, producing the most chemically stable and biocompatible system for genetic analysis and enzymatic reactions. The surface invasive cleavage reaction, followed by degenerated primer ligation and post-rolling circle amplification is consecutively performed on the addressable diamond DNA array, accurately mapping SNP sites from PCR-amplified human genomic target DNA. Furthermore, a specially-designed DNA array containing dual probes in the same pixel is fabricated by following a reverse light-directed DNA synthesis protocol. This essentially enables us to decipher thousands of SNP alleles in a single-pot reaction by the simple addition of enzyme, target and reaction buffers.

  13. SNP-based discovery of salinity-tolerant QTLs in a bi-parental population of rice (Oryza sativa).

    PubMed

    Gimhani, D R; Gregorio, Glenn B; Kottearachchi, N S; Samarasinghe, W L G

    2016-12-01

    Breeding for salt tolerance is the most promising approach to enhance the productivity of saline prone areas. However, polygenic inheritance of salt tolerance in rice acts as a bottleneck in conventional breeding for salt tolerance. Hence, we set our goals to construct a single nucleotide polymorphism (SNP)-based molecular map employing high-throughput SNP marker technology and to investigate salinity tolerant QTLs with closest flanking markers using an elite rice background. Seedling stage salinity responses were assessed in a population of 281 recombinant inbred lines (RILs) derived from the cross between At354 (salt tolerant) and Bg352 (salt susceptible), by 11 morpho-physiological indices under a hydroponic system. Selected extreme 94 RILs were genotyped using Illumina Infinium rice 6K SNP array and densely saturated molecular map spanning 1460.81 cM of the rice genome with an average interval of 1.29 cM between marker loci was constructed using 1135 polymorphic SNP markers. The results revealed 83 significant QTLs for 11 salt responsive traits explaining 12.5-46.7 % of phenotypic variation in respective traits. Of them, 72 QTLs responsible for 10 traits were co-localized together forming 14 QTL hotspots at 14 different genomic regions. The all QTL hotspots were flanked less than 1 Mb intervals and therefore the SNP loci associated with these QTL hotspots would be important in candidate gene discovery for salt tolerance.

  14. dbSNP: the NCBI database of genetic variation.

    PubMed

    Sherry, S T; Ward, M H; Kholodov, M; Baker, J; Phan, L; Smigielski, E M; Sirotkin, K

    2001-01-01

    In response to a need for a general catalog of genome variation to address the large-scale sampling designs required by association studies, gene mapping and evolutionary biology, the National Center for Biotechnology Information (NCBI) has established the dbSNP database [S.T.Sherry, M.Ward and K. Sirotkin (1999) Genome Res., 9, 677-679]. Submissions to dbSNP will be integrated with other sources of information at NCBI such as GenBank, PubMed, LocusLink and the Human Genome Project data. The complete contents of dbSNP are available to the public at website: http://www.ncbi.nlm.nih.gov/SNP. The complete contents of dbSNP can also be downloaded in multiple formats via anonymous FTP at ftp://ncbi.nlm.nih.gov/snp/.

  15. TRM: a powerful two-stage machine learning approach for identifying SNP-SNP interactions.

    PubMed

    Lin, Hui-Yi; Chen, Y Ann; Tsai, Ya-Yu; Qu, Xiaotao; Tseng, Tung-Sung; Park, Jong Y

    2012-01-01

    Studies have shown that interactions of single nucleotide polymorphisms (SNPs) may play an important role in understanding the causes of complex disease. We have proposed an integrated machine learning method that combines two machine-learning methods-Random Forests (RF) and Multivariate Adaptive Regression Splines (MARS)-to identify a subset of important SNPs and detect interaction patterns more effectively and efficiently. In this two-stage RF-MARS (TRM) approach, RF is first applied to detect a predictive subset of SNPs, and then MARS is used to identify the interaction patterns. We evaluated the TRM performances in four models. RF variable selection was based on out-of-bag classification error rate (OOB) and variable important spectrum (IS). Our results support that RF(OOB) had better performance than MARS and RF(IS) in detecting important variables. This study demonstrates that TRM(OOB) , which is RF(OOB) plus MARS, has combined the strengths of RF and MARS in identifying SNP-SNP interactions in a scenario of 100 candidate SNPs. TRM(OOB) had greater true positive rate and lower false positive rate compared with MARS, particularly for searching interactions with a strong association with the outcome. Therefore, the use of TRM(OOB) is favored for exploring SNP-SNP interactions in a large-scale genetic variation study.

  16. SNP2CAPS: a SNP and INDEL analysis tool for CAPS marker development.

    PubMed

    Thiel, Thomas; Kota, Raja; Grosse, Ivo; Stein, Nils; Graner, Andreas

    2004-01-02

    With the influx of various SNP genotyping assays in recent years, there has been a need for an assay that is robust, yet cost effective, and could be performed using standard gel-based procedures. In this context, CAPS markers have been shown to meet these criteria. However, converting SNPs to CAPS markers can be a difficult process if done manually. In order to address this problem, we describe a computer program, SNP2CAPS, that facilitates the computational conversion of SNP markers into CAPS markers. 413 multiple aligned sequences derived from barley ESTs were analysed for the presence of polymorphisms in 235 distinct restriction sites. 282 (90%) of 314 alignments that contain sequence variation due to SNPs and InDels revealed at least one polymorphic restriction site. After reducing the number of restriction enzymes from 235 to 10, 31% of the polymorphic sites could still be detected. In order to demonstrate the usefulness of this tool for marker development, we experimentally validated some of the results predicted by SNP2CAPS.

  17. SNP marker detection and genotyping in tilapia.

    PubMed

    Van Bers, N E M; Crooijmans, R P M A; Groenen, M A M; Dibbits, B W; Komen, J

    2012-09-01

    We have generated a unique resource consisting of nearly 175 000 short contig sequences and 3569 SNP markers from the widely cultured GIFT (Genetically Improved Farmed Tilapia) strain of Nile tilapia (Oreochromis niloticus). In total, 384 SNPs were selected to monitor the wider applicability of the SNPs by genotyping tilapia individuals from different strains and different geographical locations. In all strains and species tested (O. niloticus, O. aureus and O. mossambicus), the genotyping assay was working for a similar number of SNPs (288-305 SNPs). The actual number of polymorphic SNPs was, as expected, highest for individuals from the GIFT population (255 SNPs). In the individuals from an Egyptian strain and in individuals caught in the wild in the basin of the river Volta, 197 and 163 SNPs were polymorphic, respectively. A pairwise calculation of Nei's genetic distance allowed the discrimination of the individual strains and species based on the genotypes determined with the SNP set. We expect that this set will be widely applicable for use in tilapia aquaculture, e.g. for pedigree reconstruction. In addition, this set is currently used for assaying the genetic diversity of native Nile tilapia in areas where tilapia is, or will be, introduced in aquaculture projects. This allows the tracing of escapees from aquaculture and the monitoring of effects of introgression and hybridization.

  18. Whole-genome single-nucleotide polymorphism (SNP) marker discovery and association analysis with the eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) content in Larimichthys crocea

    PubMed Central

    Xiao, Shijun; Wang, Panpan; Dong, Linsong; Zhang, Yaguang; Han, Zhaofang; Wang, Qiurong

    2016-01-01

    Whole-genome single-nucleotide polymorphism (SNP) markers are valuable genetic resources for the association and conservation studies. Genome-wide SNP development in many teleost species are still challenging because of the genome complexity and the cost of re-sequencing. Genotyping-By-Sequencing (GBS) provided an efficient reduced representative method to squeeze cost for SNP detection; however, most of recent GBS applications were reported on plant organisms. In this work, we used an EcoRI-NlaIII based GBS protocol to teleost large yellow croaker, an important commercial fish in China and East-Asia, and reported the first whole-genome SNP development for the species. 69,845 high quality SNP markers that evenly distributed along genome were detected in at least 80% of 500 individuals. Nearly 95% randomly selected genotypes were successfully validated by Sequenom MassARRAY assay. The association studies with the muscle eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) content discovered 39 significant SNP markers, contributing as high up to ∼63% genetic variance that explained by all markers. Functional genes that involved in fat digestion and absorption pathway were identified, such as APOB, CRAT and OSBPL10. Notably, PPT2 Gene, previously identified in the association study of the plasma n-3 and n-6 polyunsaturated fatty acid level in human, was re-discovered in large yellow croaker. Our study verified that EcoRI-NlaIII based GBS could produce quality SNP markers in a cost-efficient manner in teleost genome. The developed SNP markers and the EPA and DHA associated SNP loci provided invaluable resources for the population structure, conservation genetics and genomic selection of large yellow croaker and other fish organisms. PMID:28028455

  19. SNIT: SNP identification for strain typing

    PubMed Central

    2011-01-01

    With ever-increasing numbers of microbial genomes being sequenced, efficient tools are needed to perform strain-level identification of any newly sequenced genome. Here, we present the SNP identification for strain typing (SNIT) pipeline, a fast and accurate software system that compares a newly sequenced bacterial genome with other genomes of the same species to identify single nucleotide polymorphisms (SNPs) and small insertions/deletions (indels). Based on this information, the pipeline analyzes the polymorphic loci present in all input genomes to identify the genome that has the fewest differences with the newly sequenced genome. Similarly, for each of the other genomes, SNIT identifies the input genome with the fewest differences. Results from five bacterial species show that the SNIT pipeline identifies the correct closest neighbor with 75% to 100% accuracy. The SNIT pipeline is available for download at http://www.bhsai.org/snit.html PMID:21902825

  20. Atomic Force Microscopy for DNA SNP Identification

    NASA Astrophysics Data System (ADS)

    Valbusa, Ugo; Ierardi, Vincenzo

    The knowledge of the effects of single-nucleotide polymorphisms (SNPs) in the human genome greatly contributes to better comprehension of the relation between genetic factors and diseases. Sequence analysis of genomic DNA in different individuals reveals positions where variations that involve individual base substitutions can occur. Single-nucleotide polymorphisms are highly abundant and can have different consequences at phenotypic level. Several attempts were made to apply atomic force microscopy (AFM) to detect and map SNP sites in DNA strands. The most promising approach is the study of DNA mutations producing heteroduplex DNA strands and identifying the mismatches by means of a protein that labels the mismatches. MutS is a protein that is part of a well-known complex of mismatch repair, which initiates the process of repairing when the MutS binds to the mismatched DNA filament. The position of MutS on the DNA filament can be easily recorded by means of AFM imaging.

  1. Light whole genome sequence for SNP discovery across domestic cat breeds

    PubMed Central

    2010-01-01

    Background The domestic cat has offered enormous genomic potential in the veterinary description of over 250 hereditary disease models as well as the occurrence of several deadly feline viruses (feline leukemia virus -- FeLV, feline coronavirus -- FECV, feline immunodeficiency virus - FIV) that are homologues to human scourges (cancer, SARS, and AIDS respectively). However, to realize this bio-medical potential, a high density single nucleotide polymorphism (SNP) map is required in order to accomplish disease and phenotype association discovery. Description To remedy this, we generated 3,178,297 paired fosmid-end Sanger sequence reads from seven cats, and combined these data with the publicly available 2X cat whole genome sequence. All sequence reads were assembled together to form a 3X whole genome assembly allowing the discovery of over three million SNPs. To reduce potential false positive SNPs due to the low coverage assembly, a low upper-limit was placed on sequence coverage and a high lower-limit on the quality of the discrepant bases at a potential variant site. In all domestic cats of different breeds: female Abyssinian, female American shorthair, male Cornish Rex, female European Burmese, female Persian, female Siamese, a male Ragdoll and a female African wildcat were sequenced lightly. We report a total of 964 k common SNPs suitable for a domestic cat SNP genotyping array and an additional 900 k SNPs detected between African wildcat and domestic cats breeds. An empirical sampling of 94 discovered SNPs were tested in the sequenced cats resulting in a SNP validation rate of 99%. Conclusions These data provide a large collection of mapped feline SNPs across the cat genome that will allow for the development of SNP genotyping platforms for mapping feline diseases. PMID:20576142

  2. Combining fMRI and SNP Data to Investigate Connections Between Brain Function and Genetics Using Parallel ICA

    PubMed Central

    Liu, Jingyu; Pearlson, Godfrey; Windemuth, Andreas; Ruano, Gualberto; Perrone-Bizzozero, Nora I.; Calhoun, Vince

    2009-01-01

    There is current interest in understanding genetic influences on both healthy and disordered brain function. We assessed brain function with functional magnetic resonance imaging (fMRI) data collected during an auditory oddball task—detecting an infrequent sound within a series of frequent sounds. Then, task-related imaging findings were utilized as potential intermediate phenotypes (endophenotypes) to investigate genomic factors derived from a single nucleotide polymorphism (SNP) array. Our target is the linkage of these genomic factors to normal/abnormal brain functionality. We explored parallel independent component analysis (paraICA) as a new method for analyzing multimodal data. The method was aimed to identify simultaneously independent components of each modality and the relationships between them. When 43 healthy controls and 20 schizophrenia patients, all Caucasian, were studied, we found a correlation of 0.38 between one fMRI component and one SNP component. This fMRI component consisted mainly of parietal lobe activations. The relevant SNP component was contributed to significantly by 10 SNPs located in genes, including those coding for the nicotinic α-7cholinergic receptor, aromatic amino acid decarboxylase, disrupted in schizophrenia 1, among others. Both fMRI and SNP components showed significant differences in loading parameters between the schizophrenia and control groups (P = 0.0006 for the fMRI component; P = 0.001 for the SNP component). In summary, we constructed a framework to identify interactions between brain functional and genetic information; our findings provide a proof-of-concept that genomic SNP factors can be investigated by using endophenotypic imaging findings in a multivariate format. PMID:18072279

  3. Exercise improves adiponectin concentrations irrespective of the adiponectin gene polymorphisms SNP45 and the SNP276 in obese Korean women.

    PubMed

    Lee, Kyoung-Young; Kang, Hyun-Sik; Shin, Yun-A

    2013-03-10

    The effects of exercise on adiponectin levels have been reported to be variable and may be attributable to an interaction between environmental and genetic factors. The single nucleotide polymorphisms (SNP) 45 (T>G) and SNP276 (G>T) of the adiponectin gene are associated with metabolic risk factors including adiponectin levels. We examined whether SNP45 and SNP276 would differentially influence the effect of exercise training in middle-aged women with uncomplicated obesity. We conducted a prospective study in the general community that included 90 Korean women (age 47.0±5.1 years) with uncomplicated obesity. The intervention was aerobic exercise training for 3 months. Body composition, adiponectin levels, and other metabolic risk factors were measured. Prior to exercise training, only body weight differed among the SNP276 genotypes. Exercise training improved body composition, systolic blood pressure, maximal oxygen consumption, high-density lipoprotein cholesterol, and leptin levels. In addition, exercise improved adiponectin levels irrespective of weight gain or loss. However, after adjustments for age, BMI, body fat (%), and waist circumference, no differences were found in obesity-related characteristics (e.g., adiponectin) following exercise training among the SNP45 and the 276 genotypes. Our findings suggest that aerobic exercise affects adiponectin levels regardless of weight loss and this effect would not be influenced by SNP45 and SNP276 in the adiponectin gene.

  4. High-density SNP assay development for genetic analysis in maritime pine (Pinus pinaster).

    PubMed

    Plomion, C; Bartholomé, J; Lesur, I; Boury, C; Rodríguez-Quilón, I; Lagraulet, H; Ehrenmann, F; Bouffier, L; Gion, J M; Grivet, D; de Miguel, M; de María, N; Cervera, M T; Bagnoli, F; Isik, F; Vendramin, G G; González-Martínez, S C

    2016-03-01

    Maritime pine provides essential ecosystem services in the south-western Mediterranean basin, where it covers around 4 million ha. Its scattered distribution over a range of environmental conditions makes it an ideal forest tree species for studies of local adaptation and evolutionary responses to climatic change. Highly multiplexed single nucleotide polymorphism (SNP) genotyping arrays are increasingly used to study genetic variation in living organisms and for practical applications in plant and animal breeding and genetic resource conservation. We developed a 9k Illumina Infinium SNP array and genotyped maritime pine trees from (i) a three-generation inbred (F2) pedigree, (ii) the French breeding population and (iii) natural populations from Portugal and the French Atlantic coast. A large proportion of the exploitable SNPs (2052/8410, i.e. 24.4%) segregated in the mapping population and could be mapped, providing the densest ever gene-based linkage map for this species. Based on 5016 SNPs, natural and breeding populations from the French gene pool exhibited similar level of genetic diversity. Population genetics and structure analyses based on 3981 SNP markers common to the Portuguese and French gene pools revealed high levels of differentiation, leading to the identification of a set of highly differentiated SNPs that could be used for seed provenance certification. Finally, we discuss how the validated SNPs could facilitate the identification of ecologically and economically relevant genes in this species, improving our understanding of the demography and selective forces shaping its natural genetic diversity, and providing support for new breeding strategies.

  5. Heritability of Recurrent Exertional Rhabdomyolysis in Standardbred and Thoroughbred Racehorses Derived From SNP Genotyping Data.

    PubMed

    Norton, Elaine M; Mickelson, James R; Binns, Matthew M; Blott, Sarah C; Caputo, Paul; Isgren, Cajsa M; McCoy, Annette M; Moore, Alison; Piercy, Richard J; Swinburne, June E; Vaudin, Mark; McCue, Molly E

    2016-11-01

    Recurrent exertional rhabdomyolysis (RER) in Thoroughbred and Standardbred racehorses is characterized by episodes of muscle rigidity and cell damage that often recur upon strenuous exercise. The objective was to evaluate the importance of genetic factors in RER by obtaining an unbiased estimate of heritability in cohorts of unrelated Thoroughbred and Standardbred racehorses. Four hundred ninety-one Thoroughbred and 196 Standardbred racehorses were genotyped with the 54K or 74K SNP genotyping arrays. Heritability was calculated from genome-wide SNP data with a mixed linear and Bayesian model, utilizing the standard genetic relationship matrix (GRM). Both the mixed linear and Bayesian models estimated heritability of RER in Thoroughbreds to be approximately 0.34 and in Standardbred racehorses to be approximately 0.45 after adjusting for disease prevalence and sex. To account for potential differences in the genetic architecture of the underlying causal variants, heritability estimates were adjusted based on linkage disequilibrium weighted kinship matrix, minor allele frequency and variant effect size, yielding heritability estimates that ranged between 0.41-0.46 (Thoroughbreds) and 0.39-0.49 (Standardbreds). In conclusion, between 34-46% and 39-49% of the variance in RER susceptibility in Thoroughbred and Standardbred racehorses, respectively, can be explained by the SNPs present on these 2 genotyping arrays, indicating that RER is moderately heritable. These data provide further rationale for the investigation of genetic mutations associated with RER susceptibility.

  6. Haplotype assembly from aligned weighted SNP fragments.

    PubMed

    Zhao, Yu-Ying; Wu, Ling-Yun; Zhang, Ji-Hong; Wang, Rui-Sheng; Zhang, Xiang-Sun

    2005-08-01

    Given an assembled genome of a diploid organism the haplotype assembly problem can be formulated as retrieval of a pair of haplotypes from a set of aligned weighted SNP fragments. Known computational formulations (models) of this problem are minimum letter flips (MLF) and the weighted minimum letter flips (WMLF; Greenberg et al. (INFORMS J. Comput. 2004, 14, 211-213)). In this paper we show that the general WMLF model is NP-hard even for the gapless case. However the algorithmic solutions for selected variants of WMFL can exist and we propose a heuristic algorithm based on a dynamic clustering technique. We also introduce a new formulation of the haplotype assembly problem that we call COMPLETE WMLF (CWMLF). This model and algorithms for its implementation take into account a simultaneous presence of multiple kinds of data errors. Extensive computational experiments indicate that the algorithmic implementations of the CWMLF model achieve higher accuracy of haplotype reconstruction than the WMLF-based algorithms, which in turn appear to be more accurate than those based on MLF.

  7. SNP-SNP interaction analysis of NF-κB signaling pathway on breast cancer survival

    PubMed Central

    Jamshidi, Maral; Fagerholm, Rainer; Khan, Sofia; Aittomäki, Kristiina; Czene, Kamila; Darabi, Hatef; Li, Jingmei; Andrulis, Irene L.; Chang-Claude, Jenny; Devilee, Peter; Fasching, Peter A.; Michailidou, Kyriaki; Bolla, Manjeet K.; Dennis, Joe; Wang, Qin; Guo, Qi; Rhenius, Valerie; Cornelissen, Sten; Rudolph, Anja; Knight, Julia A.; Loehberg, Christian R.; Burwinkel, Barbara; Marme, Frederik; Hopper, John L.; Southey, Melissa C.; Bojesen, Stig E.; Flyger, Henrik; Brenner, Hermann; Holleczek, Bernd; Margolin, Sara; Mannermaa, Arto; Kosma, Veli-Matti; Dyck, Laurien Van; Nevelsteen, Ines; Couch, Fergus J.; Olson, Janet E.; Giles, Graham G.; McLean, Catriona; Haiman, Christopher A.; Henderson, Brian E.; Winqvist, Robert; Pylkäs, Katri; Tollenaar, Rob A.E.M.; García-Closas, Montserrat; Figueroa, Jonine; Hooning, Maartje J.; Martens, John W.M.; Cox, Angela; Cross, Simon S.; Simard, Jacques; Dunning, Alison M.; Easton, Douglas F.; Pharoah, Paul D.P.; Hall, Per; Blomqvist, Carl; Schmidt, Marjanka K.; Nevanlinna, Heli

    2015-01-01

    In breast cancer, constitutive activation of NF-κB has been reported, however, the impact of genetic variation of the pathway on patient prognosis has been little studied. Furthermore, a combination of genetic variants, rather than single polymorphisms, may affect disease prognosis. Here, in an extensive dataset (n = 30,431) from the Breast Cancer Association Consortium, we investigated the association of 917 SNPs in 75 genes in the NF-κB pathway with breast cancer prognosis. We explored SNP-SNP interactions on survival using the likelihood-ratio test comparing multivariate Cox’ regression models of SNP pairs without and with an interaction term. We found two interacting pairs associating with prognosis: patients simultaneously homozygous for the rare alleles of rs5996080 and rs7973914 had worse survival (HRinteraction 6.98, 95% CI=3.3-14.4, P = 1.42E-07), and patients carrying at least one rare allele for rs17243893 and rs57890595 had better survival (HRinteraction 0.51, 95% CI=0.3-0.6, P = 2.19E-05). Based on in silico functional analyses and literature, we speculate that the rs5996080 and rs7973914 loci may affect the BAFFR and TNFR1/TNFR3 receptors and breast cancer survival, possibly by disturbing both the canonical and non-canonical NF-κB pathways or their dynamics, whereas, rs17243893-rs57890595 interaction on survival may be mediated through TRAF2-TRAIL-R4 interplay. These results warrant further validation and functional analyses. PMID:26317411

  8. Magnetic arrays

    SciTech Connect

    Trumper, David L.; Kim, Won-jong; Williams, Mark E.

    1997-05-20

    Electromagnet arrays which can provide selected field patterns in either two or three dimensions, and in particular, which can provide single-sided field patterns in two or three dimensions. These features are achieved by providing arrays which have current densities that vary in the windings both parallel to the array and in the direction of array thickness.

  9. Magnetic arrays

    DOEpatents

    Trumper, D.L.; Kim, W.; Williams, M.E.

    1997-05-20

    Electromagnet arrays are disclosed which can provide selected field patterns in either two or three dimensions, and in particular, which can provide single-sided field patterns in two or three dimensions. These features are achieved by providing arrays which have current densities that vary in the windings both parallel to the array and in the direction of array thickness. 12 figs.

  10. A scan statistic for identifying chromosomal patterns of SNP association.

    PubMed

    Sun, Yan V; Levin, Albert M; Boerwinkle, Eric; Robertson, Henry; Kardia, Sharon L R

    2006-11-01

    We have developed a single nucleotide polymorphism (SNP) association scan statistic that takes into account the complex distribution of the human genome variation in the identification of chromosomal regions with significant SNP associations. This scan statistic has wide applicability for genetic analysis, whether to identify important chromosomal regions associated with common diseases based on whole-genome SNP association studies or to identify disease susceptibility genes based on dense SNP positional candidate studies. To illustrate this method, we analyzed patterns of SNP associations on chromosome 19 in a large cohort study. Among 2,944 SNPs, we found seven regions that contained clusters of significantly associated SNPs. The average width of these regions was 35 kb with a range of 10-72 kb. We compared the scan statistic results to Fisher's product method using a sliding window approach, and detected 22 regions with significant clusters of SNP associations. The average width of these regions was 131 kb with a range of 10.1-615 kb. Given that the distances between SNPs are not taken into consideration in the sliding window approach, it is likely that a large fraction of these regions represents false positives. However, all seven regions detected by the scan statistic were also detected by the sliding window approach. The linkage disequilibrium (LD) patterns within the seven regions were highly variable indicating that the clusters of SNP associations were not due to LD alone. The scan statistic developed here can be used to make gene-based or region-based SNP inferences about disease association.

  11. A Novel Test for Detecting SNP-SNP Interactions in Case-Only Trio Studies.

    PubMed

    Balliu, Brunilda; Zaitlen, Noah

    2016-04-01

    Epistasis plays a significant role in the genetic architecture of many complex phenotypes in model organisms. To date, there have been very few interactions replicated in human studies due in part to the multiple-hypothesis burden implicit in genome-wide tests of epistasis. Therefore, it is of paramount importance to develop the most powerful tests possible for detecting interactions. In this work we develop a new SNP-SNP interaction test for use in case-only trio studies called the trio correlation (TC) test. The TC test computes the expected joint distribution of marker pairs in offspring conditional on parental genotypes. This distribution is then incorporated into a standard 1 d.f. correlation test of interaction. We show via extensive simulations under a variety of disease models that our test substantially outperforms existing tests of interaction in case-only trio studies. We also demonstrate a bias in a previous case-only trio interaction test and identify its origin. Finally, we show that a previously proposed permutation scheme in trio studies mitigates the known biases of case-only tests in the presence of population stratification. We conclude that the TC test shows improved power to identify interactions in existing, as well as emerging, trio association studies. The method is publicly available at www.github.com/BrunildaBalliu/TrioEpi.

  12. Single Nucleotide Polymorphism Array Genotyping is Equivalent to Metaphase Cytogenetics for Diagnosis of Turner Syndrome

    PubMed Central

    Prakash, Siddharth; Guo, Dongchuan; Maslen, Cheryl L.; Silberbach, Michael; Investigators, GenTAC; Milewicz, Dianna; Bondy, Carolyn A.

    2013-01-01

    Background Turner syndrome (TS) is a developmental disorder caused by partial or complete monosomy for the X chromosome in 1:2500 females. We hypothesized that single nucleotide polymorphism (SNP) array genotyping can provide superior resolution in comparison to metaphase karyotype analysis to facilitate genotype-phenotype correlations. Methods We genotyped 187 TS patients with 733,000 SNP marker arrays. All cases met diagnostic criteria for TS based on karyotypes (60%) or characteristic physical features. SNP array results confirmed the diagnosis of TS in 100% of cases. Results We identified a single X chromosome (45,X) in 113 cases. In 58 additional cases (31%), other mosaic cell lines were present including isochromosomes (16%), rings (5%) and Xp deletions (8%). The remaining cases were mosaic for monosomy X and normal male or female cell lines. Array-based models of X chromosome structure were compatible with karyotypes in 104 of 116 comparable cases (90%). We found that SNP array data did not detect X;autosome translocations (3 cases), but did identify 2 derivative Y chromosomes and 13 large copy number variants that were not detected by karyotyping. Conclusions Our data is the first systematic comparison between the two methods and supports the utility of SNP array genotyping to address clinical and research questions in TS. PMID:23743550

  13. Cardiovascular pharmacogenetics in the SNP era.

    PubMed

    Mooser, V; Waterworth, D M; Isenhour, T; Middleton, L

    2003-07-01

    In the past pharmacological agents have contributed to a significant reduction in age-adjusted incidence of cardiovascular events. However, not all patients treated with these agents respond favorably, and some individuals may develop side-effects. With aging of the population and the growing prevalence of cardiovascular risk factors worldwide, it is expected that the demand for cardiovascular drugs will increase in the future. Accordingly, there is a growing need to identify the 'good' responders as well as the persons at risk for developing adverse events. Evidence is accumulating to indicate that responses to drugs are at least partly under genetic control. As such, pharmacogenetics - the study of variability in drug responses attributed to hereditary factors in different populations - may significantly assist in providing answers toward meeting this challenge. Pharmacogenetics mostly relies on associations between a specific genetic marker like single nucleotide polymorphisms (SNPs), either alone or arranged in a specific linear order on a certain chromosomal region (haplotypes), and a particular response to drugs. Numerous associations have been reported between selected genotypes and specific responses to cardiovascular drugs. Recently, for instance, associations have been reported between specific alleles of the apoE gene and the lipid-lowering response to statins, or the lipid-elevating effect of isotretinoin. Thus far, these types of studies have been mostly limited to a priori selected candidate genes due to restricted genotyping and analytical capacities. Thanks to the large number of SNPs now available in the public domain through the SNP Consortium and the newly developed technologies (high throughput genotyping, bioinformatics software), it is now possible to interrogate more than 200,000 SNPs distributed over the entire human genome. One pharmacogenetic study using this approach has been launched by GlaxoSmithKline to identify the approximately 4% of

  14. An EST-derived SNP and SSR genetic linkage map of cassava (Manihot esculenta Crantz).

    PubMed

    Rabbi, Ismail Yusuf; Kulembeka, Heneriko Philbert; Masumba, Esther; Marri, Pradeep Reddy; Ferguson, Morag

    2012-07-01

    Cassava (Manihot esculenta Crantz) is one of the most important food security crops in the tropics and increasingly being adopted for agro-industrial processing. Genetic improvement of cassava can be enhanced through marker-assisted breeding. For this, appropriate genomic tools are required to dissect the genetic architecture of economically important traits. Here, a genome-wide SNP-based genetic map of cassava anchored in SSRs is presented. An outbreeder full-sib (F1) family was genotyped on two independent SNP assay platforms: an array of 1,536 SNPs on Illumina's GoldenGate platform was used to genotype a first batch of 60 F1. Of the 1,358 successfully converted SNPs, 600 which were polymorphic in at least one of the parents and was subsequently converted to KBiosciences' KASPar assay platform for genotyping 70 additional F1. High-precision genotyping of 163 informative SSRs using capillary electrophoresis was also carried out. Linkage analysis resulted in a final linkage map of 1,837 centi-Morgans (cM) containing 568 markers (434 SNPs and 134 SSRs) distributed across 19 linkage groups. The average distance between adjacent markers was 3.4 cM. About 94.2% of the mapped SNPs and SSRs have also been localized on scaffolds of version 4.1 assembly of the cassava draft genome sequence. This more saturated genetic linkage map of cassava that combines SSR and SNP markers should find several applications in the improvement of cassava including aligning scaffolds of the cassava genome sequence, genetic analyses of important agro-morphological traits, studying the linkage disequilibrium landscape and comparative genomics.

  15. Hybrid Propulsion Demonstration Program 250K Hybrid Motor

    NASA Technical Reports Server (NTRS)

    Story, George; Zoladz, Tom; Arves, Joe; Kearney, Darren; Abel, Terry; Park, O.

    2003-01-01

    The Hybrid Propulsion Demonstration Program (HPDP) program was formed to mature hybrid propulsion technology to a readiness level sufficient to enable commercialization for various space launch applications. The goal of the HPDP was to develop and test a 250,000 pound vacuum thrust hybrid booster in order to demonstrate hybrid propulsion technology and enable manufacturing of large hybrid boosters for current and future space launch vehicles. The HPDP has successfully conducted four tests of the 250,000 pound thrust hybrid rocket motor at NASA's Stennis Space Center. This paper documents the test series.

  16. Kokkos Array

    SciTech Connect

    Edwards Daniel Sunderland, Harold Carter

    2012-09-12

    The Kokkos Array library implements shared-memory array data structures and parallel task dispatch interfaces for data-parallel computational kernels that are performance-portable to multicore-CPU and manycore-accelerator (e.g., GPGPU) devices.

  17. Whole-Genome Analysis of Diversity and SNP-Major Gene Association in Peach Germplasm

    PubMed Central

    Micheletti, Diego; Dettori, Maria Teresa; Micali, Sabrina; Aramini, Valeria; Pacheco, Igor; Da Silva Linge, Cassia; Foschi, Stefano; Banchi, Elisa; Barreneche, Teresa; Quilot-Turion, Bénédicte; Lambert, Patrick; Pascal, Thierry; Iglesias, Ignasi; Carbó, Joaquim; Wang, Li-rong; Ma, Rui-juan; Li, Xiong-wei; Gao, Zhong-shan; Nazzicari, Nelson; Troggio, Michela; Bassi, Daniele; Rossini, Laura; Verde, Ignazio; Laurens, François; Arús, Pere; Aranzana, Maria José

    2015-01-01

    Peach was domesticated in China more than four millennia ago and from there it spread world-wide. Since the middle of the last century, peach breeding programs have been very dynamic generating hundreds of new commercial varieties, however, in most cases such varieties derive from a limited collection of parental lines (founders). This is one reason for the observed low levels of variability of the commercial gene pool, implying that knowledge of the extent and distribution of genetic variability in peach is critical to allow the choice of adequate parents to confer enhanced productivity, adaptation and quality to improved varieties. With this aim we genotyped 1,580 peach accessions (including a few closely related Prunus species) maintained and phenotyped in five germplasm collections (four European and one Chinese) with the International Peach SNP Consortium 9K SNP peach array. The study of population structure revealed the subdivision of the panel in three main populations, one mainly made up of Occidental varieties from breeding programs (POP1OCB), one of Occidental landraces (POP2OCT) and the third of Oriental accessions (POP3OR). Analysis of linkage disequilibrium (LD) identified differential patterns of genome-wide LD blocks in each of the populations. Phenotypic data for seven monogenic traits were integrated in a genome-wide association study (GWAS). The significantly associated SNPs were always in the regions predicted by linkage analysis, forming haplotypes of markers. These diagnostic haplotypes could be used for marker-assisted selection (MAS) in modern breeding programs. PMID:26352671

  18. Analysis of a claimed distant relationship in a deficient pedigree using high density SNP data.

    PubMed

    Lareu, M V; García-Magariños, M; Phillips, C; Quintela, I; Carracedo, A; Salas, A

    2012-05-01

    DNA markers are routinely used to reveal both simple and complex family relationships. Likelihood based approaches have been traditionally used to estimate relationships using relatively few unlinked markers. However it is widely recognized that when using such limited numbers of loci distant relationships between two individuals cannot be distinguished from the average level of allele sharing found in random pairwise comparisons in the same population. As a real example, we demonstrate the usefulness of genome-wide SNP genotyping to analyze a claimed second cousin relationship that could not be resolved using standard forensic markers, confirming theoretical expectations for very distant relationships. Genome profiles derived from Affymetrix 6.0 SNP arrays obtained from the claimed second cousins were compared to profiles obtained from unrelated individuals and simulated data. Significance of the high estimated probabilities in favor of the second cousin relationship hypothesis was proved from the results obtained with both real and simulated unrelated pairs. As a final cautionary note, it is important to consider that successful identification of the claimed distant relationship reported here is largely due to a well-founded hypothesis being compared to the alternative hypothesis of the claimants being unrelated, but where there are several possible alternative hypotheses, the approach we outline here can yield false indications of unfounded alternative relationships.

  19. Nanocylinder arrays

    DOEpatents

    Tuominen, Mark; Schotter, Joerg; Thurn-Albrecht, Thomas; Russell, Thomas P.

    2007-03-13

    Pathways to rapid and reliable fabrication of nanocylinder arrays are provided. Simple methods are described for the production of well-ordered arrays of nanopores, nanowires, and other materials. This is accomplished by orienting copolymer films and removing a component from the film to produce nanopores, that in turn, can be filled with materials to produce the arrays. The resulting arrays can be used to produce nanoscale media, devices, and systems.

  20. Nanocylinder arrays

    DOEpatents

    Tuominen, Mark; Schotter, Joerg; Thurn-Albrecht, Thomas; Russell, Thomas P.

    2009-08-11

    Pathways to rapid and reliable fabrication of nanocylinder arrays are provided. Simple methods are described for the production of well-ordered arrays of nanopores, nanowires, and other materials. This is accomplished by orienting copolymer films and removing a component from the film to produce nanopores, that in turn, can be filled with materials to produce the arrays. The resulting arrays can be used to produce nanoscale media, devices, and systems.

  1. RASSF1A and the rs2073498 Cancer Associated SNP

    PubMed Central

    Donninger, Howard; Barnoud, Thibaut; Nelson, Nick; Kassler, Suzanna; Clark, Jennifer; Cummins, Timothy D.; Powell, David W.; Nyante, Sarah; Millikan, Robert C.; Clark, Geoffrey J.

    2011-01-01

    RASSF1A is one of the most frequently inactivated tumor suppressors yet identified in human cancer. It is pro-apoptotic and appears to function as a scaffolding protein that interacts with a variety of other tumor suppressors to modulate their function. It can also complex with the Ras oncoprotein and may serve to integrate pro-growth and pro-death signaling pathways. A SNP has been identified that is present in approximately 29% of European populations [rs2073498, A(133)S]. Several studies have now presented evidence that this SNP is associated with an enhanced risk of developing breast cancer. We have used a proteomics based approach to identify multiple differences in the pattern of protein/protein interactions mediated by the wild type compared to the SNP variant protein. We have also identified a significant difference in biological activity between wild type and SNP variant protein. However, we have found only a very modest association of the SNP with breast cancer predisposition. PMID:22649770

  2. DoGSD: the dog and wolf genome SNP database.

    PubMed

    Bai, Bing; Zhao, Wen-Ming; Tang, Bi-Xia; Wang, Yan-Qing; Wang, Lu; Zhang, Zhang; Yang, He-Chuan; Liu, Yan-Hu; Zhu, Jun-Wei; Irwin, David M; Wang, Guo-Dong; Zhang, Ya-Ping

    2015-01-01

    The rapid advancement of next-generation sequencing technology has generated a deluge of genomic data from domesticated dogs and their wild ancestor, grey wolves, which have simultaneously broadened our understanding of domestication and diseases that are shared by humans and dogs. To address the scarcity of single nucleotide polymorphism (SNP) data provided by authorized databases and to make SNP data more easily/friendly usable and available, we propose DoGSD (http://dogsd.big.ac.cn), the first canidae-specific database which focuses on whole genome SNP data from domesticated dogs and grey wolves. The DoGSD is a web-based, open-access resource comprising ∼ 19 million high-quality whole-genome SNPs. In addition to the dbSNP data set (build 139), DoGSD incorporates a comprehensive collection of SNPs from two newly sequenced samples (1 wolf and 1 dog) and collected SNPs from three latest dog/wolf genetic studies (7 wolves and 68 dogs), which were taken together for analysis with the population genetic statistics, Fst. In addition, DoGSD integrates some closely related information including SNP annotation, summary lists of SNPs located in genes, synonymous and non-synonymous SNPs, sampling location and breed information. All these features make DoGSD a useful resource for in-depth analysis in dog-/wolf-related studies.

  3. HaploSNP affinities and linkage map positions illuminate subgenome composition in the octoploid, cultivated strawberry (Fragaria×ananassa).

    PubMed

    Sargent, D J; Yang, Y; Šurbanovski, N; Bianco, L; Buti, M; Velasco, R; Giongo, L; Davis, T M

    2016-01-01

    The cultivated strawberry, Fragaria×ananassa possesses a genetically complex allo-octoploid genome. Advances in genomics research in Fragaria, including the release of a genome sequence for F. vesca, have permitted the development of a high throughput whole genome genotyping array for strawberry, which promises to facilitate genetics and genomics research. In this investigation, we used the Axiom® IStraw90®)array for linkage map development, and produced a linkage map containing 8,407 SNP markers spanning 1,820cM. Whilst the linkage map provides good coverage of the genome of both parental genotypes, the map of 'Monterey' contained significantly fewer mapped markers than did that of 'Darselect'. The array contains a novel marker class known as haploSNPs, which exploit homoeologous sequence variants as probe destabilization sites to effectively reduce marker ploidy. We examined these sites as potential indicators of subgenomic identities by using comparisons to allele states in two ancestral diploids. On this basis, haploSNP loci could be inferred to be derived from F. vesca, F. iinumae, or from an unknown source. When the identity classifications of haploSNPs were considered in conjunction with their respective linkage map positions, it was possible to define two discrete subgenomes, while the remaining homoeologues of each chromosome could not be partitioned into two discrete subgenomic groupings. These findings suggested a novel hypothesis regarding octoploid strawberry subgenome structure and evolutionary origins.

  4. PanSNPdb: the Pan-Asian SNP genotyping database.

    PubMed

    Ngamphiw, Chumpol; Assawamakin, Anunchai; Xu, Shuhua; Shaw, Philip J; Yang, Jin Ok; Ghang, Ho; Bhak, Jong; Liu, Edison; Tongsima, Sissades

    2011-01-01

    The HUGO Pan-Asian SNP consortium conducted the largest survey to date of human genetic diversity among Asians by sampling 1,719 unrelated individuals among 71 populations from China, India, Indonesia, Japan, Malaysia, the Philippines, Singapore, South Korea, Taiwan, and Thailand. We have constructed a database (PanSNPdb), which contains these data and various new analyses of them. PanSNPdb is a research resource in the analysis of the population structure of Asian peoples, including linkage disequilibrium patterns, haplotype distributions, and copy number variations. Furthermore, PanSNPdb provides an interactive comparison with other SNP and CNV databases, including HapMap3, JSNP, dbSNP and DGV and thus provides a comprehensive resource of human genetic diversity. The information is accessible via a widely accepted graphical interface used in many genetic variation databases. Unrestricted access to PanSNPdb and any associated files is available at: http://www4a.biotec.or.th/PASNP.

  5. Forensic SNP Genotyping using Nanopore MinION Sequencing

    PubMed Central

    Cornelis, Senne; Gansemans, Yannick; Deleye, Lieselot; Deforce, Dieter; Van Nieuwerburgh, Filip

    2017-01-01

    One of the latest developments in next generation sequencing is the Oxford Nanopore Technologies’ (ONT) MinION nanopore sequencer. We studied the applicability of this system to perform forensic genotyping of the forensic female DNA standard 9947 A using the 52 SNP-plex assay developed by the SNPforID consortium. All but one of the loci were correctly genotyped. Several SNP loci were identified as problematic for correct and robust genotyping using nanopore sequencing. All these loci contained homopolymers in the sequence flanking the forensic SNP and most of them were already reported as problematic in studies using other sequencing technologies. When these problematic loci are avoided, correct forensic genotyping using nanopore sequencing is technically feasible. PMID:28155888

  6. Forensic SNP Genotyping using Nanopore MinION Sequencing.

    PubMed

    Cornelis, Senne; Gansemans, Yannick; Deleye, Lieselot; Deforce, Dieter; Van Nieuwerburgh, Filip

    2017-02-03

    One of the latest developments in next generation sequencing is the Oxford Nanopore Technologies' (ONT) MinION nanopore sequencer. We studied the applicability of this system to perform forensic genotyping of the forensic female DNA standard 9947 A using the 52 SNP-plex assay developed by the SNPforID consortium. All but one of the loci were correctly genotyped. Several SNP loci were identified as problematic for correct and robust genotyping using nanopore sequencing. All these loci contained homopolymers in the sequence flanking the forensic SNP and most of them were already reported as problematic in studies using other sequencing technologies. When these problematic loci are avoided, correct forensic genotyping using nanopore sequencing is technically feasible.

  7. Population distribution and ancestry of the cancer protective MDM2 SNP285 (rs117039649).

    PubMed

    Knappskog, Stian; Gansmo, Liv B; Dibirova, Khadizha; Metspalu, Andres; Cybulski, Cezary; Peterlongo, Paolo; Aaltonen, Lauri; Vatten, Lars; Romundstad, Pål; Hveem, Kristian; Devilee, Peter; Evans, Gareth D; Lin, Dongxin; Van Camp, Guy; Manolopoulos, Vangelis G; Osorio, Ana; Milani, Lili; Ozcelik, Tayfun; Zalloua, Pierre; Mouzaya, Francis; Bliznetz, Elena; Balanovska, Elena; Pocheshkova, Elvira; Kučinskas, Vaidutis; Atramentova, Lubov; Nymadawa, Pagbajabyn; Titov, Konstantin; Lavryashina, Maria; Yusupov, Yuldash; Bogdanova, Natalia; Koshel, Sergey; Zamora, Jorge; Wedge, David C; Charlesworth, Deborah; Dörk, Thilo; Balanovsky, Oleg; Lønning, Per E

    2014-09-30

    The MDM2 promoter SNP285C is located on the SNP309G allele. While SNP309G enhances Sp1 transcription factor binding and MDM2 transcription, SNP285C antagonizes Sp1 binding and reduces the risk of breast-, ovary- and endometrial cancer. Assessing SNP285 and 309 genotypes across 25 different ethnic populations (>10.000 individuals), the incidence of SNP285C was 6-8% across European populations except for Finns (1.2%) and Saami (0.3%). The incidence decreased towards the Middle-East and Eastern Russia, and SNP285C was absent among Han Chinese, Mongolians and African Americans. Interhaplotype variation analyses estimated SNP285C to have originated about 14,700 years ago (95% CI: 8,300 - 33,300). Both this estimate and the geographical distribution suggest SNP285C to have arisen after the separation between Caucasians and modern day East Asians (17,000 - 40,000 years ago). We observed a strong inverse correlation (r = -0.805; p < 0.001) between the percentage of SNP309G alleles harboring SNP285C and the MAF for SNP309G itself across different populations suggesting selection and environmental adaptation with respect to MDM2 expression in recent human evolution. In conclusion, we found SNP285C to be a pan-Caucasian variant. Ethnic variation regarding distribution of SNP285C needs to be taken into account when assessing the impact of MDM2 SNPs on cancer risk.

  8. Temple syndrome: A patient with maternal hetero-UPD14, mixed iso- and hetero-disomy detected by SNP microarray typing of patient-father duos.

    PubMed

    Shin, Eun-Hye; Cho, Eunhae; Lee, Cha Gon

    2016-08-01

    Temple syndrome (TS, MIM 616222) is an imprinting disorder involving genes within the imprinted region of chromosome 14q32. TS is a genetically complex disorder, which is associated with maternal uniparental disomy of chromosome 14 (UPD14), paternal deletions on chromosome 14, or loss of methylation at the intergenic differentially methylated region (IG-DMR). Here, we describe the case of a patient with maternal hetero-UPD14, mixed iso-/hetero-disomy mechanism identified by a single nucleotide polymorphism (SNP) array analysis of patient-father duos study. The phenotype of our case is similarities to Prader-Willi syndrome (PWS) during infancy and to Russell-Silver syndrome (RSS) during childhood. This SNP array appears to be an effective initial screening tool for patients with nonspecific clinical features suggestive of chromosomal disorders.

  9. A 48 SNP set for grapevine cultivar identification

    PubMed Central

    2011-01-01

    Background Rapid and consistent genotyping is an important requirement for cultivar identification in many crop species. Among them grapevine cultivars have been the subject of multiple studies given the large number of synonyms and homonyms generated during many centuries of vegetative multiplication and exchange. Simple sequence repeat (SSR) markers have been preferred until now because of their high level of polymorphism, their codominant nature and their high profile repeatability. However, the rapid application of partial or complete genome sequencing approaches is identifying thousands of single nucleotide polymorphisms (SNP) that can be very useful for such purposes. Although SNP markers are bi-allelic, and therefore not as polymorphic as microsatellites, the high number of loci that can be multiplexed and the possibilities of automation as well as their highly repeatable results under any analytical procedure make them the future markers of choice for any type of genetic identification. Results We analyzed over 300 SNP in the genome of grapevine using a re-sequencing strategy in a selection of 11 genotypes. Among the identified polymorphisms, we selected 48 SNP spread across all grapevine chromosomes with allele frequencies balanced enough as to provide sufficient information content for genetic identification in grapevine allowing for good genotyping success rate. Marker stability was tested in repeated analyses of a selected group of cultivars obtained worldwide to demonstrate their usefulness in genetic identification. Conclusions We have selected a set of 48 stable SNP markers with a high discrimination power and a uniform genome distribution (2-3 markers/chromosome), which is proposed as a standard set for grapevine (Vitis vinifera L.) genotyping. Any previous problems derived from microsatellite allele confusion between labs or the need to run reference cultivars to identify allele sizes disappear using this type of marker. Furthermore, because SNP

  10. Sniper: improved SNP discovery by multiply mapping deep sequenced reads.

    PubMed

    Simola, Daniel F; Kim, Junhyong

    2011-06-20

    SNP (single nucleotide polymorphism) discovery using next-generation sequencing data remains difficult primarily because of redundant genomic regions, such as interspersed repetitive elements and paralogous genes, present in all eukaryotic genomes. To address this problem, we developed Sniper, a novel multi-locus Bayesian probabilistic model and a computationally efficient algorithm that explicitly incorporates sequence reads that map to multiple genomic loci. Our model fully accounts for sequencing error, template bias, and multi-locus SNP combinations, maintaining high sensitivity and specificity under a broad range of conditions. An implementation of Sniper is freely available at http://kim.bio.upenn.edu/software/sniper.shtml.

  11. Evidence for SNP-SNP interaction identified through targeted sequencing of cleft case-parent trios.

    PubMed

    Xiao, Yanzi; Taub, Margaret A; Ruczinski, Ingo; Begum, Ferdouse; Hetmanski, Jacqueline B; Schwender, Holger; Leslie, Elizabeth J; Koboldt, Daniel C; Murray, Jeffrey C; Marazita, Mary L; Beaty, Terri H

    2017-04-01

    Nonsyndromic cleft lip with or without cleft palate (NSCL/P) is the most common craniofacial birth defect in humans, affecting 1 in 700 live births. This malformation has a complex etiology where multiple genes and several environmental factors influence risk. At least a dozen different genes have been confirmed to be associated with risk of NSCL/P in previous studies. However, all the known genetic risk factors cannot fully explain the observed heritability of NSCL/P, and several authors have suggested gene-gene (G × G) interaction may be important in the etiology of this complex and heterogeneous malformation. We tested for G × G interactions using common single nucleotide polymorphic (SNP) markers from targeted sequencing in 13 regions identified by previous studies spanning 6.3 Mb of the genome in a study of 1,498 NSCL/P case-parent trios. We used the R-package trio to assess interactions between polymorphic markers in different genes, using a 1 degree of freedom (1df) test for screening, and a 4 degree of freedom (4df) test to assess statistical significance of epistatic interactions. To adjust for multiple comparisons, we performed permutation tests. The most significant interaction was observed between rs6029315 in MAFB and rs6681355 in IRF6 (4df P = 3.8 × 10(-8) ) in case-parent trios of European ancestry, which remained significant after correcting for multiple comparisons. However, no significant interaction was detected in trios of Asian ancestry.

  12. A high-density, multi-parental SNP genetic map on apple validates a new mapping approach for outcrossing species

    PubMed Central

    Di Pierro, Erica A; Gianfranceschi, Luca; Di Guardo, Mario; Koehorst-van Putten, Herma JJ; Kruisselbrink, Johannes W; Longhi, Sara; Troggio, Michela; Bianco, Luca; Muranty, Hélène; Pagliarani, Giulia; Tartarini, Stefano; Letschka, Thomas; Lozano Luis, Lidia; Garkava-Gustavsson, Larisa; Micheletti, Diego; Bink, Marco CAM; Voorrips, Roeland E; Aziz, Ebrahimi; Velasco, Riccardo; Laurens, François; van de Weg, W Eric

    2016-01-01

    Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple (Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species. PMID:27917289

  13. Target SNP selection in complex disease association studies

    PubMed Central

    Wjst, Matthias

    2004-01-01

    Background The massive amount of SNP data stored at public internet sites provides unprecedented access to human genetic variation. Selecting target SNP for disease-gene association studies is currently done more or less randomly as decision rules for the selection of functional relevant SNPs are not available. Results We implemented a computational pipeline that retrieves the genomic sequence of target genes, collects information about sequence variation and selects functional motifs containing SNPs. Motifs being considered are gene promoter, exon-intron structure, AU-rich mRNA elements, transcription factor binding motifs, cryptic and enhancer splice sites together with expression in target tissue. As a case study, 396 genes on chromosome 6p21 in the extended HLA region were selected that contributed nearly 20,000 SNPs. By computer annotation ~2,500 SNPs in functional motifs could be identified. Most of these SNPs are disrupting transcription factor binding sites but only those introducing new sites had a significant depressing effect on SNP allele frequency. Other decision rules concern position within motifs, the validity of SNP database entries, the unique occurrence in the genome and conserved sequence context in other mammalian genomes. Conclusion Only 10% of all gene-based SNPs have sequence-predicted functional relevance making them a primary target for genotyping in association studies. PMID:15248903

  14. Do you really know where this SNP goes?

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The release of build 10.2 of the swine genome was a marked improvement over previous builds and has proven extremely useful. However, as most know, there are regions of the genome that this particular build does not accurately represent. For instance, nearly 25% of the 62,162 SNP on the Illumina Por...

  15. Genetic mapping in grapevine using a SNP microarray: intensity values

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genotyping microarrays are widely used for genome wide association studies, but in high-diversity organisms, the quality of SNP calls can be diminished by genetic variation near the assayed nucleotide. To address this limitation in grapevine, we developed a simple heuristic that uses hybridization i...

  16. High throughput SNP detection system based on magnetic nanoparticles separation.

    PubMed

    Liu, Bin; Jia, Yingying; Ma, Man; Li, Zhiyang; Liu, Hongna; Li, Song; Deng, Yan; Zhang, Liming; Lu, Zhuoxuan; Wang, Wei; He, Nongyue

    2013-02-01

    Single-nucleotide polymorphism (SNP) was one-base variations in DNA sequence that can often be helpful to find genes associations for hereditary disease, communicable disease and so on. We developed a high throughput SNP detection system based on magnetic nanoparticles (MNPs) separation and dual-color hybridization or single base extension. This system includes a magnetic separation unit for sample separation, three high precision robot arms for pipetting and microtiter plate transferring respectively, an accurate temperature control unit for PCR and DNA hybridization and a high accurate and sensitive optical signal detection unit for fluorescence detection. The cyclooxygenase-2 gene promoter region--65G > C polymorphism locus SNP genotyping experiment for 48 samples from the northern Jiangsu area has been done to verify that if this system can simplify manual operation of the researchers, save time and improve efficiency in SNP genotyping experiments. It can realize sample preparation, target sequence amplification, signal detection and data analysis automatically and can be used in clinical molecule diagnosis and high throughput fluorescence immunological detection and so on.

  17. Weighted SNP set analysis in genome-wide association study.

    PubMed

    Dai, Hui; Zhao, Yang; Qian, Cheng; Cai, Min; Zhang, Ruyang; Chu, Minjie; Dai, Juncheng; Hu, Zhibin; Shen, Hongbing; Chen, Feng

    2013-01-01

    Genome-wide association studies (GWAS) are popular for identifying genetic variants which are associated with disease risk. Many approaches have been proposed to test multiple single nucleotide polymorphisms (SNPs) in a region simultaneously which considering disadvantages of methods in single locus association analysis. Kernel machine based SNP set analysis is more powerful than single locus analysis, which borrows information from SNPs correlated with causal or tag SNPs. Four types of kernel machine functions and principal component based approach (PCA) were also compared. However, given the loss of power caused by low minor allele frequencies (MAF), we conducted an extension work on PCA and used a new method called weighted PCA (wPCA). Comparative analysis was performed for weighted principal component analysis (wPCA), logistic kernel machine based test (LKM) and principal component analysis (PCA) based on SNP set in the case of different minor allele frequencies (MAF) and linkage disequilibrium (LD) structures. We also applied the three methods to analyze two SNP sets extracted from a real GWAS dataset of non-small cell lung cancer in Han Chinese population. Simulation results show that when the MAF of the causal SNP is low, weighted principal component and weighted IBS are more powerful than PCA and other kernel machine functions at different LD structures and different numbers of causal SNPs. Application of the three methods to a real GWAS dataset indicates that wPCA and wIBS have better performance than the linear kernel, IBS kernel and PCA.

  18. The development and characterization of a 57K single nucleotide polymorphism array for rainbow trout.

    PubMed

    Palti, Y; Gao, G; Liu, S; Kent, M P; Lien, S; Miller, M R; Rexroad, C E; Moen, T

    2015-05-01

    In this study, we describe the development and characterization of the first high-density single nucleotide polymorphism (SNP) genotyping array for rainbow trout. The SNP array is publically available from a commercial vendor (Affymetrix). The SNP genotyping quality was high, and validation rate was close to 90%. This is comparable to other farm animals and is much higher than previous smaller scale SNP validation studies in rainbow trout. High quality and integrity of the genotypes are evident from sample reproducibility and from nearly 100% agreement in genotyping results from other methods. The array is very useful for rainbow trout aquaculture populations with more than 40 900 polymorphic markers per population. For wild populations that were confounded by a smaller sample size, the number of polymorphic markers was between 10 577 and 24 330. Comparison between genotypes from individual populations suggests good potential for identifying candidate markers for populations' traceability. Linkage analysis and mapping of the SNPs to the reference genome assembly provide strong evidence for a wide distribution throughout the genome with good representation in all 29 chromosomes. A total of 68% of the genome scaffolds and contigs were anchored through linkage analysis using the SNP array genotypes, including ~20% of the genome assembly that has not been previously anchored to chromosomes.

  19. Large-Scale SNP Discovery through RNA Sequencing and SNP Genotyping by Targeted Enrichment Sequencing in Cassava (Manihot esculenta Crantz)

    PubMed Central

    Pootakham, Wirulda; Shearman, Jeremy R.; Ruang-areerate, Panthita; Sonthirod, Chutima; Sangsrakru, Duangjai; Jomchai, Nukoon; Yoocha, Thippawan; Triwitayakorn, Kanokporn; Tragoonrung, Somvong; Tangphatsornruang, Sithichoke

    2014-01-01

    Cassava (Manihot esculenta Crantz) is one of the most important crop species being the main source of dietary energy in several countries. Marker-assisted selection has become an essential tool in plant breeding. Single nucleotide polymorphism (SNP) discovery via transcriptome sequencing is an attractive strategy for genome complexity reduction in organisms with large genomes. We sequenced the transcriptome of 16 cassava accessions using the Illumina HiSeq platform and identified 675,559 EST-derived SNP markers. A subset of those markers was subsequently genotyped by capture-based targeted enrichment sequencing in 100 F1 progeny segregating for starch viscosity phenotypes. A total of 2,110 non-redundant SNP markers were used to construct a genetic map. This map encompasses 1,785 cM and consists of 19 linkage groups. A major quantitative trait locus (QTL) controlling starch pasting properties was identified and shown to coincide with the QTL previously reported for this trait. With a high-density SNP-based linkage map presented here, we also uncovered a novel QTL associated with starch pasting time on LG 10. PMID:25551642

  20. Large-scale SNP discovery through RNA sequencing and SNP genotyping by targeted enrichment sequencing in cassava (Manihot esculenta Crantz).

    PubMed

    Pootakham, Wirulda; Shearman, Jeremy R; Ruang-Areerate, Panthita; Sonthirod, Chutima; Sangsrakru, Duangjai; Jomchai, Nukoon; Yoocha, Thippawan; Triwitayakorn, Kanokporn; Tragoonrung, Somvong; Tangphatsornruang, Sithichoke

    2014-01-01

    Cassava (Manihot esculenta Crantz) is one of the most important crop species being the main source of dietary energy in several countries. Marker-assisted selection has become an essential tool in plant breeding. Single nucleotide polymorphism (SNP) discovery via transcriptome sequencing is an attractive strategy for genome complexity reduction in organisms with large genomes. We sequenced the transcriptome of 16 cassava accessions using the Illumina HiSeq platform and identified 675,559 EST-derived SNP markers. A subset of those markers was subsequently genotyped by capture-based targeted enrichment sequencing in 100 F1 progeny segregating for starch viscosity phenotypes. A total of 2,110 non-redundant SNP markers were used to construct a genetic map. This map encompasses 1,785 cM and consists of 19 linkage groups. A major quantitative trait locus (QTL) controlling starch pasting properties was identified and shown to coincide with the QTL previously reported for this trait. With a high-density SNP-based linkage map presented here, we also uncovered a novel QTL associated with starch pasting time on LG 10.

  1. High-throughput SNP genotyping for breeding applications in rice using the BeadXpress platform

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Multiplexed single nucleotide polymorphism (SNP) markers have the potential to increase the speed and cost-effectiveness of genotyping, provided that an optimal SNP density is used for each application. To test the efficiency of multiplexed SNP genotyping for diversity, mapping and breeding applicat...

  2. Development of Single Nucleotide Polymorphism (SNP) Markers for Use in Commercial Maize (Zea Mays L.) Germplasm

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The development of single nucleotide polymorphism (SNP) markers in maize offer the opportunity to utilize DNA markers in many new areas of population genetics, gene discovery, plant breeding, and germplasm identification. However, the steps from sequencing and SNP discovery to SNP marker design and ...

  3. Development of new SNP derived cleaved amplified polymorphic sequence marker set and its successful utilization in the genetic analysis of seed color variation in barley.

    PubMed

    Bungartz, Annemarie; Klaus, Marius; Mathew, Boby; Léon, Jens; Naz, Ali Ahmad

    2016-03-01

    The aim of the present study was to develop a new cost effective PCR based CAPS marker set using advantages of high-throughput SNP genotyping. Initially, SNP survey was made using 20 diverse barley genotypes via 9k iSelect array genotyping that resulted in 6334 polymorphic SNP markers. Principle component analysis using this marker data showed fine differentiation of barley diverse gene pool. Till this end, we developed 200 SNP derived CAPS markers distributed across the genome covering around 991cM with an average marker density of 5.09cM. Further, we genotyped 68 CAPS markers in an F2 population (Cheri×ICB181160) segregating for seed color variation in barley. Genetic mapping of seed color revealed putative linkage of single nuclear gene on chromosome 1H. These findings showed the proof of concept for the development and utility of a newer cost effective genomic tool kit to analyze broader genetic resources of barley worldwide.

  4. Vitis phylogenomics: hybridization intensities from a SNP array outperform genotype calls

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Understanding relationships among species is a fundamental goal of evolutionary biology. Single nucleotide polymorphisms (SNPs) identified through next generation sequencing and related technologies enable phylogeny reconstruction by providing unprecedented numbers of characters for analysis. One ...

  5. Review of the initial validation and characterization of a 3K chicken SNP array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The year 2004 was a historic one for biologists and especially the chicken research community as the first draft of the chicken genome was published (International Chicken Genome Sequencing Consortium, 2004). The 6.6X coverage of a UCD001 female Red Jungle Fowl (RJF) genome was the first complete d...

  6. Analysis of East Asia Genetic Substructure Using Genome-Wide SNP Arrays

    PubMed Central

    Tian, Chao; Kosoy, Roman; Lee, Annette; Ransom, Michael; Belmont, John W.; Gregersen, Peter K.; Seldin, Michael F.

    2008-01-01

    Accounting for population genetic substructure is important in reducing type 1 errors in genetic studies of complex disease. As efforts to understand complex genetic disease are expanded to different continental populations the understanding of genetic substructure within these continents will be useful in design and execution of association tests. In this study, population differentiation (Fst) and Principal Components Analyses (PCA) are examined using >200 K genotypes from multiple populations of East Asian ancestry. The population groups included those from the Human Genome Diversity Panel [Cambodian, Yi, Daur, Mongolian, Lahu, Dai, Hezhen, Miaozu, Naxi, Oroqen, She, Tu, Tujia, Naxi, Xibo, and Yakut], HapMap [ Han Chinese (CHB) and Japanese (JPT)], and East Asian or East Asian American subjects of Vietnamese, Korean, Filipino and Chinese ancestry. Paired Fst (Wei and Cockerham) showed close relationships between CHB and several large East Asian population groups (CHB/Korean, 0.0019; CHB/JPT, 00651; CHB/Vietnamese, 0.0065) with larger separation with Filipino (CHB/Filipino, 0.014). Low levels of differentiation were also observed between Dai and Vietnamese (0.0045) and between Vietnamese and Cambodian (0.0062). Similarly, small Fst's were observed among different presumed Han Chinese populations originating in different regions of mainland of China and Taiwan (Fst's <0.0025 with CHB). For PCA, the first two PC's showed a pattern of relationships that closely followed the geographic distribution of the different East Asian populations. PCA showed substructure both between different East Asian groups and within the Han Chinese population. These studies have also identified a subset of East Asian substructure ancestry informative markers (EASTASAIMS) that may be useful for future complex genetic disease association studies in reducing type 1 errors and in identifying homogeneous groups that may increase the power of such studies. PMID:19057645

  7. Immunochip SNP array identifies novel genetic variants conferring susceptibility to candidaemia.

    PubMed

    Kumar, Vinod; Cheng, Shih-Chin; Johnson, Melissa D; Smeekens, Sanne P; Wojtowicz, Agnieszka; Giamarellos-Bourboulis, Evangelos; Karjalainen, Juha; Franke, Lude; Withoff, Sebo; Plantinga, Theo S; van de Veerdonk, Frank L; van der Meer, Jos W M; Joosten, Leo A B; Sokol, Harry; Bauer, Hermann; Herrmann, Bernhard G; Bochud, Pierre-Yves; Marchetti, Oscar; Perfect, John R; Xavier, Ramnik J; Kullberg, Bart Jan; Wijmenga, Cisca; Netea, Mihai G

    2014-09-08

    Candidaemia is the fourth most common cause of bloodstream infection, with a high mortality rate of up to 40%. Identification of host genetic factors that confer susceptibility to candidaemia may aid in designing adjunctive immunotherapeutic strategies. Here we hypothesize that variation in immune genes may predispose to candidaemia. We analyse 118,989 single-nucleotide polymorphisms (SNPs) across 186 loci known to be associated with immune-mediated diseases in the largest candidaemia cohort to date of 217 patients of European ancestry and a group of 11,920 controls. We validate the significant associations by comparison with a disease-matched control group. We observe significant association between candidaemia and SNPs in the CD58 (P = 1.97 × 10(-11); odds ratio (OR) = 4.68), LCE4A-C1orf68 (P = 1.98 × 10(-10); OR = 4.25) and TAGAP (P = 1.84 × 10(-8); OR = 2.96) loci. Individuals carrying two or more risk alleles have an increased risk for candidaemia of 19.4-fold compared with individuals carrying no risk allele. We identify three novel genetic risk factors for candidaemia, which we subsequently validate for their role in antifungal host defence.

  8. Diversity analysis of cotton (Gossypium hirsutum L.) germplasm using the CottonSNP63K Array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Cotton germplasm resources contain beneficial alleles that can be exploited to develop germplasm adapting to emerging environmental and climate conditions, and this germplasm has commonly been characterized based on phenotypes. However, phenotypic profiles are limited by what can be observed and me...

  9. Candidate loci involved in domestication and improvement detected by a published 90K wheat SNP array

    PubMed Central

    Gao, Lifeng; Zhao, Guangyao; Huang, Dawei; Jia, Jizeng

    2017-01-01

    Selection is one of the most important forces in crop evolution. Common wheat is a major world food crop and a typical allopolyploid with a huge and complex genome. We applied four approaches to detect loci selected in wheat during domestication and improvement. A total of 7,984 candidate loci were detected, accounting for 23.3% of all 34,317 SNPs analysed, a much higher proportion than estimated in previous reports. We constructed a first generation wheat selection map which revealed the following new insights on genome-wide selection: (1) diversifying selection acted by increasing, decreasing or not affecting gene frequencies; (2) the number of loci under selection during domestication was much higher than that during improvement; (3) the contribution to wheat improvement by the D sub-genome was relatively small due to the bottleneck of hexaploidisation and diversity can be expanded by using synthetic wheat and introgression lines; and (4) clustered selection regions occur throughout the wheat genome, including the centromere regions. This study will not only help future wheat breeding and evolutionary studies, but will also accelerate study of other crops, especially polyploids. PMID:28327671

  10. SNP genotyping using single-tube fluorescent bidirectional PCR.

    PubMed

    Waterfall, Christy M; Cobb, Benjamin D

    2002-07-01

    SNP genotyping is a well-populatedfield with a large number of assay formats offering accurate allelic discrimination. However, there remains a discord between the ultimate goal of rapid, inexpensive assays that do not require complex design considerations and involved optimization strategies. We describe the first integration of bidirectional allele-specific amplification, SYBR Green I, and rapid-cycle PCR to provide a homogeneous SNP-typing assay. Wild-type, mutant, and heterozygous alleles were easily discriminated in a single tube using melt curve profiling of PCR products alone. We demonstrate the effectiveness and reliability of this assay with a blinded trial using clinical samples from individuals with sickle cell anemia, sickle cell trait, or unaffected individuals. The tests were completed in less than 30 min without expensive fluorogenic probes, prohibiting design rules, or lengthy downstream processing for product analysis.

  11. Pyrobayes: an improved base caller for SNP discovery in pyrosequences.

    PubMed

    Quinlan, Aaron R; Stewart, Donald A; Strömberg, Michael P; Marth, Gábor T

    2008-02-01

    Previously reported applications of the 454 Life Sciences pyrosequencing technology have relied on deep sequence coverage for accurate polymorphism discovery because of frequent insertion and deletion sequence errors. Here we report a new base calling program, Pyrobayes, for pyrosequencing reads. Pyrobayes permits accurate single-nucleotide polymorphism (SNP) calling in resequencing applications, even in shallow read coverage, primarily because it produces more confident base calls than the native base calling program.

  12. Enthalpy arrays

    PubMed Central

    Torres, Francisco E.; Kuhn, Peter; De Bruyker, Dirk; Bell, Alan G.; Wolkin, Michal V.; Peeters, Eric; Williamson, James R.; Anderson, Gregory B.; Schmitz, Gregory P.; Recht, Michael I.; Schweizer, Sandra; Scott, Lincoln G.; Ho, Jackson H.; Elrod, Scott A.; Schultz, Peter G.; Lerner, Richard A.; Bruce, Richard H.

    2004-01-01

    We report the fabrication of enthalpy arrays and their use to detect molecular interactions, including protein–ligand binding, enzymatic turnover, and mitochondrial respiration. Enthalpy arrays provide a universal assay methodology with no need for specific assay development such as fluorescent labeling or immobilization of reagents, which can adversely affect the interaction. Microscale technology enables the fabrication of 96-detector enthalpy arrays on large substrates. The reduction in scale results in large decreases in both the sample quantity and the measurement time compared with conventional microcalorimetry. We demonstrate the utility of the enthalpy arrays by showing measurements for two protein–ligand binding interactions (RNase A + cytidine 2′-monophosphate and streptavidin + biotin), phosphorylation of glucose by hexokinase, and respiration of mitochondria in the presence of 2,4-dinitrophenol uncoupler. PMID:15210951

  13. Array tomography: imaging stained arrays.

    PubMed

    Micheva, Kristina D; O'Rourke, Nancy; Busse, Brad; Smith, Stephen J

    2010-11-01

    Array tomography is a volumetric microscopy method based on physical serial sectioning. Ultrathin sections of a plastic-embedded tissue are cut using an ultramicrotome, bonded in an ordered array to a glass coverslip, stained as desired, and imaged. The resulting two-dimensional image tiles can then be reconstructed computationally into three-dimensional volume images for visualization and quantitative analysis. The minimal thickness of individual sections permits high-quality rapid staining and imaging, whereas the array format allows reliable and convenient section handling, staining, and automated imaging. Also, the physical stability of the arrays permits images to be acquired and registered from repeated cycles of staining, imaging, and stain elution, as well as from imaging using multiple modalities (e.g., fluorescence and electron microscopy). Array tomography makes it possible to visualize and quantify previously inaccessible features of tissue structure and molecular architecture. However, careful preparation of the tissue is essential for successful array tomography; these steps can be time-consuming and require some practice to perfect. In this protocol, tissue arrays are imaged using conventional wide-field fluorescence microscopy. Images can be captured manually or, with the appropriate software and hardware, the process can be automated.

  14. Array tomography: production of arrays.

    PubMed

    Micheva, Kristina D; O'Rourke, Nancy; Busse, Brad; Smith, Stephen J

    2010-11-01

    Array tomography is a volumetric microscopy method based on physical serial sectioning. Ultrathin sections of a plastic-embedded tissue are cut using an ultramicrotome, bonded in an ordered array to a glass coverslip, stained as desired, and imaged. The resulting two-dimensional image tiles can then be reconstructed computationally into three-dimensional volume images for visualization and quantitative analysis. The minimal thickness of individual sections permits high-quality rapid staining and imaging, whereas the array format allows reliable and convenient section handling, staining, and automated imaging. Also, the physical stability of the arrays permits images to be acquired and registered from repeated cycles of staining, imaging, and stain elution, as well as from imaging using multiple modalities (e.g., fluorescence and electron microscopy). Array tomography makes it possible to visualize and quantify previously inaccessible features of tissue structure and molecular architecture. However, careful preparation of the tissue is essential for successful array tomography; these steps can be time consuming and require some practice to perfect. This protocol describes the sectioning of embedded tissues and the mounting of the serial arrays. The procedures require some familiarity with the techniques used for ultramicrotome sectioning for electron microscopy.

  15. Infrared Arrays

    NASA Astrophysics Data System (ADS)

    McLean, I.; Murdin, P.

    2000-11-01

    Infrared arrays are small electronic imaging devices subdivided into a grid or `array' of picture elements, or pixels, each of which is made of a material sensitive to photons (ELECTROMAGNETIC RADIATION) with wavelengths much longer than normal visible light. Typical dimensions of currently available devices are about 27-36 mm square, and formats now range from 2048×2048 pixels for the near-infra...

  16. Development of a forensic identity SNP panel for Indonesia.

    PubMed

    Augustinus, Daniel; Gahan, Michelle E; McNevin, Dennis

    2015-07-01

    Genetic markers included in forensic identity panels must exhibit Hardy-Weinberg and linkage equilibrium (HWE and LE). "Universal" panels designed for global use can fail these tests in regional jurisdictions exhibiting high levels of genetic differentiation such as the Indonesian archipelago. This is especially the case where a single DNA database is required for allele frequency estimates to calculate random match probabilities (RMPs) and associated likelihood ratios (LRs). A panel of 65 single nucleotide polymorphisms (SNPs) and a reduced set of 52 SNPs have been selected from 15 Indonesian subpopulations in the HUGO Pan Asian SNP database using a SNP selection strategy that could be applied to any panel of forensic identity markers. The strategy consists of four screening steps: (1) application of a G test for HWE; (2) ranking for high heterozygosity; (3) selection for LE; and (4) selection for low inbreeding depression. SNPs in our Indonesian panel perform well in comparison to some other universal SNP and short tandem repeat (STR) panels as measured by Fisher's exact test for HWE and LE and Wright's F statistics.

  17. Population distribution and ancestry of the cancer protective MDM2 SNP285 (rs117039649)

    PubMed Central

    Knappskog, Stian; Gansmo, Liv B.; Dibirova, Khadizha; Metspalu, Andres; Cybulski, Cezary; Peterlongo, Paolo; Aaltonen, Lauri; Vatten, Lars; Romundstad, Pål; Hveem, Kristian; Devilee, Peter; Evans, Gareth D.; Lin, Dongxin; Camp, Guy Van; Manolopoulos, Vangelis G.; Osorio, Ana; Milani, Lili; Ozcelik, Tayfun; Zalloua, Pierre; Mouzaya, Francis; Bliznetz, Elena; Balanovska, Elena; Pocheshkova, Elvira; Kučinskas, Vaidutis; Atramentova, Lubov; Nymadawa, Pagbajabyn; Titov, Konstantin; Lavryashina, Maria; Yusupov, Yuldash; Bogdanova, Natalia; Koshel, Sergey; Zamora, Jorge; Wedge, David C.; Charlesworth, Deborah; Dörk, Thilo; Balanovsky, Oleg; Lønning, Per E.

    2014-01-01

    The MDM2 promoter SNP285C is located on the SNP309G allele. While SNP309G enhances Sp1 transcription factor binding and MDM2 transcription, SNP285C antagonizes Sp1 binding and reduces the risk of breast-, ovary- and endometrial cancer. Assessing SNP285 and 309 genotypes across 25 different ethnic populations (>10.000 individuals), the incidence of SNP285C was 6-8% across European populations except for Finns (1.2%) and Saami (0.3%). The incidence decreased towards the Middle-East and Eastern Russia, and SNP285C was absent among Han Chinese, Mongolians and African Americans. Interhaplotype variation analyses estimated SNP285C to have originated about 14,700 years ago (95% CI: 8,300 – 33,300). Both this estimate and the geographical distribution suggest SNP285C to have arisen after the separation between Caucasians and modern day East Asians (17,000 - 40,000 years ago). We observed a strong inverse correlation (r = -0.805; p < 0.001) between the percentage of SNP309G alleles harboring SNP285C and the MAF for SNP309G itself across different populations suggesting selection and environmental adaptation with respect to MDM2 expression in recent human evolution. In conclusion, we found SNP285C to be a pan-Caucasian variant. Ethnic variation regarding distribution of SNP285C needs to be taken into account when assessing the impact of MDM2 SNPs on cancer risk. PMID:25327560

  18. Forensic SNP genotyping with SNaPshot: Technical considerations for the development and optimization of multiplexed SNP assays.

    PubMed

    Fondevila, M; Børsting, C; Phillips, C; de la Puente, M; Consortium, Euroforen-NoE; Carracedo, A; Morling, N; Lareu, M V

    2017-01-01

    This review explores the key factors that influence the optimization, routine use, and profile interpretation of the SNaPshot single-base extension (SBE) system applied to forensic single-nucleotide polymorphism (SNP) genotyping. Despite being a mainly complimentary DNA genotyping technique to routine STR profiling, use of SNaPshot is an important part of the development of SNP sets for a wide range of forensic applications with these markers, from genotyping highly degraded DNA with very short amplicons to the introduction of SNPs to ascertain the ancestry and physical characteristics of an unidentified contact trace donor. However, this technology, as resourceful as it is, displays several features that depart from the usual STR genotyping far enough to demand a certain degree of expertise from the forensic analyst before tackling the complex casework on which SNaPshot application provides an advantage. In order to provide the basis for developing such expertise, we cover in this paper the most challenging aspects of the SNaPshot technology, focusing on the steps taken to design primer sets, optimize the PCR and single-base extension chemistries, and the important features of the peak patterns observed in typical forensic SNP profiles using SNaPshot. With that purpose in mind, we provide guidelines and troubleshooting for multiplex-SNaPshot-oriented primer design and the resulting capillary electrophoresis (CE) profile interpretation (covering the most commonly observed artifacts and expected departures from the ideal conditions).

  19. Microlens arrays

    NASA Astrophysics Data System (ADS)

    Hutley, Michael C.; Stevens, Richard F.; Daly, Daniel J.

    1992-04-01

    Microlenses have been with us for a long time as indeed the very word lens reminds us. Many early lenses,including those made by Hooke and Leeuwenhoek in the 17th century were small and resembled lentils. Many languages use the same word for both (French tilentillelt and German "Linse") and the connection is only obscure in English because we use the French word for the vegetable and the German for the optic. Many of the applications for arrays of inicrolenses are also well established. Lippmann's work on integral photography at the turn of the century required lens arrays and stimulated an interest that is very much alive today. At one stage, lens arrays played an important part in high speed photography and various schemes have been put forward to take advantage of the compact imaging properties of combinations of lens arrays. The fact that many of these ingenious schemes have not been developed to their full potential has to a large degree been due to the absence of lens arrays of a suitable quality and cost.

  20. A novel TCF7L2 type 2 diabetes SNP identified from fine mapping in African American women

    PubMed Central

    Haddad, Stephen A.; Palmer, Julie R.; Lunetta, Kathryn L.; Ng, Maggie C. Y.; Ruiz-Narváez, Edward A.

    2017-01-01

    SNP rs7903146 in the Wnt pathway’s TCF7L2 gene is the variant most significantly associated with type 2 diabetes to date, with associations observed across diverse populations. We sought to determine whether variants in other Wnt pathway genes are also associated with this disease. We evaluated 69 genes involved in the Wnt pathway, including TCF7L2, for associations with type 2 diabetes in 2632 African American cases and 2596 controls from the Black Women’s Health Study. Tag SNPs for each gene region were genotyped on a custom Affymetrix Axiom Array, and imputation was performed to 1000 Genomes Phase 3 data. Gene-based analyses were conducted using the adaptive rank truncated product (ARTP) statistic. The PSMD2 gene was significantly associated with type 2 diabetes after correction for multiple testing (corrected p = 0.016), based on the nine most significant single variants in the +/- 20 kb region surrounding the gene, which includes nearby genes EIF4G1, ECE2, and EIF2B5. Association data on four of the nine variants were available from an independent sample of 8284 African American cases and 15,543 controls; associations were in the same direction, but weak and not statistically significant. TCF7L2 was the only other gene associated with type 2 diabetes at nominal p <0.01 in our data. One of the three variants in the best gene-based model for TCF7L2, rs114770437, was not correlated with the GWAS index SNP rs7903146 and may represent an independent association signal seen only in African ancestry populations. Data on this SNP were not available in the replication sample. PMID:28253288

  1. Exploration of SNP variants affecting hair colour prediction in Europeans.

    PubMed

    Söchtig, Jens; Phillips, Chris; Maroñas, Olalla; Gómez-Tato, Antonio; Cruz, Raquel; Alvarez-Dios, Jose; de Cal, María-Ángeles Casares; Ruiz, Yarimar; Reich, Kristian; Fondevila, Manuel; Carracedo, Ángel; Lareu, María V

    2015-09-01

    DNA profiling is a key tool for forensic analysis; however, current methods identify a suspect either by direct comparison or from DNA database searches. In cases with unidentified suspects, prediction of visible physical traits e.g. pigmentation or hair distribution of the DNA donors can provide important probative information. This study aimed to explore single nucleotide polymorphism (SNP) variants for their effect on hair colour prediction. A discovery panel of 63 SNPs consisting of already established hair colour markers from the HIrisPlex hair colour phenotyping assay as well as additional markers for which associations to human pigmentation traits were previously identified was used to develop multiplex assays based on SNaPshot single-base extension technology. A genotyping study was performed on a range of European populations (n = 605). Hair colour phenotyping was accomplished by matching donor's hair to a graded colour category system of reference shades and photography. Since multiple SNPs in combination contribute in varying degrees to hair colour predictability in Europeans, we aimed to compile a compact marker set that could provide a reliable hair colour inference from the fewest SNPs. The predictive approach developed uses a naïve Bayes classifier to provide hair colour assignment probabilities for the SNP profiles of the key SNPs and was embedded into the Snipper online SNP classifier ( http://mathgene.usc.es/snipper/ ). Results indicate that red, blond, brown and black hair colours are predictable with informative probabilities in a high proportion of cases. Our study resulted in the identification of 12 most strongly associated SNPs to hair pigmentation variation in six genes.

  2. Etiological yield of SNP microarrays in idiopathic intellectual disability.

    PubMed

    Utine, G Eda; Haliloğlu, Göknur; Volkan-Salancı, Bilge; Çetinkaya, Arda; Kiper, Pelin Ö; Alanay, Yasemin; Aktaş, Dilek; Anlar, Banu; Topçu, Meral; Boduroğlu, Koray; Alikaşifoğlu, Mehmet

    2014-05-01

    Intellectual disability (ID) has a prevalence of 3% and is classified according to its severity. An underlying etiology cannot be determined in 75-80% in mild ID, and in 20-50% of severe ID. After it has been shown that copy number variations involving short DNA segments may cause ID, genome-wide SNP microarrays are being used as a tool for detecting submicroscopic copy number changes and uniparental disomy. This study was performed to investigate the presence of copy number changes in patients with ID of unidentified etiology. Affymetrix(®) 6.0 SNP microarray platform was used for analysis of 100 patients and their healthy parents, and data were evaluated using various databases and literature. Etiological diagnoses were made in 12 patients (12%). Homozygous deletion in NRXN1 gene and duplication in IL1RAPL1 gene were detected for the first time. Two separate patients had deletions in FOXP2 and UBE2A genes, respectively, for which only few patients have recently been reported. Interstitial and subtelomeric copy number changes were described in 6 patients, in whom routine cytogenetic tools revealed normal results. In one patient uniparental disomy type of Angelman syndrome was diagnosed. SNP microarrays constitute a screening test able to detect very small genomic changes, with a high etiological yield even in patients already evaluated using traditional cytogenetic tools, offer analysis for uniparental disomy and homozygosity, and thereby are helpful in finding novel disease-causing genes: for these reasons they should be considered as a first-tier genetic screening test in the evaluation of patients with ID and autism.

  3. A new diagnostic workflow for patients with mental retardation and/or multiple congenital abnormalities: test arrays first.

    PubMed

    Gijsbers, Antoinet C J; Lew, Janet Y K; Bosch, Cathy A J; Schuurs-Hoeijmakers, Janneke H M; van Haeringen, Arie; den Hollander, Nicolette S; Kant, Sarina G; Bijlsma, Emilia K; Breuning, Martijn H; Bakker, Egbert; Ruivenkamp, Claudia A L

    2009-11-01

    High-density single-nucleotide polymorphism (SNP) genotyping technology enables extensive genotyping as well as the detection of increasingly smaller chromosomal aberrations. In this study, we assess molecular karyotyping as first-round analysis of patients with mental retardation and/or multiple congenital abnormalities (MR/MCA). We used different commercially available SNP array platforms, the Affymetrix GeneChip 262K NspI, the Genechip 238K StyI, the Illumina HumanHap 300 and HumanCNV 370 BeadChip, to detect copy number variants (CNVs) in 318 patients with unexplained MR/MCA. We found abnormalities in 22.6% of the patients, including six CNVs that overlap known microdeletion/duplication syndromes, eight CNVs that overlap recently described syndromes, 63 potentially pathogenic CNVs (in 52 patients), four large segments of homozygosity and two mosaic trisomies for an entire chromosome. This study shows that high-density SNP array analysis reveals a much higher diagnostic yield as that of conventional karyotyping. SNP arrays have the potential to detect CNVs, mosaics, uniparental disomies and loss of heterozygosity in one experiment. We, therefore, propose a novel diagnostic approach to all MR/MCA patients by first analyzing every patient with an SNP array instead of conventional karyotyping.

  4. Computational tradeoffs in multiplex PCR assay design for SNP genotyping

    PubMed Central

    Rachlin, John; Ding, Chunming; Cantor, Charles; Kasif, Simon

    2005-01-01

    Background Multiplex PCR is a key technology for detecting infectious microorganisms, whole-genome sequencing, forensic analysis, and for enabling flexible yet low-cost genotyping. However, the design of a multiplex PCR assays requires the consideration of multiple competing objectives and physical constraints, and extensive computational analysis must be performed in order to identify the possible formation of primer-dimers that can negatively impact product yield. Results This paper examines the computational design limits of multiplex PCR in the context of SNP genotyping and examines tradeoffs associated with several key design factors including multiplexing level (the number of primer pairs per tube), coverage (the % of SNP whose associated primers are actually assigned to one of several available tube), and tube-size uniformity. We also examine how design performance depends on the total number of available SNPs from which to choose, and primer stringency criterial. We show that finding high-multiplexing/high-coverage designs is subject to a computational phase transition, becoming dramatically more difficult when the probability of primer pair interaction exceeds a critical threshold. The precise location of this critical transition point depends on the number of available SNPs and the level of multiplexing required. We also demonstrate how coverage performance is impacted by the number of available snps, primer selection criteria, and target multiplexing levels. Conclusion The presence of a phase transition suggests limits to scaling Multiplex PCR performance for high-throughput genomics applications. Achieving broad SNP coverage rapidly transitions from being very easy to very hard as the target multiplexing level (# of primer pairs per tube) increases. The onset of a phase transition can be "delayed" by having a larger pool of SNPs, or loosening primer selection constraints so as to increase the number of candidate primer pairs per SNP, though the latter

  5. Pacific Array

    NASA Astrophysics Data System (ADS)

    Kawakatsu, H.; Takeo, A.; Isse, T.; Nishida, K.; Shiobara, H.; Suetsugu, D.

    2014-12-01

    Based on our recent results on broadband ocean bottom seismometry, we propose a next generation large-scale array experiment in the ocean. Recent advances in ocean bottom broadband seismometry (e.g., Suetsugu & Shiobara, 2014, Annual Review EPS), together with advances in the seismic analysis methodology, have now enabled us to resolve the regional 1-D structure of the entire lithosphere/asthenosphere system, including seismic anisotropy (both radial and azimuthal), with deployments of ~10-15 broadband ocean bottom seismometers (BBOBSs) (namely "ocean-bottom broadband dispersion survey"; Takeo et al., 2013, JGR; Kawakatsu et al., 2013, AGU; Takeo, 2014, Ph.D. Thesis; Takeo et al., 2014, JpGU). Having ~15 BBOBSs as an array unit for 2-year deployment, and repeating such deployments in a leap-frog way (an array of arrays) for a decade or so would enable us to cover a large portion of the Pacific basin. Such efforts, not only by giving regional constraints on the 1-D structure, but also by sharing waveform data for global scale waveform tomography, would drastically increase our knowledge of how plate tectonics works on this planet, as well as how it worked for the past 150 million years. International collaborations might be sought.

  6. Compact wire array sources: power scaling and implosion physics.

    SciTech Connect

    Serrano, Jason Dimitri; Chuvatin, Alexander S.; Jones, M. C.; Vesey, Roger Alan; Waisman, Eduardo M.; Ivanov, V. V.; Esaulov, Andrey A.; Ampleford, David J.; Cuneo, Michael Edward; Kantsyrev, Victor Leonidovich; Coverdale, Christine Anne; Rudakov, L. I.; Jones, Brent Manley; Safronova, Alla S.; Vigil, Marcelino Patricio

    2008-09-01

    A series of ten shots were performed on the Saturn generator in short pulse mode in order to study planar and small-diameter cylindrical tungsten wire arrays at {approx}5 MA current levels and 50-60 ns implosion times as candidates for compact z-pinch radiation sources. A new vacuum hohlraum configuration has been proposed in which multiple z pinches are driven in parallel by a pulsed power generator. Each pinch resides in a separate return current cage, serving also as a primary hohlraum. A collection of such radiation sources surround a compact secondary hohlraum, which may potentially provide an attractive Planckian radiation source or house an inertial confinement fusion fuel capsule. Prior to studying this concept experimentally or numerically, advanced compact wire array loads must be developed and their scaling behavior understood. The 2008 Saturn planar array experiments extend the data set presented in Ref. [1], which studied planar arrays at {approx}3 MA, 100 ns in Saturn long pulse mode. Planar wire array power and yield scaling studies now include current levels directly applicable to multi-pinch experiments that could be performed on the 25 MA Z machine. A maximum total x-ray power of 15 TW (250 kJ in the main pulse, 330 kJ total yield) was observed with a 12-mm-wide planar array at 5.3 MA, 52 ns. The full data set indicates power scaling that is sub-quadratic with load current, while total and main pulse yields are closer to quadratic; these trends are similar to observations of compact cylindrical tungsten arrays on Z. We continue the investigation of energy coupling in these short pulse Saturn experiments using zero-dimensional-type implosion modeling and pinhole imaging, indicating 16 cm/?s implosion velocity in a 12-mm-wide array. The same phenomena of significant trailing mass and evidence for resistive heating are observed at 5 MA as at 3 MA. 17 kJ of Al K-shell radiation was obtained in one Al planar array fielded at 5.5 MA, 57 ns and we

  7. Identification of null alleles and deletions from SNP genotypes for an intercross between domestic and wild chickens.

    PubMed

    Crooks, Lucy; Carlborg, Örjan; Marklund, Stefan; Johansson, Anna M

    2013-08-07

    We analyzed genotypes from ~10K single-nucleotide polymorphisms (SNPs) in two families of an F2 intercross between Red Junglefowl and White Leghorn chickens. Possible null alleles were found by patterns of incompatible and missing genotypes. We estimated that 2.6% of SNPs had null alleles compared with 2.3% with genotyping errors and that 40% of SNPs in which a parent and offspring were genotyped as different homozygotes had null alleles. Putative deletions were identified by null alleles at adjacent markers. We found two candidate deletions that were supported by fluorescence intensity data from a 60K SNP chip. One of the candidate deletions was from the Red Junglefowl, and one was present in both the Red Junglefowl and White Leghorn. Both candidate deletions spanned protein-coding regions and were close to a previously detected quantitative trait locus affecting body weight in this population. This study demonstrates that the ~50K SNP genotyping arrays now available for several agricultural species can be used to identify null alleles and deletions in data from large families. We suggest that our approach could be a useful complement to linkage analysis in experimental crosses.

  8. Single-copy gene based 50 K SNP chip for genetic studies and molecular breeding in rice

    PubMed Central

    Singh, Nisha; Jayaswal, Pawan Kumar; Panda, Kabita; Mandal, Paritra; Kumar, Vinod; Singh, Balwant; Mishra, Shefali; Singh, Yashi; Singh, Renu; Rai, Vandna; Gupta, Anita; Raj Sharma, Tilak; Singh, Nagendra Kumar

    2015-01-01

    Single nucleotide polymorphism (SNP) is the most abundant DNA sequence variation present in plant genomes. Here, we report the design and validation of a unique genic-SNP genotyping chip for genetic and evolutionary studies as well as molecular breeding applications in rice. The chip incorporates 50,051 SNPs from 18,980 different genes spanning 12 rice chromosomes, including 3,710 single-copy (SC) genes conserved between wheat and rice, 14,959 SC genes unique to rice, 194 agronomically important cloned rice genes and 117 multi-copy rice genes. Assays with this chip showed high success rate and reproducibility because of the SC gene based array with no sequence redundancy and cross-hybridisation problems. The usefulness of the chip in genetic diversity and phylogenetic studies of cultivated and wild rice germplasm was demonstrated. Furthermore, its efficacy was validated for analysing background recovery in improved mega rice varieties with submergence tolerance developed through marker-assisted backcross breeding. PMID:26111882

  9. Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms

    PubMed Central

    2012-01-01

    Background A genome-wide set of single nucleotide polymorphisms (SNPs) is a valuable resource in genetic research and breeding and is usually developed by re-sequencing a genome. If a genome sequence is not available, an alternative strategy must be used. We previously reported the development of a pipeline (AGSNP) for genome-wide SNP discovery in coding sequences and other single-copy DNA without a complete genome sequence in self-pollinating (autogamous) plants. Here we updated this pipeline for SNP discovery in outcrossing (allogamous) species and demonstrated its efficacy in SNP discovery in walnut (Juglans regia L.). Results The first step in the original implementation of the AGSNP pipeline was the construction of a reference sequence and the identification of single-copy sequences in it. To identify single-copy sequences, multiple genome equivalents of short SOLiD reads of another individual were mapped to shallow genome coverage of long Sanger or Roche 454 reads making up the reference sequence. The relative depth of SOLiD reads was used to filter out repeated sequences from single-copy sequences in the reference sequence. The second step was a search for SNPs between SOLiD reads and the reference sequence. Polymorphism within the mapped SOLiD reads would have precluded SNP discovery; hence both individuals had to be homozygous. The AGSNP pipeline was updated here for using SOLiD or other type of short reads of a heterozygous individual for these two principal steps. A total of 32.6X walnut genome equivalents of SOLiD reads of vegetatively propagated walnut scion cultivar ‘Chandler’ were mapped to 48,661 ‘Chandler’ bacterial artificial chromosome (BAC) end sequences (BESs) produced by Sanger sequencing during the construction of a walnut physical map. A total of 22,799 putative SNPs were initially identified. A total of 6,000 Infinium II type SNPs evenly distributed along the walnut physical map were selected for the construction of an Infinium Bead

  10. The Impact of a Common MDM2 SNP on the Sensitivity of Breast Cancer to Treatment

    DTIC Science & Technology

    2012-06-01

    could decrease the effectiveness of treatment. These outcomes are likely due to the increased expression of mdm2 protein in SNP309 individuals, which...expression at the protein level occur in the mdm2 SNP309 cell line. There was no association between the mdm2 SNP309 and clinical outcome of breast cancer...with chemotherapy, hormonal therapy and radiation therapy. 1S. SUBJECT TERMS mdm2, breast cancer, polymorphisms 16. SECURITY CLASSIFICATION OF: 17

  11. A genome-wide search for common SNP x SNP interactions on the risk of venous thrombosis

    PubMed Central

    2013-01-01

    Background Venous Thrombosis (VT) is a common multifactorial disease with an estimated heritability between 35% and 60%. Known genetic polymorphisms identified so far only explain ~5% of the genetic variance of the disease. This study was aimed to investigate whether pair-wise interactions between common single nucleotide polymorphisms (SNPs) could exist and modulate the risk of VT. Methods A genome-wide SNP x SNP interaction analysis on VT risk was conducted in a French case–control study and the most significant findings were tested for replication in a second independent French case–control sample. The results obtained in the two studies totaling 1,953 cases and 2,338 healthy subjects were combined into a meta-analysis. Results The smallest observed p-value for interaction was p = 6.00 10-11 but it did not pass the Bonferroni significance threshold of 1.69 10-12 correcting for the number of investigated interactions that was 2.96 1010. Among the 37 suggestive pair-wise interactions with p-value less than 10-8, one was further shown to involve two SNPs, rs9804128 (IGFS21 locus) and rs4784379 (IRX3 locus) that demonstrated significant interactive effects (p = 4.83 10-5) on the variability of plasma Factor VIII levels, a quantitative biomarker of VT risk, in a sample of 1,091 VT patients. Conclusion This study, the first genome-wide SNP interaction analysis conducted so far on VT risk, suggests that common SNPs are unlikely exerting strong interactive effects on the risk of disease. PMID:23509962

  12. Review of alignment and SNP calling algorithms for next-generation sequencing data.

    PubMed

    Mielczarek, M; Szyda, J

    2016-02-01

    Application of the massive parallel sequencing technology has become one of the most important issues in life sciences. Therefore, it was crucial to develop bioinformatics tools for next-generation sequencing (NGS) data processing. Currently, two of the most significant tasks include alignment to a reference genome and detection of single nucleotide polymorphisms (SNPs). In many types of genomic analyses, great numbers of reads need to be mapped to the reference genome; therefore, selection of the aligner is an essential step in NGS pipelines. Two main algorithms-suffix tries and hash tables-have been introduced for this purpose. Suffix array-based aligners are memory-efficient and work faster than hash-based aligners, but they are less accurate. In contrast, hash table algorithms tend to be slower, but more sensitive. SNP and genotype callers may also be divided into two main different approaches: heuristic and probabilistic methods. A variety of software has been subsequently developed over the past several years. In this paper, we briefly review the current development of NGS data processing algorithms and present the available software.

  13. Prospective diagnostic analysis of copy number variants using SNP microarrays in individuals with autism spectrum disorders

    PubMed Central

    Nava, Caroline; Keren, Boris; Mignot, Cyril; Rastetter, Agnès; Chantot-Bastaraud, Sandra; Faudet, Anne; Fonteneau, Eric; Amiet, Claire; Laurent, Claudine; Jacquette, Aurélia; Whalen, Sandra; Afenjar, Alexandra; Périsse, Didier; Doummar, Diane; Dorison, Nathalie; Leboyer, Marion; Siffroi, Jean-Pierre; Cohen, David; Brice, Alexis; Héron, Delphine; Depienne, Christel

    2014-01-01

    Copy number variants (CNVs) have repeatedly been found to cause or predispose to autism spectrum disorders (ASDs). For diagnostic purposes, we screened 194 individuals with ASDs for CNVs using Illumina SNP arrays. In several probands, we also analyzed candidate genes located in inherited deletions to unmask autosomal recessive variants. Three CNVs, a de novo triplication of chromosome 15q11–q12 of paternal origin, a deletion on chromosome 9p24 and a de novo 3q29 deletion, were identified as the cause of the disorder in one individual each. An autosomal recessive cause was considered possible in two patients: a homozygous 1p31.1 deletion encompassing PTGER3 and a deletion of the entire DOCK10 gene associated with a rare hemizygous missense variant. We also identified multiple private or recurrent CNVs, the majority of which were inherited from asymptomatic parents. Although highly penetrant CNVs or variants inherited in an autosomal recessive manner were detected in rare cases, our results mainly support the hypothesis that most CNVs contribute to ASDs in association with other CNVs or point variants located elsewhere in the genome. Identification of these genetic interactions in individuals with ASDs constitutes a formidable challenge. PMID:23632794

  14. Prospective diagnostic analysis of copy number variants using SNP microarrays in individuals with autism spectrum disorders.

    PubMed

    Nava, Caroline; Keren, Boris; Mignot, Cyril; Rastetter, Agnès; Chantot-Bastaraud, Sandra; Faudet, Anne; Fonteneau, Eric; Amiet, Claire; Laurent, Claudine; Jacquette, Aurélia; Whalen, Sandra; Afenjar, Alexandra; Périsse, Didier; Doummar, Diane; Dorison, Nathalie; Leboyer, Marion; Siffroi, Jean-Pierre; Cohen, David; Brice, Alexis; Héron, Delphine; Depienne, Christel

    2014-01-01

    Copy number variants (CNVs) have repeatedly been found to cause or predispose to autism spectrum disorders (ASDs). For diagnostic purposes, we screened 194 individuals with ASDs for CNVs using Illumina SNP arrays. In several probands, we also analyzed candidate genes located in inherited deletions to unmask autosomal recessive variants. Three CNVs, a de novo triplication of chromosome 15q11-q12 of paternal origin, a deletion on chromosome 9p24 and a de novo 3q29 deletion, were identified as the cause of the disorder in one individual each. An autosomal recessive cause was considered possible in two patients: a homozygous 1p31.1 deletion encompassing PTGER3 and a deletion of the entire DOCK10 gene associated with a rare hemizygous missense variant. We also identified multiple private or recurrent CNVs, the majority of which were inherited from asymptomatic parents. Although highly penetrant CNVs or variants inherited in an autosomal recessive manner were detected in rare cases, our results mainly support the hypothesis that most CNVs contribute to ASDs in association with other CNVs or point variants located elsewhere in the genome. Identification of these genetic interactions in individuals with ASDs constitutes a formidable challenge.

  15. SNPs Array Karyotyping in Non-Hodgkin Lymphoma

    PubMed Central

    Etebari, Maryam; Navari, Mohsen; Piccaluga, Pier Paolo

    2015-01-01

    The traditional methods for detection of chromosomal aberrations, which included cytogenetic or gene candidate solutions, suffered from low sensitivity or the need for previous knowledge of the target regions of the genome. With the advent of single nucleotide polymorphism (SNP) arrays, genome screening at global level in order to find chromosomal aberrations like copy number variants, DNA amplifications, deletions, and also loss of heterozygosity became feasible. In this review, we present an update of the knowledge, gained by SNPs arrays, of the genomic complexity of the most important subtypes of non-Hodgkin lymphomas. PMID:27600240

  16. Accuracy of direct genomic values in Holstein bulls and cows using subsets of SNP markers

    PubMed Central

    2010-01-01

    Background At the current price, the use of high-density single nucleotide polymorphisms (SNP) genotyping assays in genomic selection of dairy cattle is limited to applications involving elite sires and dams. The objective of this study was to evaluate the use of low-density assays to predict direct genomic value (DGV) on five milk production traits, an overall conformation trait, a survival index, and two profit index traits (APR, ASI). Methods Dense SNP genotypes were available for 42,576 SNP for 2,114 Holstein bulls and 510 cows. A subset of 1,847 bulls born between 1955 and 2004 was used as a training set to fit models with various sets of pre-selected SNP. A group of 297 bulls born between 2001 and 2004 and all cows born between 1992 and 2004 were used to evaluate the accuracy of DGV prediction. Ridge regression (RR) and partial least squares regression (PLSR) were used to derive prediction equations and to rank SNP based on the absolute value of the regression coefficients. Four alternative strategies were applied to select subset of SNP, namely: subsets of the highest ranked SNP for each individual trait, or a single subset of evenly spaced SNP, where SNP were selected based on their rank for ASI, APR or minor allele frequency within intervals of approximately equal length. Results RR and PLSR performed very similarly to predict DGV, with PLSR performing better for low-density assays and RR for higher-density SNP sets. When using all SNP, DGV predictions for production traits, which have a higher heritability, were more accurate (0.52-0.64) than for survival (0.19-0.20), which has a low heritability. The gain in accuracy using subsets that included the highest ranked SNP for each trait was marginal (5-6%) over a common set of evenly spaced SNP when at least 3,000 SNP were used. Subsets containing 3,000 SNP provided more than 90% of the accuracy that could be achieved with a high-density assay for cows, and 80% of the high-density assay for young bulls

  17. PCR amplification of SNP loci from crude DNA for large-scale genotyping of oomycetes.

    PubMed

    Hu, Jian; Lyon, Rebecca; Zhou, Yuxin; Lamour, Kurt

    2014-01-01

    Similar to other eukaryotes, single nucleotide polymorphism (SNP) markers are abundant in many oomycete plant pathogen genomes. High resolution DNA melting analysis (HR-DMA) is a cost-effective method for SNP genotyping, but like many SNP marker technologies, is limited by the amount and quality of template DNA. We describe PCR preamplification of Phytophthora and Peronospora SNP loci from crude DNA extracted from a small amount of mycelium and/or infected plant tissue to produce sufficient template to genotype at least 10 000 SNPs. The approach is fast, inexpensive, requires minimal biological material and should be useful for many organisms in a variety of contexts.

  18. SNP Markers and Their Impact on Plant Breeding

    PubMed Central

    Mammadov, Jafar; Aggarwal, Rajat; Buyyarapu, Ramesh; Kumpatla, Siva

    2012-01-01

    The use of molecular markers has revolutionized the pace and precision of plant genetic analysis which in turn facilitated the implementation of molecular breeding of crops. The last three decades have seen tremendous advances in the evolution of marker systems and the respective detection platforms. Markers based on single nucleotide polymorphisms (SNPs) have rapidly gained the center stage of molecular genetics during the recent years due to their abundance in the genomes and their amenability for high-throughput detection formats and platforms. Computational approaches dominate SNP discovery methods due to the ever-increasing sequence information in public databases; however, complex genomes pose special challenges in the identification of informative SNPs warranting alternative strategies in those crops. Many genotyping platforms and chemistries have become available making the use of SNPs even more attractive and efficient. This paper provides a review of historical and current efforts in the development, validation, and application of SNP markers in QTL/gene discovery and plant breeding by discussing key experimental strategies and cases exemplifying their impact. PMID:23316221

  19. Eigenanalysis of SNP data with an identity by descent interpretation.

    PubMed

    Zheng, Xiuwen; Weir, Bruce S

    2016-02-01

    Principal component analysis (PCA) is widely used in genome-wide association studies (GWAS), and the principal component axes often represent perpendicular gradients in geographic space. The explanation of PCA results is of major interest for geneticists to understand fundamental demographic parameters. Here, we provide an interpretation of PCA based on relatedness measures, which are described by the probability that sets of genes are identical-by-descent (IBD). An approximately linear transformation between ancestral proportions (AP) of individuals with multiple ancestries and their projections onto the principal components is found. In addition, a new method of eigenanalysis "EIGMIX" is proposed to estimate individual ancestries. EIGMIX is a method of moments with computational efficiency suitable for millions of SNP data, and it is not subject to the assumption of linkage equilibrium. With the assumptions of multiple ancestries and their surrogate ancestral samples, EIGMIX is able to infer ancestral proportions (APs) of individuals. The methods were applied to the SNP data from the HapMap Phase 3 project and the Human Genome Diversity Panel. The APs of individuals inferred by EIGMIX are consistent with the findings of the program ADMIXTURE. In conclusion, EIGMIX can be used to detect population structure and estimate genome-wide ancestral proportions with a relatively high accuracy.

  20. Structural Architecture of SNP Effects on Complex Traits

    PubMed Central

    Gamazon, Eric R.; Cox, Nancy J.; Davis, Lea K.

    2014-01-01

    Despite the discovery of copy-number variation (CNV) across the genome nearly 10 years ago, current SNP-based analysis methodologies continue to collapse the homozygous (i.e., A/A), hemizygous (i.e., A/0), and duplicative (i.e., A/A/A) genotype states, treating the genotype variable as irreducible or unaltered by other colocalizing forms of genetic (e.g., structural) variation. Our understanding of common, genome-wide CNVs suggests that the canonical genotype construct might belie the enormous complexity of the genome. Here we present multiple analyses of several phenotypes and provide methods supporting a conceptual shift that embraces the structural dimension of genotype. We comprehensively investigate the impact of the structural dimension of genotype on (1) GWAS methods, (2) interpretation of rare LOF variants, (3) characterization of genomic architecture, and (4) implications for mapping loci involved in complex disease. Taken together, these results argue for the inclusion of a structural dimension and suggest that some portion of the “missing” heritability might be recovered through integration of the structural dimension of SNP effects on complex traits. PMID:25307299

  1. A Genome-Wide Association Study for Agronomic Traits in Soybean Using SNP Markers and SNP-Based Haplotype Analysis

    PubMed Central

    de Oliveira, Marco Antônio Rott; Higashi, Wilson; Scapim, Carlos Alberto; Schuster, Ivan

    2017-01-01

    Mapping quantitative trait loci through the use of linkage disequilibrium (LD) in populations of unrelated individuals provides a valuable approach for dissecting the genetic basis of complex traits in soybean (Glycine max). The haplotype-based genome-wide association study (GWAS) has now been proposed as a complementary approach to intensify benefits from LD, which enable to assess the genetic determinants of agronomic traits. In this study a GWAS was undertaken to identify genomic regions that control 100-seed weight (SW), plant height (PH) and seed yield (SY) in a soybean association mapping panel using single nucleotide polymorphism (SNP) markers and haplotype information. The soybean cultivars (N = 169) were field-evaluated across four locations of southern Brazil. The genome-wide haplotype association analysis (941 haplotypes) identified eleven, seventeen and fifty-nine SNP-based haplotypes significantly associated with SY, SW and PH, respectively. Although most marker-trait associations were environment and trait specific, stable haplotype associations were identified for SY and SW across environments (i.e., haplotypes Gm12_Hap12). The haplotype block 42 on Chr19 (Gm19_Hap42) was confirmed to be associated with PH in two environments. These findings enable us to refine the breeding strategy for tropical soybean, which confirm that haplotype-based GWAS can provide new insights on the genetic determinants that are not captured by the single-marker approach. PMID:28152092

  2. SNP Discovery for mapping alien introgressions in wheat

    PubMed Central

    2014-01-01

    Background Monitoring alien introgressions in crop plants is difficult due to the lack of genetic and molecular mapping information on the wild crop relatives. The tertiary gene pool of wheat is a very important source of genetic variability for wheat improvement against biotic and abiotic stresses. By exploring the 5Mg short arm (5MgS) of Aegilops geniculata, we can apply chromosome genomics for the discovery of SNP markers and their use for monitoring alien introgressions in wheat (Triticum aestivum L). Results The short arm of chromosome 5Mg of Ae. geniculata Roth (syn. Ae. ovata L.; 2n = 4x = 28, UgUgMgMg) was flow-sorted from a wheat line in which it is maintained as a telocentric chromosome. DNA of the sorted arm was amplified and sequenced using an Illumina Hiseq 2000 with ~45x coverage. The sequence data was used for SNP discovery against wheat homoeologous group-5 assemblies. A total of 2,178 unique, 5MgS-specific SNPs were discovered. Randomly selected samples of 59 5MgS-specific SNPs were tested (44 by KASPar assay and 15 by Sanger sequencing) and 84% were validated. Of the selected SNPs, 97% mapped to a chromosome 5Mg addition to wheat (the source of t5MgS), and 94% to 5Mg introgressed from a different accession of Ae. geniculata substituting for chromosome 5D of wheat. The validated SNPs also identified chromosome segments of 5MgS origin in a set of T5D-5Mg translocation lines; eight SNPs (25%) mapped to TA5601 [T5DL · 5DS-5MgS(0.75)] and three (8%) to TA5602 [T5DL · 5DS-5MgS (0.95)]. SNPs (gsnp_5ms83 and gsnp_5ms94), tagging chromosome T5DL · 5DS-5MgS(0.95) with the smallest introgression carrying resistance to leaf rust (Lr57) and stripe rust (Yr40), were validated in two released germplasm lines with Lr57 and Yr40 genes. Conclusion This approach should be widely applicable for the identification of species/genome-specific SNPs. The development of a large number of SNP markers will facilitate the precise introgression and

  3. Linear reduction method for predictive and informative tag SNP selection.

    PubMed

    He, Jingwu; Westbrooks, Kelly; Zelikovsky, Alexander

    2005-01-01

    Constructing a complete human haplotype map is helpful when associating complex diseases with their related SNPs. Unfortunately, the number of SNPs is very large and it is costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNPs that should be sequenced to a small number of informative representatives called tag SNPs. In this paper, we propose a new linear algebra-based method for selecting and using tag SNPs. We measure the quality of our tag SNP selection algorithm by comparing actual SNPs with SNPs predicted from selected linearly independent tag SNPs. Our experiments show that for sufficiently long haplotypes, knowing only 0.4% of all SNPs the proposed linear reduction method predicts an unknown haplotype with the error rate below 2% based on 10% of the population.

  4. Grouping preprocess for haplotype inference from SNP and CNV data

    NASA Astrophysics Data System (ADS)

    Shindo, Hiroyuki; Chigira, Hiroshi; Nagaoka, Tomoyo; Kamatani, Naoyuki; Inoue, Masato

    2009-12-01

    The method of statistical haplotype inference is an indispensable technique in the field of medical science. The authors previously reported Hardy-Weinberg equilibrium-based haplotype inference that could manage single nucleotide polymorphism (SNP) data. We recently extended the method to cover copy number variation (CNV) data. Haplotype inference from mixed data is important because SNPs and CNVs are occasionally in linkage disequilibrium. The idea underlying the proposed method is simple, but the algorithm for it needs to be quite elaborate to reduce the calculation cost. Consequently, we have focused on the details on the algorithm in this study. Although the main advantage of the method is accuracy, in that it does not use any approximation, its main disadvantage is still the calculation cost, which is sometimes intractable for large data sets with missing values.

  5. Authentication of medicinal plants by SNP-based multiplex PCR.

    PubMed

    Lee, Ok Ran; Kim, Min-Kyeoung; Yang, Deok-Chun

    2012-01-01

    Highly variable intergenic spacer and intron regions from nuclear and cytoplasmic DNA have been used for species identification. Noncoding internal transcribed spacers (ITSs) located in 18S-5.8S-26S, and 5S ribosomal RNA genes (rDNAs) represent suitable region for medicinal plant authentication. Noncoding regions from two cytoplasmic DNA, chloroplast DNA (trnT-F intergenic spacer region), and mitochondrial DNA (fourth intron region of nad7 gene) are also successfully applied for the proper identification of medicinal plants. Single-nucleotide polymorphism (SNP) sites obtained from the amplification of intergenic spacer and intron regions are properly utilized for the verification of medicinal plants in species level using multiplex PCR. Multiplex PCR as a variant of PCR technique used to amplify more than two loci simultaneously.

  6. Methods for the design, implementation, and analysis of illumina infinium™ SNP assays in plants.

    PubMed

    Chagné, David; Bianco, Luca; Lawley, Cindy; Micheletti, Diego; Jacobs, Jeanne M E

    2015-01-01

    The advent of Next-Generation sequencing-by-synthesis technologies has fuelled SNP discovery, genotyping, and screening of populations in myriad ways for many species, including various plant species. One technique widely applied to screening a large number of SNP markers over a large number of samples is the Illumina Infinium™ assay.

  7. A genome-wide SNP panel for genetic diversity, mapping and breeding studies in rice

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide SNP resource was developed for rice using the GoldenGate assay and used to genotype 400 landrace accessions of O. sativa. SNPs were originally discovered using Perlegen re-sequencing technology in 20 diverse landraces of O. sativa as part of OryzaSNP project (http://irfgc.irri.org). An...

  8. A Coordinated Approach to Peach SNP Discovery in RosBREED

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In the USDA-funded multi-institutional and trans-disciplinary project, “RosBREED”, crop-specific SNP genome scan platforms are being developed for peach, apple, strawberry, and cherry at a resolution of at least one polymorphic SNP marker every 5 cM in any random cross, for use in Pedigree-Based Ana...

  9. Global Arrays

    SciTech Connect

    Krishnamoorthy, Sriram; Daily, Jeffrey A.; Vishnu, Abhinav; Palmer, Bruce J.

    2015-11-01

    Global Arrays (GA) is a distributed-memory programming model that allows for shared-memory-style programming combined with one-sided communication, to create a set of tools that combine high performance with ease-of-use. GA exposes a relatively straightforward programming abstraction, while supporting fully-distributed data structures, locality of reference, and high-performance communication. GA was originally formulated in the early 1990’s to provide a communication layer for the Northwest Chemistry (NWChem) suite of chemistry modeling codes that was being developed concurrently.

  10. TNF-alpha SNP haplotype frequencies in equidae.

    PubMed

    Brown, J J; Ollier, W E R; Thomson, W; Matthews, J B; Carter, S D; Binns, M; Pinchbeck, G; Clegg, P D

    2006-05-01

    Tumour necrosis factor alpha (TNF-alpha) is a pro-inflammatory cytokine that plays a crucial role in the regulation of inflammatory and immune responses. In all vertebrate species the genes encoding TNF-alpha are located within the major histocompatability complex. In the horse TNF-alpha has been ascribed a role in a variety of important disease processes. Previously two single nucleotide polymorphisms (SNPs) have been reported within the 5' un-translated region of the equine TNF-alpha gene. We have examined the equine TNF-alpha promoter region further for additional SNPs by analysing DNA from 131 horses (Equus caballus), 19 donkeys (E. asinus), 2 Grant's zebras (E. burchellii boehmi) and one onager (E. hemionus). Two further SNPs were identified at nucleotide positions 24 (T/G) and 452 (T/C) relative to the first nucleotide of the 522 bp polymerase chain reaction product. A sequence variant at position 51 was observed between equidae. SNaPSHOT genotyping assays for these and the two previously reported SNPs were performed on 457 horses comprising seven different breeds and 23 donkeys to determine the gene frequencies. SNP frequencies varied considerably between different horse breeds and also between the equine species. In total, nine different TNF-alpha promoter SNP haplotypes and their frequencies were established amongst the various equidae examined, with some haplotypes being found only in horses and others only in donkeys or zebras. The haplotype frequencies observed varied greatly between different horse breeds. Such haplotypes may relate to levels of TNF-alpha production and disease susceptibility and further investigation is required to identify associations between particular haplotypes and altered risk of disease.

  11. Design and characterization of a 52K SNP chip for goats.

    PubMed

    Tosser-Klopp, Gwenola; Bardou, Philippe; Bouchez, Olivier; Cabau, Cédric; Crooijmans, Richard; Dong, Yang; Donnadieu-Tonon, Cécile; Eggen, André; Heuven, Henri C M; Jamli, Saadiah; Jiken, Abdullah Johari; Klopp, Christophe; Lawley, Cynthia T; McEwan, John; Martin, Patrice; Moreno, Carole R; Mulsant, Philippe; Nabihoudine, Ibouniyamine; Pailhoux, Eric; Palhière, Isabelle; Rupp, Rachel; Sarry, Julien; Sayre, Brian L; Tircazes, Aurélie; Jun Wang; Wang, Wen; Zhang, Wenguang

    2014-01-01

    The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a 50-60,000 SNP chip for goats. The success of a moderate density SNP assay depends on reliable bioinformatic SNP detection procedures, the technological success rate of the SNP design, even spacing of SNPs on the genome and selection of Minor Allele Frequencies (MAF) suitable to use in diverse breeds. Through the federation of three SNP discovery projects consolidated as the International Goat Genome Consortium, we have identified approximately twelve million high quality SNP variants in the goat genome stored in a database together with their biological and technical characteristics. These SNPs were identified within and between six breeds (meat, milk and mixed): Alpine, Boer, Creole, Katjang, Saanen and Savanna, comprising a total of 97 animals. Whole genome and Reduced Representation Library sequences were aligned on >10 kb scaffolds of the de novo goat genome assembly. The 60,000 selected SNPs, evenly spaced on the goat genome, were submitted for oligo manufacturing (Illumina, Inc) and published in dbSNP along with flanking sequences and map position on goat assemblies (i.e. scaffolds and pseudo-chromosomes), sheep genome V2 and cattle UMD3.1 assembly. Ten breeds were then used to validate the SNP content and 52,295 loci could be successfully genotyped and used to generate a final cluster file. The combined strategy of using mainly whole genome Next Generation Sequencing and mapping on a contig genome assembly, complemented with Illumina design tools proved to be efficient in producing this GoatSNP50 chip. Advances in use of molecular markers are expected to accelerate goat genomic studies in coming years.

  12. Evaluating the repair of DNA derived from formalin-fixed paraffin-embedded tissues prior to genomic profiling by SNP-CGH analysis.

    PubMed

    Hosein, Abdel Nasser; Song, Sarah; McCart Reed, Amy E; Jayanthan, Janani; Reid, Lynne E; Kutasovic, Jamie R; Cummings, Margaret C; Waddell, Nic; Lakhani, Sunil R; Chenevix-Trench, Georgia; Simpson, Peter T

    2013-06-01

    Pathology archives contain vast resources of clinical material in the form of formalin-fixed paraffin-embedded (FFPE) tissue samples. Owing to the methods of tissue fixation and storage, the integrity of DNA and RNA available from FFPE tissue is compromized, which means obtaining informative data regarding epigenetic, genomic, and expression alterations can be challenging. Here, we have investigated the utility of repairing damaged DNA derived from FFPE tumors prior to single-nucleotide polymorphism (SNP) arrays for whole-genome DNA copy number analysis. DNA was extracted from FFPE samples spanning five decades, involving tumor material obtained from surgical specimens and postmortems. Various aspects of the protocol were assessed, including the method of DNA extraction, the role of Quality Control quantitative PCR (qPCR) in predicting sample success, and the effect of DNA restoration on assay performance, data quality, and the prediction of copy number aberrations (CNAs). DNA that had undergone the repair process yielded higher SNP call rates, reduced log R ratio variance, and improved calling of CNAs compared with matched FFPE DNA not subjected to repair. Reproducible mapping of genomic break points and detection of focal CNAs representing high-level gains and homozygous deletions (HD) were possible, even on autopsy material obtained in 1974. For example, DNA amplifications at the ERBB2 and EGFR gene loci and a HD mapping to 13q14.2 were validated using immunohistochemistry, in situ hybridization, and qPCR. The power of SNP arrays lies in the detection of allele-specific aberrations; however, this aspect of the analysis remains challenging, particularly in the distinction between loss of heterozygosity (LOH) and copy neutral LOH. In summary, attempting to repair DNA that is damaged during fixation and storage may be a useful pretreatment step for genomic studies of large archival FFPE cohorts with long-term follow-up or for understanding rare cancer types, where

  13. Rice SNP-seek database update: new SNPs, indels, and queries

    PubMed Central

    Mansueto, Locedie; Fuentes, Roven Rommel; Borja, Frances Nikki; Detras, Jeffery; Abriol-Santos, Juan Miguel; Chebotarov, Dmytro; Sanciangco, Millicent; Palis, Kevin; Copetti, Dario; Poliakov, Alexandre; Dubchak, Inna; Solovyev, Victor; Wing, Rod A.; Hamilton, Ruaraidh Sackville; Mauleon, Ramil; McNally, Kenneth L.; Alexandrov, Nickolai

    2017-01-01

    We describe updates to the Rice SNP-Seek Database since its first release. We ran a new SNP-calling pipeline followed by filtering that resulted in complete, base, filtered and core SNP datasets. Besides the Nipponbare reference genome, the pipeline was run on genome assemblies of IR 64, 93-11, DJ 123 and Kasalath. New genotype query and display features are added for reference assemblies, SNP datasets and indels. JBrowse now displays BAM, VCF and other annotation tracks, the additional genome assemblies and an embedded VISTA genome comparison viewer. Middleware is redesigned for improved performance by using a hybrid of HDF5 and RDMS for genotype storage. Query modules for genotypes, varieties and genes are improved to handle various constraints. An integrated list manager allows the user to pass query parameters for further analysis. The SNP Annotator adds traits, ontology terms, effects and interactions to markers in a list. Web-service calls were implemented to access most data. These features enable seamless querying of SNP-Seek across various biological entities, a step toward semi-automated gene-trait association discovery. URL: http://snp-seek.irri.org. PMID:27899667

  14. Rice SNP-seek database update: new SNPs, indels, and queries.

    PubMed

    Mansueto, Locedie; Fuentes, Roven Rommel; Borja, Frances Nikki; Detras, Jeffery; Abriol-Santos, Juan Miguel; Chebotarov, Dmytro; Sanciangco, Millicent; Palis, Kevin; Copetti, Dario; Poliakov, Alexandre; Dubchak, Inna; Solovyev, Victor; Wing, Rod A; Hamilton, Ruaraidh Sackville; Mauleon, Ramil; McNally, Kenneth L; Alexandrov, Nickolai

    2017-01-04

    We describe updates to the Rice SNP-Seek Database since its first release. We ran a new SNP-calling pipeline followed by filtering that resulted in complete, base, filtered and core SNP datasets. Besides the Nipponbare reference genome, the pipeline was run on genome assemblies of IR 64, 93-11, DJ 123 and Kasalath. New genotype query and display features are added for reference assemblies, SNP datasets and indels. JBrowse now displays BAM, VCF and other annotation tracks, the additional genome assemblies and an embedded VISTA genome comparison viewer. Middleware is redesigned for improved performance by using a hybrid of HDF5 and RDMS for genotype storage. Query modules for genotypes, varieties and genes are improved to handle various constraints. An integrated list manager allows the user to pass query parameters for further analysis. The SNP Annotator adds traits, ontology terms, effects and interactions to markers in a list. Web-service calls were implemented to access most data. These features enable seamless querying of SNP-Seek across various biological entities, a step toward semi-automated gene-trait association discovery. URL: http://snp-seek.irri.org.

  15. Identification, validation and survey of a single nucleotide polymorphism (SNP) associated with pungency in Capsicum spp.

    PubMed

    Garcés-Claver, Ana; Fellman, Shanna Moore; Gil-Ortega, Ramiro; Jahn, Molly; Arnedo-Andrés, María S

    2007-11-01

    A single nucleotide polymorphism (SNP) associated with pungency was detected within an expressed sequence tag (EST) of 307 bp. This fragment was identified after expression analysis of the EST clone SB2-66 in placenta tissue of Capsicum fruits. Sequence alignments corresponding to this new fragment allowed us to identify an SNP between pungent and non-pungent accessions. Two methods were chosen for the development of the SNP marker linked to pungency: tetra-primer amplification refractory mutation system-PCR (tetra-primer ARMS-PCR) and cleaved amplified polymorphic sequence. Results showed that both methods were successful in distinguishing genotypes. Nevertheless, tetra-primer ARMS-PCR was chosen for SNP genotyping because it was more rapid, reliable and less cost-effective. The utility of this SNP marker for pungency was demonstrated by the ability to distinguish between 29 pungent and non-pungent cultivars of Capsicum annuum. In addition, the SNP was also associated with phenotypic pungent character in the tested genotypes of C. chinense, C. baccatum, C. frutescens, C. galapagoense, C. eximium, C. tovarii and C. cardenasi. This SNP marker is a faster, cheaper and more reproducible method for identifying pungent peppers than other techniques such as panel tasting, and allows rapid screening of the trait in early growth stages.

  16. QuickSNP: an automated web server for selection of tagSNPs

    PubMed Central

    Grover, Deepak; Woodfield, Alonzo S.; Verma, Ranjana; Zandi, Peter P.; Levinson, Douglas F.; Potash, James B.

    2007-01-01

    Although large-scale genetic association studies involving hundreds to thousands of SNPs have become feasible, the associated cost is substantial. Even with the increased efficiency introduced by the use of tagSNPs, researchers are often seeking ways to maximize resource utilization given a set of SNP-based gene-mapping goals. We have developed a web server named QuickSNP in order to provide cost-effective selection of SNPs, and to fill in some of the gaps in existing SNP selection tools. One useful feature of QuickSNP is the option to select only gene-centric SNPs from a chromosomal region in an automated fashion. Other useful features include automated selection of coding non-synonymous SNPs, SNP filtering based on inter-SNP distances and information regarding the availability of genotyping assays for SNPs and whether they are present on whole genome chips. The program produces user-friendly summary tables and results, and a link to a UCSC Genome Browser track illustrating the position of the selected tagSNPs in relation to genes and other genomic features. We hope the unique combination of features of this server will be useful for researchers aiming to select markers for their genotyping studies. The server is freely available and can be accessed at the URL http://bioinformoodics.jhmi.edu/quickSNP.pl. PMID:17517769

  17. Exploring of new Y-chromosome SNP loci using Pyrosequencing and the SNaPshot methods.

    PubMed

    Wei, Wei; Luo, Hai-Bo; Yan, Jing; Hou, Yi-Ping

    2012-11-01

    The single nucleotide polymorphisms on the Y chromosome (Y-SNP) have been considered to be important in forensic casework. However, Y-SNP loci were mostly population specific and lacked biallelic polymorphisms in the Asian population. In this study, we developed a strategy for seeking and genotyping new Y-SNP markers based on both Pyrosequencing and the SNaPshot methods. As results, 34 new biallelic markers were observed to be polymorphic in the Chinese Han population by estimation of allele frequencies of 103 candidate's Y-SNP loci in DNA pools using Pyrosequencing technology. Then, a multiplex system with 20 Y-SNP loci was genotyped using the SNaPshot™ multiplex kit. Twenty Y-SNP loci defined 56 different haplotypes, and the haplotype diversity was estimated to be 0.9539. Our result demonstrated that the strategy could be used as an efficient tool to search and genotype biallelic markers from a large amount of candidate loci. In addition, 20 Y-SNP loci constructed a multiplex system, which could provide supplementary information for forensic identification.

  18. Novel SNP Discovery in African Buffalo, Syncerus caffer, using high-throughput Sequencing.

    PubMed

    le Roex, Nikki; Noyes, Harry; Brass, Andrew; Bradley, Daniel G; Kemp, Steven J; Kay, Suzanne; van Helden, Paul D; Hoal, Eileen G

    2012-01-01

    The African buffalo, Syncerus caffer, is one of the most abundant and ecologically important species of megafauna in the savannah ecosystem. It is an important prey species, as well as a host for a vast array of nematodes, pathogens and infectious diseases, such as bovine tuberculosis and corridor disease. Large-scale SNP discovery in this species would greatly facilitate further research into the area of host genetics and disease susceptibility, as well as provide a wealth of sequence information for other conservation and genomics studies. We sequenced pools of Cape buffalo DNA from a total of 9 animals, on an ABI SOLiD4 sequencer. The resulting short reads were mapped to the UMD3.1 Bos taurus genome assembly using both BWA and Bowtie software packages. A mean depth of 2.7× coverage over the mapped regions was obtained. Btau4 gene annotation was added to all SNPs identified within gene regions. Bowtie and BWA identified a maximum of 2,222,665 and 276,847 SNPs within the buffalo respectively, depending on analysis method. A panel of 173 SNPs was validated by fluorescent genotyping in 87 individuals. 27 SNPs failed to amplify, and of the remaining 146 SNPs, 43-54% of the Bowtie SNPs and 57-58% of the BWA SNPs were confirmed as polymorphic. dN/dS ratios found no evidence of positive selection, and although there were genes that appeared to be under negative selection, these were more likely to be slowly evolving house-keeping genes.

  19. SNP-based association mapping of the polled gene in divergent cattle breeds.

    PubMed

    Seichter, D; Russ, I; Rothammer, S; Eder, J; Förster, M; Medugorac, I

    2012-10-01

    Naturally, hornless cattle are called polled. Although the POLL locus could be assigned to a c. 1.36-Mb interval in the centromeric region of BTA1, the underlying genetic basis for the polled trait is still unknown. Here, an association mapping design was set up to refine the candidate region of the polled trait for subsequent high-throughput sequencing. The case group comprised 101 homozygous polled animals from nine divergent cattle breeds, the majority represented by Galloway, Angus, Fleckvieh and Holstein Friesian. Additionally, this group included some polled individuals of Blonde d'Aquitaine, Charolais, Hereford, Jersey and Limousin breeds. The control group comprised horned Belgian Blue, Fleckvieh, Holstein Friesian and Illyrian Buša cattle. A genome-wide scan using 49,163 SNPs was performed, which revealed one shared homozygous haplotype block consisting of nine neighbouring SNPs in all polled animals. This segment defines a 381-kb interval on BTA1 that we consider to be the most likely location of the POLL mutation. Our results further demonstrate that the polled-associated haplotype is also frequent in horned animals included in this study, and thus the haplotype as such cannot be used for population-wide genetic testing. The actual trait-associated haplotype may be revealed by using higher-density SNP arrays. For the final identification of the causal mutation, we suggest high-throughput sequencing of the entire candidate region, because the identification of functional candidate genes is difficult owing to the lack of a comparable model.

  20. A system for exact and approximate genetic linkage analysis of SNP data in large pedigrees

    PubMed Central

    Silberstein, Mark; Weissbrod, Omer; Otten, Lars; Tzemach, Anna; Anisenia, Andrei; Shtark, Oren; Tuberg, Dvir; Galfrin, Eddie; Gannon, Irena; Shalata, Adel; Borochowitz, Zvi U.; Dechter, Rina; Thompson, Elizabeth; Geiger, Dan

    2013-01-01

    Motivation: The use of dense single nucleotide polymorphism (SNP) data in genetic linkage analysis of large pedigrees is impeded by significant technical, methodological and computational challenges. Here we describe Superlink-Online SNP, a new powerful online system that streamlines the linkage analysis of SNP data. It features a fully integrated flexible processing workflow comprising both well-known and novel data analysis tools, including SNP clustering, erroneous data filtering, exact and approximate LOD calculations and maximum-likelihood haplotyping. The system draws its power from thousands of CPUs, performing data analysis tasks orders of magnitude faster than a single computer. By providing an intuitive interface to sophisticated state-of-the-art analysis tools coupled with high computing capacity, Superlink-Online SNP helps geneticists unleash the potential of SNP data for detecting disease genes. Results: Computations performed by Superlink-Online SNP are automatically parallelized using novel paradigms, and executed on unlimited number of private or public CPUs. One novel service is large-scale approximate Markov Chain–Monte Carlo (MCMC) analysis. The accuracy of the results is reliably estimated by running the same computation on multiple CPUs and evaluating the Gelman–Rubin Score to set aside unreliable results. Another service within the workflow is a novel parallelized exact algorithm for inferring maximum-likelihood haplotyping. The reported system enables genetic analyses that were previously infeasible. We demonstrate the system capabilities through a study of a large complex pedigree affected with metabolic syndrome. Availability: Superlink-Online SNP is freely available for researchers at http://cbl-hap.cs.technion.ac.il/superlink-snp. The system source code can also be downloaded from the system website. Contact: omerw@cs.technion.ac.il Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23162081

  1. Characterization of polyploid wheat genomic diversity using a high-density 90 000 single nucleotide polymorphism array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    High-density single nucleotide polymorphism (SNP) genotyping chips are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships among individuals in populations and studying marker-trait associations in mapping experiments. We developed a genotyping array includ...

  2. Sequential sentinel SNP Regional Association Plots (SSS-RAP): an approach for testing independence of SNP association signals using meta-analysis data.

    PubMed

    Zheng, Jie; Gaunt, Tom R; Day, Ian N M

    2013-01-01

    Genome-Wide Association Studies (GWAS) frequently incorporate meta-analysis within their framework. However, conditional analysis of individual-level data, which is an established approach for fine mapping of causal sites, is often precluded where only group-level summary data are available for analysis. Here, we present a numerical and graphical approach, "sequential sentinel SNP regional association plot" (SSS-RAP), which estimates regression coefficients (beta) with their standard errors using the meta-analysis summary results directly. Under an additive model, typical for genes with small effect, the effect for a sentinel SNP can be transformed to the predicted effect for a possibly dependent SNP through a 2×2 2-SNP haplotypes table. The approach assumes Hardy-Weinberg equilibrium for test SNPs. SSS-RAP is available as a Web-tool (http://apps.biocompute.org.uk/sssrap/sssrap.cgi). To develop and illustrate SSS-RAP we analyzed lipid and ECG traits data from the British Women's Heart and Health Study (BWHHS), evaluated a meta-analysis for ECG trait and presented several simulations. We compared results with existing approaches such as model selection methods and conditional analysis. Generally findings were consistent. SSS-RAP represents a tool for testing independence of SNP association signals using meta-analysis data, and is also a convenient approach based on biological principles for fine mapping in group level summary data.

  3. SNP-SNP interactions between WNT4 and WNT5A were associated with obesity related traits in Han Chinese Population

    PubMed Central

    Dong, Shan-Shan; Hu, Wei-Xin; Yang, Tie-Lin; Chen, Xiao-Feng; Yan, Han; Chen, Xiang-Ding; Tan, Li-Jun; Tian, Qing; Deng, Hong-Wen; Guo, Yan

    2017-01-01

    Considering the biological roles of WNT4 and WNT5A involved in adipogenesis, we aimed to investigate whether SNPs in WNT4 and WNT5A contribute to obesity related traits in Han Chinese population. Targeted genomic sequence for WNT4 and WNT5A was determined in 100 Han Chinese subjects and tag SNPs were selected. Both single SNP and SNP × SNP interaction association analyses with body mass index (BMI) were evaluated in the 100 subjects and another independent sample of 1,627 Han Chinese subjects. Meta-analyses were performed and multiple testing corrections were carried out using the Bonferroni method. Consistent with the Genetic Investigation of ANthropometric Traits (GIANT) dataset results, we didn’t detect significant association signals in single SNP association analyses. However, the interaction between rs2072920 and rs11918967, was associated with BMI after multiple testing corrections (combined P = 2.20 × 10−4). The signal was also significant in each contributing data set. SNP rs2072920 is located in the 3′-UTR of WNT4 and SNP rs11918967 is located in the intron of WNT5A. Functional annotation results revealed that both SNPs might be involved in transcriptional regulation of gene expression. Our results suggest that a combined effect of SNPs via WNT4-WNT5A interaction may affect the variation of BMI in Han Chinese population. PMID:28272483

  4. Porcine colonization of the Americas: a 60k SNP story.

    PubMed

    Burgos-Paz, W; Souza, C A; Megens, H J; Ramayo-Caldas, Y; Melo, M; Lemús-Flores, C; Caal, E; Soto, H W; Martínez, R; Alvarez, L A; Aguirre, L; Iñiguez, V; Revidatti, M A; Martínez-López, O R; Llambi, S; Esteve-Codina, A; Rodríguez, M C; Crooijmans, R P M A; Paiva, S R; Schook, L B; Groenen, M A M; Pérez-Enciso, M

    2013-04-01

    The pig, Sus scrofa, is a foreign species to the American continent. Although pigs originally introduced in the Americas should be related to those from the Iberian Peninsula and Canary islands, the phylogeny of current creole pigs that now populate the continent is likely to be very complex. Because of the extreme climates that America harbors, these populations also provide a unique example of a fast evolutionary phenomenon of adaptation. Here, we provide a genome wide study of these issues by genotyping, with a 60k SNP chip, 206 village pigs sampled across 14 countries and 183 pigs from outgroup breeds that are potential founders of the American populations, including wild boar, Iberian, international and Chinese breeds. Results show that American village pigs are primarily of European ancestry, although the observed genetic landscape is that of a complex conglomerate. There was no correlation between genetic and geographical distances, neither continent wide nor when analyzing specific areas. Most populations showed a clear admixed structure where the Iberian pig was not necessarily the main component, illustrating how international breeds, but also Chinese pigs, have contributed to extant genetic composition of American village pigs. We also observe that many genes related to the cardiovascular system show an increased differentiation between altiplano and genetically related pigs living near sea level.

  5. Porcine colonization of the Americas: a 60k SNP story

    PubMed Central

    Burgos-Paz, W; Souza, C A; Megens, H J; Ramayo-Caldas, Y; Melo, M; Lemús-Flores, C; Caal, E; Soto, H W; Martínez, R; Álvarez, L A; Aguirre, L; Iñiguez, V; Revidatti, M A; Martínez-López, O R; Llambi, S; Esteve-Codina, A; Rodríguez, M C; Crooijmans, R P M A; Paiva, S R; Schook, L B; Groenen, M A M; Pérez-Enciso, M

    2013-01-01

    The pig, Sus scrofa, is a foreign species to the American continent. Although pigs originally introduced in the Americas should be related to those from the Iberian Peninsula and Canary islands, the phylogeny of current creole pigs that now populate the continent is likely to be very complex. Because of the extreme climates that America harbors, these populations also provide a unique example of a fast evolutionary phenomenon of adaptation. Here, we provide a genome wide study of these issues by genotyping, with a 60k SNP chip, 206 village pigs sampled across 14 countries and 183 pigs from outgroup breeds that are potential founders of the American populations, including wild boar, Iberian, international and Chinese breeds. Results show that American village pigs are primarily of European ancestry, although the observed genetic landscape is that of a complex conglomerate. There was no correlation between genetic and geographical distances, neither continent wide nor when analyzing specific areas. Most populations showed a clear admixed structure where the Iberian pig was not necessarily the main component, illustrating how international breeds, but also Chinese pigs, have contributed to extant genetic composition of American village pigs. We also observe that many genes related to the cardiovascular system show an increased differentiation between altiplano and genetically related pigs living near sea level. PMID:23250008

  6. Rapid SNP Detection and Genotyping of Bacterial Pathogens by Pyrosequencing.

    PubMed

    Amoako, Kingsley K; Thomas, Matthew C; Janzen, Timothy W; Goji, Noriko

    2017-01-01

    Bacterial identification and typing are fixtures of microbiology laboratories and are vital aspects of our response mechanisms in the event of foodborne outbreaks and bioterrorist events. Whole genome sequencing (WGS) is leading the way in terms of expanding our ability to identify and characterize bacteria through the identification of subtle differences between genomes (e.g. single nucleotide polymorphisms (SNPs) and insertions/deletions). Modern high-throughput technologies such as pyrosequencing can facilitate the typing of bacteria by generating short-read sequence data of informative regions identified by WGS analyses, at a fraction of the cost of WGS. Thus, pyrosequencing systems remain a valuable asset in the laboratory today. Presented in this chapter are two methods developed in the Amoako laboratory that detail the identification and genotyping of bacterial pathogens. The first targets canonical single nucleotide polymorphisms (canSNPs) of evolutionary importance in Bacillus anthracis, the causative agent of Anthrax. The second assay detects Shiga-toxin (stx) genes, which are associated with virulence in Escherichia coli and Shigella spp., and differentiates the subtypes of stx-1 and stx-2 based on SNP loci. These rapid methods provide end users with important information regarding virulence traits as well as the evolutionary and biogeographic origin of isolates.

  7. SNP Markers as Additional Information to Resolve Complex Kinship Cases

    PubMed Central

    Pontes, M. Lurdes; Fondevila, Manuel; Laréu, Maria Victoria; Medeiros, Rui

    2015-01-01

    Summary Background DNA profiling with sets of highly polymorphic autosomal short tandem repeat (STR) markers has been applied in various aspects of human identification in forensic casework for nearly 20 years. However, in some cases of complex kinship investigation, the information provided by the conventionally used STR markers is not enough, often resulting in low likelihood ratio (LR) calculations. In these cases, it becomes necessary to increment the number of loci under analysis to reach adequate LRs. Recently, it has been proposed that single nucleotide polymorphisms (SNPs) could be used as a supportive tool to STR typing, eventually even replacing the methods/markers now employed. Methods In this work, we describe the results obtained in 7 revised complex paternity cases when applying a battery of STRs, as well as 52 human identification SNPs (SNPforID 52plex identification panel) using a SNaPshot methodology followed by capillary electrophoresis. Results Our results show that the analysis of SNPs, as complement to STR typing in forensic casework applications, would at least increase by a factor of 4 total PI values and correspondent Essen-Möller's W value. Conclusions We demonstrated that SNP genotyping could be a key complement to STR information in challenging casework of disputed paternity, such as close relative individualization or complex pedigrees subject to endogamous relations. PMID:26733770

  8. Using Mendelian inheritance to improve high-throughput SNP discovery.

    PubMed

    Chen, Nancy; Van Hout, Cristopher V; Gottipati, Srikanth; Clark, Andrew G

    2014-11-01

    Restriction site-associated DNA sequencing or genotyping-by-sequencing (GBS) approaches allow for rapid and cost-effective discovery and genotyping of thousands of single-nucleotide polymorphisms (SNPs) in multiple individuals. However, rigorous quality control practices are needed to avoid high levels of error and bias with these reduced representation methods. We developed a formal statistical framework for filtering spurious loci, using Mendelian inheritance patterns in nuclear families, that accommodates variable-quality genotype calls and missing data--both rampant issues with GBS data--and for identifying sex-linked SNPs. Simulations predict excellent performance of both the Mendelian filter and the sex-linkage assignment under a variety of conditions. We further evaluate our method by applying it to real GBS data and validating a subset of high-quality SNPs. These results demonstrate that our metric of Mendelian inheritance is a powerful quality filter for GBS loci that is complementary to standard coverage and Hardy-Weinberg filters. The described method, implemented in the software MendelChecker, will improve quality control during SNP discovery in nonmodel as well as model organisms.

  9. Imputation of KIR Types from SNP Variation Data

    PubMed Central

    Vukcevic, Damjan; Traherne, James A.; Næss, Sigrid; Ellinghaus, Eva; Kamatani, Yoichiro; Dilthey, Alexander; Lathrop, Mark; Karlsen, Tom H.; Franke, Andre; Moffatt, Miriam; Cookson, William; Trowsdale, John; McVean, Gil; Sawcer, Stephen; Leslie, Stephen

    2015-01-01

    Large population studies of immune system genes are essential for characterizing their role in diseases, including autoimmune conditions. Of key interest are a group of genes encoding the killer cell immunoglobulin-like receptors (KIRs), which have known and hypothesized roles in autoimmune diseases, resistance to viruses, reproductive conditions, and cancer. These genes are highly polymorphic, which makes typing expensive and time consuming. Consequently, despite their importance, KIRs have been little studied in large cohorts. Statistical imputation methods developed for other complex loci (e.g., human leukocyte antigen [HLA]) on the basis of SNP data provide an inexpensive high-throughput alternative to direct laboratory typing of these loci and have enabled important findings and insights for many diseases. We present KIR∗IMP, a method for imputation of KIR copy number. We show that KIR∗IMP is highly accurate and thus allows the study of KIRs in large cohorts and enables detailed investigation of the role of KIRs in human disease. PMID:26430804

  10. Use of molecular variation in the NCBI dbSNP database.

    PubMed

    Sherry, S T; Ward, M; Sirotkin, K

    2000-01-01

    While high quality information regarding variation in genes is currently available in locus-specific or specialized mutation databases, the need remains for a general catalog of genome variation to address the large-scale sampling designs required by association studies, gene mapping, and evolutionary biology. In response to this need, the National Center for Biotechnology Information (NCBI) has established the dbSNP database http://ncbi. nlm.nih.gov/SNP/ to serve as a generalized, central variation database. Submissions to dbSNP will be integrated with other sources of information at NCBI such as GenBank, PubMed, LocusLink, and the Human Genome Project data, and the complete contents of dbSNP are available to the public via anonymous FTP. Hum Mutat 15:68-75, 2000. Published 2000 Wiley-Liss, Inc.

  11. Set up of cutoff thresholds for kinship determination using SNP loci.

    PubMed

    Cho, Sohee; Shin, Eun Soon; Yu, Hyung Jin; Lee, Ji Hyun; Seo, Hee Jin; Kim, Moon Young; Lee, Soong Deok

    2017-03-08

    The usefulness of single nucleotide polymorphism (SNP) loci for kinship testing has been demonstrated in many case works, and suggested as a promising marker for relationship identification. For interpreting results based on the calculation of the likelihood ratio (LR) in kinship testing, it is important to prepare cutoffs for respective relatives which are dependent on genetic relatedness. For this, analysis using true pedigree data is significant and reliable as it reflects the actual frequencies of markers in the population. In this study, the kinship index was explored through 1209 parent-child pairs, 1373 full sibling pairs, and 247 uncle-nephew pairs using 136 SNP loci. The cutoffs for LR were set up using different numbers of SNP loci with accuracy, sensitivity, and specificity. It is expected that this study can support the application of SNP loci-based kinship testing for various relationships.

  12. SNP discovery and genotyping using Genotyping-by-Sequencing in Pekin ducks

    PubMed Central

    Zhu, Feng; Cui, Qian-Qian; Hou, Zhuo-Cheng

    2016-01-01

    Genomic selection and genome-wide association studies need thousands to millions of SNPs. However, many non-model species do not have reference chips for detecting variation. Our goal was to develop and validate an inexpensive but effective method for detecting SNP variation. Genotyping by sequencing (GBS) can be a highly efficient strategy for genome-wide SNP detection, as an alternative to microarray chips. Here, we developed a GBS protocol for ducks and tested it to genotype 49 Pekin ducks. A total of 169,209 SNPs were identified from all animals, with a mean of 55,920 SNPs per individual. The average SNP density reached 1156 SNPs/MB. In this study, the first application of GBS to ducks, we demonstrate the power and simplicity of this method. GBS can be used for genetic studies in to provide an effective method for genome-wide SNP discovery. PMID:27845353

  13. Gene-Environment Interaction in the Etiology of Mathematical Ability Using SNP Sets

    PubMed Central

    Kovas, Yulia; Plomin, Robert

    2010-01-01

    Mathematics ability and disability is as heritable as other cognitive abilities and disabilities, however its genetic etiology has received relatively little attention. In our recent genome-wide association study of mathematical ability in 10-year-old children, 10 SNP associations were nominated from scans of pooled DNA and validated in an individually genotyped sample. In this paper, we use a ‘SNP set’ composite of these 10 SNPs to investigate gene-environment (GE) interaction, examining whether the association between the 10-SNP set and mathematical ability differs as a function of ten environmental measures in the home and school in a sample of 1888 children with complete data. We found two significant GE interactions for environmental measures in the home and the school both in the direction of the diathesis-stress type of GE interaction: The 10-SNP set was more strongly associated with mathematical ability in chaotic homes and when parents are negative. PMID:20978832

  14. Diode Laser Arrays

    NASA Astrophysics Data System (ADS)

    Botez, Dan; Scifres, Don R.

    2005-11-01

    Contributors; 1. Monolithic phase-locked semiconductor laser arrays D. Botez; 2. High power coherent, semiconductor laser master oscillator power amplifiers and amplifier arrays D. F. Welch and D. G. Mehuys; 3. Microoptical components applied to incoherent and coherent laser arrays J. R. Leger; 4. Modeling of diode laser arrays G. R. Hadley; 5. Dynamics of coherent semiconductor laser arrays H. G. Winfuland and R. K. Defreez; 6. High average power semiconductor laser arrays and laser array packaging with an emphasis for pumping solid state lasers R. Solarz; 7. High power diode laser arrays and their reliability D. R. Scifres and H. H. Kung; 8. Strained layer quantum well heterostructure laser arrays J. J. Coleman; 9. Vertical cavity surface emitting laser arrays C. J. Chang-Hasnain; 10. Individually addressed arrays of diode lasers D. Carlin.

  15. Evaluation of breast cancer susceptibility using improved genetic algorithms to generate genotype SNP barcodes.

    PubMed

    Yang, Cheng-Hong; Lin, Yu-Da; Chuang, Li-Yeh; Chang, Hsueh-Wei

    2013-01-01

    Genetic association is a challenging task for the identification and characterization of genes that increase the susceptibility to common complex multifactorial diseases. To fully execute genetic studies of complex diseases, modern geneticists face the challenge of detecting interactions between loci. A genetic algorithm (GA) is developed to detect the association of genotype frequencies of cancer cases and noncancer cases based on statistical analysis. An improved genetic algorithm (IGA) is proposed to improve the reliability of the GA method for high-dimensional SNP-SNP interactions. The strategy offers the top five results to the random population process, in which they guide the GA toward a significant search course. The IGA increases the likelihood of quickly detecting the maximum ratio difference between cancer cases and noncancer cases. The study systematically evaluates the joint effect of 23 SNP combinations of six steroid hormone metabolisms, and signaling-related genes involved in breast carcinogenesis pathways were systematically evaluated, with IGA successfully detecting significant ratio differences between breast cancer cases and noncancer cases. The possible breast cancer risks were subsequently analyzed by odds-ratio (OR) and risk-ratio analysis. The estimated OR of the best SNP barcode is significantly higher than 1 (between 1.15 and 7.01) for specific combinations of two to 13 SNPs. Analysis results support that the IGA provides higher ratio difference values than the GA between breast cancer cases and noncancer cases over 3-SNP to 13-SNP interactions. A more specific SNP-SNP interaction profile for the risk of breast cancer is also provided.

  16. Prim-SNPing: a primer designer for cost-effective SNP genotyping.

    PubMed

    Chang, Hsueh-Wei; Chuang, Li-Yeh; Cheng, Yu-Huei; Hung, Yu-Chen; Wen, Cheng-Hao; Gu, De-Leung; Yang, Cheng-Hong

    2009-05-01

    Many kinds of primer design (PD) software tools have been developed, but most of them lack a single nucleotide polymorphism (SNP) genotyping service. Here, we introduce the web-based freeware "Prim-SNPing," which, in addition to general PD, provides three kinds of primer design functions for cost-effective SNP genotyping: natural PD, mutagenic PD, and confronting two-pair primers (CTPP) PD. The natural PD and mutagenic PD provide primers and restriction enzyme mining for polymerase chain reaction-restriction fragment of length polymorphism (PCR-RFLP), while CTPP PD provides primers for restriction enzyme-free SNP genotyping. The PCR specificity and efficiency of the designed primers are improved by BLAST searching and evaluating secondary structure (such as GC clamps, dimers, and hairpins), respectively. The length pattern of PCR-RFLP using natural PD is user-adjustable, and the restriction sites of the RFLP enzymes provided by Prim-SNPing are confirmed to be absent within the generated PCR product. In CTPP PD, the need for a separate digestion step in RFLP is eliminated, thus making it faster and cheaper. The output of Prim-SNPing includes the primer list, melting temperature (Tm) value, GC percentage, and amplicon size with enzyme digestion information. The reference SNP (refSNP, or rs) clusters from the Single Nucleotide Polymorphism database (dbSNP) at the National Center for Biotechnology Information (NCBI), and multiple other formats of human, mouse, and rat SNP sequences are acceptable input. In summary, Prim-SNPing provides interactive, user-friendly and cost-effective primer design for SNP genotyping. It is freely available at http://bio.kuas.edu.tw/prim-snping.

  17. Evaluation of 16 loci to examine the cross-species utility of single nucleotide polymorphism arrays.

    PubMed

    Sechi, T; Coltman, D W; Kijas, J W

    2010-04-01

    Large collections of single nucleotide polymorphisms (SNPs) have recently been identified from a number of livestock genomes. This raises the possibility that SNP arrays might be useful for analysis in related species for which few genetic markers are currently available. To address the likely success of such an approach, the aim of this study was to examine the threshold number and position of flanking mutations which act to prevent genotype calls being produced. Sequence diversity was measured across 16 loci containing SNPs known either to work successfully between species or fail between species. In pairwise comparisons between domestic and wild sheep, sequence divergence surrounding working SNP assays was significantly lower than that surrounding non-functional assays. In addition, the location of flanking mismatches tended to be closer to the target SNP in loci that failed to generate genotype calls across species. The magnitude of sequence divergence observed for both working and non-functional assays was compared with the divergence separating domestic sheep from European Mouflon, African Barbary, goat and cattle. The results suggest that the utility of SNP arrays for analysis of shared polymorphism will be restricted to closely related pairs of species. Analysis across more divergent species will, however, be successful for other objectives, such as the identification of the ancestral state of SNPs.

  18. Electrochemical Li Topotactic Reaction in Layered SnP3 for Superior Li-Ion Batteries

    PubMed Central

    Park, Jae-Wan; Park, Cheol-Min

    2016-01-01

    The development of new anode materials having high electrochemical performances and interesting reaction mechanisms is highly required to satisfy the need for long-lasting mobile electronic devices and electric vehicles. Here, we report a layer crystalline structured SnP3 and its unique electrochemical behaviors with Li. The SnP3 was simply synthesized through modification of Sn crystallography by combination with P and its potential as an anode material for LIBs was investigated. During Li insertion reaction, the SnP3 anode showed an interesting two-step electrochemical reaction mechanism comprised of a topotactic transition (0.7–2.0 V) and a conversion (0.0–2.0 V) reaction. When the SnP3-based composite electrode was tested within the topotactic reaction region (0.7–2.0 V) between SnP3 and LixSnP3 (x ≤ 4), it showed excellent electrochemical properties, such as a high volumetric capacity (1st discharge/charge capacity was 840/663 mA h cm−3) with a high initial coulombic efficiency, stable cycle behavior (636 mA h cm−3 over 100 cycles), and fast rate capability (550 mA h cm−3 at 3C). This layered SnP3 anode will be applicable to a new anode material for rechargeable LIBs. PMID:27775090

  19. Electrochemical Li Topotactic Reaction in Layered SnP3 for Superior Li-Ion Batteries

    NASA Astrophysics Data System (ADS)

    Park, Jae-Wan; Park, Cheol-Min

    2016-10-01

    The development of new anode materials having high electrochemical performances and interesting reaction mechanisms is highly required to satisfy the need for long-lasting mobile electronic devices and electric vehicles. Here, we report a layer crystalline structured SnP3 and its unique electrochemical behaviors with Li. The SnP3 was simply synthesized through modification of Sn crystallography by combination with P and its potential as an anode material for LIBs was investigated. During Li insertion reaction, the SnP3 anode showed an interesting two-step electrochemical reaction mechanism comprised of a topotactic transition (0.7–2.0 V) and a conversion (0.0–2.0 V) reaction. When the SnP3-based composite electrode was tested within the topotactic reaction region (0.7–2.0 V) between SnP3 and LixSnP3 (x ≤ 4), it showed excellent electrochemical properties, such as a high volumetric capacity (1st discharge/charge capacity was 840/663 mA h cm‑3) with a high initial coulombic efficiency, stable cycle behavior (636 mA h cm‑3 over 100 cycles), and fast rate capability (550 mA h cm‑3 at 3C). This layered SnP3 anode will be applicable to a new anode material for rechargeable LIBs.

  20. SNP-based prediction of the human germ cell methylation landscape.

    PubMed

    Xie, Hehuang; Wang, Min; Bischof, Jared; Bonaldo, Maria de Fatima; Soares, Marcelo Bento

    2009-05-01

    Base substitution occurs at a high rate at CpG dinucleotides due to the frequent methylation of CpG and the deamination of methylated cytosine to thymine. If these substitutions occur in germ cells, they constitute a heritable mutation that may eventually rise to polymorphic frequencies, hence resulting in a SNP that is methylation associated. In this study, we sought to identify clusters of methylation associated SNPs as a basis for prediction of methylation landscapes of germ cell genomes. Genomic regions enriched with methylation associated SNPs, namely "methylation associated SNP clusters", were identified with an agglomerative hierarchical clustering algorithm. Repetitive elements, segmental duplications, and syntenic tandem DNA repeats were enriched in methylation associated SNP clusters. The frequency of methylation associated SNPs in Alu Y/S elements exhibited a gradient pattern suggestive of linear spreading, being higher in proximity to methylation associated SNP clusters and lower closer to CpG islands. Interestingly, methylation associated SNP clusters were over-represented near the transcriptional initiation sites of immune response genes. We propose a de novo DNA methylation model during germ cell development whereby a pattern is established by long-range chromatic interactions through syntenic repeats combined with regional methylation spreading from methylation associated SNP clusters.

  1. SNP2TFBS – a database of regulatory SNPs affecting predicted transcription factor binding site affinity

    PubMed Central

    Kumar, Sunil; Ambrosini, Giovanna; Bucher, Philipp

    2017-01-01

    SNP2TFBS is a computational resource intended to support researchers investigating the molecular mechanisms underlying regulatory variation in the human genome. The database essentially consists of a collection of text files providing specific annotations for human single nucleotide polymorphisms (SNPs), namely whether they are predicted to abolish, create or change the affinity of one or several transcription factor (TF) binding sites. A SNP's effect on TF binding is estimated based on a position weight matrix (PWM) model for the binding specificity of the corresponding factor. These data files are regenerated at regular intervals by an automatic procedure that takes as input a reference genome, a comprehensive SNP catalogue and a collection of PWMs. SNP2TFBS is also accessible over a web interface, enabling users to view the information provided for an individual SNP, to extract SNPs based on various search criteria, to annotate uploaded sets of SNPs or to display statistics about the frequencies of binding sites affected by selected SNPs. Homepage: http://ccg.vital-it.ch/snp2tfbs/. PMID:27899579

  2. SNP and mutation data on the web - hidden treasures for uncovering.

    PubMed

    Barnes, Michael R

    2002-01-01

    SNP data has grown exponentially over the last two years, SNP database evolution has matched this growth, as initial development of several independent SNP databases has given way to one central SNP database, dbSNP. Other SNP databases have instead evolved to complement this central database by providing gene specific focus and an increased level of curation and analysis on subsets of data, derived from the central data set. By contrast, human mutation data, which has been collected over many years, is still stored in disparate sources, although moves are afoot to move to a similar central database. These developments are timely, human mutation and polymorphism data both hold complementary keys to a better understanding of how genes function and malfunction in disease. The impending availability of a complete human genome presents us with an ideal framework to integrate both these forms of data, as our understanding of the mechanisms of disease increase, the full genomic context of variation may become increasingly significant.

  3. SNP2TFBS - a database of regulatory SNPs affecting predicted transcription factor binding site affinity.

    PubMed

    Kumar, Sunil; Ambrosini, Giovanna; Bucher, Philipp

    2017-01-04

    SNP2TFBS is a computational resource intended to support researchers investigating the molecular mechanisms underlying regulatory variation in the human genome. The database essentially consists of a collection of text files providing specific annotations for human single nucleotide polymorphisms (SNPs), namely whether they are predicted to abolish, create or change the affinity of one or several transcription factor (TF) binding sites. A SNP's effect on TF binding is estimated based on a position weight matrix (PWM) model for the binding specificity of the corresponding factor. These data files are regenerated at regular intervals by an automatic procedure that takes as input a reference genome, a comprehensive SNP catalogue and a collection of PWMs. SNP2TFBS is also accessible over a web interface, enabling users to view the information provided for an individual SNP, to extract SNPs based on various search criteria, to annotate uploaded sets of SNPs or to display statistics about the frequencies of binding sites affected by selected SNPs. Homepage: http://ccg.vital-it.ch/snp2tfbs/.

  4. Single nucleotide polymorphism array-based karyotyping in acute myeloid leukemia or myelodysplastic syndrome with trisomy 8 as the sole chromosomal abnormality.

    PubMed

    Hahm, Chorong; Mun, Yeung Chul; Seong, Chu Myong; Han, Sung-Hee; Chung, Wha Soon; Huh, Jungwon

    2013-01-01

    The clinical heterogeneity of patients with acute myeloid leukemia (AML) or myelodysplastic syndrome (MDS) with trisomy 8 as the sole abnormality may result from cytogenetically undetectable genetic changes. The purpose of this study was to identify hidden genomic aberrations not detected by metaphase cytogenetics (MC) using high-resolution single nucleotide polymorphism array (SNP-A)-based karyotyping in AML/MDS patients with a sole trisomy 8. The study group included 8 patients (3 AML and 5 MDS) and array-based karyotyping was done using whole-genome SNP-A (SNP 6.0 and SNP 2.7M). By SNP-A, additional genomic aberrations not detected by MC were identified in 2 patients: 1 AML patient exhibited a copy-neutral loss of heterozygosity (CN-LOH) of 3q21.1-q29 and 11q13.1-q25 and the other patient with MDS (refractory cytopenia with unilineage dysplasia) had CN-LOH of 2p25.3-p15. In particular, the latter patient progressed to AML 18 months after the diagnosis. In 3 patients, aberrations in addition to trisomy 8 were not identified by SNP-A. In the remaining 3 patients, SNP-A could not detect trisomy 8, while trisomy 8 was found in 25-67% of metaphase cells by MC. This study suggests that additional genomic aberrations may in fact be present even in cases of trisomy 8 as sole abnormality by MC, and SNP-A could be a useful karyotyping tool to identify hidden aberrations such as CN-LOH.

  5. Identification of Laying-Related SNP Markers in Geese Using RAD Sequencing.

    PubMed

    Yu, ShiGang; Chu, WeiWei; Zhang, LiFan; Han, HouMing; Zhao, RongXue; Wu, Wei; Zhu, JiangNing; Dodson, Michael V; Wei, Wei; Liu, HongLin; Chen, Jie

    2015-01-01

    Laying performance is an important economical trait of goose production. As laying performance is of low heritability, it is of significance to develop a marker-assisted selection (MAS) strategy for this trait. Definition of sequence variation related to the target trait is a prerequisite of quantitating MAS, but little is presently known about the goose genome, which greatly hinders the identification of genetic markers for the laying traits of geese. Recently developed restriction site-associated DNA (RAD) sequencing is a possible approach for discerning large-scale single nucleotide polymorphism (SNP) and reducing the complexity of a genome without having reference genomic information available. In the present study, we developed a pooled RAD sequencing strategy for detecting geese laying-related SNP. Two DNA pools were constructed, each consisting of equal amounts of genomic DNA from 10 individuals with either high estimated breeding value (HEBV) or low estimated breeding value (LEBV). A total of 139,013 SNP were obtained from 42,291,356 sequences, of which 18,771,943 were for LEBV and 23,519,413 were for HEBV cohorts. Fifty-five SNP which had different allelic frequencies in the two DNA pools were further validated by individual-based AS-PCR genotyping in the LEBV and HEBV cohorts. Ten out of 55 SNP exhibited distinct allele distributions in these two cohorts. These 10 SNP were further genotyped in a goose population of 492 geese to verify the association with egg numbers. The result showed that 8 of 10 SNP were associated with egg numbers. Additionally, liner regression analysis revealed that SNP Record-111407, 106975 and 112359 were involved in a multiplegene network affecting laying performance. We used IPCR to extend the unknown regions flanking the candidate RAD tags. The obtained sequences were subjected to BLAST to retrieve the orthologous genes in either ducks or chickens. Five novel genes were cloned for geese which harbored the candidate laying

  6. Identification of Laying-Related SNP Markers in Geese Using RAD Sequencing

    PubMed Central

    Yu, ShiGang; Chu, WeiWei; Zhang, LiFan; Han, HouMing; Zhao, RongXue; Wu, Wei; Zhu, JiangNing; Dodson, Michael V.; Wei, Wei; Liu, HongLin; Chen, Jie

    2015-01-01

    Laying performance is an important economical trait of goose production. As laying performance is of low heritability, it is of significance to develop a marker-assisted selection (MAS) strategy for this trait. Definition of sequence variation related to the target trait is a prerequisite of quantitating MAS, but little is presently known about the goose genome, which greatly hinders the identification of genetic markers for the laying traits of geese. Recently developed restriction site-associated DNA (RAD) sequencing is a possible approach for discerning large-scale single nucleotide polymorphism (SNP) and reducing the complexity of a genome without having reference genomic information available. In the present study, we developed a pooled RAD sequencing strategy for detecting geese laying-related SNP. Two DNA pools were constructed, each consisting of equal amounts of genomic DNA from 10 individuals with either high estimated breeding value (HEBV) or low estimated breeding value (LEBV). A total of 139,013 SNP were obtained from 42,291,356 sequences, of which 18,771,943 were for LEBV and 23,519,413 were for HEBV cohorts. Fifty-five SNP which had different allelic frequencies in the two DNA pools were further validated by individual-based AS-PCR genotyping in the LEBV and HEBV cohorts. Ten out of 55 SNP exhibited distinct allele distributions in these two cohorts. These 10 SNP were further genotyped in a goose population of 492 geese to verify the association with egg numbers. The result showed that 8 of 10 SNP were associated with egg numbers. Additionally, liner regression analysis revealed that SNP Record-111407, 106975 and 112359 were involved in a multiplegene network affecting laying performance. We used IPCR to extend the unknown regions flanking the candidate RAD tags. The obtained sequences were subjected to BLAST to retrieve the orthologous genes in either ducks or chickens. Five novel genes were cloned for geese which harbored the candidate laying

  7. Generation of SNP datasets for orangutan population genomics using improved reduced-representation sequencing and direct comparisons of SNP calling algorithms

    PubMed Central

    2014-01-01

    Background High-throughput sequencing has opened up exciting possibilities in population and conservation genetics by enabling the assessment of genetic variation at genome-wide scales. One approach to reduce genome complexity, i.e. investigating only parts of the genome, is reduced-representation library (RRL) sequencing. Like similar approaches, RRL sequencing reduces ascertainment bias due to simultaneous discovery and genotyping of single-nucleotide polymorphisms (SNPs) and does not require reference genomes. Yet, generating such datasets remains challenging due to laboratory and bioinformatical issues. In the laboratory, current protocols require improvements with regards to sequencing homologous fragments to reduce the number of missing genotypes. From the bioinformatical perspective, the reliance of most studies on a single SNP caller disregards the possibility that different algorithms may produce disparate SNP datasets. Results We present an improved RRL (iRRL) protocol that maximizes the generation of homologous DNA sequences, thus achieving improved genotyping-by-sequencing efficiency. Our modifications facilitate generation of single-sample libraries, enabling individual genotype assignments instead of pooled-sample analysis. We sequenced ~1% of the orangutan genome with 41-fold median coverage in 31 wild-born individuals from two populations. SNPs and genotypes were called using three different algorithms. We obtained substantially different SNP datasets depending on the SNP caller. Genotype validations revealed that the Unified Genotyper of the Genome Analysis Toolkit and SAMtools performed significantly better than a caller from CLC Genomics Workbench (CLC). Of all conflicting genotype calls, CLC was only correct in 17% of the cases. Furthermore, conflicting genotypes between two algorithms showed a systematic bias in that one caller almost exclusively assigned heterozygotes, while the other one almost exclusively assigned homozygotes. Conclusions

  8. The iSelect 9 K SNP analysis revealed polyploidization induced revolutionary changes and intense human selection causing strong haplotype blocks in wheat

    PubMed Central

    Hao, Chenyang; Wang, Yuquan; Chao, Shiaoman; Li, Tian; Liu, Hongxia; Wang, Lanfen; Zhang, Xueyong

    2017-01-01

    A Chinese wheat mini core collection was genotyped using the wheat 9 K iSelect SNP array. Total 2420 and 2396 polymorphic SNPs were detected on the A and the B genome chromosomes, which formed 878 haplotype blocks. There were more blocks in the B genome, but the average block size was significantly (P < 0.05) smaller than those in the A genome. Intense selection (domestication and breeding) had a stronger effect on the A than on the B genome chromosomes. Based on the genetic pedigrees, many blocks can be traced back to a well-known Strampelli cross, which was made one century ago. Furthermore, polyploidization of wheat (both tetraploidization and hexaploidization) induced revolutionary changes in both the A and the B genomes, with a greater increase of gene diversity compared to their diploid ancestors. Modern breeding has dramatically increased diversity in the gene coding regions, though obvious blocks were formed on most of the chromosomes in both tetraploid and hexaploid wheats. Tag-SNP markers identified in this study can be used for marker assisted selection using haplotype blocks as a wheat breeding strategy. This strategy can also be employed to facilitate genome selection in other self-pollinating crop species. PMID:28134278

  9. High-throughput SNP genotyping in Cucurbita pepo for map construction and quantitative trait loci mapping

    PubMed Central

    2012-01-01

    Background Cucurbita pepo is a member of the Cucurbitaceae family, the second- most important horticultural family in terms of economic importance after Solanaceae. The "summer squash" types, including Zucchini and Scallop, rank among the highest-valued vegetables worldwide. There are few genomic tools available for this species. The first Cucurbita transcriptome, along with a large collection of Single Nucleotide Polymorphisms (SNP), was recently generated using massive sequencing. A set of 384 SNP was selected to generate an Illumina GoldenGate assay in order to construct the first SNP-based genetic map of Cucurbita and map quantitative trait loci (QTL). Results We herein present the construction of the first SNP-based genetic map of Cucurbita pepo using a population derived from the cross of two varieties with contrasting phenotypes, representing the main cultivar groups of the species' two subspecies: Zucchini (subsp. pepo) × Scallop (subsp. ovifera). The mapping population was genotyped with 384 SNP, a set of selected EST-SNP identified in silico after massive sequencing of the transcriptomes of both parents, using the Illumina GoldenGate platform. The global success rate of the assay was higher than 85%. In total, 304 SNP were mapped, along with 11 SSR from a previous map, giving a map density of 5.56 cM/marker. This map was used to infer syntenic relationships between C. pepo and cucumber and to successfully map QTL that control plant, flowering and fruit traits that are of benefit to squash breeding. The QTL effects were validated in backcross populations. Conclusion Our results show that massive sequencing in different genotypes is an excellent tool for SNP discovery, and that the Illumina GoldenGate platform can be successfully applied to constructing genetic maps and performing QTL analysis in Cucurbita. This is the first SNP-based genetic map in the Cucurbita genus and is an invaluable new tool for biological research, especially considering that most

  10. SnpFilt: A pipeline for reference-free assembly-based identification of SNPs in bacterial genomes.

    PubMed

    Chan, Carmen H S; Octavia, Sophie; Sintchenko, Vitali; Lan, Ruiting

    2016-12-01

    De novo assembly of bacterial genomes from next-generation sequencing (NGS) data allows a reference-free discovery of single nucleotide polymorphisms (SNP). However, substantial rates of errors in genomes assembled by this approach remain a major barrier for the reference-free analysis of genome variations in medically important bacteria. The aim of this report was to improve the quality of SNP identification in bacterial genomes without closely related references. We developed a bioinformatics pipeline (SnpFilt) that constructs an assembly using SPAdes and then removes unreliable regions based on the quality and coverage of re-aligned reads at neighbouring regions. The performance of the pipeline was compared against reference-based SNP calling for Illumina HiSeq, MiSeq and NextSeq reads from a range of bacterial pathogens including Salmonella, which is one of the most common causes of food-borne disease. The SnpFilt pipeline removed all false SNP in all test NGS datasets consisting of paired-end Illumina reads. We also showed that for reliable and complete SNP calls, at least 40-fold coverage is required. Analysis of bacterial isolates associated with epidemiologically confirmed outbreaks using the SnpFilt pipeline produced results consistent with previously published findings. The SnpFilt pipeline improves the quality of de-novo assembly and precision of SNP calling in bacterial genomes by removal of regions of the assembly that may potentially contain assembly errors. SnpFilt is available from https://github.com/LanLab/SnpFilt.

  11. Explaining the disease phenotype of intergenic SNP through predicted long range regulation

    PubMed Central

    Chen, Jingqi; Tian, Weidong

    2016-01-01

    Thousands of disease-associated SNPs (daSNPs) are located in intergenic regions (IGR), making it difficult to understand their association with disease phenotypes. Recent analysis found that non-coding daSNPs were frequently located in or approximate to regulatory elements, inspiring us to try to explain the disease phenotypes of IGR daSNPs through nearby regulatory sequences. Hence, after locating the nearest distal regulatory element (DRE) to a given IGR daSNP, we applied a computational method named INTREPID to predict the target genes regulated by the DRE, and then investigated their functional relevance to the IGR daSNP's disease phenotypes. 36.8% of all IGR daSNP-disease phenotype associations investigated were possibly explainable through the predicted target genes, which were enriched with, were functionally relevant to, or consisted of the corresponding disease genes. This proportion could be further increased to 60.5% if the LD SNPs of daSNPs were also considered. Furthermore, the predicted SNP-target gene pairs were enriched with known eQTL/mQTL SNP-gene relationships. Overall, it's likely that IGR daSNPs may contribute to disease phenotypes by interfering with the regulatory function of their nearby DREs and causing abnormal expression of disease genes. PMID:27280978

  12. Objective evaluation measures of genetic marker selection in large-scale SNP genotyping.

    PubMed

    Kaminuma, Eli; Masuya, Hiroshi; Miura, Ikuo; Motegi, Hiromi; Takahasi, Kenzi R; Nakazawa, Miki; Matsui, Minami; Gondo, Yoichi; Noda, Tetsuo; Shiroishi, Toshihiko; Wakana, Shigeharu; Toyoda, Tetsuro

    2008-10-01

    High-throughput single nucleotide polymorphism (SNP) genotyping systems provide two kinds of fluorescent signals detected from different alleles. In current technologies, the process of genotype discrimination requires subjective judgments by expert operators, even when using clustering algorithms. Here, we propose two evaluation measures to manage fluorescent scatter data with nonclear plot aggregation. The first is the marker ranking measure, which provides a ranking system for the SNP markers based on the distance between the scatter plot distribution and a user-defined ideal distribution. The second measure, called individual genotype membership, uses the membership probability of each genotype related to an individual plot in the scatter data. In verification experiments, the marker ranking measure determined the ranking of SNP markers correlated with the subjective order of SNP markers judged by an expert operator. The experiment using the individual genotype membership measure clarified that the total number of unclassified individuals was remarkably reduced compared to that of manually unclassified ones. These two evaluation measures were implemented as the GTAssist software. GTAssist provides objective standards and avoids subjective biases in SNP genotyping workflows.

  13. Assessment of high resolution melting analysis as a potential SNP genotyping technique in forensic casework.

    PubMed

    Venables, Samantha J; Mehta, Bhavik; Daniel, Runa; Walsh, Simon J; van Oorschot, Roland A H; McNevin, Dennis

    2014-11-01

    High resolution melting (HRM) analysis is a simple, cost effective, closed tube SNP genotyping technique with high throughput potential. The effectiveness of HRM for forensic SNP genotyping was assessed with five commercially available HRM kits evaluated on the ViiA™ 7 Real Time PCR instrument. Four kits performed satisfactorily against forensically relevant criteria. One was further assessed to determine the sensitivity, reproducibility, and accuracy of HRM SNP genotyping. The manufacturer's protocol using 0.5 ng input DNA and 45 PCR cycles produced accurate and reproducible results for 17 of the 19 SNPs examined. Problematic SNPs had GC rich flanking regions which introduced additional melting domains into the melting curve (rs1800407) or included homozygotes that were difficult to distinguish reliably (rs16891982; a G to C SNP). A proof of concept multiplexing experiment revealed that multiplexing a small number of SNPs may be possible after further investigation. HRM enables genotyping of a number of SNPs in a large number of samples without extensive optimization. However, it requires more genomic DNA as template in comparison to SNaPshot®. Furthermore, suitably modifying pre-existing forensic intelligence SNP panels for HRM analysis may pose difficulties due to the properties of some SNPs.

  14. Highly specific SNP detection using 2D graphene electronics and DNA strand displacement.

    PubMed

    Hwang, Michael T; Landon, Preston B; Lee, Joon; Choi, Duyoung; Mo, Alexander H; Glinsky, Gennadi; Lal, Ratnesh

    2016-06-28

    Single-nucleotide polymorphisms (SNPs) in a gene sequence are markers for a variety of human diseases. Detection of SNPs with high specificity and sensitivity is essential for effective practical implementation of personalized medicine. Current DNA sequencing, including SNP detection, primarily uses enzyme-based methods or fluorophore-labeled assays that are time-consuming, need laboratory-scale settings, and are expensive. Previously reported electrical charge-based SNP detectors have insufficient specificity and accuracy, limiting their effectiveness. Here, we demonstrate the use of a DNA strand displacement-based probe on a graphene field effect transistor (FET) for high-specificity, single-nucleotide mismatch detection. The single mismatch was detected by measuring strand displacement-induced resistance (and hence current) change and Dirac point shift in a graphene FET. SNP detection in large double-helix DNA strands (e.g., 47 nt) minimize false-positive results. Our electrical sensor-based SNP detection technology, without labeling and without apparent cross-hybridization artifacts, would allow fast, sensitive, and portable SNP detection with single-nucleotide resolution. The technology will have a wide range of applications in digital and implantable biosensors and high-throughput DNA genotyping, with transformative implications for personalized medicine.

  15. snpGeneSets: An R Package for Genome-Wide Study Annotation

    PubMed Central

    Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian

    2016-01-01

    Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/. PMID:27807048

  16. snpGeneSets: An R Package for Genome-Wide Study Annotation.

    PubMed

    Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian

    2016-12-07

    Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/.

  17. Supervised learning-based tagSNP selection for genome-wide disease classifications

    PubMed Central

    Liu, Qingzhong; Yang, Jack; Chen, Zhongxue; Yang, Mary Qu; Sung, Andrew H; Huang, Xudong

    2008-01-01

    Background Comprehensive evaluation of common genetic variations through association of single nucleotide polymorphisms (SNPs) with complex human diseases on the genome-wide scale is an active area in human genome research. One of the fundamental questions in a SNP-disease association study is to find an optimal subset of SNPs with predicting power for disease status. To find that subset while reducing study burden in terms of time and costs, one can potentially reconcile information redundancy from associations between SNP markers. Results We have developed a feature selection method named Supervised Recursive Feature Addition (SRFA). This method combines supervised learning and statistical measures for the chosen candidate features/SNPs to reconcile the redundancy information and, in doing so, improve the classification performance in association studies. Additionally, we have proposed a Support Vector based Recursive Feature Addition (SVRFA) scheme in SNP-disease association analysis. Conclusions We have proposed using SRFA with different statistical learning classifiers and SVRFA for both SNP selection and disease classification and then applying them to two complex disease data sets. In general, our approaches outperform the well-known feature selection method of Support Vector Machine Recursive Feature Elimination and logic regression-based SNP selection for disease classification in genetic association studies. Our study further indicates that both genetic and environmental variables should be taken into account when doing disease predictions and classifications for the most complex human diseases that have gene-environment interactions. PMID:18366619

  18. Highly specific SNP detection using 2D graphene electronics and DNA strand displacement

    PubMed Central

    Hwang, Michael T.; Landon, Preston B.; Lee, Joon; Choi, Duyoung; Mo, Alexander H.; Glinsky, Gennadi; Lal, Ratnesh

    2016-01-01

    Single-nucleotide polymorphisms (SNPs) in a gene sequence are markers for a variety of human diseases. Detection of SNPs with high specificity and sensitivity is essential for effective practical implementation of personalized medicine. Current DNA sequencing, including SNP detection, primarily uses enzyme-based methods or fluorophore-labeled assays that are time-consuming, need laboratory-scale settings, and are expensive. Previously reported electrical charge-based SNP detectors have insufficient specificity and accuracy, limiting their effectiveness. Here, we demonstrate the use of a DNA strand displacement-based probe on a graphene field effect transistor (FET) for high-specificity, single-nucleotide mismatch detection. The single mismatch was detected by measuring strand displacement-induced resistance (and hence current) change and Dirac point shift in a graphene FET. SNP detection in large double-helix DNA strands (e.g., 47 nt) minimize false-positive results. Our electrical sensor-based SNP detection technology, without labeling and without apparent cross-hybridization artifacts, would allow fast, sensitive, and portable SNP detection with single-nucleotide resolution. The technology will have a wide range of applications in digital and implantable biosensors and high-throughput DNA genotyping, with transformative implications for personalized medicine. PMID:27298347

  19. Developing a new nonbinary SNP fluorescent multiplex detection system for forensic application in China.

    PubMed

    Liu, Yanfang; Liao, Huidan; Liu, Ying; Guo, Juanjuan; Sun, Yi; Fu, Xiaoliang; Xiao, Ding; Cai, Jifeng; Lan, Lingmei; Xie, Pingli; Zha, Lagabaiyila

    2017-02-06

    Nonbinary single-nucleotide polymorphisms (SNPs) are potential forensic genetic markers because their discrimination power is greater than that of normal binary SNPs, and that they can detect highly degraded samples. We previously developed a nonbinary SNP multiplex typing assay. In this study, we selected additional 20 nonbinary SNPs from the NCBI SNP database and verified them through pyrosequencing. These 20 nonbinary SNPs were analyzed using the fluorescent-labeled SNaPshot multiplex SNP typing method. The allele frequencies and genetic parameters of these 20 nonbinary SNPs were determined among 314 unrelated individuals from Han populations from China. The total power of discrimination was 0.9999999999994, and the cumulative probability of exclusion was 0.9986. Moreover, the result of the combination of this 20 nonbinary SNP assay with the 20 nonbinary SNP assay we previously developed demonstrated that the cumulative probability of exclusion of the 40 nonbinary SNPs was 0.999991 and that no significant linkage disequilibrium was observed in all 40 nonbinary SNPs. Thus, we concluded that this new system consisting of new 20 nonbinary SNPs could provide highly informative polymorphic data which would be further used in forensic application and would serve as a potentially valuable supplement to forensic DNA analysis.

  20. MDM2 SNP309 polymorphism is associated with colorectal cancer risk

    PubMed Central

    Wang, Weizhi; Du, Mulong; Gu, Dongying; Zhu, Lingjun; Chu, Haiyan; Tong, Na; Zhang, Zhengdong; Xu, Zekuan; Wang, Meilin

    2014-01-01

    The human murine double minute 2 (MDM2) is known as an oncoprotein through inhibiting P53 transcriptional activity and mediating P53 ubiquitination. Therefore, the amplification of MDM2 may attenuate the P53 pathway and promote tumorigenesis. The SNP309 T>G polymorphism (rs2279744), which is located in the intronic promoter of MDM2 gene, was reported to contribute to the increased level of MDM2 protein. In this hospital-based case-control study, which consisted of 573 cases and 588 controls, we evaluated the association between MDM2 SNP309 and the risk of colorectal cancer (CRC) in a Chinese population by using the TaqMan method to genotype the polymorphism. We found that the MDM2 SNP309 polymorphism was significantly associated with CRC risk. In addition, in our meta-analysis, we found a significant association between MDM2 SNP309 and CRC risk among Asians, which was consistent with our results. In conclusion, we demonstrated that the MDM2 SNP309 polymorphism increased the susceptibility of CRC in Asian populations. PMID:24797837

  1. High-density fiber optic biosensor arrays

    NASA Astrophysics Data System (ADS)

    Epstein, Jason R.; Walt, David R.

    2002-02-01

    Novel approaches are required to coordinate the immense amounts of information derived from diverse genomes. This concept has influenced the expanded role of high-throughput DNA detection and analysis in the biological sciences. A high-density fiber optic DNA biosensor was developed consisting of oligonucleotide-functionalized, 3.1 mm diameter microspheres deposited into the etched wells on the distal face of a 500 micrometers imaging fiber bundle. Imaging fiber bundles containing thousands of optical fibers, each associated with a unique oligonucleotide probe sequence, were the foundation for an optically connected, individually addressable DNA detection platform. Different oligonucleotide-functionalized microspheres were combined in a stock solution, and randomly dispersed into the etched wells. Microsphere positions were registered from optical dyes incorporated onto the microspheres. The distribution process provided an inherent redundancy that increases the signal-to-noise ratio as the square root of the number of sensors examined. The representative amount of each probe-type in the array was dependent on their initial stock solution concentration, and as other sequences of interest arise, new microsphere elements can be added to arrays without altering the existing detection capabilities. The oligonucleotide probe sequences hybridize to fluorescently-labeled, complementary DNA target solutions. Fiber optic DNA microarray research has included DNA-protein interaction profiles, microbial strain differentiation, non-labeled target interrogation with molecular beacons, and single cell-based assays. This biosensor array is proficient in DNA detection linked to specific disease states, single nucleotide polymorphism (SNP's) discrimination, and gene expression analysis. This array platform permits multiple detection formats, provides smaller feature sizes, and enables sensor design flexibility. High-density fiber optic microarray biosensors provide a fast

  2. SNP Discovery by Illumina-Based Transcriptome Sequencing of the Olive and the Genetic Characterization of Turkish Olive Genotypes Revealed by AFLP, SSR and SNP Markers

    PubMed Central

    Kaya, Hilal Betul; Cetin, Oznur; Kaya, Hulya; Sahin, Mustafa; Sefer, Filiz; Kahraman, Abdullah; Tanyolac, Bahattin

    2013-01-01

    Background The olive tree (Olea europaea L.) is a diploid (2n = 2x = 46) outcrossing species mainly grown in the Mediterranean area, where it is the most important oil-producing crop. Because of its economic, cultural and ecological importance, various DNA markers have been used in the olive to characterize and elucidate homonyms, synonyms and unknown accessions. However, a comprehensive characterization and a full sequence of its transcriptome are unavailable, leading to the importance of an efficient large-scale single nucleotide polymorphism (SNP) discovery in olive. The objectives of this study were (1) to discover olive SNPs using next-generation sequencing and to identify SNP primers for cultivar identification and (2) to characterize 96 olive genotypes originating from different regions of Turkey. Methodology/Principal Findings Next-generation sequencing technology was used with five distinct olive genotypes and generated cDNA, producing 126,542,413 reads using an Illumina Genome Analyzer IIx. Following quality and size trimming, the high-quality reads were assembled into 22,052 contigs with an average length of 1,321 bases and 45 singletons. The SNPs were filtered and 2,987 high-quality putative SNP primers were identified. The assembled sequences and singletons were subjected to BLAST similarity searches and annotated with a Gene Ontology identifier. To identify the 96 olive genotypes, these SNP primers were applied to the genotypes in combination with amplified fragment length polymorphism (AFLP) and simple sequence repeats (SSR) markers. Conclusions/Significance This study marks the highest number of SNP markers discovered to date from olive genotypes using transcriptome sequencing. The developed SNP markers will provide a useful source for molecular genetic studies, such as genetic diversity and characterization, high density quantitative trait locus (QTL) analysis, association mapping and map-based gene cloning in the olive. High levels of

  3. Breast cancer-associated high-order SNP-SNP interaction of CXCL12/CXCR4-related genes by an improved multifactor dimensionality reduction (MDR-ER).

    PubMed

    Fu, Ou-Yang; Chang, Hsueh-Wei; Lin, Yu-Da; Chuang, Li-Yeh; Hou, Ming-Feng; Yang, Cheng-Hong

    2016-09-01

    In association studies, the combined effects of single nucleotide polymorphism (SNP)-SNP interactions and the problem of imbalanced data between cases and controls are frequently ignored. In the present study, we used an improved multifactor dimensionality reduction (MDR) approach namely MDR-ER to detect the high order SNP‑SNP interaction in an imbalanced breast cancer data set containing seven SNPs of chemokine CXCL12/CXCR4 pathway genes. Most individual SNPs were not significantly associated with breast cancer. After MDR‑ER analysis, six significant SNP‑SNP interaction models with seven genes (highest cross‑validation consistency, 10; classification error rates, 41.3‑21.0; and prediction error rates, 47.4‑55.3) were identified. CD4 and VEGFA genes were associated in a 2‑loci interaction model (classification error rate, 41.3; prediction error rate, 47.5; odds ratio (OR), 2.069; 95% bootstrap CI, 1.40‑2.90; P=1.71E‑04) and it also appeared in all the best 2‑7‑loci models. When the loci number increased, the classification error rates and P‑values decreased. The powers in 2‑7‑loci in all models were >0.9. The minimum classification error rate of the MDR‑ER‑generated model was shown with the 7‑loci interaction model (classification error rate, 21.0; OR=15.282; 95% bootstrap CI, 9.54‑23.87; P=4.03E‑31). In the epistasis network analysis, the overall effect with breast cancer susceptibility was identified and the SNP order of impact on breast cancer was identified as follows: CD4 = VEGFA > KITLG > CXCL12 > CCR7 = MMP2 > CXCR4. In conclusion, the MDR‑ER can effectively and correctly identify the best SNP‑SNP interaction models in an imbalanced data set for breast cancer cases.

  4. Transcriptome sequencing for SNP discovery across Cucumis melo

    PubMed Central

    2012-01-01

    from India and Africa as compared to commercial cultivars, cultigens and landraces from Eastern Europe, Western Asia and the Mediterranean basin is consistent with the evolutionary history proposed for the species. Group-specific SNVs that will be useful in introgression programs were also detected. In a sample of 143 selected putative SNPs, we verified 93% of the polymorphisms in a panel of 78 genotypes. Conclusions This study provides the first comprehensive resequencing data for wild, exotic, and cultivated (landraces and commercial) melon transcriptomes, yielding the largest melon SNP collection available to date and representing a notable sample of the species diversity. This data provides a valuable resource for creating a catalog of allelic variants of melon genes and it will aid in future in-depth studies of population genetics, marker-assisted breeding, and gene identification aimed at developing improved varieties. PMID:22726804

  5. Observation of perturbed 3snp double photoexcited Ryberg series of beryllium atoms

    SciTech Connect

    Yoshida, Fumiko; Matsuoka, Leo; Osaki, Hiroyuki; Kikkawa, Satoshi; Fukushima, Yu; Hasegawa, Shuichi; Nagata, Tetsuo; Azuma, Yoshiro; Obara, Satoshi

    2006-04-15

    We observed the 3snp autoionizing Rydberg series of the Be atom in order to investigate the double-photoexcitation processes in two-s-electron systems. We employed synchrotron radiation to photoexcite the Be atoms and measured the generated Be{sup +} photoions by the time-of-flight method. The 3snp (n=3-9) photoexcitation resonance peaks with interloper state of 3p4s that converges to Be{sup +}(3p) threshold were observed. We derived the resonance parameters of 3snp series from a fitting procedure and obtained the Fano parameter q, energy position E{sub 0}, and resonance width {gamma}. These parameters are in good agreement with theoretical values. In the vicinity of the 3s5p state these experimental results clearly revealed the influence of the interloper 3p4s state, and the comparison with the numerical calculations indicates that more detailed calculations might be required to fully explain this phenomenon.

  6. Multi-marker-LD based genetic algorithm for tag SNP selection.

    PubMed

    Mouawad, Amer E; Mansour, Nashat

    2014-12-01

    Despite the advances in genotyping technologies which have led to large reduction in genotyping cost, the Tag SNP Selection problem remains an important problem for computational biologists and geneticists. Selecting the smallest subset of tag SNPs that can predict the other SNPs would considerably minimize the complexity of genome-wide or block-based SNP-disease association studies. These studies would lead to better diagnosis and treatment of diseases. In this work, we propose three variations of a genetic algorithm based on two-marker linkage disequilibrium, multi-marker linkage disequilibrium, and a third measure that we denote by prediction power. The performance of the three algorithms are compared with those of a recognized tag SNP selection algorithm using three different real data sets from the HapMap project. The results indicate that the multi-marker linkage disequilibrium based genetic algorithm yields better prediction accuracy.

  7. Developing single nucleotide polymorphism (SNP) markers from transcriptome sequences for identification of longan (Dimocarpus longan) germplasm

    PubMed Central

    Wang, Boyi; Tan, Hua-Wei; Fang, Wanping; Meinhardt, Lyndel W; Mischke, Sue; Matsumoto, Tracie; Zhang, Dapeng

    2015-01-01

    Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in 50 longan germplasm accessions, including cultivated varieties and wild germplasm; and designated 25 SNP markers that unambiguously identified all tested longan varieties with high statistical rigor (P<0.0001). Multiple trees from the same clone were verified and off-type trees were identified. Diversity analysis revealed genetic relationships among analyzed accessions. Cultivated varieties differed significantly from wild populations (Fst=0.300; P<0.001), demonstrating untapped genetic diversity for germplasm conservation and utilization. Within cultivated varieties, apparent differences between varieties from China and those from Thailand and Hawaii indicated geographic patterns of genetic differentiation. These SNP markers provide a powerful tool to manage longan genetic resources and breeding, with accurate and efficient genotype identification. PMID:26504559

  8. SNP discrimination through proofreading and OFF-switch of exo+ polymerase.

    PubMed

    Zhang, Jia; Li, Kai; Pardinas, Jose R; Liao, Duan F; Li, Hong J; Zhang, Xu

    2004-05-01

    Single nucleotide polymorphisms (SNPs) are useful physical markers for genetic studies as well as the cause of some genetic diseases. To develop more reliable SNP assays, we examined the underlying molecular mechanisms by which deoxyribonucleic acid (DNA) polymerases with 3' exonuclease activity maintain the high fidelity of DNA replication. In addition to mismatch removal by proofreading, we have discovered a premature termination of polymerization mediated by a novel OFF-switch mechanism. Two SNP assays were developed, one based on proofreading using 3' end-labeled primer extension and the other based on the newly identified OFF-switch, respectively. These two new assays are well suited for conventional techniques, such as electrophoresis and microplates detection systems as well as the sophisticated microchips. Application of these reliable SNP assays will greatly facilitate genetic and biomedical studies in the postgenome era.

  9. Copy Number Variation Analysis by Array Analysis of Single Cells Following Whole Genome Amplification.

    PubMed

    Dimitriadou, Eftychia; Zamani Esteki, Masoud; Vermeesch, Joris Robert

    2015-01-01

    Whole genome amplification is required to ensure the availability of sufficient material for copy number variation analysis of a genome deriving from an individual cell. Here, we describe the protocols we use for copy number variation analysis of non-fixed single cells by array-based approaches following single-cell isolation and whole genome amplification. We are focusing on two alternative protocols, an isothermal and a PCR-based whole genome amplification method, followed by either comparative genome hybridization (aCGH) or SNP array analysis, respectively.

  10. Development of highly reliable in silico SNP resource and genotyping assay from exome capture and sequencing: an example from black spruce (Picea mariana).

    PubMed

    Pavy, Nathalie; Gagnon, France; Deschênes, Astrid; Boyle, Brian; Beaulieu, Jean; Bousquet, Jean

    2016-03-01

    Picea mariana is a widely distributed boreal conifer across Canada and the subject of advanced breeding programmes for which population genomics and genomic selection approaches are being developed. Targeted sequencing was achieved after capturing P. mariana exome with probes designed from the sequenced transcriptome of Picea glauca, a distant relative. A high capture efficiency of 75.9% was reached although spruce has a complex and large genome including gene sequences interspersed by some long introns. The results confirmed the relevance of using probes from congeneric species to perform successfully interspecific exome capture in the genus Picea. A bioinformatics pipeline was developed including stringent criteria that helped detect a set of 97,075 highly reliable in silico SNPs. These SNPs were distributed across 14,909 genes. Part of an Infinium iSelect array was used to estimate the rate of true positives by validating 4267 of the predicted in silico SNPs by genotyping trees from P. mariana populations. The true positive rate was 96.2% for in silico SNPs, compared to a genotyping success rate of 96.7% for a set 1115 P. mariana control SNPs recycled from previous genotyping arrays. These results indicate the high success rate of the genotyping array and the relevance of the selection criteria used to delineate the new P. mariana in silico SNP resource. Furthermore, in silico SNPs were generally of medium to high frequency in natural populations, thus providing high informative value for future population genomics applications.

  11. k-merSNP discovery: Software for alignment-and reference-free scalable SNP discovery, phylogenetics, and annotation for hundreds of microbial genomes

    SciTech Connect

    2014-11-18

    With the flood of whole genome finished and draft microbial sequences, we need faster, more scalable bioinformatics tools for sequence comparison. An algorithm is described to find single nucleotide polymorphisms (SNPs) in whole genome data. It scales to hundreds of bacterial or viral genomes, and can be used for finished and/or draft genomes available as unassembled contigs or raw, unassembled reads. The method is fast to compute, finding SNPs and building a SNP phylogeny in minutes to hours, depending on the size and diversity of the input sequences. The SNP-based trees that result are consistent with known taxonomy and trees determined in other studies. The approach we describe can handle many gigabases of sequence in a single run. The algorithm is based on k-mer analysis.

  12. Identification of the varietal origin of loose leaf tea based on analysis of a single leaf by SNP nanofluidic array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Tea [Camellia sinensis (L.) O Kuntze] is an economically important crop cultivated in more than 50 countries. Production and marketing of premium specialty tea products provides opportunities for tea growers, the tea industry and consumers. Rapid market segmentation in the tea industry has resulted ...

  13. Development of genotyping by sequencing (GBS) and array derived SNP markers for stem rust resistance gene Sr42

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The stem rust fungus, particularly race TTKSK (Ug99), poses a serious threat to world wheat production. Gene Sr42 or SrCad (which could be the same gene or an allele of Sr42) is effective against race TTKSK. However, known genetic markers for Sr42 are mostly SSR markers which are generally labor i...

  14. Varietal identification of tea (Camellia sinensis [L.] Kuntze) using nanofluidic array of Single Nucleotide Polymorphism (SNP) markers

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Apart from water, tea is the world’s most widely consumed beverage. Tea is produced in more than 50 countries with an annual production of approximately 4.7 million tons. The market segment for specialty tea has been expanding rapidly owing to increased demand, resulting in higher revenues and profi...

  15. An integrated SNP mining and utilization (ISMU) pipeline for next generation sequencing data.

    PubMed

    Azam, Sarwar; Rathore, Abhishek; Shah, Trushar M; Telluri, Mohan; Amindala, BhanuPrakash; Ruperao, Pradeep; Katta, Mohan A V S K; Varshney, Rajeev K

    2014-01-01

    Open source single nucleotide polymorphism (SNP) discovery pipelines for next generation sequencing data commonly requires working knowledge of command line interface, massive computational resources and expertise which is a daunting task for biologists. Further, the SNP information generated may not be readily used for downstream processes such as genotyping. Hence, a comprehensive pipeline has been developed by integrating several open source next generation sequencing (NGS) tools along with a graphical user interface called Integrated SNP Mining and Utilization (ISMU) for SNP discovery and their utilization by developing genotyping assays. The pipeline features functionalities such as pre-processing of raw data, integration of open source alignment tools (Bowtie2, BWA, Maq, NovoAlign and SOAP2), SNP prediction (SAMtools/SOAPsnp/CNS2snp and CbCC) methods and interfaces for developing genotyping assays. The pipeline outputs a list of high quality SNPs between all pairwise combinations of genotypes analyzed, in addition to the reference genome/sequence. Visualization tools (Tablet and Flapjack) integrated into the pipeline enable inspection of the alignment and errors, if any. The pipeline also provides a confidence score or polymorphism information content value with flanking sequences for identified SNPs in standard format required for developing marker genotyping (KASP and Golden Gate) assays. The pipeline enables users to process a range of NGS datasets such as whole genome re-sequencing, restriction site associated DNA sequencing and transcriptome sequencing data at a fast speed. The pipeline is very useful for plant genetics and breeding community with no computational expertise in order to discover SNPs and utilize in genomics, genetics and breeding studies. The pipeline has been parallelized to process huge datasets of next generation sequencing. It has been developed in Java language and is available at http://hpc.icrisat.cgiar.org/ISMU as a standalone

  16. Using Hamming Distance as Information for SNP-Sets Clustering and Testing in Disease Association Studies.

    PubMed

    Wang, Charlotte; Kao, Wen-Hsin; Hsiao, Chuhsing Kate

    2015-01-01

    The availability of high-throughput genomic data has led to several challenges in recent genetic association studies, including the large number of genetic variants that must be considered and the computational complexity in statistical analyses. Tackling these problems with a marker-set study such as SNP-set analysis can be an efficient solution. To construct SNP-sets, we first propose a clustering algorithm, which employs Hamming distance to measure the similarity between strings of SNP genotypes and evaluates whether the given SNPs or SNP-sets should be clustered. A dendrogram can then be constructed based on such distance measure, and the number of clusters can be determined. With the resulting SNP-sets, we next develop an association test HDAT to examine susceptibility to the disease of interest. This proposed test assesses, based on Hamming distance, whether the similarity between a diseased and a normal individual differs from the similarity between two individuals of the same disease status. In our proposed methodology, only genotype information is needed. No inference of haplotypes is required, and SNPs under consideration do not need to locate in nearby regions. The proposed clustering algorithm and association test are illustrated with applications and simulation studies. As compared with other existing methods, the clustering algorithm is faster and better at identifying sets containing SNPs exerting a similar effect. In addition, the simulation studies demonstrated that the proposed test works well for SNP-sets containing a large proportion of neutral SNPs. Furthermore, employing the clustering algorithm before testing a large set of data improves the knowledge in confining the genetic regions for susceptible genetic markers.

  17. Viability of in-house datamarting approaches for population genetics analysis of SNP genotypes

    PubMed Central

    Amigo, Jorge; Phillips, Christopher; Salas, Antonio; Carracedo, Ángel

    2009-01-01

    Background Databases containing very large amounts of SNP (Single Nucleotide Polymorphism) data are now freely available for researchers interested in medical and/or population genetics applications. While many of these SNP repositories have implemented data retrieval tools for general-purpose mining, these alone cannot cover the broad spectrum of needs of most medical and population genetics studies. Results To address this limitation, we have built in-house customized data marts from the raw data provided by the largest public databases. In particular, for population genetics analysis based on genotypes we have built a set of data processing scripts that deal with raw data coming from the major SNP variation databases (e.g. HapMap, Perlegen), stripping them into single genotypes and then grouping them into populations, then merged with additional complementary descriptive information extracted from dbSNP. This allows not only in-house standardization and normalization of the genotyping data retrieved from different repositories, but also the calculation of statistical indices from simple allele frequency estimates to more elaborate genetic differentiation tests within populations, together with the ability to combine population samples from different databases. Conclusion The present study demonstrates the viability of implementing scripts for handling extensive datasets of SNP genotypes with low computational costs, dealing with certain complex issues that arise from the divergent nature and configuration of the most popular SNP repositories. The information contained in these databases can also be enriched with additional information obtained from other complementary databases, in order to build a dedicated data mart. Updating the data structure is straightforward, as well as permitting easy implementation of new external data and the computation of supplementary statistical indices of interest. PMID:19344481

  18. An Integrated SNP Mining and Utilization (ISMU) Pipeline for Next Generation Sequencing Data

    PubMed Central

    Azam, Sarwar; Rathore, Abhishek; Shah, Trushar M.; Telluri, Mohan; Amindala, BhanuPrakash; Ruperao, Pradeep; Katta, Mohan A. V. S. K.; Varshney, Rajeev K.

    2014-01-01

    Open source single nucleotide polymorphism (SNP) discovery pipelines for next generation sequencing data commonly requires working knowledge of command line interface, massive computational resources and expertise which is a daunting task for biologists. Further, the SNP information generated may not be readily used for downstream processes such as genotyping. Hence, a comprehensive pipeline has been developed by integrating several open source next generation sequencing (NGS) tools along with a graphical user interface called Integrated SNP Mining and Utilization (ISMU) for SNP discovery and their utilization by developing genotyping assays. The pipeline features functionalities such as pre-processing of raw data, integration of open source alignment tools (Bowtie2, BWA, Maq, NovoAlign and SOAP2), SNP prediction (SAMtools/SOAPsnp/CNS2snp and CbCC) methods and interfaces for developing genotyping assays. The pipeline outputs a list of high quality SNPs between all pairwise combinations of genotypes analyzed, in addition to the reference genome/sequence. Visualization tools (Tablet and Flapjack) integrated into the pipeline enable inspection of the alignment and errors, if any. The pipeline also provides a confidence score or polymorphism information content value with flanking sequences for identified SNPs in standard format required for developing marker genotyping (KASP and Golden Gate) assays. The pipeline enables users to process a range of NGS datasets such as whole genome re-sequencing, restriction site associated DNA sequencing and transcriptome sequencing data at a fast speed. The pipeline is very useful for plant genetics and breeding community with no computational expertise in order to discover SNPs and utilize in genomics, genetics and breeding studies. The pipeline has been parallelized to process huge datasets of next generation sequencing. It has been developed in Java language and is available at http://hpc.icrisat.cgiar.org/ISMU as a standalone

  19. Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography

    PubMed Central

    Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi

    2013-01-01

    New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined ‘elimination’ status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of M. leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. PMID:23291420

  20. Axiom turkey genotyping array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The Axiom®Turkey Genotyping Array interrogates 643,845 probesets on the array, covering 643,845 SNPs. The array development was led by Dr. Julie Long of the USDA-ARS Beltsville Agricultural Research Center under a public-private partnership with Hendrix Genetics, Aviagen, and Affymetrix. The Turk...

  1. Oligonucleotide-arrayed TFT photosensor applicable for DNA chip technology.

    PubMed

    Tanaka, Tsuyoshi; Hatakeyama, Keiichi; Sawaguchi, Masahiro; Iwadate, Akihito; Mizutani, Yasushi; Sasaki, Kazuhiro; Tateishi, Naofumi; Takeyama, Haruko; Matsunaga, Tadashi

    2006-09-05

    A thin film transistor (TFT) photosensor fabricated by semiconductor integrated circuit (IC) technology was applied to DNA chip technology. The surface of the TFT photosensor was coated with TiO2 using a vapor deposition technique for the fabrication of optical filters. The immobilization of thiolated oligonucleotide probes onto a TiO2-coated TFT photosensor using gamma-aminopropyltriethoxysilane (APTES) and N-(gamma-maleimidobutyloxy) sulfosuccinimide ester (GMBS) was optimized. The coverage value of immobilized oligonucleotides reached a plateau at 33.7 pmol/cm2, which was similar to a previous analysis using radioisotope-labeled oligonucleotides. The lowest detection limits were 0.05 pmol/cm2 for quantum dot and 2.1 pmol/cm2 for Alexa Fluor 350. Furthermore, single nucleotide polymorphism (SNP) detection was examined using the oligonucleotide-arrayed TFT photosensor. A SNP present in the aldehyde dehydrogenase 2 (ALDH2) gene was used as a target. The SNPs in ALDH2*1 and ALDH2*2 target DNA were detected successfully using the TFT photosensor. DNA hybridization in the presence of both ALDH2*1 and ALDH2*2 target DNA was observed using both ALDH2*1 and ALDH2*2 detection oligonucleotides-arrayed TFT photosensor. Use of the TFT photosensor will allow the development of a disposable photodetecting device for DNA chip systems.

  2. High-Density SNP Map Construction and QTL Identification for the Apetalous Character in Brassica napus L.

    PubMed Central

    Wang, Xiaodong; Yu, Kunjiang; Li, Hongge; Peng, Qi; Chen, Feng; Zhang, Wei; Chen, Song; Hu, Maolong; Zhang, Jiefu

    2015-01-01

    The apetalous genotype is a morphological ideotype for increasing seed yield and should be of considerable agricultural use; however, only a few studies have focused on the genetic control of this trait in Brassica napus. In the present study, a recombinant inbred line, the AH population, containing 189 individuals was derived from a cross between an apetalous line ‘APL01’ and a normally petalled variety ‘Holly’. The Brassica 60 K Infinium BeadChip Array harboring 52,157 single nucleotide polymorphism (SNP) markers was used to genotype the AH individuals. A high-density genetic linkage map was constructed based on 2,755 bins involving 11,458 SNPs and 57 simple sequence repeats, and was used to identify loci associated with petalous degree (PDgr). The linkage map covered 2,027.53 cM, with an average marker interval of 0.72 cM. The AH map had good collinearity with the B. napus reference genome, indicating its high quality and accuracy. After phenotypic analyses across five different experiments, a total of 19 identified quantitative trait loci (QTLs) distributed across chromosomes A3, A5, A6, A9 and C8 were obtained, and these QTLs were further integrated into nine consensus QTLs by a meta-analysis. Interestingly, the major QTL qPD.C8-2 was consistently detected in all five experiments, and qPD.A9-2 and qPD.C8-3 were stably expressed in four experiments. Comparative mapping between the AH map and the B. napus reference genome suggested that there were 328 genes underlying the confidence intervals of the three steady QTLs. Based on the Gene Ontology assignments of 52 genes to the regulation of floral development in published studies, 146 genes were considered as potential candidate genes for PDgr. The current study carried out a QTL analysis for PDgr using a high-density SNP map in B. napus, providing novel targets for improving seed yield. These results advanced our understanding of the genetic control of PDgr regulation in B. napus. PMID:26779193

  3. SNPranker 2.0: a gene-centric data mining tool for diseases associated SNP prioritization in GWAS

    PubMed Central

    2013-01-01

    Background The capability of correlating specific genotypes with human diseases is a complex issue in spite of all advantages arisen from high-throughput technologies, such as Genome Wide Association Studies (GWAS). New tools for genetic variants interpretation and for Single Nucleotide Polymorphisms (SNPs) prioritization are actually needed. Given a list of the most relevant SNPs statistically associated to a specific pathology as result of a genotype study, a critical issue is the identification of genes that are effectively related to the disease by re-scoring the importance of the identified genetic variations. Vice versa, given a list of genes, it can be of great importance to predict which SNPs can be involved in the onset of a particular disease, in order to focus the research on their effects. Results We propose a new bioinformatics approach to support biological data mining in the analysis and interpretation of SNPs associated to pathologies. This system can be employed to design custom genotyping chips for disease-oriented studies and to re-score GWAS results. The proposed method relies (1) on the data integration of public resources using a gene-centric database design, (2) on the evaluation of a set of static biomolecular annotations, defined as features, and (3) on the SNP scoring function, which computes SNP scores using parameters and weights set by users. We employed a machine learning classifier to set default feature weights and an ontological annotation layer to enable the enrichment of the input gene set. We implemented our method as a web tool called SNPranker 2.0 (http://www.itb.cnr.it/snpranker), improving our first published release of this system. A user-friendly interface allows the input of a list of genes, SNPs or a biological process, and to customize the features set with relative weights. As result, SNPranker 2.0 returns a list of SNPs, localized within input and ontologically enriched genes, combined with their prioritization scores

  4. Clinical significance of previously cryptic copy number alterations and loss of heterozygosity in pediatric acute myeloid leukemia and myelodysplastic syndrome determined using combined array comparative genomic hybridization plus single-nucleotide polymorphism microarray analyses.

    PubMed

    Koh, Kyung-Nam; Lee, Jin Ok; Seo, Eul Ju; Lee, Seong Wook; Suh, Jin Kyung; Im, Ho Joon; Seo, Jong Jin

    2014-07-01

    The combined array comparative genomic hybridization plus single-nucleotide polymorphism microarray (CGH+SNP microarray) platform can simultaneously detect copy number alterations (CNA) and copy-neutral loss of heterozygosity (LOH). Eighteen children with acute myeloid leukemia (AML) (n=15) or myelodysplastic syndrome (MDS) (n=3) were studied using CGH+SNP microarray to evaluate the clinical significance of submicroscopic chromosomal aberrations. CGH+SNP microarray revealed CNAs at 14 regions in 9 patients, while metaphase cytogenetic (MC) analysis detected CNAs in 11 regions in 8 patients. Using CGH+SNP microarray, LOHs>10 Mb involving terminal regions or the whole chromosome were detected in 3 of 18 patients (17%). CGH+SNP microarray revealed cryptic LOHs with or without CNAs in 3 of 5 patients with normal karyotypes. CGH+SNP microarray detected additional cryptic CNAs (n=2) and LOHs (n=5) in 6 of 13 patients with abnormal MC. In total, 9 patients demonstrated additional aberrations, including CNAs (n=3) and/or LOHs (n=8). Three of 15 patients with AML and terminal LOH>10 Mb demonstrated a significantly inferior relapse-free survival rate (P=0.041). This study demonstrates that CGH+SNP microarray can simultaneously detect previously cryptic CNAs and LOH, which may demonstrate prognostic implications.

  5. MDM2 promoter SNP55 (rs2870820) affects risk of colon cancer but not breast-, lung-, or prostate cancer

    PubMed Central

    Helwa, Reham; Gansmo, Liv B.; Romundstad, Pål; Hveem, Kristian; Vatten, Lars; Ryan, Bríd M.; Harris, Curtis C.; Lønning, Per E.; Knappskog, Stian

    2016-01-01

    Two functional SNPs (SNP285G > C; rs117039649 and SNP309T > G; rs2279744) have previously been reported to modulate Sp1 transcription factor binding to the promoter of the proto-oncogene MDM2, and to influence cancer risk. Recently, a third SNP (SNP55C > T; rs2870820) was also reported to affect Sp1 binding and MDM2 transcription. In this large population based case-control study, we genotyped MDM2 SNP55 in 10,779 Caucasian individuals, previously genotyped for SNP309 and SNP285, including cases of colon (n = 1,524), lung (n = 1,323), breast (n = 1,709) and prostate cancer (n = 2,488) and 3,735 non-cancer controls, as well as 299 healthy African-Americans. Applying the dominant model, we found an elevated risk of colon cancer among individuals harbouring SNP55TT/CT genotypes compared to the SNP55CC genotype (OR = 1.15; 95% CI = 1.01–1.30). The risk was found to be highest for left-sided colon cancer (OR = 1.21; 95% CI = 1.00–1.45) and among females (OR = 1.32; 95% CI = 1.01–1.74). Assessing combined genotypes, we found the highest risk of colon cancer among individuals harbouring the SNP55TT or CT together with the SNP309TG genotype (OR = 1.21; 95% CI = 1.00–1.46). Supporting the conclusions from the risk estimates, we found colon cancer cases carrying the SNP55TT/CT genotypes to be diagnosed at younger age as compared to SNP55CC (p = 0.053), in particular among patients carrying the SNP309TG/TT genotypes (p = 0.009). PMID:27624283

  6. Priming of seeds with nitric oxide donor sodium nitroprusside (SNP) alleviates the inhibition on wheat seed germination by salt stress.

    PubMed

    Duan, Pei; Ding, Feng; Wang, Fang; Wang, Bao-Shan

    2007-06-01

    The effect of SNP, an NO donor, on seed germination of wheat (Triticum aestivum L. cv. 'DK961') under salt stress was studied. The results showed that priming of seeds with 0.06 mmol/L SNP for 24 h markedly alleviated the decrease of the germination percentage, germination index, vigor index and imbibition rate of wheat seeds under salt stress. SNP significantly alleviated the decrease of the beta-amylase activity but almost did not affect the alpha-amylase activity of wheat seeds under salt stress. SNP slightly increased the alpha-amylase isoenzymes (especially isoenzyme 3) and significantly increased the beta-amylase isoenzymes (especially isoenzyme d, e, f and g). SNP pretreatment decreased Na(+) content, but increased the K(+) content, resulting in a mark increase of K(+)/Na(+) ratio of wheat seedlings under salt stress. These results suggested that NO is involved in promoting wheat seed germination under salt stress by increasing the beta-amylase activity.

  7. Changes in variance explained by top SNP windows over generations for three traits in broiler chicken

    PubMed Central

    Fragomeni, Breno de Oliveira; Misztal, Ignacy; Lourenco, Daniela Lino; Aguilar, Ignacio; Okimoto, Ronald; Muir, William M.

    2014-01-01

    The purpose of this study was to determine if the set of genomic regions inferred as accounting for the majority of genetic variation in quantitative traits remain stable over multiple generations of selection. The data set contained phenotypes for five generations of broiler chicken for body weight, breast meat, and leg score. The population consisted of 294,632 animals over five generations and also included genotypes of 41,036 single nucleotide polymorphism (SNP) for 4,866 animals, after quality control. The SNP effects were calculated by a GWAS type analysis using single step genomic BLUP approach for generations 1–3, 2–4, 3–5, and 1–5. Variances were calculated for windows of 20 SNP. The top ten windows for each trait that explained the largest fraction of the genetic variance across generations were examined. Across generations, the top 10 windows explained more than 0.5% but less than 1% of the total variance. Also, the pattern of the windows was not consistent across generations. The windows that explained the greatest variance changed greatly among the combinations of generations, with a few exceptions. In many cases, a window identified as top for one combination, explained less than 0.1% for the other combinations. We conclude that identification of top SNP windows for a population may have little predictive power for genetic selection in the following generations for the traits here evaluated. PMID:25324857

  8. SNP discovery in complex allotetraploid genomes (Gossypium spp., Malvaceae) using genotyping by sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Dramatic decreases in the cost of DNA sequencing have enabled the development of very large numbers of markers based on single nucleotide polymorphism (SNP) for phylogenetic studies, population genetics, linkage mapping, marker-assisted breeding and other applications. Using Illumina next-generatio...

  9. Multiplexed SNP genotyping using the Qbead™ system: a quantum dot-encoded microsphere-based assay

    PubMed Central

    Xu, Hongxia; Sha, Michael Y.; Wong, Edith Y.; Uphoff, Janet; Xu, Yanzhang; Treadway, Joseph A.; Truong, Anh; O’Brien, Eamonn; Asquith, Steven; Stubbins, Michael; Spurr, Nigel K.; Lai, Eric H.; Mahoney, Walt

    2003-01-01

    We have developed a new method using the Qbead™ system for high-throughput genotyping of single nucleotide polymorphisms (SNPs). The Qbead system employs fluorescent Qdot™ semiconductor nanocrystals, also known as quantum dots, to encode microspheres that subsequently can be used as a platform for multiplexed assays. By combining mixtures of quantum dots with distinct emission wavelengths and intensities, unique spectral ‘barcodes’ are created that enable the high levels of multiplexing required for complex genetic analyses. Here, we applied the Qbead system to SNP genotyping by encoding microspheres conjugated to allele-specific oligonucleotides. After hybridization of oligonucleotides to amplicons produced by multiplexed PCR of genomic DNA, individual microspheres are analyzed by flow cytometry and each SNP is distinguished by its unique spectral barcode. Using 10 model SNPs, we validated the Qbead system as an accurate and reliable technique for multiplexed SNP genotyping. By modifying the types of probes conjugated to microspheres, the Qbead system can easily be adapted to other assay chemistries for SNP genotyping as well as to other applications such as analysis of gene expression and protein–protein interactions. With its capability for high-throughput automation, the Qbead system has the potential to be a robust and cost-effective platform for a number of applications. PMID:12682378

  10. Identification of a SNP marker associated with WB242 nematode resistance in sugar beet

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The beet-cyst nematode (Heterodera schachtii Schmidt) is one of the major diseases of sugar beet. The identification of molecular markers associated to the nematode resistance would be helpful for developing resistant varieties. The aim of this study was the identification of SNP (Single Nucleotide ...

  11. Utilization of a whole genome SNP panel for efficient genetic mapping in the mouse

    PubMed Central

    Moran, Jennifer L.; Bolton, Andrew D.; Tran, Pamela V.; Brown, Alison; Dwyer, Noelle D.; Manning, Danielle K.; Bjork, Bryan C.; Li, Cheng; Montgomery, Kate; Siepka, Sandra M.; Vitaterna, Martha Hotz; Takahashi, Joseph S.; Wiltshire, Tim; Kwiatkowski, David J.; Kucherlapati, Raju; Beier, David R.

    2006-01-01

    Phenotype-driven genetics can be used to create mouse models of human disease and birth defects. However, the utility of these mutant models is limited without identification of the causal gene. To facilitate genetic mapping, we developed a fixed single nucleotide polymorphism (SNP) panel of 394 SNPs as an alternative to analyses using simple sequence length polymorphism (SSLP) marker mapping. With the SNP panel, chromosomal locations for 22 monogenic mutants were identified. The average number of affected progeny genotyped for mapped monogenic mutations is nine. Map locations for several mutants have been obtained with as few as four affected progeny. The average size of genetic intervals obtained for these mutants is 43 Mb, with a range of 17–83 Mb. Thus, our SNP panel allows for identification of moderate resolution map position with small numbers of mice in a high-throughput manner. Importantly, the panel is suitable for mapping crosses from many inbred and wild-derived inbred strain combinations. The chromosomal localizations obtained with the SNP panel allow one to quickly distinguish between potentially novel loci or remutations in known genes, and facilitates fine mapping and positional cloning. By using this approach, we identified DNA sequence changes in two ethylnitrosourea-induced mutants. PMID:16461637

  12. Verification of genetic identity of introduced cacao germplasm in Ghana using single nucleotide polymorphism (SNP) markers

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Accurate identification of individual genotypes is important for cacao (Theobroma cacao L.) breeding, germplasm conservation and seed propagation. The development of single nucleotide polymorphism (SNP) markers in cacao offers an effective way to use a high-throughput genotyping system for cacao gen...

  13. Applying SNP marker technology in the cacao breeding program at the Cocoa Research Institute of Ghana

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In this investigation 45 parental cacao plants and five progeny derived from the parental stock studied were genotyped using six SNP markers to determine off-types or mislabeled clones and to authenticate crosses made in the Cocoa Research Institute of Ghana (CRIG) breeding program. Investigation wa...

  14. A web-based genome browser for 'SNP-aware' assay design

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Human and animal genomes contain an abundance of single nucleotide polymorphisms (SNPs) that are useful for genetic testing. However, the relatively large number of SNPs present in diverse populations can pose serious problems when designing assays. It is important to “mask” some SNP positions so ...

  15. SNP-based genotyping in lentil: linking sequence information with phenotypes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Lentil (Lens culinaris) has been late to enter the world of high throughput molecular analysis due to a general lack of genomic resources. Using a 454 sequencing-based approach, SNPs have been identified in genes across the lentil genome. Several hundred have been turned into single SNP KASP assay...

  16. High-throughput RAD-SNP genotyping for characterization of sugar beet genotypes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    High-throughput SNP genotyping provides a rapid way of developing resourceful set of markers for delineating the genetic architecture and for effective species discrimination. In the presented research, we demonstrate a set of 192 SNPs for effective genotyping in sugar beet using high-throughput mar...

  17. The use of SNP data for the monitoring of genetic diversity in cattle breeds

    Technology Transfer Automated Retrieval System (TEKTRAN)

    LD between SNPs contains information about effective population size. In this study, we investigate the use of genome-wide SNP data for marker based estimation of effective population size for two taurine cattle breeds of Africa and two local cattle breeds of Switzerland. Estimated recombination rat...

  18. Microsatellite Imputation for parental verification from SNP across multiple Bos taurus and indicus breeds

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Microsatellite markers (MS) have traditionally been used for parental verification and are still the international standard in spite of their higher cost, error rate, and turnaround time compared with Single Nucleotide Polymorphisms (SNP)-based assays. Despite domestic and international demands fro...

  19. An improved consensus linkage map of barley based on flow-sorted chromosomes and SNP markers

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Recent advances in high-throughput genotyping have made it easier to combine information from different mapping populations into consensus genetic maps, which provide increased marker density and genome coverage compared to individual maps. Previously, a SNP-based genotyping platform was developed a...

  20. Mining for SNPs and SSRs using SNPServer, dbSNP and SSR taxonomy tree.

    PubMed

    Batley, Jacqueline; Edwards, David

    2009-01-01

    Molecular genetic markers represent one of the most powerful tools for the analysis of genomes and the association of heritable traits with underlying genetic variation. The development of high-throughput methods for the detection of single nucleotide polymorphisms (SNPs) and simple sequence repeats (SSRs) has led to a revolution in their use as molecular markers. The availability of large sequence data sets permits mining for these molecular markers, which may then be used for applications such as genetic trait mapping, diversity analysis and marker assisted selection in agriculture. Here we describe web-based automated methods for the discovery of SSRs using SSR taxonomy tree, the discovery of SNPs from sequence data using SNPServer and the identification of validated SNPs from within the dbSNP database. SSR taxonomy tree identifies pre-determined SSR amplification primers for virtually all species represented within the GenBank database. SNPServer uses a redundancy based approach to identify SNPs within DNA sequences. Following submission of a sequence of interest, SNPServer uses BLAST to identify similar sequences, CAP3 to cluster and assemble these sequences and then the SNP discovery software autoSNP to detect SNPs and insertion/deletion (indel) polymorphisms. The NCBI dbSNP database is a catalogue of molecular variation, hosting validated SNPs for several species within a public-domain archive.

  1. The impact of SNP fingerprinting and parentage analysis on the effectiveness of variety recommendations in cacao

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Evidence for the impact of mislabeling and/or pollen contamination on consistency of field performance has been lacking to reinforce the need for strict adherence to quality control protocols in cacao seed garden and germplasm plot management. The present study used SNP fingerprinting at 64 loci to ...

  2. Association mapping of resistance to leaf rust in emmer wheat using high throughput SNP markers

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Emmer wheat (Triticum turgidum L. subsp. dicoccum) is known to be a useful source of genes for many desirable characters for improvement of modern cultivated wheat. Recently, a panel of 181 emmer wheat accessions has been genotyped with wheat 9K SNP (single nucleotide polymorphism) markers and exte...

  3. EvoSNP-DB: A database of genetic diversity in East Asian populations

    PubMed Central

    Kim, Young Uk; Kim, Young Jin; Lee, Jong-Young; Park, Kiejung

    2013-01-01

    Genome-wide association studies (GWAS) have become popular as an approach for the identification of large numbers of phenotype-associated variants. However, differences in genetic architecture and environmental factors mean that the effect of variants can vary across populations. Understanding population genetic diversity is valuable for the investigation of possible population specific and independent effects of variants. EvoSNP-DB aims to provide information regarding genetic diversity among East Asian populations, including Chinese, Japanese, and Korean. Non-redundant SNPs (1.6 million) were genotyped in 54 Korean trios (162 samples) and were compared with 4 million SNPs from HapMap phase II populations. EvoSNP-DB provides two user interfaces for data query and visualization, and integrates scores of genetic diversity (Fst and VarLD) at the level of SNPs, genes, and chromosome regions. EvoSNP-DB is a web-based application that allows users to navigate and visualize measurements of population genetic differences in an interactive manner, and is available online at [http://biomi.cdc.go.kr/EvoSNP/]. [BMB Reports 2013; 46(8): 416-421] PMID:23977990

  4. Longevity and Plasticity of CFTR Provide an Argument for Noncanonical SNP Organization in Hominid DNA

    PubMed Central

    Hill, Aubrey E.; Plyler, Zackery E.; Tiwari, Hemant; Patki, Amit; Tully, Joel P.; McAtee, Christopher W.; Moseley, Leah A.; Sorscher, Eric J.

    2014-01-01

    Like many other ancient genes, the cystic fibrosis transmembrane conductance regulator (CFTR) has survived for hundreds of millions of years. In this report, we consider whether such prodigious longevity of an individual gene – as opposed to an entire genome or species – should be considered surprising in the face of eons of relentless DNA replication errors, mutagenesis, and other causes of sequence polymorphism. The conventions that modern human SNP patterns result either from purifying selection or random (neutral) drift were not well supported, since extant models account rather poorly for the known plasticity and function (or the established SNP distributions) found in a multitude of genes such as CFTR. Instead, our analysis can be taken as a polemic indicating that SNPs in CFTR and many other mammalian genes may have been generated—and continue to accrue—in a fundamentally more organized manner than would otherwise have been expected. The resulting viewpoint contradicts earlier claims of ‘directional’ or ‘intelligent design-type’ SNP formation, and has important implications regarding the pace of DNA adaptation, the genesis of conserved non-coding DNA, and the extent to which eukaryotic SNP formation should be viewed as adaptive. PMID:25350658

  5. SNP-microarrays can accurately identify the presence of an individual in complex forensic DNA mixtures.

    PubMed

    Voskoboinik, Lev; Ayers, Sheri B; LeFebvre, Aaron K; Darvasi, Ariel

    2015-05-01

    Common forensic and mass disaster scenarios present DNA evidence that comprises a mixture of several contributors. Identifying the presence of an individual in such mixtures has proven difficult. In the current study, we evaluate the practical usefulness of currently available "off-the-shelf" SNP microarrays for such purposes. We found that a set of 3000 SNPs specifically selected for this purpose can accurately identify the presence of an individual in complex DNA mixtures of various compositions. For example, individuals contributing as little as 5% to a complex DNA mixture can be robustly identified even if the starting DNA amount was as little as 5.0ng and had undergone whole-genome amplification (WGA) prior to SNP analysis. The work presented in this study represents proof-of-principle that our previously proposed approach, can work with real "forensic-type" samples. Furthermore, in the absence of a low-density focused forensic SNP microarray, the use of standard, currently available high-density SNP microarrays can be similarly used and even increase statistical power due to the larger amount of available information.

  6. The identification of SNPs with indeterminate positions using the Equine SNP50 BeadChip.

    PubMed

    Corbin, L J; Blott, S C; Swinburne, J E; Vaudin, M; Bishop, S C; Woolliams, J A

    2012-06-01

    We have used linkage disequilibrium (LD) to identify single nucleotide polymorphisms (SNPs) on the Illumina Equine SNP50 BeadChip, which may be incorrectly positioned on the genome map. A total of 1201 Thoroughbred horses were genotyped using the Illumina Equine SNP50 BeadChip. LD was evaluated in a pairwise fashion between all autosomal SNPs, both within and across chromosomes. Filters were then applied to the data, firstly to identify SNPs that may have been mapped to the wrong chromosome and secondly to identify SNPs that may have been incorrectly positioned within chromosomes. We identified a single SNP on ECA28, which showed low LD with neighbouring SNPs but considerable LD with a group of SNPs on ECA10. Furthermore, a cluster of SNPs on ECA5 showed unusually low LD with surrounding SNPs. A total of 39 SNPs met the criteria for unusual within-chromosome LD. The results of this study indicate that some SNPs may be misplaced. This finding is significant, as misplaced SNPs may lead to difficulties in the application of genomic methods, such as homozygosity mapping, for which SNP order is important.

  7. Analysis of gene-derived SNP marker polymorphism in wheat (Triticum aestivum L.)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In this study, we analyzed 359 single nucleotide polymorphisms (SNPs) previously discovered in intron sequences of wheat genes to evaluate SNP marker polymorphism in common wheat (Triticum aestivum L.). These SNPs showed an average polymorphism information content (PIC) of 0.181 among 20 US wheat c...

  8. SNP-based high density genetic map and mapping of btwd1 dwarfing gene in barley

    PubMed Central

    Ren, Xifeng; Wang, Jibin; Liu, Lipan; Sun, Genlou; Li, Chengdao; Luo, Hong; Sun, Dongfa

    2016-01-01

    A high-density linkage map is a valuable tool for functional genomics and breeding. A newly developed sequence-based marker technology, restriction site associated DNA (RAD) sequencing, has been proven to be powerful for the rapid discovery and genotyping of genome-wide single nucleotide polymorphism (SNP) markers and for the high-density genetic map construction. The objective of this research was to construct a high-density genetic map of barley using RAD sequencing. 1894 high-quality SNP markers were developed and mapped onto all seven chromosomes together with 68 SSR markers. These 1962 markers constituted a total genetic length of 1375.8 cM and an average of 0.7 cM between adjacent loci. The number of markers within each linkage group ranged from 209 to 396. The new recessive dwarfing gene btwd1 in Huaai 11 was mapped onto the high density linkage maps. The result showed that the btwd1 is positioned between SNP marks 7HL_6335336 and 7_249275418 with a genetic distance of 0.9 cM and 0.7 cM on chromosome 7H, respectively. The SNP-based high-density genetic map developed and the dwarfing gene btwd1 mapped in this study provide critical information for position cloning of the btwd1 gene and molecular breeding of barley. PMID:27530597

  9. Changes in variance explained by top SNP windows over generations for three traits in broiler chicken.

    PubMed

    Fragomeni, Breno de Oliveira; Misztal, Ignacy; Lourenco, Daniela Lino; Aguilar, Ignacio; Okimoto, Ronald; Muir, William M

    2014-01-01

    The purpose of this study was to determine if the set of genomic regions inferred as accounting for the majority of genetic variation in quantitative traits remain stable over multiple generations of selection. The data set contained phenotypes for five generations of broiler chicken for body weight, breast meat, and leg score. The population consisted of 294,632 animals over five generations and also included genotypes of 41,036 single nucleotide polymorphism (SNP) for 4,866 animals, after quality control. The SNP effects were calculated by a GWAS type analysis using single step genomic BLUP approach for generations 1-3, 2-4, 3-5, and 1-5. Variances were calculated for windows of 20 SNP. The top ten windows for each trait that explained the largest fraction of the genetic variance across generations were examined. Across generations, the top 10 windows explained more than 0.5% but less than 1% of the total variance. Also, the pattern of the windows was not consistent across generations. The windows that explained the greatest variance changed greatly among the combinations of generations, with a few exceptions. In many cases, a window identified as top for one combination, explained less than 0.1% for the other combinations. We conclude that identification of top SNP windows for a population may have little predictive power for genetic selection in the following generations for the traits here evaluated.

  10. Longevity and plasticity of CFTR provide an argument for noncanonical SNP organization in hominid DNA.

    PubMed

    Hill, Aubrey E; Plyler, Zackery E; Tiwari, Hemant; Patki, Amit; Tully, Joel P; McAtee, Christopher W; Moseley, Leah A; Sorscher, Eric J

    2014-01-01

    Like many other ancient genes, the cystic fibrosis transmembrane conductance regulator (CFTR) has survived for hundreds of millions of years. In this report, we consider whether such prodigious longevity of an individual gene--as opposed to an entire genome or species--should be considered surprising in the face of eons of relentless DNA replication errors, mutagenesis, and other causes of sequence polymorphism. The conventions that modern human SNP patterns result either from purifying selection or random (neutral) drift were not well supported, since extant models account rather poorly for the known plasticity and function (or the established SNP distributions) found in a multitude of genes such as CFTR. Instead, our analysis can be taken as a polemic indicating that SNPs in CFTR and many other mammalian genes may have been generated--and continue to accrue--in a fundamentally more organized manner than would otherwise have been expected. The resulting viewpoint contradicts earlier claims of 'directional' or 'intelligent design-type' SNP formation, and has important implications regarding the pace of DNA adaptation, the genesis of conserved non-coding DNA, and the extent to which eukaryotic SNP formation should be viewed as adaptive.

  11. MAFsnp: A Multi-Sample Accurate and Flexible SNP Caller Using Next-Generation Sequencing Data.

    PubMed

    Hu, Jiyuan; Li, Tengfei; Xiu, Zidi; Zhang, Hong

    2015-01-01

    Most existing statistical methods developed for calling single nucleotide polymorphisms (SNPs) using next-generation sequencing (NGS) data are based on Bayesian frameworks, and there does not exist any SNP caller that produces p-values for calling SNPs in a frequentist framework. To fill in this gap, we develop a new method MAFsnp, a Multiple-sample based Accurate and Flexible algorithm for calling SNPs with NGS data. MAFsnp is based on an estimated likelihood ratio test (eLRT) statistic. In practical situation, the involved parameter is very close to the boundary of the parametric space, so the standard large sample property is not suitable to evaluate the finite-sample distribution of the eLRT statistic. Observing that the distribution of the test statistic is a mixture of zero and a continuous part, we propose to model the test statistic with a novel two-parameter mixture distribution. Once the parameters in the mixture distribution are estimated, p-values can be easily calculated for detecting SNPs, and the multiple-testing corrected p-values can be used to control false discovery rate (FDR) at any pre-specified level. With simulated data, MAFsnp is shown to have much better control of FDR than the existing SNP callers. Through the application to two real datasets, MAFsnp is also shown to outperform the existing SNP callers in terms of calling accuracy. An R package "MAFsnp" implementing the new SNP caller is freely available at http://homepage.fudan.edu.cn/zhangh/softwares/.

  12. Association of Agronomic Traits with SNP Markers in Durum Wheat (Triticum turgidum L. durum (Desf.))

    PubMed Central

    Hu, Xin; Ren, Jing; Ren, Xifeng; Huang, Sisi; Sabiel, Salih A. I.; Luo, Mingcheng; Nevo, Eviatar; Fu, Chunjie; Peng, Junhua; Sun, Dongfa

    2015-01-01

    Association mapping is a powerful approach to detect associations between traits of interest and genetic markers based on linkage disequilibrium (LD) in molecular plant breeding. In this study, 150 accessions of worldwide originated durum wheat germplasm (Triticum turgidum spp. durum) were genotyped using 1,366 SNP markers. The extent of LD on each chromosome was evaluated. Association of single nucleotide polymorphisms (SNP) markers with ten agronomic traits measured in four consecutive years was analyzed under a mix linear model (MLM). Two hundred and one significant association pairs were detected in the four years. Several markers were associated with one trait, and also some markers were associated with multiple traits. Some of the associated markers were in agreement with previous quantitative trait loci (QTL) analyses. The function and homology analyses of the corresponding ESTs of some SNP markers could explain many of the associations for plant height, length of main spike, number of spikelets on main spike, grain number per plant, and 1000-grain weight, etc. The SNP associations for the observed traits are generally clustered in specific chromosome regions of the wheat genome, mainly in 2A, 5A, 6A, 7A, 1B, and 6B chromosomes. This study demonstrates that association mapping can complement and enhance previous QTL analyses and provide additional information for marker-assisted selection. PMID:26110423

  13. Superconducting Bolometer Array Architectures

    NASA Technical Reports Server (NTRS)

    Benford, Dominic; Chervenak, Jay; Irwin, Kent; Moseley, S. Harvey; Shafer, Rick; Staguhn, Johannes; Wollack, Ed; Oegerle, William (Technical Monitor)

    2002-01-01

    The next generation of far-infrared and submillimeter instruments require large arrays of detectors containing thousands of elements. These arrays will necessarily be multiplexed, and superconducting bolometer arrays are the most promising present prospect for these detectors. We discuss our current research into superconducting bolometer array technologies, which has recently resulted in the first multiplexed detections of submillimeter light and the first multiplexed astronomical observations. Prototype arrays containing 512 pixels are in production using the Pop-Up Detector (PUD) architecture, which can be extended easily to 1000 pixel arrays. Planar arrays of close-packed bolometers are being developed for the GBT (Green Bank Telescope) and for future space missions. For certain applications, such as a slewed far-infrared sky survey, feedhorncoupling of a large sparsely-filled array of bolometers is desirable, and is being developed using photolithographic feedhorn arrays. Individual detectors have achieved a Noise Equivalent Power (NEP) of -10(exp 17) W/square root of Hz at 300mK, but several orders of magnitude improvement are required and can be reached with existing technology. The testing of such ultralow-background detectors will prove difficult, as this requires optical loading of below IfW. Antenna-coupled bolometer designs have advantages for large format array designs at low powers due to their mode selectivity.

  14. Electronic Switch Arrays for Managing Microbattery Arrays

    NASA Technical Reports Server (NTRS)

    Mojarradi, Mohammad; Alahmad, Mahmoud; Sukumar, Vinesh; Zghoul, Fadi; Buck, Kevin; Hess, Herbert; Li, Harry; Cox, David

    2008-01-01

    Integrated circuits have been invented for managing the charging and discharging of such advanced miniature energy-storage devices as planar arrays of microscopic energy-storage elements [typically, microscopic electrochemical cells (microbatteries) or microcapacitors]. The architecture of these circuits enables implementation of the following energy-management options: dynamic configuration of the elements of an array into a series or parallel combination of banks (subarrarys), each array comprising a series of parallel combination of elements; direct addressing of individual banks for charging/or discharging; and, disconnection of defective elements and corresponding reconfiguration of the rest of the array to utilize the remaining functional elements to obtain the desited voltage and current performance. An integrated circuit according to the invention consists partly of a planar array of field-effect transistors that function as switches for routing electric power among the energy-storage elements, the power source, and the load. To connect the energy-storage elements to the power source for charging, a specific subset of switches is closed; to connect the energy-storage elements to the load for discharging, a different specific set of switches is closed. Also included in the integrated circuit is circuitry for monitoring and controlling charging and discharging. The control and monitoring circuitry, the switching transistors, and interconnecting metal lines are laid out on the integrated-circuit chip in a pattern that registers with the array of energy-storage elements. There is a design option to either (1) fabricate the energy-storage elements in the corresponding locations on, and as an integral part of, this integrated circuit; or (2) following a flip-chip approach, fabricate the array of energy-storage elements on a separate integrated-circuit chip and then align and bond the two chips together.

  15. Detecting SNP combinations discriminating human populations from HapMap data.

    PubMed

    Ding, XiaoJun; Li, Min; Gu, HaiHua; Peng, XiaoQing; Zhang, Zhen; Wu, FangXiang

    2015-03-01

    The genomes of different human beings are similar. There are only a relatively small number of genetic differences between people. The genetic differences between people are very worthy of study. Researchers have proposed the fixation index FST measurement to find the single nucleotide polymorphisms (SNPs) which can reflect human population differences. However, most SNPs have interactions and they work together, which leads to the differences among human populations. The number of all possible m-locus combinations chosen from n SNPs grows exponentially. Most methods concern on 2-locus interactions. In this paper, we propose a novel method to find a new coordinate system under which the energy distributions of different populations are quite different. We select out candidate SNPs from n SNPs by using the information of the axes in the coordinate system. The number of candidate SNPs is small, thus SNP-SNP interactions can be searched efficiently. The method can also find interactions of more than two loci. These interactions should be able to reflect the evolution of human populations from another way. The numbers of SNP-SNP interactions are regarded as the differences between pairwise populations and a hierarchical clustering algorithm is used to construct the evolutionary tree. In the experiments, we apply the method to SNP data of four chromosomes separately and the trees constructed on these four chromosomes are highly consistent. Furthermore, the trees are also consistent with previous studies, which indicates that evolutionary information is well mined. The method provides a new insight to analyze the human population differences.

  16. MDM2 SNP309 polymorphism contributes to endometrial cancer susceptibility: evidence from a meta-analysis

    PubMed Central

    2013-01-01

    Objective The SNP309 polymorphism (T-G) in the promoter of MDM2 gene has been reported to be associated with enhanced MDM2 expression and tumor development. Studies investigating the association between MDM2 SNP309 polymorphism and endometrial cancer risk reported conflicting results. We performed a meta-analysis of all available studies to explore this association. Methods All studies published up to August 2013 on the association between MDM2 SNP309 polymorphism and endometrial cancer risk were identified by searching electronic databases PubMed, Web of Science, EMBASE, and Chinese Biomedical Literature database (CBM). The association between the MDM2 SNP309 polymorphism and endometrial cancer risk was assessed by odds ratios (ORs) together with their 95% confidence intervals (CIs). Results Eight case–control studies with 2069 endometrial cancer cases and 4546 controls were identified. Overall, significant increase of endometrial cancer risk was found when all studies were pooled in the meta-analysis (GG vs. TT: OR = 1.464, 95% CI 1.246–1.721, P < 0.001; GG vs. TG + TT: OR = 1.726, 95% CI 1.251–2.380, P = 0.001; GG + TG vs. TT: OR = 1.169, 95% CI 1.048–1.304, P = 0.005). In subgroup analysis by ethnicity and HWE in controls, significant increase of endometrial cancer risks were observed in Caucasians and studies consistent with HWE. In subgroup analysis according to study quality, significant associations were observed in both high quality studies and low quality studies. Conclusions This meta-analysis suggests that MDM2 SNP309 polymorphism contributes to endometrial cancer susceptibility, especially in Caucasian populations. Further large and well-designed studies are needed to confirm this association. PMID:24423195

  17. SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate

    USGS Publications Warehouse

    Roffler, Gretchen H.; Amish, Stephen J.; Smith, Seth; Cosart, Ted F.; Kardos, Marty; Schwartz, Michael K.; Luikart, Gordon

    2016-01-01

    Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding and nearby 5′ and 3′ untranslated regions of chosen candidate genes. Targeted sequences were taken from bighorn sheep (Ovis canadensis) exon capture data and directly from the domestic sheep genome (Ovis aries v. 3; oviAri3). The bighorn sheep sequences used in the Dall's sheep (Ovis dalli dalli) exon capture aligned to 2350 genes on the oviAri3 genome with an average of 2 exons each. We developed a microfluidic qPCR-based SNP chip to genotype 476 Dall's sheep from locations across their range and test for patterns of selection. Using multiple corroborating approaches (lositan and bayescan), we detected 28 SNP loci potentially under selection. We additionally identified candidate loci significantly associated with latitude, longitude, precipitation and temperature, suggesting local environmental adaptation. The three methods demonstrated consistent support for natural selection on nine genes with immune and disease-regulating functions (e.g. Ovar-DRA, APC, BATF2, MAGEB18), cell regulation signalling pathways (e.g. KRIT1, PI3K, ORRC3), and respiratory health (CYSLTR1). Characterizing adaptive allele distributions from novel genetic techniques will facilitate investigation of the influence of environmental variation on local adaptation of a northern alpine ungulate throughout its range. This research demonstrated the utility of exon capture for gene-targeted SNP discovery and subsequent SNP chip genotyping using low-quality samples in a nonmodel species.

  18. SNP discovery in the transcriptome of white Pacific shrimp Litopenaeus vannamei by next generation sequencing.

    PubMed

    Yu, Yang; Wei, Jiankai; Zhang, Xiaojun; Liu, Jingwen; Liu, Chengzhang; Li, Fuhua; Xiang, Jianhai

    2014-01-01

    The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP) discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei) generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp) and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies.

  19. Gradient Boosting as a SNP Filter: an Evaluation Using Simulated and Hair Morphology Data

    PubMed Central

    Lubke, GH; Laurin, C; Walters, R; Eriksson, N; Hysi, P; Spector, TD; Montgomery, GW; Martin, NG; Medland, SE; Boomsma, DI

    2013-01-01

    Typically, genome-wide association studies consist of regressing the phenotype on each SNP separately using an additive genetic model. Although statistical models for recessive, dominant, SNP-SNP, or SNP-environment interactions exist, the testing burden makes an evaluation of all possible effects impractical for genome-wide data. We advocate a two-step approach where the first step consists of a filter that is sensitive to different types of SNP main and interactions effects. The aim is to substantially reduce the number of SNPs such that more specific modeling becomes feasible in a second step. We provide an evaluation of a statistical learning method called “gradient boosting machine” (GBM) that can be used as a filter. GBM does not require an a priori specification of a genetic model, and permits inclusion of large numbers of covariates. GBM can therefore be used to explore multiple GxE interactions, which would not be feasible within the parametric framework used in GWAS. We show in a simulation that GBM performs well even under conditions favorable to the standard additive regression model commonly used in GWAS, and is sensitive to the detection of interaction effects even if one of the interacting variables has a zero main effect. The latter would not be detected in GWAS. Our evaluation is accompanied by an analysis of empirical data concerning hair morphology. We estimate the phenotypic variance explained by increasing numbers of highest ranked SNPs, and show that it is sufficient to select 10K-20K SNPs in the first step of a two-step approach. PMID:24404405

  20. Carbon nanotube nanoelectrode arrays

    DOEpatents

    Ren, Zhifeng; Lin, Yuehe; Yantasee, Wassana; Liu, Guodong; Lu, Fang; Tu, Yi

    2008-11-18

    The present invention relates to microelectode arrays (MEAs), and more particularly to carbon nanotube nanoelectrode arrays (CNT-NEAs) for chemical and biological sensing, and methods of use. A nanoelectrode array includes a carbon nanotube material comprising an array of substantially linear carbon nanotubes each having a proximal end and a distal end, the proximal end of the carbon nanotubes are attached to a catalyst substrate material so as to form the array with a pre-determined site density, wherein the carbon nanotubes are aligned with respect to one another within the array; an electrically insulating layer on the surface of the carbon nanotube material, whereby the distal end of the carbon nanotubes extend beyond the electrically insulating layer; a second adhesive electrically insulating layer on the surface of the electrically insulating layer, whereby the distal end of the carbon nanotubes extend beyond the second adhesive electrically insulating layer; and a metal wire attached to the catalyst substrate material.

  1. Pacific Array (Transportable Broadband Ocean Floor Array)

    NASA Astrophysics Data System (ADS)

    Kawakatsu, Hitoshi; Ekstrom, Goran; Evans, Rob; Forsyth, Don; Gaherty, Jim; Kennett, Brian; Montagner, Jean-Paul; Utada, Hisashi

    2016-04-01

    Based on recent developments on broadband ocean bottom seismometry, we propose a next generation large-scale array experiment in the ocean. Recent advances in ocean bottom broadband seismometry1, together with advances in the seismic analysis methodology, have enabled us to resolve the regional 1-D structure of the entire lithosphere/asthenosphere system, including seismic anisotropy (azimuthal, and hopefully radial), with deployments of ~15 broadband ocean bottom seismometers (BBOBSs). Having ~15 BBOBSs as an array unit for a 2-year deployment, and repeating such deployments in a leap-frog way or concurrently (an array of arrays) for a decade or so would enable us to cover a large portion of the Pacific basin. Such efforts, not only by giving regional constraints on the 1-D structure beneath Pacific ocean, but also by sharing waveform data for global scale waveform tomography, would drastically increase our knowledge of how plate tectonics works on this planet, as well as how it worked for the past 150 million years. International collaborations is essential: if three countries/institutions participate this endeavor together, Pacific Array may be accomplished within five-or-so years.

  2. Active aperture phased arrays

    NASA Astrophysics Data System (ADS)

    Shenoy, R. P.

    1989-04-01

    Developments towards the realization of active aperture phased arrays are reviewed. The technology and cost aspects of the power amplifier and phase shifter subsystems are discussed. Consideration is given to research concerning T/R modules, MESFETs, side lobe control, beam steering, optical control techniques, and printed circuit antennas. Methods for configuring the array are examined, focusing on the tile and brick configurations. It is found that there is no technological impediment for introducing active aperture phased arrays.

  3. A photovoltaic array simulator

    NASA Astrophysics Data System (ADS)

    Vachtsevanos, G. J.; Grimbas, E. J.

    A system simulating the output voltage-current characteristics of a photovoltaic array is described. The simulator may be used to test the performance of PV arrays and associated power conditioning equipment necessary for the autonomous or interconnected operation of photovoltaic energy sources. The simulator's main features include simplicity of construction, wide parametric variability and low cost. It is capable of reproducing the output characteristics of commercially available arrays at varying solar irradiation levels with sufficient accuracy. The design ensures the lowest possible power dissipation and minimal thermal drift. It is estimated that the cost of the simulator is an insignificant fraction of the actual array cost in the kilowatt power range.

  4. Interferometric array generation

    NASA Astrophysics Data System (ADS)

    Patra, A. S.; Khare, Alika

    2006-02-01

    We report the formation of square, rectangular as well as hexagonal arrays of small light spots in one single setup using Michelson and Mach-Zehnder interferometers in tandem. The geometry of arrays can be altered easily online, by changing the relative orientations of the mirrors. The arrays could be scanned over large longitudinal distances and could be compressed to give large spot density. The expression for the resultant intensity distribution for the arrays has been worked out and the computed pattern is compared with the experimental data.

  5. Integrated avalanche photodiode arrays

    DOEpatents

    Harmon, Eric S.

    2015-07-07

    The present disclosure includes devices for detecting photons, including avalanche photon detectors, arrays of such detectors, and circuits including such arrays. In some aspects, the detectors and arrays include a virtual beveled edge mesa structure surrounded by resistive material damaged by ion implantation and having side wall profiles that taper inwardly towards the top of the mesa structures, or towards the direction from which the ion implantation occurred. Other aspects are directed to masking and multiple implantation and/or annealing steps. Furthermore, methods for fabricating and using such devices, circuits and arrays are disclosed.

  6. Identification of differently expressed genes with specific SNP Loci for breast cancer by the integration of SNP and gene expression profiling analyses.

    PubMed

    Yuan, Pengfei; Liu, Dechun; Deng, Miao; Liu, Jiangbo; Wang, Jianguang; Zhang, Like; Liu, Qipeng; Zhang, Ting; Chen, Yanbin; Jin, Gaoyuan

    2015-04-01

    This study aims to explore the relationship between gene polymorphism and breast cancer, and to screen DEGs (differentially expressed genes) with SNPs (single nucleotide polymorphisms) related to breast cancer. The SNPs of 17 patients and the preprocessed SNP profiling GSE 32258 (38 cases of normal breast cells) were combined to identify their correlation with breast cancer using chi-square test. The gene expression profiling batch8_9 (38 cases of patients and 8 cases of normal tissue) was preprocessed with limma package, and the DEGs were filtered out. Then fisher's method was applied to integrate DEGs and SNPs associated with breast cancer. With NetBox software, TRED (Transcriptional Regulatory Element Database) and UCSC (University of California Santa Cruz) database, genes-associated network and transcriptional regulatory network were constructed using cytoscape software. Further, GO (Gene Ontology) and KEGG analyses were performed for genes in the networks by using siggenes. In total, 332 DEGs were identified. There were 160 breast cancer-related SNPs related to 106 genes of gene expression profiling (19 were significant DEGs). Finally, 11co-correlated DEGs were selected. In genes-associated network, 9 significant DEGs were correlated to 23 LINKER genes while, in transcriptional regulatory network, E2F1 had regulatory relationships with 7 DEGs including MTUS1, CD44, CCNB1 and CCND2. KRAS with SNP locus of rs1137282 was involved in 35 KEGG pathways. The genes of MTUS1, CD44, CCNB1, CCND2 and KRAS with specific SNP loci may be used as biomarkers for diagnosis of breast cancer. Besides, E2F1 was recognized as the transcription factor of 7 DEGs including MTUS1, CD44, CCNB1 and CCND2.

  7. When whole-genome alignments just won't work: kSNP v2 software for alignment-free SNP discovery and phylogenetics of hundreds of microbial genomes.

    PubMed

    Gardner, Shea N; Hall, Barry G

    2013-01-01

    Effective use of rapid and inexpensive whole genome sequencing for microbes requires fast, memory efficient bioinformatics tools for sequence comparison. The kSNP v2 software finds single nucleotide polymorphisms (SNPs) in whole genome data. kSNP v2 has numerous improvements over kSNP v1 including SNP gene annotation; better scaling for draft genomes available as assembled contigs or raw, unassembled reads; a tool to identify the optimal value of k; distribution of packages of executables for Linux and Mac OS X for ease of installation and user-friendly use; and a detailed User Guide. SNP discovery is based on k-mer analysis, and requires no multiple sequence alignment or the selection of a single reference genome. Most target sets with hundreds of genomes complete in minutes to hours. SNP phylogenies are built by maximum likelihood, parsimony, and distance, based on all SNPs, only core SNPs, or SNPs present in some intermediate user-specified fraction of targets. The SNP-based trees that result are consistent with known taxonomy. kSNP v2 can handle many gigabases of sequence in a single run, and if one or more annotated genomes are included in the target set, SNPs are annotated with protein coding and other information (UTRs, etc.) from Genbank file(s). We demonstrate application of kSNP v2 on sets of viral and bacterial genomes, and discuss in detail analysis of a set of 68 finished E. coli and Shigella genomes and a set of the same genomes to which have been added 47 assemblies and four "raw read" genomes of H104:H4 strains from the recent European E. coli outbreak that resulted in both bloody diarrhea and hemolytic uremic syndrome (HUS), and caused at least 50 deaths.

  8. Experiences with array-based sequence capture; toward clinical applications

    PubMed Central

    Almomani, Rowida; van der Heijden, Jaap; Ariyurek, Yavuz; Lai, Yuching; Bakker, Egbert; van Galen, Michiel; Breuning, Martijn H; den Dunnen, Johan T

    2011-01-01

    Although sequencing of a human genome gradually becomes an option, zooming in on the region of interest remains attractive and cost saving. We performed array-based sequence capture using 385K Roche NimbleGen, Inc. arrays to zoom in on the protein-coding and immediate intron-flanking sequences of 112 genes, potentially involved in mental retardation and congenital malformation. Captured material was sequenced using Illumina technology. A data analysis pipeline was built that detects sequence variants, positions them in relation to the gene, checks for presence in databases (eg, db single-nucleotide polymorphism (SNP)) and predicts the potential consequences at the level of RNA splicing and protein translation. In the samples analyzed, all known variants were reliably detected, including pathogenic variants from control cases and SNPs derived from array experiments. Although overall coverage varied considerably, it was reproducible per region and facilitated the detection of large deletions and duplications (copy number variations), including a partial deletion in the B3GALTL gene from a patient sample. For ultimate diagnostic application, overall results need to be improved. Future arrays should contain probes from both DNA strands, and to obtain a more even coverage, one could add fewer probes from densely and more probes from sparsely covered regions. PMID:21102627

  9. Focal plane array with modular pixel array components for scalability

    SciTech Connect

    Kay, Randolph R; Campbell, David V; Shinde, Subhash L; Rienstra, Jeffrey L; Serkland, Darwin K; Holmes, Michael L

    2014-12-09

    A modular, scalable focal plane array is provided as an array of integrated circuit dice, wherein each die includes a given amount of modular pixel array circuitry. The array of dice effectively multiplies the amount of modular pixel array circuitry to produce a larger pixel array without increasing die size. Desired pixel pitch across the enlarged pixel array is preserved by forming die stacks with each pixel array circuitry die stacked on a separate die that contains the corresponding signal processing circuitry. Techniques for die stack interconnections and die stack placement are implemented to ensure that the desired pixel pitch is preserved across the enlarged pixel array.

  10. Solar array stepping to minimize array excitation

    NASA Technical Reports Server (NTRS)

    Bhat, Mahabaleshwar K. P. (Inventor); Liu, Tung Y. (Inventor); Plescia, Carl T. (Inventor)

    1989-01-01

    Mechanical oscillations of a mechanism containing a stepper motor, such as a solar-array powered spacecraft, are reduced and minimized by the execution of step movements in pairs of steps, the period between steps being equal to one-half of the period of torsional oscillation of the mechanism. Each pair of steps is repeated at needed intervals to maintain desired continuous movement of the portion of elements to be moved, such as the solar array of a spacecraft. In order to account for uncertainty as well as slow change in the period of torsional oscillation, a command unit may be provided for varying the interval between steps in a pair.

  11. Antares prototype 300-kJ, 250-kA Marx generator. Final report

    SciTech Connect

    Riepe, K.B.; Barrone, L.L.; Bickford, K.J.; Livermore, G.H.

    1981-01-01

    A high-energy, low-inductance, low prefire rate, low trigger jitter, high-voltage, pulsed-power supply was needed to drive the gas discharge in the Antares laser power amplifier. This report describes the design and testing of a Marx generator that meets these requirements, the development and testing of a high-capacity spark gap, and the selection of suitable capacitors and resistors.

  12. DISCOVERY OF A ∼250 K BROWN DWARF AT 2 pc FROM THE SUN

    SciTech Connect

    Luhman, K. L.

    2014-05-10

    Through a previous analysis of multi-epoch astrometry from the Wide-field Infrared Survey Explorer (WISE), I identified WISE J085510.83–071442.5 as a new high proper motion object. By combining astrometry from WISE and the Spitzer Space Telescope, I have measured a proper motion of 8.1 ± 0.1'' yr{sup –1} and a parallax of 0.454 ± 0.045'' (2.20{sub −0.20}{sup +0.24} pc) for WISE J085510.83–071442.5, giving it the third highest proper motion and the fourth largest parallax of any known star or brown dwarf. It is also the coldest known brown dwarf based on its absolute magnitude at 4.5 μm and its color in [3.6]-[4.5]. By comparing M {sub 4.5} with the values predicted by theoretical evolutionary models, I estimate an effective temperature of 225-260 K and a mass of 3-10 M {sub Jup} for the age range of 1-10 Gyr that encompasses most nearby stars.

  13. Operating experience with a 250 kW el molten carbonate fuel cell (MCFC) power plant

    NASA Astrophysics Data System (ADS)

    Bischoff, Manfred; Huppmann, Gerhard

    The MTU MCFC program is carried out by a European consortium comprising the German companies MTU Friedrichshafen GmbH, Ruhrgas AG and RWE Energie AG as well as the Danish company Energi E2 S/A. MTU acts as consortium leader. The company shares a license and technology exchange agreement with Fuel Cell Energy Inc., Danbury, CT, USA (formerly Energy Research Corp., ERC). The program was started in 1990 and covers a period of about 10 years. The highlights of this program to date are: Considerable improvements regarding component stability have been demonstrated on laboratory scale. Manufacturing technology has been developed to a point which enables the consortium to fabricate the porous components on a 250 cm 2 scale. Several large area stacks with 5000-7660 cm 2 cell area and a power range of 3-10 kW have been tested at the facilities in Munich (Germany) and Kyndby (Denmark). These stacks have been supplied by FCE. As far as the system design is concerned it was soon realized that conventional systems do not hold the promise for competitive power plants. A system analysis led to the conclusion that a new innovative design approach is required. As a result the "Hot Module" system was developed by the consortium. A Hot Module combines all the components of a MCFC system operating at the similar temperatures and pressures into a common thermally insulated vessel. In August 1997 the consortium started its first full size Hot Module MCFC test plant at the facilities of Ruhrgas AG in Dorsten, Germany. The stack was assembled in Munich using 292 cell packages purchased from FCE. The plant is based on the consortium's unique and proprietary "Hot Module" concept. It operates on pipeline natural gas and was grid connected on 16 August 1997. After a total of 1500 h of operation, the plant was intentionally shut down in a controlled manner in April 1998 for post-test analysis. The Hot Module system concept has demonstrated its functionality. The safety concept has been convincingly proven, though in part unintentionally. The electrical power level of 155 kW (ca. 60% of maximum power) achieved allows validation of the concept with reasonable degree of confidence. Horizontal stack operation—an essential innovation of the Hot Module concept—is feasible. The fuel processing subsystem worked reliably as expected. After initial problems in the inverter control software, the electrical and control subsystem operated to full satisfaction. Stable automatic operation not only under various load conditions, but also in idle mode, hot parking mode, and grid-independent mode has been demonstrated. Together with progress achieved by FCE in the qualification of large direct fuel cell (DFC) stacks the basis was laid for the next test unit of similar design, which will be operated in Bielefeld, Germany. The pre-tests of the stack took place already in July 1999 with good results. Additionally, projects for the test of the DFC Hot Module operating on biogas and other opportunity fuels are under preparation.

  14. Strong effect of SNP rs4988300 of the LRP5 gene on bone phenotype of Caucasian postmenopausal women.

    PubMed

    Horváth, Péter; Balla, Bernadett; Kósa, János P; Tóbiás, Bálint; Szili, Balázs; Kirschner, Gyöngyi; Győri, Gabriella; Kató, Karina; Lakatos, Péter; Takács, István

    2016-01-01

    The purpose of this study was to identify relationships between single nucleotide polymorphisms (SNPs) in the genes of the Wnt pathway and bone mineral density (BMD) of postmenopausal women. We chose this pathway due to its importance in bone metabolism that was underlined in several studies. DNA samples of 932 Hungarian postmenopausal women were studied. First, their BMD values at different sites (spine, total hip) were measured, using a Lunar Prodigy DXA scanner. Thereafter, T-score values and the patients' body mass indices (BMIs) were calculated, while information about the fracture history of the sample population was also collected. We genotyped nine SNPs of the following three genes: LRP5, GPR177, and SP7, using a Sequenom MassARRAY Analyzer 4 instrument. The genomic DNA samples used for genotyping were extracted from the buccal mucosa of the subjects. Statistical analyses were carried out using the SPSS 21 and R package. The results of this analysis showed a significant association between SNP rs4988300 of the LRP5 gene and total hip BMD values. We could not reveal any associations between the markers of GPR177, SP7, and bone phenotypes. We found no effect of these genotypes on fracture risk. We could demonstrate a significant gene-gene interaction between two SNPs of LRP5 (rs4988300 and rs634008, p = 0.009) which was lost after Bonferroni correction. We could firmly demonstrate a significant association between rs4988300 of the LRP5 gene and bone density of the hip on the largest homogeneous postmenopausal study group analyzed to date. Our finding corroborates the relationship between LRP5 genotype and bone phenotype in postmenopausal women, however, the complete mechanism of this relationship requires further investigations.

  15. Detection of selective sweeps in cattle using genome-wide SNP data

    PubMed Central

    2013-01-01

    Background The domestication and subsequent selection by humans to create breeds and biological types of cattle undoubtedly altered the patterning of variation within their genomes. Strong selection to fix advantageous large-effect mutations underlying domesticability, breed characteristics or productivity created selective sweeps in which variation was lost in the chromosomal region flanking the selected allele. Selective sweeps have now been identified in the genomes of many animal species including humans, dogs, horses, and chickens. Here, we attempt to identify and characterise regions of the bovine genome that have been subjected to selective sweeps. Results Two datasets were used for the discovery and validation of selective sweeps via the fixation of alleles at a series of contiguous SNP loci. BovineSNP50 data were used to identify 28 putative sweep regions among 14 diverse cattle breeds. Affymetrix BOS 1 prescreening assay data for five breeds were used to identify 85 regions and validate 5 regions identified using the BovineSNP50 data. Many genes are located within these regions and the lack of sequence data for the analysed breeds precludes the nomination of selected genes or variants and limits the prediction of the selected phenotypes. However, phenotypes that we predict to have historically been under strong selection include horned-polled, coat colour, stature, ear morphology, and behaviour. Conclusions The bias towards common SNPs in the design of the BovineSNP50 assay led to the identification of recent selective sweeps associated with breed formation and common to only a small number of breeds rather than ancient events associated with domestication which could potentially be common to all European taurines. The limited SNP density, or marker resolution, of the BovineSNP50 assay significantly impacted the rate of false discovery of selective sweeps, however, we found sweeps in common between breeds which were confirmed using an ultra

  16. Identification of Mendelian inconsistencies between SNP and pedigree information of sibs

    PubMed Central

    2011-01-01

    Background Using SNP genotypes to apply genomic selection in breeding programs is becoming common practice. Tools to edit and check the quality of genotype data are required. Checking for Mendelian inconsistencies makes it possible to identify animals for which pedigree information and genotype information are not in agreement. Methods Straightforward tests to detect Mendelian inconsistencies exist that count the number of opposing homozygous marker (e.g. SNP) genotypes between parent and offspring (PAR-OFF). Here, we develop two tests to identify Mendelian inconsistencies between sibs. The first test counts SNP with opposing homozygous genotypes between sib pairs (SIBCOUNT). The second test compares pedigree and SNP-based relationships (SIBREL). All tests iteratively remove animals based on decreasing numbers of inconsistent parents and offspring or sibs. The PAR-OFF test, followed by either SIB test, was applied to a dataset comprising 2,078 genotyped cows and 211 genotyped sires. Theoretical expectations for distributions of test statistics of all three tests were calculated and compared to empirically derived values. Type I and II error rates were calculated after applying the tests to the edited data, while Mendelian inconsistencies were introduced by permuting pedigree against genotype data for various proportions of animals. Results Both SIB tests identified animal pairs for which pedigree and genomic relationships could be considered as inconsistent by visual inspection of a scatter plot of pairwise pedigree and SNP-based relationships. After removal of 235 animals with the PAR-OFF test, SIBCOUNT (SIBREL) identified 18 (22) additional inconsistent animals. Seventeen animals were identified by both methods. The numbers of incorrectly deleted animals (Type I error), were equally low for both methods, while the numbers of incorrectly non-deleted animals (Type II error), were considerably higher for SIBREL compared to SIBCOUNT. Conclusions Tests to remove

  17. Development and characterization of a microheater array device for real-time DNA mutation detection

    NASA Astrophysics Data System (ADS)

    Williams, Layne; Okandan, Murat; Chagovetz, Alex; Blair, Steve

    2008-02-01

    DNA analysis, specifically single nucleotide polymorphism (SNP) detection, is becoming increasingly important in rapid diagnostics and disease detection. Temperature is often controlled to help speed reaction rates and perform melting of hybridized oligonucleotides. The difference in melting temperatures, Tm, between wild-type and SNP sequences, respectively, to a given probe oligonucleotide, is indicative of the specificity of the reaction. We have characterized Tm's in solution and on a solid substrate of three sequences from known mutations associated with Cystic Fibrosis. Taking advantage of Tm differences, a microheater array device was designed to enable individual temperature control of up to 18 specific hybridization events. The device was fabricated at Sandia National Laboratories using surface micromachining techniques. The microheaters have been characterized using an IR camera at Sandia and show individual temperature control with minimal thermal cross talk. Development of the device as a real-time DNA detection platform, including surface chemistry and associated microfluidics, is described.

  18. Development and characterization of a microheater array device for real-time DNA mutation detection

    NASA Astrophysics Data System (ADS)

    Williams, Layne; Okandan, Murat; Chagovetz, Alex; Blair, Steve

    2008-04-01

    DNA analysis, specifically single nucleotide polymorphism (SNP) detection, is becoming increasingly important in rapid diagnostics and disease detection. Temperature is often controlled to help speed reaction rates and perform melting of hybridized oligonucleotides. The difference in melting temperatures, Tm, between wild-type and SNP sequences, respectively, to a given probe oligonucleotide, is indicative of the specificity of the reaction. We have characterized Tm's in solution and on a solid substrate of three sequences from known mutations associated with Cystic Fibrosis. Taking advantage of Tm differences, a microheater array device was designed to enable individual temperature control of up to 18 specific hybridization events. The device was fabricated at Sandia National Laboratories using surface micromachining techniques. The microheaters have been characterized using an IR camera at Sandia and show individual temperature control with minimal thermal cross talk. Development of the device as a real-time DNA detection platform, including surface chemistry and associated microfluidics, is described.

  19. ISS Solar Array Management

    NASA Technical Reports Server (NTRS)

    Williams, James P.; Martin, Keith D.; Thomas, Justin R.; Caro, Samuel

    2010-01-01

    The International Space Station (ISS) Solar Array Management (SAM) software toolset provides the capabilities necessary to operate a spacecraft with complex solar array constraints. It monitors spacecraft telemetry and provides interpretations of solar array constraint data in an intuitive manner. The toolset provides extensive situational awareness to ensure mission success by analyzing power generation needs, array motion constraints, and structural loading situations. The software suite consists of several components including samCS (constraint set selector), samShadyTimers (array shadowing timers), samWin (visualization GUI), samLock (array motion constraint computation), and samJet (attitude control system configuration selector). It provides high availability and uptime for extended and continuous mission support. It is able to support two-degrees-of-freedom (DOF) array positioning and supports up to ten simultaneous constraints with intuitive 1D and 2D decision support visualizations of constraint data. Display synchronization is enabled across a networked control center and multiple methods for constraint data interpolation are supported. Use of this software toolset increases flight safety, reduces mission support effort, optimizes solar array operation for achieving mission goals, and has run for weeks at a time without issues. The SAM toolset is currently used in ISS real-time mission operations.

  20. Array for detecting microbes

    DOEpatents

    Andersen, Gary L.; DeSantis, Todd D.

    2014-07-08

    The present embodiments relate to an array system for detecting and identifying biomolecules and organisms. More specifically, the present embodiments relate to an array system comprising a microarray configured to simultaneously detect a plurality of organisms in a sample at a high confidence level.

  1. Solar array deployment mechanism

    NASA Technical Reports Server (NTRS)

    Calassa, Mark C.; Kackley, Russell

    1995-01-01

    This paper describes a Solar Array Deployment Mechanism (SADM) used to deploy a rigid solar array panel on a commercial spacecraft. The application required a deployment mechanism design that was not only lightweight, but also could be produced and installed at the lowest possible cost. This paper covers design, test, and analysis of a mechanism that meets these requirements.

  2. Ecotoxicological assessment of PAHs and their dead-end metabolites after degradation by Mycobacterium sp. strain SNP11.

    PubMed

    Pagnout, Christophe; Rast, Claudine; Veber, Anne-Marie; Poupin, Pascal; Férard, Jean-François

    2006-10-01

    Mycobacterium sp. SNP11 has a high PAH biodegradation potential. In this paper, the toxicity of pyrene, fluoranthene, phenanthrene, and their dead-end metabolites, accumulated in the media after biodegradation by Mycobacterium sp. SNP11, were evaluated by a screening battery of acute, chronic, and genotoxic tests. According to the bioassays, performed on bacteria (Vibrio fischeri, Salmonella typhimurium strains TA1535/pSK1002, TA97a, TA98, TA100), algae (Pseudokirchneriella subcapitata), and crustaceans (Daphnia magna, Ceriodaphnia dubia), total disappearance or a very significant reduction of the (geno)toxic potential was observed after PAH degradation by Mycobacterium sp. SNP11.

  3. High density pixel array

    NASA Technical Reports Server (NTRS)

    Wiener-Avnear, Eliezer (Inventor); McFall, James Earl (Inventor)

    2004-01-01

    A pixel array device is fabricated by a laser micro-milling method under strict process control conditions. The device has an array of pixels bonded together with an adhesive filling the grooves between adjacent pixels. The array is fabricated by moving a substrate relative to a laser beam of predetermined intensity at a controlled, constant velocity along a predetermined path defining a set of grooves between adjacent pixels so that a predetermined laser flux per unit area is applied to the material, and repeating the movement for a plurality of passes of the laser beam until the grooves are ablated to a desired depth. The substrate is of an ultrasonic transducer material in one example for fabrication of a 2D ultrasonic phase array transducer. A substrate of phosphor material is used to fabricate an X-ray focal plane array detector.

  4. Micromachined electrode array

    DOEpatents

    Okandan, Murat; Wessendorf, Kurt O.

    2007-12-11

    An electrode array is disclosed which has applications for neural stimulation and sensing. The electrode array, in certain embodiments, can include a plurality of electrodes each of which is flexibly attached to a common substrate using a plurality of springs to allow the electrodes to move independently. In other embodiments of the electrode array, the electrodes can be fixed to the substrate. The electrode array can be formed from a combination of bulk and surface micromachining, and can include electrode tips having an electroplated metal (e.g. platinum, iridium, gold or titanium) or a metal oxide (e.g. iridium oxide) for biocompatibility. The electrode array can be used to form a part of a neural prosthesis, and is particularly well adapted for use in an implantable retinal prosthesis.

  5. Photovoltaic array loss mechanisms

    NASA Astrophysics Data System (ADS)

    Gonzalez, Charles

    1986-10-01

    Loss mechanisms which come into play when solar cell modules are mounted in arrays are identified. Losses can occur either from a reduction in the array electrical performance or with nonoptimal extraction of power from the array. Electrical performance degradation is caused by electrical mismatch, transmission losses from cell surface soiling and steep angle of reflectance, and electrical losses from field wiring resistance and the voltage drop across blocking diodes. The second type of loss, concerned with the operating points of the array, can involve nonoptimal load impedance and limiting the operating envelope of the array to specific ranges of voltage and current. Each of the loss mechanisms are discussed and average energy losses expected from soiling, steep reflectance angles and circuit losses are calculated.

  6. Multibeam Phased Array Antennas

    NASA Technical Reports Server (NTRS)

    Popovic, Zoya; Romisch, Stefania; Rondineau, Sebastien

    2004-01-01

    In this study, a new architecture for Ka-band multi-beam arrays was developed and demonstrated experimentally. The goal of the investigation was to demonstrate a new architecture that has the potential of reducing the cost as compared to standard expensive phased array technology. The goals of this specific part of the project, as stated in the yearly statement of work in the original proposal are: 1. Investigate bounds on performance of multi-beam lens arrays in terms of beamwidths, volume (size), isolation between beams, number of simultaneous beams, etc. 2. Design a small-scale array to demonstrate the principle. The array will be designed for operation around 3OGHz (Ka-band), with two 10-degree beamwidth beams. 3. Investigate most appropriate way to accomplish fine-tuning of the beam pointing within 5 degrees around the main beam pointing angle.

  7. Microfabricated ion trap array

    DOEpatents

    Blain, Matthew G.; Fleming, James G.

    2006-12-26

    A microfabricated ion trap array, comprising a plurality of ion traps having an inner radius of order one micron, can be fabricated using surface micromachining techniques and materials known to the integrated circuits manufacturing and microelectromechanical systems industries. Micromachining methods enable batch fabrication, reduced manufacturing costs, dimensional and positional precision, and monolithic integration of massive arrays of ion traps with microscale ion generation and detection devices. Massive arraying enables the microscale ion traps to retain the resolution, sensitivity, and mass range advantages necessary for high chemical selectivity. The reduced electrode voltage enables integration of the microfabricated ion trap array with on-chip circuit-based rf operation and detection electronics (i.e., cell phone electronics). Therefore, the full performance advantages of the microfabricated ion trap array can be realized in truly field portable, handheld microanalysis systems.

  8. A genome-wide SNP scan reveals novel loci for egg production and quality traits in white leghorn and brown-egg dwarf layers.

    PubMed

    Liu, Wenbo; Li, Dongfeng; Liu, Jianfeng; Chen, Sirui; Qu, Lujiang; Zheng, Jiangxia; Xu, Guiyun; Yang, Ning

    2011-01-01

    Availability of the complete genome sequence as well as high-density SNP genotyping platforms allows genome-wide association studies (GWAS) in chickens. A high-density SNP array containing 57,636 markers was employed herein to identify associated variants underlying egg production and quality traits within two lines of chickens, i.e., White Leghorn and brown-egg dwarf layers. For each individual, age at first egg (AFE), first egg weight (FEW), and number of eggs (EN) from 21 to 56 weeks of age were recorded, and egg quality traits including egg weight (EW), eggshell weight (ESW), yolk weight (YW), eggshell thickness (EST), eggshell strength (ESS), albumen height(AH) and Haugh unit(HU) were measured at 40 and 60 weeks of age. A total of 385 White Leghorn females and 361 brown-egg dwarf dams were selected to be genotyped. The genome-wide scan revealed 8 SNPs showing genome-wise significant (P<1.51E-06, Bonferroni correction) association with egg production and quality traits under the Fisher's combined probability method. Some significant SNPs are located in known genes including GRB14 and GALNT1 that can impact development and function of ovary, but more are located in genes with unclear functions in layers, and need to be studied further. Many chromosome-wise significant SNPs were also detected in this study and some of them are located in previously reported QTL regions. Most of loci detected in this study are novel and the follow-up replication studies may be needed to further confirm the functional significance for these newly identified SNPs.

  9. Spatial Structure and Climatic Adaptation in African Maize Revealed by Surveying SNP Diversity in Relation to Global Breeding and Landrace Panels

    PubMed Central

    Westengen, Ola T.; Berg, Paul R.; Kent, Matthew P.; Brysting, Anne K.

    2012-01-01

    Background Climate change threatens maize productivity in sub-Saharan Africa. To ensure food security, access to locally adapted genetic resources and varieties is an important adaptation measure. Most of the maize grown in Africa is a genetic mix of varieties introduced at different historic times following the birth of the trans-Atlantic economy, and knowledge about geographic structure and local adaptations is limited. Methodology A panel of 48 accessions of maize representing various introduction routes and sources of historic and recent germplasm introductions in Africa was genotyped with the MaizeSNP50 array. Spatial genetic structure and genetic relationships in the African panel were analysed separately and in the context of a panel of 265 inbred lines representing global breeding material (based on 26,900 SNPs) and a panel of 1127 landraces from the Americas (270 SNPs). Environmental association analysis was used to detect SNPs associated with three climatic variables based on the full 43,963 SNP dataset. Conclusions The genetic structure is consistent between subsets of the data and the markers are well suited for resolving relationships and admixture among the accessions. The African accessions are structured in three clusters reflecting historical and current patterns of gene flow from the New World and within Africa. The Sahelian cluster reflects original introductions of Meso-American landraces via Europe and a modern introduction of temperate breeding material. The Western cluster reflects introduction of Coastal Brazilian landraces, as well as a Northeast-West spread of maize through Arabic trade routes across the continent. The Eastern cluster most strongly reflects gene flow from modern introduced tropical varieties. Controlling for population history in a linear model, we identify 79 SNPs associated with maximum temperature during the growing season. The associations located in genes of known importance for abiotic stress tolerance are

  10. Genome-Wide SNP Identification and Characterization in Two Soybean Cultivars with Contrasting Mungbean Yellow Mosaic India Virus Disease Resistance Traits

    PubMed Central

    Muthamilarasan, Mehanathan; Mukherjee, Minakshi; Khan, Yusuf; Rathi, Pushpendra; Prasad, Manoj

    2015-01-01

    Mungbean yellow mosaic India virus (MYMIV) is a bipartite Geminivirus, which causes severe yield loss in soybean (Glycine max). Considering this, the present study was conducted to develop large-scale genome-wide single nucleotide polymorphism (SNP) markers and identify potential markers linked with known disease resistance loci for their effective use in genomics-assisted breeding to impart durable MYMIV tolerance. The whole-genome re-sequencing of MYMIV resistant cultivar ‘UPSM-534’ and susceptible Indian cultivar ‘JS-335’ was performed to identify high-quality SNPs and InDels (insertion and deletions). Approximately 234 and 255 million of 100-bp paired-end reads were generated from UPSM-534 and JS-335, respectively, which provided ~98% coverage of reference soybean genome. A total of 3083987 SNPs (1559556 in UPSM-534 and 1524431 in JS-335) and 562858 InDels (281958 in UPSM-534 and 280900 in JS-335) were identified. Of these, 1514 SNPs were found to be present in 564 candidate disease resistance genes. Among these, 829 non-synonymous and 671 synonymous SNPs were detected in 266 and 286 defence-related genes, respectively. Noteworthy, a non-synonymous SNP (in chromosome 18, named 18-1861613) at the 149th base-pair of LEUCINE-RICH REPEAT RECEPTOR-LIKE PROTEIN KINASE gene responsible for a G/C transversion [proline (CCC) to alanine(GCC)] was identified and validated in a set of 12 soybean cultivars. Taken together, the present study generated a large-scale genomic resource such as, SNPs and InDels at a genome-wide scale that will facilitate the dissection of various complex traits through construction of high-density linkage maps and fine mapping. In the present scenario, these markers can be effectively used to design high-density SNP arrays for their large-scale validation and high-throughput genotyping in diverse natural and mapping populations, which could accelerate genomics-assisted MYMIV disease resistance breeding in soybean. PMID:25875830

  11. Genome-wide SNP identification and characterization in two soybean cultivars with contrasting Mungbean Yellow Mosaic India Virus disease resistance traits.

    PubMed

    Yadav, Chandra Bhan; Bhareti, Priyanka; Muthamilarasan, Mehanathan; Mukherjee, Minakshi; Khan, Yusuf; Rathi, Pushpendra; Prasad, Manoj

    2015-01-01

    Mungbean yellow mosaic India virus (MYMIV) is a bipartite Geminivirus, which causes severe yield loss in soybean (Glycine max). Considering this, the present study was conducted to develop large-scale genome-wide single nucleotide polymorphism (SNP) markers and identify potential markers linked with known disease resistance loci for their effective use in genomics-assisted breeding to impart durable MYMIV tolerance. The whole-genome re-sequencing of MYMIV resistant cultivar 'UPSM-534' and susceptible Indian cultivar 'JS-335' was performed to identify high-quality SNPs and InDels (insertion and deletions). Approximately 234 and 255 million of 100-bp paired-end reads were generated from UPSM-534 and JS-335, respectively, which provided ~98% coverage of reference soybean genome. A total of 3083987 SNPs (1559556 in UPSM-534 and 1524431 in JS-335) and 562858 InDels (281958 in UPSM-534 and 280900 in JS-335) were identified. Of these, 1514 SNPs were found to be present in 564 candidate disease resistance genes. Among these, 829 non-synonymous and 671 synonymous SNPs were detected in 266 and 286 defence-related genes, respectively. Noteworthy, a non-synonymous SNP (in chromosome 18, named 18-1861613) at the 149th base-pair of LEUCINE-RICH REPEAT RECEPTOR-LIKE PROTEIN KINASE gene responsible for a G/C transversion [proline (CCC) to alanine(GCC)] was identified and validated in a set of 12 soybean cultivars. Taken together, the present study generated a large-scale genomic resource such as, SNPs and InDels at a genome-wide scale that will facilitate the dissection of various complex traits through construction of high-density linkage maps and fine mapping. In the present scenario, these markers can be effectively used to design high-density SNP arrays for their large-scale validation and high-throughput genotyping in diverse natural and mapping populations, which could accelerate genomics-assisted MYMIV disease resistance breeding in soybean.

  12. Using false discovery rates to benchmark SNP-callers in next-generation sequencing projects.

    PubMed

    Farrer, Rhys A; Henk, Daniel A; MacLean, Dan; Studholme, David J; Fisher, Matthew C

    2013-01-01

    Sequence alignments form the basis for many comparative and population genomic studies. Alignment tools provide a range of accuracies dependent on the divergence between the sequences and the alignment methods. Despite widespread use, there is no standard method for assessing the accuracy of a dataset and alignment strategy after resequencing. We present a framework and tool for determining the overall accuracies of an input read dataset, alignment and SNP-calling method providing an isolate in that dataset has a corresponding, or closely related reference sequence available. In addition to this tool for comparing False Discovery Rates (FDR), we include a method for determining homozygous and heterozygous positions from an alignment using binomial probabilities for an expected error rate. We benchmark this method against other SNP callers using our FDR method with three fungal genomes, finding that it was able achieve a high level of accuracy. These tools are available at http://cfdr.sourceforge.net/.

  13. Using False Discovery Rates to Benchmark SNP-callers in next-generation sequencing projects

    PubMed Central

    Farrer, Rhys A.; Henk, Daniel A.; MacLean, Dan; Studholme, David J.; Fisher, Matthew C.

    2013-01-01

    Sequence alignments form the basis for many comparative and population genomic studies. Alignment tools provide a range of accuracies dependent on the divergence between the sequences and the alignment methods. Despite widespread use, there is no standard method for assessing the accuracy of a dataset and alignment strategy after resequencing. We present a framework and tool for determining the overall accuracies of an input read dataset, alignment and SNP-calling method providing an isolate in that dataset has a corresponding, or closely related reference sequence available. In addition to this tool for comparing False Discovery Rates (FDR), we include a method for determining homozygous and heterozygous positions from an alignment using binomial probabilities for an expected error rate. We benchmark this method against other SNP callers using our FDR method with three fungal genomes, finding that it was able achieve a high level of accuracy. These tools are available at http://cfdr.sourceforge.net/. PMID:23518929

  14. Genome-wide SNP analysis of the Systemic Capillary Leak Syndrome (Clarkson disease)

    PubMed Central

    Xie, Zhihui; Nagarajan, Vijayaraj; Sturdevant, Daniel E; Iwaki, Shoko; Chan, Eunice; Wisch, Laura; Young, Michael; Nelson, Celeste M; Porcella, Stephen F; Druey, Kirk M

    2013-01-01

    The Systemic Capillary Leak Syndrome (SCLS) is an extremely rare, orphan disease that resembles, and is frequently erroneously diagnosed as, systemic anaphylaxis. The disorder is characterized by repeated, transient, and seemingly unprovoked episodes of hypotensive shock and peripheral edema due to transient endothelial hyperpermeability. SCLS is often accompanied by a monoclonal gammopathy of unknown significance (MGUS). Using Affymetrix Single Nucleotide Polymorphism (SNP) microarrays, we performed the first genome-wide SNP analysis of SCLS in a cohort of 12 disease subjects and 18 controls. Exome capture sequencing was performed on genomic DNA from nine of these patients as validation for the SNP-chip discoveries and de novo data generation. We identified candidate susceptibility loci for SCLS, which included a region flanking CAV3 (3p25.3) as well as SNP clusters in PON1 (7q21.3), PSORS1C1 (6p21.3), and CHCHD3 (7q33). Among the most highly ranked discoveries were gene-associated SNPs in the uncharacterized LOC100130480 gene (rs6417039, rs2004296). Top case-associated SNPs were observed in BTRC (rs12355803, 3rs4436485), ARHGEF18 (rs11668246), CDH13 (rs4782779), and EDG2 (rs12552348), which encode proteins with known or suspected roles in B cell function and/or vascular integrity. 61 SNPs that were significantly associated with SCLS by microarray analysis were also detected and validated by exome deep sequencing. Functional annotation of highly ranked SNPs revealed enrichment of cell projections, cell junctions and adhesion, and molecules containing pleckstrin homology, Ras/Rho regulatory, and immunoglobulin Ig-like C2/fibronectin type III domains, all of which involve mechanistic functions that correlate with the SCLS phenotype. These results highlight SNPs with potential relevance to SCLS. PMID:24808988

  15. Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT

    PubMed Central

    Neigenfind, Jost; Gyetvai, Gabor; Basekow, Rico; Diehl, Svenja; Achenbach, Ute; Gebhardt, Christiane; Selbig, Joachim; Kersten, Birgit

    2008-01-01

    Background Haplotype inference based on unphased SNP markers is an important task in population genetics. Although there are different approaches to the inference of haplotypes in diploid species, the existing software is not suitable for inferring haplotypes from unphased SNP data in polyploid species, such as the cultivated potato (Solanum tuberosum). Potato species are tetraploid and highly heterozygous. Results Here we present the software SATlotyper which is able to handle polyploid and polyallelic data. SATlo-typer uses the Boolean satisfiability problem to formulate Haplotype Inference by Pure Parsimony. The software excludes existing haplotype inferences, thus allowing for calculation of alternative inferences. As it is not known which of the multiple haplotype inferences are best supported by the given unphased data set, we use a bootstrapping procedure that allows for scoring of alternative inferences. Finally, by means of the bootstrapping scores, it is possible to optimise the phased genotypes belonging to a given haplotype inference. The program is evaluated with simulated and experimental SNP data generated for heterozygous tetraploid populations of potato. We show that, instead of taking the first haplotype inference reported by the program, we can significantly improve the quality of the final result by applying additional methods that include scoring of the alternative haplotype inferences and genotype optimisation. For a sub-population of nineteen individuals, the predicted results computed by SATlotyper were directly compared with results obtained by experimental haplotype inference via sequencing of cloned amplicons. Prediction and experiment gave similar results regarding the inferred haplotypes and phased genotypes. Conclusion Our results suggest that Haplotype Inference by Pure Parsimony can be solved efficiently by the SAT approach, even for data sets of unphased SNP from heterozygous polyploids. SATlotyper is freeware and is distributed as

  16. Evaluation of TP53 Pro72Arg and MDM2 SNP285-SNP309 polymorphisms in an Italian cohort of LFS suggestive patients lacking identifiable TP53 germline mutations.

    PubMed

    Ponti, Francesca; Corsini, Serena; Gnoli, Maria; Pedrini, Elena; Mordenti, Marina; Sangiorgi, Luca

    2016-10-01

    Li-Fraumeni syndrome (LFS) is a rare genetic cancer predisposition disease, partly determined by the presence of a TP53 germline mutation; lacking thereof, in presence of a typical LFS phenotype, defines a wide group of 'LFS Suggestive' patients. Alternative LFS susceptibility genes have been investigated without promising results, thus suggesting other genetic determinants involvement in cancer predisposition. Hence, this study explores the single and combined effects of cancer risk, age of onset and cancer type of three single nucleotide polymorphisms (SNPs)-TP53 Pro72Arg, MDM2 SNP285 and SNP309-already described as modifiers on TP53 mutation carriers but not properly investigated in LFS Suggestive patients. This case-control study examines 34 Italian LFS Suggestive lacking of germline TP53 mutations and 95 tumour-free subjects. A significant prevalence of homozygous MDM2 SNP309 G in the LFS Suggestive group (p < 0.0005) confirms its contribute to cancer susceptibility, also highlighted in LFS TP53 positive families. Conversely its anticipating role on tumour onset has not been confirmed, as in our results it was associated with the SNP309 T allele. A strong combined outcome with a 'dosage' effect has also been reported for TP53 P72 and MDM2 SNP309 G allele on cancer susceptibility (p < 0.0005). Whereas the MDM2 SNP285 C allele neutralizing effect on MDM2 SNP309 G variant is not evident in our population. Although it needs further evaluations, obtained results strengthen the role of MDM2 SNP309 as a genetic factor in hereditary predisposition to cancer, so improving LFS Suggestive patients management.

  17. Developing an Inflatable Solar Array

    NASA Technical Reports Server (NTRS)

    Malone, Patrick K.; Jankowski, Francis J.; Williams, Geoffery T.; Vendura, George J., Jr.

    1992-01-01

    Viewgraphs describing the development of an inflatable solar array as part of the Inflatable Torus Solar Array Technology (ITSAT) program are presented. Program phases, overall and subsystem designs, and array deployment are addressed.

  18. High-throughput SNP-genotyping analysis of the relationships among Ponto-Caspian sturgeon species

    PubMed Central

    Rastorguev, Sergey M; Nedoluzhko, Artem V; Mazur, Alexander M; Gruzdeva, Natalia M; Volkov, Alexander A; Barmintseva, Anna E; Mugue, Nikolai S; Prokhortchouk, Egor B

    2013-01-01

    Abstract Legally certified sturgeon fisheries require population protection and conservation methods, including DNA tests to identify the source of valuable sturgeon roe. However, the available genetic data are insufficient to distinguish between different sturgeon populations, and are even unable to distinguish between some species. We performed high-throughput single-nucleotide polymorphism (SNP)-genotyping analysis on different populations of Russian (Acipenser gueldenstaedtii), Persian (A. persicus), and Siberian (A. baerii) sturgeon species from the Caspian Sea region (Volga and Ural Rivers), the Azov Sea, and two Siberian rivers. We found that Russian sturgeons from the Volga and Ural Rivers were essentially indistinguishable, but they differed from Russian sturgeons in the Azov Sea, and from Persian and Siberian sturgeons. We identified eight SNPs that were sufficient to distinguish these sturgeon populations with 80% confidence, and allowed the development of markers to distinguish sturgeon species. Finally, on the basis of our SNP data, we propose that the A. baerii-like mitochondrial DNA found in some Russian sturgeons from the Caspian Sea arose via an introgression event during the Pleistocene glaciation. In the present study, the high-throughput genotyping analysis of several sturgeon populations was performed. SNP markers for species identification were defined. The possible explanation of the baerii-like mitotype presence in some Russian sturgeons in the Caspian Sea was suggested. PMID:24567827

  19. Demographic Trends in Korean Native Cattle Explained Using Bovine SNP50 Beadchip

    PubMed Central

    Sharma, Aditi; Lim, Dajeong; Chai, Han-Ha; Choi, Bong-Hwan; Cho, Yongmin

    2016-01-01

    Linkage disequilibrium (LD) is the non-random association between the loci and it could give us a preliminary insight into the genetic history of the population. In the present study LD patterns and effective population size (Ne) of three Korean cattle breeds along with Chinese, Japanese and Mongolian cattle were compared using the bovine Illumina SNP50 panel. The effective population size (Ne) is the number of breeding individuals in a population and is particularly important as it determines the rate at which genetic variation is lost. The genotype data in our study comprised a total of 129 samples, varying from 4 to 39 samples. After quality control there were ~29,000 single nucleotide polymorphisms (SNPs) for which r2 value was calculated. Average distance between SNP pairs was 1.14 Mb across all breeds. Average r2 between adjacent SNP pairs ranged between was 0.1 for Yanbian to 0.3 for Qinchuan. Effective population size of the breeds based on r2 varied from 16 in Hainan to 226 in Yanbian. Amongst the Korean native breeds effective population size of Brindle Hanwoo was the least with Ne = 59 and Brown Hanwoo was the highest with Ne = 83. The effective population size of the Korean cattle breeds has been decreasing alarmingly over the past generations. We suggest appropriate measures to be taken to prevent these local breeds in their native tracts. PMID:28154516

  20. SNP-based markers for discriminating olive (Olea europaea L.) cultivars.

    PubMed

    Reale, S; Doveri, S; Díaz, A; Angiolillo, A; Lucentini, L; Pilla, F; Martín, A; Donini, P; Lee, D

    2006-09-01

    A set of 11 polymorphic markers (1 cleaved amplified polymorphic sequence (CAPS), 2 sequence-characterized amplified regions (SCARs), and 8 single-nucleotide polymorphism (SNP)-derived markers) was obtained for olive cultivar identification by comparing DNA sequences from different accessions. Marker development was more efficient, using sequences from the database rather than cloning arbitrary DNA fragments. Analyses of the sequences of 3 genes from 11 diverse cultivars revealed an SNP frequency of 1 per 190 base pairs in exons and 1 per 149 base pairs in introns. Most mutations were silent or had little perceptible effect on the polypeptide encoded. The higher incidence of transversions (55%) suggests that methylation is not the major driving force for DNA base changes. Evidence of linkage disequilibrium in 2 pairs of markers has been detected. The set of predominantly SNP-based markers was used to genotype 65 olive samples obtained from Europe and Australia, and was able clearly to discriminate 77% of the cultivars. Samples, putatively of the same cultivar but derived from different sources, were revealed as identical, demonstrating the utility of these markers as tools for resolving nomenclature issues. Genotyping data were used for constructing a dendrogram by UPGMA cluster analysis using the simple matching similarity coefficient. Relationships between cultivars are discussed in relation to the route of olive's spread.

  1. High-throughput SNP-genotyping analysis of the relationships among Ponto-Caspian sturgeon species.

    PubMed

    Rastorguev, Sergey M; Nedoluzhko, Artem V; Mazur, Alexander M; Gruzdeva, Natalia M; Volkov, Alexander A; Barmintseva, Anna E; Mugue, Nikolai S; Prokhortchouk, Egor B

    2013-08-01

    Legally certified sturgeon fisheries require population protection and conservation methods, including DNA tests to identify the source of valuable sturgeon roe. However, the available genetic data are insufficient to distinguish between different sturgeon populations, and are even unable to distinguish between some species. We performed high-throughput single-nucleotide polymorphism (SNP)-genotyping analysis on different populations of Russian (Acipenser gueldenstaedtii), Persian (A. persicus), and Siberian (A. baerii) sturgeon species from the Caspian Sea region (Volga and Ural Rivers), the Azov Sea, and two Siberian rivers. We found that Russian sturgeons from the Volga and Ural Rivers were essentially indistinguishable, but they differed from Russian sturgeons in the Azov Sea, and from Persian and Siberian sturgeons. We identified eight SNPs that were sufficient to distinguish these sturgeon populations with 80% confidence, and allowed the development of markers to distinguish sturgeon species. Finally, on the basis of our SNP data, we propose that the A. baerii-like mitochondrial DNA found in some Russian sturgeons from the Caspian Sea arose via an introgression event during the Pleistocene glaciation. In the present study, the high-throughput genotyping analysis of several sturgeon populations was performed. SNP markers for species identification were defined. The possible explanation of the baerii-like mitotype presence in some Russian sturgeons in the Caspian Sea was suggested.

  2. SNP Marker Discovery in Pima Cotton (Gossypium barbadense L.) Leaf Transcriptomes

    PubMed Central

    Kottapalli, Pratibha; Ulloa, Mauricio; Kottapalli, Kameswara Rao; Payton, Paxton; Burke, John

    2016-01-01

    The objective of this study was to explore the known narrow genetic diversity and discover single-nucleotide polymorphic (SNP) markers for marker-assisted breeding within Pima cotton (Gossypium barbadense L.) leaf transcriptomes. cDNA from 25-day plants of three diverse cotton genotypes [Pima S6 (PS6), Pima S7 (PS7), and Pima 3-79 (P3-79)] was sequenced on Illumina sequencing platform. A total of 28.9 million reads (average read length of 138 bp) were generated by sequencing cDNA libraries of these three genotypes. The de novo assembly of reads generated transcriptome sets of 26,369 contigs for PS6, 25,870 contigs for PS7, and 24,796 contigs for P3-79. A Pima leaf reference transcriptome was generated consisting of 42,695 contigs. More than 10,000 single-nucleotide polymorphisms (SNPs) were identified between the genotypes, with 100% SNP frequency and a minimum of eight sequencing reads. The most prevalent SNP substitutions were C—T and A—G in these cotton genotypes. The putative SNPs identified can be utilized for characterizing genetic diversity, genotyping, and eventually in Pima cotton breeding through marker-assisted selection. PMID:27721653

  3. Triallelic SNP-mediated genotyping of regenerated protoplasts of the heterokaryotic fungus Rhizoctonia solani.

    PubMed

    Thomas, Elizabeth; Pakala, Suman; Fedorova, Natalie D; Nierman, William C; Cubeta, Marc A

    2012-04-15

    The aneuploid and heterokaryotic nuclear condition of the soil fungus Rhizoctonia solani have provided challenges in obtaining a complete genome sequence. To better aid in the assembly and annotation process, a protoplast and single nucleotide polymorphism (SNP)-based method was developed to identify regenerated protoplasts with a reduced nuclear genome. Protocol optimization experiments showed that enzymatic digestion of mycelium from a 24 h culture of R. solani increased the proportion of protoplasts with a diameter of ≤7.5 μm and 1-4 nuclei. To determine whether strains regenerated from protoplasts with a reduced number of nuclei were genetically different from the parental strain, triallelic SNPs identified from variance records of the genomic DNA sequence reads of R. solani were used in PCR-based genotyping assays. Results from 16 of the 24 SNP-based PCR assays provided evidence that one of the three alleles was missing in the 11 regenerated protoplast strains, suggesting that these strains represent a reduced genomic complement of the parental strain. The protoplast and triallelic SNP-based method used in this study may be useful in strain development and analysis of other basidiomycete fungi with complex nuclear genomes.

  4. SNP typing reveals similarity in Mycobacterium tuberculosis genetic diversity between Portugal and Northeast Brazil.

    PubMed

    Lopes, Joao S; Marques, Isabel; Soares, Patricia; Nebenzahl-Guimaraes, Hanna; Costa, Joao; Miranda, Anabela; Duarte, Raquel; Alves, Adriana; Macedo, Rita; Duarte, Tonya A; Barbosa, Theolis; Oliveira, Martha; Nery, Joilda S; Boechat, Neio; Pereira, Susan M; Barreto, Mauricio L; Pereira-Leal, Jose; Gomes, Maria Gabriela Miranda; Penha-Goncalves, Carlos

    2013-08-01

    Human tuberculosis is an infectious disease caused by bacteria from the Mycobacterium tuberculosis complex (MTBC). Although spoligotyping and MIRU-VNTR are standard methodologies in MTBC genetic epidemiology, recent studies suggest that Single Nucleotide Polymorphisms (SNP) are advantageous in phylogenetics and strain group/lineages identification. In this work we use a set of 79 SNPs to characterize 1987 MTBC isolates from Portugal and 141 from Northeast Brazil. All Brazilian samples were further characterized using spolygotyping. Phylogenetic analysis against a reference set revealed that about 95% of the isolates in both populations are singly attributed to bacterial lineage 4. Within this lineage, the most frequent strain groups in both Portugal and Brazil are LAM, followed by Haarlem and X. Contrary to these groups, strain group T showed a very different prevalence between Portugal (10%) and Brazil (1.5%). Spoligotype identification shows about 10% of mis-matches compared to the use of SNPs and a little more than 1% of strains unidentifiability. The mis-matches are observed in the most represented groups of our sample set (i.e., LAM and Haarlem) in almost the same proportion. Besides being more accurate in identifying strain groups/lineages, SNP-typing can also provide phylogenetic relationships between strain groups/lineages and, thus, indicate cases showing phylogenetic incongruence. Overall, the use of SNP-typing revealed striking similarities between MTBC populations from Portugal and Brazil.

  5. Quadruplex-single nucleotide polymorphisms (Quad-SNP) influence gene expression difference among individuals.

    PubMed

    Baral, Aradhita; Kumar, Pankaj; Halder, Rashi; Mani, Prithvi; Yadav, Vinod Kumar; Singh, Ankita; Das, Swapan K; Chowdhury, Shantanu

    2012-05-01

    Non-canonical guanine quadruplex structures are not only predominant but also conserved among bacterial and mammalian promoters. Moreover recent findings directly implicate quadruplex structures in transcription. These argue for an intrinsic role of the structural motif and thereby posit that single nucleotide polymorphisms (SNP) that compromise the quadruplex architecture could influence function. To test this, we analysed SNPs within quadruplex motifs (Quad-SNP) and gene expression in 270 individuals across four populations (HapMap) representing more than 14,500 genotypes. Findings reveal significant association between quadruplex-SNPs and expression of the corresponding gene in individuals (P < 0.0001). Furthermore, analysis of Quad-SNPs obtained from population-scale sequencing of 1000 human genomes showed relative selection bias against alteration of the structural motif. To directly test the quadruplex-SNP-transcription connection, we constructed a reporter system using the RPS3 promoter-remarkable difference in promoter activity in the 'quadruplex-destabilized' versus 'quadruplex-intact' promoter was noticed. As a further test, we incorporated a quadruplex motif or its disrupted counterpart within a synthetic promoter reporter construct. The quadruplex motif, and not the disrupted-motif, enhanced transcription in human cell lines of different origin. Together, these findings build direct support for quadruplex-mediated transcription and suggest quadruplex-SNPs may play significant role in mechanistically understanding variations in gene expression among individuals.

  6. Demographic Trends in Korean Native Cattle Explained Using Bovine SNP50 Beadchip.

    PubMed

    Sharma, Aditi; Lim, Dajeong; Chai, Han-Ha; Choi, Bong-Hwan; Cho, Yongmin

    2016-12-01

    Linkage disequilibrium (LD) is the non-random association between the loci and it could give us a preliminary insight into the genetic history of the population. In the present study LD patterns and effective population size (Ne) of three Korean cattle breeds along with Chinese, Japanese and Mongolian cattle were compared using the bovine Illumina SNP50 panel. The effective population size (Ne) is the number of breeding individuals in a population and is particularly important as it determines the rate at which genetic variation is lost. The genotype data in our study comprised a total of 129 samples, varying from 4 to 39 samples. After quality control there were ~29,000 single nucleotide polymorphisms (SNPs) for which r(2) value was calculated. Average distance between SNP pairs was 1.14 Mb across all breeds. Average r(2) between adjacent SNP pairs ranged between was 0.1 for Yanbian to 0.3 for Qinchuan. Effective population size of the breeds based on r(2) varied from 16 in Hainan to 226 in Yanbian. Amongst the Korean native breeds effective population size of Brindle Hanwoo was the least with Ne = 59 and Brown Hanwoo was the highest with Ne = 83. The effective population size of the Korean cattle breeds has been decreasing alarmingly over the past generations. We suggest appropriate measures to be taken to prevent these local breeds in their native tracts.

  7. FASTSNP: an always up-to-date and extendable service for SNP function analysis and prioritization

    PubMed Central

    Yuan, Hsiang-Yu; Chiou, Jen-Jie; Tseng, Wen-Hsien; Liu, Chia-Hung; Liu, Chuan-Kun; Lin, Yi-Jung; Wang, Hui-Hung; Yao, Adam; Chen, Yuan-Tsong; Hsu, Chun-Nan

    2006-01-01

    Single nucleotide polymorphism (SNP) prioritization based on the phenotypic risk is essential for association studies. Assessment of the risk requires access to a variety of heterogeneous biological databases and analytical tools. FASTSNP (function analysis and selection tool for single nucleotide polymorphisms) is a web server that allows users to efficiently identify and prioritize high-risk SNPs according to their phenotypic risks and putative functional effects. A unique feature of FASTSNP is that the functional effect information used for SNP prioritization is always up-to-date, because FASTSNP extracts the information from 11 external web servers at query time using a team of web wrapper agents. Moreover, FASTSNP is extendable by simply deploying more Web wrapper agents. To validate the results of our prioritization, we analyzed 1569 SNPs from the SNP500Cancer database. The results show that SNPs with a high predicted risk exhibit low allele frequencies for the minor alleles, consistent with a well-known finding that a strong selective pressure exists for functional polymorphisms. We have been using FASTSNP for 2 years and FASTSNP enables us to discover a novel promoter polymorphism. FASTSNP is available at . PMID:16845089

  8. Rapid Detection of Rare Deleterious Variants by Next Generation Sequencing with Optional Microarray SNP Genotype Data.

    PubMed

    Watson, Christopher M; Crinnion, Laura A; Gurgel-Gianetti, Juliana; Harrison, Sally M; Daly, Catherine; Antanavicuite, Agne; Lascelles, Carolina; Markham, Alexander F; Pena, Sergio D J; Bonthron, David T; Carr, Ian M

    2015-09-01

    Autozygosity mapping is a powerful technique for the identification of rare, autosomal recessive, disease-causing genes. The ease with which this category of disease gene can be identified has greatly increased through the availability of genome-wide SNP genotyping microarrays and subsequently of exome sequencing. Although these methods have simplified the generation of experimental data, its analysis, particularly when disparate data types must be integrated, remains time consuming. Moreover, the huge volume of sequence variant data generated from next generation sequencing experiments opens up the possibility of using these data instead of microarray genotype data to identify disease loci. To allow these two types of data to be used in an integrated fashion, we have developed AgileVCFMapper, a program that performs both the mapping of disease loci by SNP genotyping and the analysis of potentially deleterious variants using exome sequence variant data, in a single step. This method does not require microarray SNP genotype data, although analysis with a combination of microarray and exome genotype data enables more precise delineation of disease loci, due to superior marker density and distribution.

  9. Efficient SNP Discovery by Combining Microarray and Lab-on-a-Chip Data for Animal Breeding and Selection

    PubMed Central

    Huang, Chao-Wei; Lin, Yu-Tsung; Ding, Shih-Torng; Lo, Ling-Ling; Wang, Pei-Hwa; Lin, En-Chung; Liu, Fang-Wei; Lu, Yen-Wen

    2015-01-01

    The genetic markers associated with economic traits have been widely explored for animal breeding. Among these markers, single-nucleotide polymorphism (SNPs) are gradually becoming a prevalent and effective evaluation tool. Since SNPs only focus on the genetic sequences of interest, it thereby reduces the evaluation time and cost. Compared to traditional approaches, SNP genotyping techniques incorporate informative genetic background, improve the breeding prediction accuracy and acquiesce breeding quality on the farm. This article therefore reviews the typical procedures of animal breeding using SNPs and the current status of related techniques. The associated SNP information and genotyping techniques, including microarray and Lab-on-a-Chip based platforms, along with their potential are highlighted. Examples in pig and poultry with different SNP loci linked to high economic trait values are given. The recommendations for utilizing SNP genotyping in nimal breeding are summarized. PMID:27600241

  10. Citrus (Rutaceae) SNP markers based on Competitive Allele-Specific PCR; transferability across the Aurantioideae subfamily1

    PubMed Central

    Garcia-Lor, Andres; Ancillo, Gema; Navarro, Luis; Ollitrault, Patrick

    2013-01-01

    • Premise of the study: Single nucleotide polymorphism (SNP) markers based on Competitive Allele-Specific PCR (KASPar) were developed from sequences of three Citrus species. Their transferability was tested in 63 Citrus genotypes and 19 relative genera of the subfamily Aurantioideae to estimate the potential of SNP markers, selected from a limited intrageneric discovery panel, for ongoing broader diversity analysis at the intra- and intergeneric levels and systematic germplasm bank characterization. • Methods and Results: Forty-two SNP markers were developed using KASPar technology. Forty-one were successfully genotyped in all of the Citrus germplasm, where intra- and interspecific polymorphisms were observed. The transferability and diversity decreased with increasing taxonomic distance. • Conclusions: SNP markers based on the KASPar method developed from sequence data of a limited intrageneric discovery panel provide a valuable molecular resource for genetic diversity analysis of germplasm within a genus and should be useful for germplasm fingerprinting at a much broader diversity level. PMID:25202535

  11. Developing Single Nucleotide Polymorphism (SNP) markers from transcriptome sequences for the identification of longan (Dimocarpus longan) germplasm

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in...

  12. [Novel mechanism of 3' exonuclease of polymerase in maintenance of DNA replication fidelity and its application in SNP assay].

    PubMed

    Chen, Lin-Ling; Zhang, Jia; Peng, Cui-Ying; Liao, Duan-Fang; Li, Hong-Jian; Gao, Han-Lin; Li, Kai

    2005-03-01

    Polymerase with 3' to 5'exonulcease plays an important role in the maintenance of in vivo DNA replication fidelity. In order to develop more reliable SNP assays, we revisit the underlying molecular mechanisms by which DNA polymerases with 3' exonucleases maintain high fidelity of DNA replication. In addition to mismatch removal by proofreading, we recently discovered a premature termination of polymerization by a new mechanism of OFF-switch. This novel ON/OFF switch turns off DNA polymerization from mismatched primers and turns on DNA polymerization from matched primers. Two SNP assays were developed based on the proofreading and the newly identified OFF-switch respectively: terminal labeled primer extension and the ON/OFF switch operated SNP assay. These two new methods are well adapted to conventional techniques such as electrophoresis, real time PCR, microplates, and microarray. Application of these reliable SNP assays will greatly facilitate genetic and biomedical studies in the post-genome era.

  13. Dense array expressions

    NASA Astrophysics Data System (ADS)

    Wilson, Joseph N.; Chen, LiangMing

    1999-10-01

    Various researchers have realized the value of implementing loop fusion to evaluate dense (pointwise) array expressions. Recently, the method of template metaprogramming in C++ has been used to significantly speed-up the evaluation of array expressions, allowing C++ programs to achieve performance comparable to or better than FORTRAN for numerical analysis applications. Unfortunately, the template metaprogramming technique suffers from several limitations in applicability, portability, and potential performance. We present a framework for evaluating dense array expressions in object-oriented programming languages. We demonstrate how this technique supports both common subexpression elimination and threaded implementation and compare its performance to object-library and hand-generated code.

  14. Arrays vs. single telescopes

    NASA Astrophysics Data System (ADS)

    Johnson, H. L.

    The question of the relative efficiencies of telescope arrays versus an equivalent mirror-area very large telescope is re-examined and summarized. Four separate investigations by Bowen, Johnson and Richards, Code, and Disney all came to the same conclusion: that an array of telescopes is superior, both scientifically and economically, to a single very large telescope. The costs of recently completed telescopes are compared. The costs of arrays of telescopes are shown to be significantly lower than that of a single, very large telescope, with the further advantage that because existing, proven, designs can be used, no engineering 'break-throughs' are needed.

  15. Characterization of polyploid wheat genomic diversity using a high-density 90,000 single nucleotide polymorphism array.

    PubMed

    Wang, Shichen; Wong, Debbie; Forrest, Kerrie; Allen, Alexandra; Chao, Shiaoman; Huang, Bevan E; Maccaferri, Marco; Salvi, Silvio; Milner, Sara G; Cattivelli, Luigi; Mastrangelo, Anna M; Whan, Alex; Stephen, Stuart; Barker, Gary; Wieseke, Ralf; Plieske, Joerg; Lillemo, Morten; Mather, Diane; Appels, Rudi; Dolferus, Rudy; Brown-Guedira, Gina; Korol, Abraham; Akhunova, Alina R; Feuillet, Catherine; Salse, Jerome; Morgante, Michele; Pozniak, Curtis; Luo, Ming-Cheng; Dvorak, Jan; Morell, Matthew; Dubcovsky, Jorge; Ganal, Martin; Tuberosa, Roberto; Lawley, Cindy; Mikoulitch, Ivan; Cavanagh, Colin; Edwards, Keith J; Hayden, Matthew; Akhunov, Eduard

    2014-08-01

    High-density single nucleotide polymorphism (SNP) genotyping arrays are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships between individuals in populations and studying marker-trait associations in mapping experiments. We developed a genotyping array including about 90,000 gene-associated SNPs and used it to characterize genetic variation in allohexaploid and allotetraploid wheat populations. The array includes a significant fraction of common genome-wide distributed SNPs that are represented in populations of diverse geographical origin. We used density-based spatial clustering algorithms to enable high-throughput genotype calling in complex data sets obtained for polyploid wheat. We show that these model-free clustering algorithms provide accurate genotype calling in the presence of multiple clusters including clusters with low signal intensity resulting from significant sequence divergence at the target SNP site or gene deletions. Assays that detect low-intensity clusters can provide insight into the distribution of presence-absence variation (PAV) in wheat populations. A total of 46 977 SNPs from the wheat 90K array were genetically mapped using a combination of eight mapping populations. The developed array and cluster identification algorithms provide an opportunity to infer detailed haplotype structure in polyploid wheat and will serve as an invaluable resource for diversity studies and investigating the genetic basis of trait variation in wheat.

  16. Characterization of polyploid wheat genomic diversity using a high-density 90 000 single nucleotide polymorphism array

    PubMed Central

    Wang, Shichen; Wong, Debbie; Forrest, Kerrie; Allen, Alexandra; Chao, Shiaoman; Huang, Bevan E; Maccaferri, Marco; Salvi, Silvio; Milner, Sara G; Cattivelli, Luigi; Mastrangelo, Anna M; Whan, Alex; Stephen, Stuart; Barker, Gary; Wieseke, Ralf; Plieske, Joerg; International Wheat Genome Sequencing Consortium; Lillemo, Morten; Mather, Diane; Appels, Rudi; Dolferus, Rudy; Brown-Guedira, Gina; Korol, Abraham; Akhunova, Alina R; Feuillet, Catherine; Salse, Jerome; Morgante, Michele; Pozniak, Curtis; Luo, Ming-Cheng; Dvorak, Jan; Morell, Matthew; Dubcovsky, Jorge; Ganal, Martin; Tuberosa, Roberto; Lawley, Cindy; Mikoulitch, Ivan; Cavanagh, Colin; Edwards, Keith J; Hayden, Matthew; Akhunov, Eduard

    2014-01-01

    High-density single nucleotide polymorphism (SNP) genotyping arrays are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships between individuals in populations and studying marker–trait associations in mapping experiments. We developed a genotyping array including about 90 000 gene-associated SNPs and used it to characterize genetic variation in allohexaploid and allotetraploid wheat populations. The array includes a significant fraction of common genome-wide distributed SNPs that are represented in populations of diverse geographical origin. We used density-based spatial clustering algorithms to enable high-throughput genotype calling in complex data sets obtained for polyploid wheat. We show that these model-free clustering algorithms provide accurate genotype calling in the presence of multiple clusters including clusters with low signal intensity resulting from significant sequence divergence at the target SNP site or gene deletions. Assays that detect low-intensity clusters can provide insight into the distribution of presence–absence variation (PAV) in wheat populations. A total of 46 977 SNPs from the wheat 90K array were genetically mapped using a combination of eight mapping populations. The developed array and cluster identification algorithms provide an opportunity to infer detailed haplotype structure in polyploid wheat and will serve as an invaluable resource for diversity studies and investigating the genetic basis of trait variation in wheat. PMID:24646323

  17. A Multiple-SNP Approach for Genome-Wide Association Study of Milk Production Traits in Chinese Holstein Cattle

    PubMed Central

    Fang, Ming; Fu, Weixuan; Jiang, Dan; Zhang, Qin; Sun, Dongxiao; Ding, Xiangdong; Liu, Jianfeng

    2014-01-01

    The multiple-SNP analysis has been studied by many researchers, in which the effects of multiple SNPs are simultaneously estimated and tested in a multiple linear regression. The multiple-SNP association analysis usually has higher power and lower false-positive rate for detecting causative SNP(s) than single marker analysis (SMA). Several methods have been proposed to simultaneously estimate and test multiple SNP effects. In this research, a fast method called MEML (Mixed model based Expectation-Maximization Lasso algorithm) was developed for simultaneously estimate of multiple SNP effects. An improved Lasso prior was assigned to SNP effects which were estimated by searching the maximum joint posterior mode. The residual polygenic effect was included in the model to absorb many tiny SNP effects, which is treated as missing data in our EM algorithm. A series of simulation experiments were conducted to validate the proposed method, and the results showed that compared with SMMA, the new method can dramatically decrease the false-positive rate. The new method was also applied to the 50k SNP-panel dataset for genome-wide association study of milk production traits in Chinese Holstein cattle. Totally, 39 significant SNPs and their nearby 25 genes were found. The number of significant SNPs is remarkably fewer than that by SMMA which found 105 significant SNPs. Among 39 significant SNPs, 8 were also found by SMMA and several well-known QTLs or genes were confirmed again; furthermore, we also got some positional candidate gene with potential function of effecting milk production traits. These novel findings in our research should be valuable for further investigation. PMID:25148050

  18. TheSNPpit—A High Performance Database System for Managing Large Scale SNP Data

    PubMed Central

    Groeneveld, Eildert; Lichtenberg, Helmut

    2016-01-01

    The fast development of high throughput genotyping has opened up new possibilities in genetics while at the same time producing considerable data handling issues. TheSNPpit is a database system for managing large amounts of multi panel SNP genotype data from any genotyping platform. With an increasing rate of genotyping in areas like animal and plant breeding as well as human genetics, already now hundreds of thousand of individuals need to be managed. While the common database design with one row per SNP can manage hundreds of samples this approach becomes progressively slower as the size of the data sets increase until it finally fails completely once tens or even hundreds of thousands of individuals need to be managed. TheSNPpit has implemented three ideas to also accomodate such large scale experiments: highly compressed vector storage in a relational database, set based data manipulation, and a very fast export written in C with Perl as the base for the framework and PostgreSQL as the database backend. Its novel subset system allows the creation of named subsets based on the filtering of SNP (based on major allele frequency, no-calls, and chromosomes) and manually applied sample and SNP lists at negligible storage costs, thus avoiding the issue of proliferating file copies. The named subsets are exported for down stream analysis. PLINK ped and map files are processed as in- and outputs. TheSNPpit allows management of different panel sizes in the same population of individuals when higher density panels replace previous lower density versions as it occurs in animal and plant breeding programs. A completely generalized procedure allows storage of phenotypes. TheSNPpit only occupies 2 bits for storing a single SNP implying a capacity of 4 mio SNPs per 1MB of disk storage. To investigate performance scaling, a database with more than 18.5 mio samples has been created with 3.4 trillion SNPs from 12 panels ranging from 1000 through 20 mio SNPs resulting in a

  19. TheSNPpit-A High Performance Database System for Managing Large Scale SNP Data.

    PubMed

    Groeneveld, Eildert; Lichtenberg, Helmut

    2016-01-01

    The fast development of high throughput genotyping has opened up new possibilities in genetics while at the same time producing considerable data handling issues. TheSNPpit is a database system for managing large amounts of multi panel SNP genotype data from any genotyping platform. With an increasing rate of genotyping in areas like animal and plant breeding as well as human genetics, already now hundreds of thousand of individuals need to be managed. While the common database design with one row per SNP can manage hundreds of samples this approach becomes progressively slower as the size of the data sets increase until it finally fails completely once tens or even hundreds of thousands of individuals need to be managed. TheSNPpit has implemented three ideas to also accomodate such large scale experiments: highly compressed vector storage in a relational database, set based data manipulation, and a very fast export written in C with Perl as the base for the framework and PostgreSQL as the database backend. Its novel subset system allows the creation of named subsets based on the filtering of SNP (based on major allele frequency, no-calls, and chromosomes) and manually applied sample and SNP lists at negligible storage costs, thus avoiding the issue of proliferating file copies. The named subsets are exported for down stream analysis. PLINK ped and map files are processed as in- and outputs. TheSNPpit allows management of different panel sizes in the same population of individuals when higher density panels replace previous lower density versions as it occurs in animal and plant breeding programs. A completely generalized procedure allows storage of phenotypes. TheSNPpit only occupies 2 bits for storing a single SNP implying a capacity of 4 mio SNPs per 1MB of disk storage. To investigate performance scaling, a database with more than 18.5 mio samples has been created with 3.4 trillion SNPs from 12 panels ranging from 1000 through 20 mio SNPs resulting in a

  20. SNP genotypes of olfactory receptor genes associated with olfactory ability in German Shepherd dogs.

    PubMed

    Yang, M; Geng, G-J; Zhang, W; Cui, L; Zhang, H-X; Zheng, J-L

    2016-04-01

    To find out the relationship between SNP genotypes of canine olfactory receptor genes and olfactory ability, 28 males and 20 females from German Shepherd dogs in police service were scored by odor detection tests and analyzed using the Beckman GenomeLab SNPstream. The representative 22 SNP loci from the exonic regions of 12 olfactory receptor genes were investigated, and three kinds of odor (human, ice drug and trinitrotoluene) were detected. The results showed that the SNP genotypes at the OR10H1-like:c.632C>T, OR10H1-like:c.770A>T, OR2K2-like:c.518G>A, OR4C11-like:c.511T>G and OR4C11-like:c.692G>A loci had a statistically significant effect on the scenting abilities (P < 0.001). The kind of odor influenced the performances of the dogs (P < 0.001). In addition, there were interactions between genotype and the kind of odor at the following loci: OR10H1-like:c.632C>T, OR10H1-like:c.770A>T, OR4C11-like:c.511T>G and OR4C11-like:c.692G>A (P < 0.001). The dogs with genotype CC at the OR10H1-like:c.632C>T, genotype AA at the OR10H1-like:c.770A>T, genotype TT at the OR4C11-like:c.511T>G and genotype GG at the OR4C11-like:c.692G>A loci did better at detecting the ice drug. We concluded that there was linkage between certain SNP genotypes and the olfactory ability of dogs and that SNP genotypes might be useful in determining dogs' scenting potential.

  1. Identification of SNP and SSR Markers in Finger Millet Using Next Generation Sequencing Technologies

    PubMed Central

    Gimode, Davis; Odeny, Damaris A.; de Villiers, Etienne P.; Wanyonyi, Solomon; Dida, Mathews M.; Mneney, Emmarold E.; Muchugi, Alice; Machuka, Jesse; de Villiers, Santie M.

    2016-01-01

    Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS) technologies to develop both Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNP) markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC) was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included in the regional

  2. Development and Evaluation of a High Density Genotyping ‘Axiom_Arachis’ Array with 58 K SNPs for Accelerating Genetics and Breeding in Groundnut

    PubMed Central

    Pandey, Manish K.; Agarwal, Gaurav; Kale, Sandip M.; Clevenger, Josh; Nayak, Spurthi N.; Sriswathi, Manda; Chitikineni, Annapurna; Chavarro, Carolina; Chen, Xiaoping; Upadhyaya, Hari D.; Vishwakarma, Manish K.; Leal-Bertioli, Soraya; Liang, Xuanqiang; Bertioli, David J.; Guo, Baozhu; Jackson, Scott A.; Ozias-Akins, Peggy; Varshney, Rajeev K.

    2017-01-01

    Single nucleotide polymorphisms (SNPs) are the most abundant DNA sequence variation in the genomes which can be used to associate genotypic variation to the phenotype. Therefore, availability of a high-density SNP array with uniform genome coverage can advance genetic studies and breeding applications. Here we report the development of a high-density SNP array ‘Axiom_Arachis’ with 58 K SNPs and its utility in groundnut genetic diversity study. In this context, from a total of 163,782 SNPs derived from DNA resequencing and RNA-sequencing of 41 groundnut accessions and wild diploid ancestors, a total of 58,233 unique and informative SNPs were selected for developing the array. In addition to cultivated groundnuts (Arachis hypogaea), fair representation was kept for other diploids (A. duranensis, A. stenosperma, A. cardenasii, A. magna and A. batizocoi). Genotyping of the groundnut ‘Reference Set’ containing 300 genotypes identified 44,424 polymorphic SNPs and genetic diversity analysis provided in-depth insights into the genetic architecture of this material. The availability of the high-density SNP array ‘Axiom_Arachis’ with 58 K SNPs will accelerate the process of high resolution trait genetics and molecular breeding in cultivated groundnut. PMID:28091575

  3. Supplementing High-Density SNP Microarrays for Additional Coverage of Disease-Related Genes: Addiction as a Paradigm

    PubMed Central

    Saccone, Scott F.; Bierut, Laura J.; Chesler, Elissa J.; Kalivas, Peter W.; Lerman, Caryn; Saccone, Nancy L.; Uhl, George R.; Li, Chuan-Yun; Philip, Vivek M.; Edenberg, Howard J.; Sherry, Stephen T.; Feolo, Michael; Moyzis, Robert K.; Rutter, Joni L.

    2009-01-01

    Commercial SNP microarrays now provide comprehensive and affordable coverage of the human genome. However, some diseases have biologically relevant genomic regions that may require additional coverage. Addiction, for example, is thought to be influenced by complex interactions among many relevant genes and pathways. We have assembled a list of 486 biologically relevant genes nominated by a panel of experts on addiction. We then added 424 genes that showed evidence of association with addiction phenotypes through mouse QTL mappings and gene co-expression analysis. We demonstrate that there are a substantial number of SNPs in these genes that are not well represented by commercial SNP platforms. We address this problem by introducing a publicly available SNP database for addiction. The database is annotated using numeric prioritization scores indicating the extent of biological relevance. The scores incorporate a number of factors such as SNP/gene functional properties (including synonymy and promoter regions), data from mouse systems genetics and measures of human/mouse evolutionary conservation. We then used HapMap genotyping data to determine if a SNP is tagged by a commercial microarray through linkage disequilibrium. This combination of biological prioritization scores and LD tagging annotation will enable addiction researchers to supplement commercial SNP microarrays to ensure comprehensive coverage of biologically relevant regions. PMID:19381300

  4. Supplementing High-Density SNP Microarrays for Additional Coverage of Disease-Related Genes: Addiction as a Paradigm

    SciTech Connect

    SacconePhD, Scott F; Chesler, Elissa J; Bierut, Laura J; Kalivas, Peter J; Lerman, Caryn; Saccone, Nancy L; Uhl, George R; Li, Chuan-Yun; Philip, Vivek M; Edenberg, Howard; Sherry, Steven; Feolo, Michael; Moyzis, Robert K; Rutter, Joni L

    2009-01-01

    Commercial SNP microarrays now provide comprehensive and affordable coverage of the human genome. However, some diseases have biologically relevant genomic regions that may require additional coverage. Addiction, for example, is thought to be influenced by complex interactions among many relevant genes and pathways. We have assembled a list of 486 biologically relevant genes nominated by a panel of experts on addiction. We then added 424 genes that showed evidence of association with addiction phenotypes through mouse QTL mappings and gene co-expression analysis. We demonstrate that there are a substantial number of SNPs in these genes that are not well represented by commercial SNP platforms. We address this problem by introducing a publicly available SNP database for addiction. The database is annotated using numeric prioritization scores indicating the extent of biological relevance. The scores incorporate a number of factors such as SNP/gene functional properties (including synonymy and promoter regions), data from mouse systems genetics and measures of human/mouse evolutionary conservation. We then used HapMap genotyping data to determine if a SNP is tagged by a commercial microarray through linkage disequilibrium. This combination of biological prioritization scores and LD tagging annotation will enable addiction researchers to supplement commercial SNP microarrays to ensure comprehensive coverage of biologically relevant regions.

  5. Allele frequencies for 40 autosomal SNP loci typed for US population samples using electrospray ionization mass spectrometry

    PubMed Central

    Kiesler, Kevin M.; Vallone, Peter M.

    2013-01-01

    Aim To type a set of 194 US African American, Caucasian, and Hispanic samples (self-declared ancestry) for 40 autosomal single nucleotide polymorphism (SNP) markers intended for human identification purposes. Methods Genotyping was performed on an automated commercial electrospray ionization time-of-flight mass spectrometer, the PLEX-ID. The 40 SNP markers were amplified in eight unique 5plex PCRs, desalted, and resolved based on amplicon mass. For each of the three US sample groups statistical analyses were performed on the resulting genotypes. Results The assay was found to be robust and capable of genotyping the 40 SNP markers consuming approximately 4 nanograms of template per sample. The combined random match probabilities for the 40 SNP assay ranged from 10−16 to 10−21. Conclusion The multiplex PLEX-ID SNP-40 assay is the first fully automated genotyping method capable of typing a panel of 40 forensically relevant autosomal SNP markers on a mass spectrometry platform. The data produced provided the first allele frequencies estimates for these 40 SNPs in a National Institute of Standards and Technology US population sample set. No population bias was detected although one locus deviated from its expected level of heterozygosity. PMID:23771752

  6. Virtual center arraying

    NASA Technical Reports Server (NTRS)

    Deutsch, L. J.; Lipes, R. G.; Miller, R. L.

    1981-01-01

    Methods to increase the amount of data that can be received from outer planet missions are described with emphasis on antenna arraying systems designed to increase the total effective aperture of the receiving system. One such method is virtual center arraying (VCA). In VCA, a combined carrier reference is derived at a point that is, conceptually, the geometric center of the array. This point need not coincide with any of the actual antennas of the array. A noise analysis of the VCA system is given along with formulas for the phase jitter as a function of loop bandwidths and the amount of loop damping. If the ratio of the loop bandwidths of the center loop to the vertex loops is greater than 100, then the jitter is very nearly equal to that expected for ideal combined carrier referencing.

  7. Flexible retinal electrode array

    DOEpatents

    Okandan, Murat; Wessendorf, Kurt O.; Christenson, Todd R.

    2006-10-24

    An electrode array which has applications for neural stimulation and sensing. The electrode array can include a large number of electrodes each of which is flexibly attached to a common substrate using a plurality of springs to allow the electrodes to move independently. The electrode array can be formed from a combination of bulk and surface micromachining, with electrode tips that can include an electroplated metal (e.g. platinum, iridium, gold or titanium) or a metal oxide (e.g. iridium oxide) for biocompatibility. The electrode array can be used to form a part of a neural prosthesis, and is particularly well adapted for use in an implantable retinal prosthesis where the electrodes can be tailored to provide a uniform gentle contact pressure with optional sensing of this contact pressure at one or more of the electrodes.

  8. Expandable LED array interconnect

    DOEpatents

    Yuan, Thomas Cheng-Hsin; Keller, Bernd

    2011-03-01

    A light emitting device that can function as an array element in an expandable array of such devices. The light emitting device comprises a substrate that has a top surface and a plurality of edges. Input and output terminals are mounted to the top surface of the substrate. Both terminals comprise a plurality of contact pads disposed proximate to the edges of the substrate, allowing for easy access to both terminals from multiple edges of the substrate. A lighting element is mounted to the top surface of the substrate. The lighting element is connected between the input and output terminals. The contact pads provide multiple access points to the terminals which allow for greater flexibility in design when the devices are used as array elements in an expandable array.

  9. Economical custom LSI arrays

    NASA Technical Reports Server (NTRS)

    Feller, A.; Smith, A.; Ramondetta, P.; Noto, R.; Lombardi, T.

    1976-01-01

    Automatic design technique uses standard circuit cells for producing large-scale integrated arrays. Computerized fabrication process provides individual cells of high density and efficiency, quick turnaround time, low cost, and ease of corrections for changes and errors.

  10. Conformal array antenna subsystem

    NASA Astrophysics Data System (ADS)

    1985-04-01

    An antenna subsystem to communicate between Ariane 4 and a data relay satellite was studied, concluding that the original ideas on ring antennas should be corrected due to the wide margin of coverage required in elevation for such antennas, which implies the need of splitting the coverage. Nevertheless, the study of cylindrical and conical conformal arrays was continued in view of their intrinsic interest. Needed coverages with specified gain can be obtained with a set of microstrip circular patch antennas. For the lower stage, a single patch is enough. For geostationary missions, one horizontal array is used, and for heliosynchronous missions two horizontal arrays and a vertical one. The numerical study carried out on omniazimuthal ring antennas shows that a tendency to omnidirectional pattern exists in spite of the directivity of the elementary radiators. A small pointing improvement of the meridian pattern can be obtained by means of conical arrays instead of the cylindrical ones.

  11. Multi Sensor Array

    NASA Technical Reports Server (NTRS)

    Immer, Christopher; Voska, Ned (Technical Monitor)

    2002-01-01

    This paper presents viewgraphs on the Multi Sensor Array. The topics include: 1) MSA Algorithm; 2) Types of Sensors for the MSA; 3) How to test the MSA; 4) Monte Carlo Simulation; and 5) Accelerated Life Tests.

  12. Glory Solar Array Deployment

    NASA Video Gallery

    The Glory spacecraft uses Orbital Sciences Corporation Space Systems Group's LEOStar-1 bus design, with deployable, four-panel solar arrays. This conceptual animation reveals Glory's unique solar a...

  13. Random array grid collimator

    DOEpatents

    Fenimore, E.E.

    1980-08-22

    A hexagonally shaped quasi-random no-two-holes touching grid collimator. The quasi-random array grid collimator eliminates contamination from small angle off-axis rays by using a no-two-holes-touching pattern which simultaneously provides for a self-supporting array increasng throughput by elimination of a substrate. The presentation invention also provides maximum throughput using hexagonally shaped holes in a hexagonal lattice pattern for diffraction limited applications. Mosaicking is also disclosed for reducing fabrication effort.

  14. Blood pressure loci identified with a gene-centric array.

    PubMed

    Johnson, Toby; Gaunt, Tom R; Newhouse, Stephen J; Padmanabhan, Sandosh; Tomaszewski, Maciej; Kumari, Meena; Morris, Richard W; Tzoulaki, Ioanna; O'Brien, Eoin T; Poulter, Neil R; Sever, Peter; Shields, Denis C; Thom, Simon; Wannamethee, Sasiwarang G; Whincup, Peter H; Brown, Morris J; Connell, John M; Dobson, Richard J; Howard, Philip J; Mein, Charles A; Onipinla, Abiodun; Shaw-Hawkins, Sue; Zhang, Yun; Davey Smith, George; Day, Ian N M; Lawlor, Debbie A; Goodall, Alison H; Fowkes, F Gerald; Abecasis, Gonçalo R; Elliott, Paul; Gateva, Vesela; Braund, Peter S; Burton, Paul R; Nelson, Christopher P; Tobin, Martin D; van der Harst, Pim; Glorioso, Nicola; Neuvrith, Hani; Salvi, Erika; Staessen, Jan A; Stucchi, Andrea; Devos, Nabila; Jeunemaitre, Xavier; Plouin, Pierre-François; Tichet, Jean; Juhanson, Peeter; Org, Elin; Putku, Margus; Sõber, Siim; Veldre, Gudrun; Viigimaa, Margus; Levinsson, Anna; Rosengren, Annika; Thelle, Dag S; Hastie, Claire E; Hedner, Thomas; Lee, Wai K; Melander, Olle; Wahlstrand, Björn; Hardy, Rebecca; Wong, Andrew; Cooper, Jackie A; Palmen, Jutta; Chen, Li; Stewart, Alexandre F R; Wells, George A; Westra, Harm-Jan; Wolfs, Marcel G M; Clarke, Robert; Franzosi, Maria Grazia; Goel, Anuj; Hamsten, Anders; Lathrop, Mark; Peden, John F; Seedorf, Udo; Watkins, Hugh; Ouwehand, Willem H; Sambrook, Jennifer; Stephens, Jonathan; Casas, Juan-Pablo; Drenos, Fotios; Holmes, Michael V; Kivimaki, Mika; Shah, Sonia; Shah, Tina; Talmud, Philippa J; Whittaker, John; Wallace, Chris; Delles, Christian; Laan, Maris; Kuh, Diana; Humphries, Steve E; Nyberg, Fredrik; Cusi, Daniele; Roberts, Robert; Newton-Cheh, Christopher; Franke, Lude; Stanton, Alice V; Dominiczak, Anna F; Farrall, Martin; Hingorani, Aroon D; Samani, Nilesh J; Caulfield, Mark J; Munroe, Patricia B

    2011-12-09

    Raised blood pressure (BP) is a major risk factor for cardiovascular disease. Previous studies have identified 47 distinct genetic variants robustly associated with BP, but collectively these explain only a few percent of the heritability for BP phenotypes. To find additional BP loci, we used a bespoke gene-centric array to genotype an independent discovery sample of 25,118 individuals that combined hypertensive case-control and general population samples. We followed up four SNPs associated with BP at our p < 8.56 × 10(-7) study-specific significance threshold and six suggestively associated SNPs in a further 59,349 individuals. We identified and replicated a SNP at LSP1/TNNT3, a SNP at MTHFR-NPPB independent (r(2) = 0.33) of previous reports, and replicated SNPs at AGT and ATP2B1 reported previously. An analysis of combined discovery and follow-up data identified SNPs significantly associated with BP at p < 8.56 × 10(-7) at four further loci (NPR3, HFE, NOS3, and SOX6). The high number of discoveries made with modest genotyping effort can be attributed to using a large-scale yet targeted genotyping array and to the development of a weighting scheme that maximized power when meta-analyzing results from samples ascertained with extreme phenotypes, in combination with results from nonascertained or population samples. Chromatin immunoprecipitation and transcript expression data highlight potential gene regulatory mechanisms at the MTHFR and NOS3 loci. These results provide candidates for further study to help dissect mechanisms affecting BP and highlight the utility of studying SNPs and samples that are independent of those studied previously even when the sample size is smaller than that in previous studies.

  15. Sampled Longest Common Prefix Array

    NASA Astrophysics Data System (ADS)

    Sirén, Jouni

    When augmented with the longest common prefix (LCP) array and some other structures, the suffix array can solve many string processing problems in optimal time and space. A compressed representation of the LCP array is also one of the main building blocks in many compressed suffix tree proposals. In this paper, we describe a new compressed LCP representation: the sampled LCP array. We show that when used with a compressed suffix array (CSA), the sampled LCP array often offers better time/space trade-offs than the existing alternatives. We also show how to construct the compressed representations of the LCP array directly from a CSA.

  16. Imputation of microsatellite alleles from dense SNP genotypes for parentage verification across multiple Bos taurus and Bos indicus breeds

    PubMed Central

    McClure, Matthew C.; Sonstegard, Tad S.; Wiggans, George R.; Van Eenennaam, Alison L.; Weber, Kristina L.; Penedo, Cecilia T.; Berry, Donagh P.; Flynn, John; Garcia, Jose F.; Carmo, Adriana S.; Regitano, Luciana C. A.; Albuquerque, Milla; Silva, Marcos V. G. B.; Machado, Marco A.; Coffey, Mike; Moore, Kirsty; Boscher, Marie-Yvonne; Genestout, Lucie; Mazza, Raffaele; Taylor, Jeremy F.; Schnabel, Robert D.; Simpson, Barry; Marques, Elisa; McEwan, John C.; Cromie, Andrew; Coutinho, Luiz L.; Kuehn, Larry A.; Keele, John W.; Piper, Emily K.; Cook, Jim; Williams, Robert; Van Tassell, Curtis P.

    2013-01-01

    To assist cattle producers transition from microsatellite (MS) to single nucleotide polymorphism (SNP) genotyping for parental verification we previously devised an effective and inexpensive method to impute MS alleles from SNP haplotypes. While the reported method was verified with only a limited data set (N = 479) from Brown Swiss, Guernsey, Holstein, and Jersey cattle, some of the MS-SNP haplotype associations were concordant across these phylogenetically diverse breeds. This implied that some haplotypes predate modern breed formation and remain in strong linkage disequilibrium. To expand the utility of MS allele imputation across breeds, MS and SNP data from more than 8000 animals representing 39 breeds (Bos taurus and B. indicus) were used to predict 9410 SNP haplotypes, incorporating an average of 73 SNPs per haplotype, for which alleles from 12 MS markers could be accurately be imputed. Approximately 25% of the MS-SNP haplotypes were present in multiple breeds (N = 2 to 36 breeds). These shared haplotypes allowed for MS imputation in breeds that were not represented in the reference population with only a small increase in Mendelian inheritance inconsistancies. Our reported reference haplotypes can be used for any cattle breed and the reported methods can be applied to any species to aid the transition from MS to SNP genetic markers. While ~91% of the animals with imputed alleles for 12 MS markers had ≤1 Mendelian inheritance conflicts with their parents' reported MS genotypes, this figure was 96% for our reference animals, indicating potential errors in the reported MS genotypes. The workflow we suggest autocorrects for genotyping errors and rare haplotypes, by MS genotyping animals whose imputed MS alleles fail parentage verification, and then incorporating those animals into the reference dataset. PMID:24065982

  17. Somatic Mutation of the SNP rs11614913 and Its Association with Increased MIR 196A2 Expression in Breast Cancer.

    PubMed

    Zhao, Huanhuan; Xu, Jingman; Zhao, Dan; Geng, Meijuan; Ge, Haize; Fu, Li; Zhu, Zhengmao

    2016-02-01

    Common genetic variants (single-nucleotide polymorphisms [SNPs]) in microRNA genes may alter their maturation or expression, resulting in varied functional consequences. Several studies have evaluated the association between the SNP rs11614913 and cancer risk in diverse populations and in a range of cancers, with contradictory outcomes. In this study, we examined 114 paired samples (tumor and normal tissues) from breast cancer patients to study the genotype distribution and somatic mutation of the SNP in MIR 196A2 (rs11614913 C-T). In addition, we evaluated their influence on the mature MIR 196A2 expression. We found that 14% (16/114) of tumors underwent somatic mutation of the SNP rs11614913. Moreover, the CT heterozygous and the CC homozygous states of SNP rs11614913 were more prone to mutation, while the TT homozygous state appeared to be resistant. We further detected a significant increase (p = 0.002) in mature MIR 196A2 expression in breast cancer. In particular, we found a significant association between the occurrence of SNP rs11614913 mutation and high expression (p = 0.0002). In addition, the mature MIR 196A2 expression level was significantly associated with the higher tumor grade (p = 0.004). Taken together, our results seem to demonstrate that somatic mutation of SNP rs11614913 in MIR 196A2 can have an influence on its expression. In addition, it indicated that an unknown mechanism is responsible for both the mutation of SNP rs11614913 and the dysregulation of mature MIR 196A2 expression.

  18. Deriving Gene Networks from SNP Associated with Triacylglycerol and Phospholipid Fatty Acid Fractions from Ribeyes of Angus Cattle

    PubMed Central

    Buchanan, Justin W.; Reecy, James M.; Garrick, Dorian J.; Duan, Qing; Beitz, Don C.; Koltes, James E.; Saatchi, Mahdi; Koesterke, Lars; Mateescu, Raluca G.

    2016-01-01

    The fatty acid profile of beef is a complex trait that can benefit from gene-interaction network analysis to understand relationships among loci that contribute to phenotypic variation. Phenotypic measures of fatty acid profile from triacylglycerol and phospholipid fractions of longissimus muscle, pedigree information, and Illumina 54 k bovine SNP genotypes were utilized to derive an annotated gene network associated with fatty acid composition in 1,833 Angus beef cattle. The Bayes-B statistical model was utilized to perform a genome wide association study to estimate associations between 54 k SNP genotypes and 39 individual fatty acid phenotypes within each fraction. Posterior means of the effects were estimated for each of the 54 k SNP and for the collective effects of all the SNP in every 1-Mb genomic window in terms of the proportion of genetic variance explained by the window. Windows that explained the largest proportions of genetic variance for individual lipids were found in the triacylglycerol fraction. There was almost no overlap in the genomic regions explaining variance between the triacylglycerol and phospholipid fractions. Partial correlations were used to identify correlated regions of the genome for the set of largest 1 Mb windows that explained up to 35% genetic variation in either fatty acid fraction. SNP were allocated to windows based on the bovine UMD3.1 assembly. Gene network clusters were generated utilizing a partial correlation and information theory algorithm. Results were used in conjunction with network scoring and visualization software to analyze correlated SNP across 39 fatty acid phenotypes to identify SNP of significance. Significant pathways implicated in fatty acid metabolism through GO term enrichment analysis included homeostasis of number of cells, homeostatic process, coenzyme/cofactor activity, and immunoglobulin. These results suggest different metabolic pathways regulate the development of different types of lipids found in

  19. Evaporating metal nanocrystal arrays.

    PubMed

    Zhang, Xue; Joy, James C; Zhao, Chenwei; Kim, Jin Ho; Fernandes, Gustavo; Xu, J M; Valles, James M

    2017-03-10

    Anodic aluminum oxide (AAO) substrates with a self-ordered triangular array of nanopores provide the means to fabricate multiple forms of nano materials, such as nanowires and nanoparticles. This study focuses on nanostructures that emerge in thin films of metals thermally evaporated onto the surface of AAO. Previous work showed that films of different evaporated metals assume dramatically different structures, e.g. an ordered triangular array of nearly monodisperse nanoparticles forms for lead (Pb) while a polycrystalline nanohoneycomb structure forms for silver (Ag). Here, we present investigations of the effects of substrate temperature and deposition angle that reveal the processes controlling the nano particle array formation. Our findings indicate that arrays form provided the grain nucleation density exceeds the pore density and the atomic mobility is high enough to promote grain coalescence. They introduce a method for producing films with anisotropic grain array structure. The results provide insight into the influence of substrate nano-morphology on thin film growth energetics and kinetics that can be harnessed for creating films with other novel nano-structures.

  20. Evaporating metal nanocrystal arrays

    NASA Astrophysics Data System (ADS)

    Zhang, Xue; Joy, James C.; Zhao, Chenwei; Kim, Jin Ho; Fernandes, Gustavo; Xu, J. M.; Valles, James M., Jr.

    2017-03-01

    Anodic aluminum oxide (AAO) substrates with a self-ordered triangular array of nanopores provide the means to fabricate multiple forms of nano materials, such as nanowires and nanoparticles. This study focuses on nanostructures that emerge in thin films of metals thermally evaporated onto the surface of AAO. Previous work showed that films of different evaporated metals assume dramatically different structures, e.g. an ordered triangular array of nearly monodisperse nanoparticles forms for lead (Pb) while a polycrystalline nanohoneycomb structure forms for silver (Ag). Here, we present investigations of the effects of substrate temperature and deposition angle that reveal the processes controlling the nano particle array formation. Our findings indicate that arrays form provided the grain nucleation density exceeds the pore density and the atomic mobility is high enough to promote grain coalescence. They introduce a method for producing films with anisotropic grain array structure. The results provide insight into the influence of substrate nano-morphology on thin film growth energetics and kinetics that can be harnessed for creating films with other novel nano-structures.

  1. Folding of Nucleosome Arrays

    NASA Astrophysics Data System (ADS)

    Howell, Steven; Jimenez-Useche, Isabel; Andresen, Kurt; Yuan, Chongli; Qiu, Xiangyun

    2014-03-01

    Chromatin conformation and dynamics is central to gene functions including packaging, regulation, and repair. At the molecular level, the basic building block of chromatin is a nucleosome core particle (NCP) made of 147 base pairs (bp) of dsDNA wrapped around an octamer of histone proteins. These NCPs are connected by short 10-90 bps of linker DNA as beads on a string. Key factors determining the packaging of NCP arrays to form chromatin include ionic condition, linker DNA length, and epigenetic modifications, especially of the histone tails. We have investigated how the conformations of model tetra-NCP arrays are modulated by these factors using small angle x-ray scattering (SAXS). Here we present recent studies of the effects of ion (KCl and MgCl2), linker length, and histone modification (tail deletions) on NCP arrays. Our SAXS measurement makes it possible to learn about both the global compaction of NCP arrays and local inter-NCP spatial correlations within the same array.

  2. Striped tertiary storage arrays

    NASA Technical Reports Server (NTRS)

    Drapeau, Ann L.

    1993-01-01

    Data stripping is a technique for increasing the throughput and reducing the response time of large access to a storage system. In striped magnetic or optical disk arrays, a single file is striped or interleaved across several disks; in a striped tape system, files are interleaved across tape cartridges. Because a striped file can be accessed by several disk drives or tape recorders in parallel, the sustained bandwidth to the file is greater than in non-striped systems, where access to the file are restricted to a single device. It is argued that applying striping to tertiary storage systems will provide needed performance and reliability benefits. The performance benefits of striping for applications using large tertiary storage systems is discussed. It will introduce commonly available tape drives and libraries, and discuss their performance limitations, especially focusing on the long latency of tape accesses. This section will also describe an event-driven tertiary storage array simulator that is being used to understand the best ways of configuring these storage arrays. The reliability problems of magnetic tape devices are discussed, and plans for modeling the overall reliability of striped tertiary storage arrays to identify the amount of error correction required are described. Finally, work being done by other members of the Sequoia group to address latency of accesses, optimizing tertiary storage arrays that perform mostly writes, and compression is discussed.

  3. Adaptive multibeam antenna array

    NASA Astrophysics Data System (ADS)

    Novikov, V. I.

    1984-01-01

    An adaptive multibeam antenna array is considered which will enhance the advantages of a plain one. By providing simultaneous reception of signals from different directions and their sequential processing. The optimization of the array control for maximum interference suppression in the radiation pattern is emphasized. The optimum control is sought with respect to the signal-to-interference power ratio as a genaralized criterion. Sampled useful signals and transmission coefficients are found to be complex-conjugate quantities, assuming compatible formation of beams, so that synphasal equiamplitude addition of signals from all array element is attainable by unique settings of the weight factors. Calculations are simplified by letting the useful signal power in the 1-th beam be approximately equal to the k-th weight factor, before optimizing the weight vector for maximum signal-to-interference ratio. A narrowband interference described by power P and vector V of signal distribution over the array is considered as an example, to demonstrate the algorithm of synthesis. The algorithm, using the Butler matrix, was executed experimentally on a computer for a linear equidistant antenna array of 32 elements with compatible formation of beams.

  4. Solar array flight experiment

    NASA Technical Reports Server (NTRS)

    1986-01-01

    Emerging satellite designs require increasing amounts of electrical power to operate spacecraft instruments and to provide environments suitable for human habitation. In the past, electrical power was generated by covering rigid honeycomb panels with solar cells. This technology results in unacceptable weight and volume penalties when large amounts of power are required. To fill the need for large-area, lightweight solar arrays, a fabrication technique in which solar cells are attached to a copper printed circuit laminated to a plastic sheet was developed. The result is a flexible solar array with one-tenth the stowed volume and one-third the weight of comparably sized rigid arrays. An automated welding process developed to attack the cells to the printed circuit guarantees repeatable welds that are more tolerant of severe environments than conventional soldered connections. To demonstrate the flight readiness of this technology, the Solar Array Flight Experiment (SAFE) was developed and flown on the space shuttle Discovery in September 1984. The tests showed the modes and frequencies of the array to be very close to preflight predictions. Structural damping, however, was higher than anticipated. Electrical performance of the active solar panel was also tested. The flight performance and postflight data evaluation are described.

  5. Imaging antenna arrays

    NASA Technical Reports Server (NTRS)

    Rutledge, D. B.; Muha, M. S.

    1982-01-01

    Many millimeter and far-infrared imaging systems are limited in sensitivity and speed because they depend on a single scanned element. Because of recent advances in planar detectors such as Schottky diodes, superconducting tunnel junctions, and microbolometers, an attractive approach to this problem is a planar antenna array with integrated detectors. A planar line antenna array and optical system for imaging has been developed. The significant advances are a 'reverse-microscope' optical configuration and a modified bow-tie antenna design. In the 'reverse-microscope' configuration, a lens is attached to the bottom of the substrate containing the antennas. Imaging is done through the substrate. This configuration eliminates the troublesome effects of substrate surface waves. The substrate lens has only a single refracting surface, making possible a virtually aplanatic system, with little spherical aberration or coma. The array is characterized by an optical transfer function that is easily measured. An array with 19 dB crosstalk levels between adjacent antennas has been tested and it was found that the array captured 50 percent of the available power. This imaging system was diffraction limited.

  6. Plastid DNA sequencing and nuclear SNP genotyping help resolve the puzzle of central American Platanus

    PubMed Central

    De Castro, Olga; Di Maio, Antonietta; Lozada García, José Armando; Piacenti, Danilo; Vázquez-Torres, Mario; De Luca, Paolo

    2013-01-01

    Background and Aims Recent research on the history of Platanus reveals that hybridization phenomena occurred in the central American species. This study has two goals: to help resolve the evolutive puzzle of central American Platanus, and to test the potential of real-time polymerase chain reaction (PCR) for detecting ancient hybridization. Methods Sequencing of a uniparental plastid DNA marker [psbA-trnH(GUG) intergenic spacer] and qualitative and quantitative single nucleotide polymorphism (SNP) genotyping of biparental nuclear ribosomal DNA (nrDNA) markers [LEAFY intron 2 (LFY-i2) and internal transcribed spacer 2 (ITS2)] were used. Key Results Based on the SNP genotyping results, several Platanus accessions show the presence of hybridization/introgression, including some accessions of P. rzedowskii and of P. mexicana var. interior and one of P. mexicana var. mexicana from Oaxaca (= P. oaxacana). Based on haplotype analyses of the psbA-trnH spacer, five haplotypes were detected. The most common of these is present in taxa belonging to P. orientalis, P. racemosa sensu lato, some accessions of P. occidentalis sensu stricto (s.s.) from Texas, P. occidentalis var. palmeri, P. mexicana s.s. and P. rzedowskii. This is highly relevant to genetic relationships with the haplotypes present in P. occidentalis s.s. and P. mexicana var. interior. Conclusions Hybridization and introgression events between lineages ancestral to modern central and eastern North American Platanus species occurred. Plastid haplotypes and qualitative and quantitative SNP genotyping provide information critical for understanding the complex history of Mexican Platanus. Compared with the usual molecular techniques of sub-cloning, sequencing and genotyping, real-time PCR assay is a quick and sensitive technique for analysing complex evolutionary patterns. PMID:23798602

  7. SNP identification and SNAP marker development for a GmNARK gene controlling supernodulation in soybean.

    PubMed

    Kim, M Y; Van, K; Lestari, P; Moon, J-K; Lee, S-H

    2005-04-01

    Supernodulation in soybean (Glycine max L. Merr.) is an important source of nitrogen supply to subterranean ecological systems. Single nucleotide-amplified polymorphism (SNAP) markers for supernodulation should allow rapid screening of the trait in early growth stages, without the need for inoculation and phenotyping. The gene GmNARK (Glycine max nodule autoregulation receptor kinase), controlling autoregulation of nodulation, was found to have a single nucleotide polymorphism (SNP) between the wild-type cultivar Sinpaldalkong 2 and its supernodulating mutant, SS2-2. Transversion of A to T at the 959-bp position of the GmNARK sequence results in a change of lysine (AAG) to a stop codon (TAG), thus terminating its translation in SS2-2. Based on the identified SNP in GmNARK, five primer pairs specific to each allele were designed using the WebSnaper program to develop a SNAP marker for supernodulation. One A-specific primer pair produced a band present in only Sinpaldalkong 2, while two T-specific pairs showed a band in only SS2-2. Both complementary PCRs, using each allele-specific primer pair were performed to genotype supernodulation against F2 progeny of Sinpaldalkong 2 x SS2-2. Among 28 individuals with the normal phenotype, eight individuals having only the A-allele-specific band were homozygous and normal, while 20 individuals were found to be heterozygous at the SNP having both A and T bands. Twelve supernodulating individuals showed only the band specific to the T allele. This SNAP marker for supernodulation could easily be analyzed through simple PCR and agarose gel electrophoresis. Therefore, use of this SNAP marker might be faster, cheaper, and more reproducible than using other genotyping methods, such as a cleaved amplified polymorphic sequence marker, which demand of restriction enzymes.

  8. Transcriptome-based SNP discovery by GBS and the construction of a genetic map for olive.

    PubMed

    İpek, Ahmet; İpek, Meryem; Ercişli, Sezai; Tangu, Nesrin Aktepe

    2017-02-18

    Molecular markers located in the genic regions of plants are valuable tools for the identification of candidate genes of economically important traits and consequent use in marker-assisted selection (MAS). In the past, simple sequence repeat markers (SSRs) and single-nucleotide polymorphisms (SNPs) located in expressed sequence tags (ESTs) were developed by sequencing RNA derived from different plant tissues, which involves laborious RNA extraction, mRNA isolation, and cDNA synthesis. In order to develop SNP markers located in olive transcriptomes, we used the recently developed genotyping-by-sequencing (GBS) technique. An analysis was done for 125 olive DNA samples (123 DNA samples from a cross-pollinated F1 mapping population, and two samples from parents). From 45 to 66% of Illumina reads from GBS analysis were aligned to the olive transcriptome. A total of 22,033 transcriptome-based SNP markers were identified, and 3384 of these were mapped in the olive genome. The genetic linkage map constructed in this study consists of 1 cleaved amplified polymorphic sequence (CAPS), 19 SSR, and 3384 transcriptome-based SNP markers. The map covers 3340.8 cM of the olive genome in 23 linkage groups, with the length of the linkage groups ranging from 55.6 to 248.7 cM. Average map distance between flanking markers was 0.98 cM. This genetic linkage map is a saturated genetic map and will be a useful tool for the localization of quantitative trait loci (QTLs) and gene(s) of interest and for the identification of candidate genes for economically important traits.

  9. SNP variation in ADRB3 gene reflects the breed difference of sheep populations.

    PubMed

    Wu, Jianliang; Qiao, Liying; Liu, Jianhua; Yuan, Yanan; Liu, Wenzhong

    2012-08-01

    The β3-adrenergic receptor (ADRB3), a G-protein coupled receptor, plays a major role in energy metabolism and regulation of lipolysis and homeostasis. We detect the single nucleotide polymorphism (SNP) variation in full-length sequence of ovine ADRB3 gene in 12 domestic sheep populations within four types by polymerase chain reaction-single strand conformation polymorphism and sequencing to reveal the breed difference. Twenty-two SNPs, 12 of which in the exon 1 and ten in the intron, were detected, and 12 new exonic and four new intronic SNPs were found. Most SNPs presented in Shanxi Dam Line and least ones in Dorset. The average SNP number in both meat and dual purpose for meat and wool breeds was significantly higher than general and dual purpose breeds for wool and meat. Frequency of each SNP in studied breeds or types was different. The 18C Del and 1617T Ins majorly existed in dual purpose breeds for wool and meat. The 25A Del, 119C>G and 130C>T were mostly found in the meat and dual purpose for meat and wool breeds. The 1764C>A more frequently presented in meat than in other types. The majority of variations came from within the populations as suggested by analysis of molecular variance. Close relationship presented among the Chinese and western breeds, respectively. In conclusion, SNPs of ovine ADRB3 gene can reflect the breed difference and within- and between-population variations, and to a great extent, the breed relationship.

  10. Validation of a Cost-Efficient Multi-Purpose SNP Panel for Disease Based Research

    PubMed Central

    Hou, Liping; Phillips, Christopher; Azaro, Marco; Brzustowicz, Linda M.; Bartlett, Christopher W.

    2011-01-01

    Background Here we present convergent methodologies using theoretical calculations, empirical assessment on in-house and publicly available datasets as well as in silico simulations, that validate a panel of SNPs for a variety of necessary tasks in human genetics disease research before resources are committed to larger-scale genotyping studies on those samples. While large-scale well-funded human genetic studies routinely have up to a million SNP genotypes, samples in a human genetics laboratory that are not yet part of such studies may be productively utilized in pilot projects or as part of targeted follow-up work though such smaller scale applications require at least some genome-wide genotype data for quality control purposes such as DNA “barcoding” to detect swaps or contamination issues, determining familial relationships between samples and correcting biases due to population effects such as population stratification in pilot studies. Principal Findings Empirical performance in classification of relative types for any two given DNA samples (e.g., full siblings, parental, etc) indicated that for outbred populations the panel performs sufficiently to classify relationship in extended families and therefore also for smaller structures such as trios and for twin zygosity testing. Additionally, familial relationships do not significantly diminish the (mean match) probability of sharing SNP genotypes in pedigrees, further indicating the uniqueness of the “barcode.” Simulation using these SNPs for an African American case-control disease association study demonstrated that population stratification, even in complex admixed samples, can be adequately corrected under a range of disease models using the SNP panel. Conclusion The panel has been validated for use in a variety of human disease genetics research tasks including sample barcoding, relationship verification, population substructure detection and statistical correction. Given the ease of genotyping

  11. SNP genotyping of animal and human derived isolates of Mycobacterium avium subsp. paratuberculosis.

    PubMed

    Wynne, James W; Beller, Christie; Boyd, Victoria; Francis, Barry; Gwoźdź, Jacek; Carajias, Marios; Heine, Hans G; Wagner, Josef; Kirkwood, Carl D; Michalski, Wojtek P

    2014-08-27

    Mycobacterium avium subsp. paratuberculosis (MAP) is the aetiological agent of Johne's disease (JD), a chronic granulomatous enteritis that affects ruminants worldwide. While the ability of MAP to cause disease in animals is clear, the role of this bacterium in human inflammatory bowel diseases remains unresolved. Previous whole genome sequencing of MAP isolates derived from human and three animal hosts showed that human isolates were genetically similar and showed a close phylogenetic relationship to one bovine isolate. In contrast, other animal derived isolates were more genetically diverse. The present study aimed to investigate the frequency of this human strain across 52 wild-type MAP isolates, collected predominantly from Australia. A Luminex based SNP genotyping approach was utilised to genotype SNPs that had previously been shown to be specific to the human, bovine or ovine isolate types. Fourteen SNPs were initially evaluated across a reference panel of isolates with known genotypes. A subset of seven SNPs was chosen for analysis within the wild-type collection. Of the seven SNPs, three were found to be unique to paediatric human isolates. No wild-type isolates contain these SNP alleles. Interestingly, and in contrast to the paediatric isolates, three additional adult human isolates (derived from adult Crohn's disease patients) also did not contain these SNP alleles. Furthermore we identified two SNPs, which demonstrate extensive polymorphism within the animal-derived MAP isolates. One of which appears unique to ovine and a single camel isolate. From this study we suggest the existence of genetic heterogeneity between human derived MAP isolates, some of which are highly similar to those derived from bovine hosts, but others of which are more divergent.

  12. A Whole Methylome CpG-SNP Association Study of Psychosis in Blood and Brain Tissue.

    PubMed

    van den Oord, Edwin J C G; Clark, Shaunna L; Xie, Lin Ying; Shabalin, Andrey A; Dozmorov, Mikhail G; Kumar, Gaurav; Vladimirov, Vladimir I; Magnusson, Patrik K E; Aberg, Karolina A

    2016-07-01

    Mutated CpG sites (CpG-SNPs) are potential hotspots for human diseases because in addition to the sequence variation they may show individual differences in DNA methylation. We performed methylome-wide association studies (MWAS) to test whether methylation differences at those sites were associated with schizophrenia. We assayed all common CpG-SNPs with methyl-CpG binding domain protein-enriched genome sequencing (MBD-seq) using DNA extracted from 1408 blood samples and 66 postmortem brain samples (BA10) of schizophrenia cases and controls. Seven CpG-SNPs passed our FDR threshold of 0.1 in the blood MWAS. Of the CpG-SNPs methylated in brain, 94% were also methylated in blood. This significantly exceeded the 46.2% overlap expected by chance (P-value < 1.0×10(-8)) and justified replicating findings from blood in brain tissue. CpG-SNP rs3796293 in IL1RAP replicated (P-value = .003) with the same direction of effects. This site was further validated through targeted bisulfite pyrosequencing in 736 independent case-control blood samples (P-value < 9.5×10(-4)). Our top result in the brain MWAS (P-value = 8.8×10(-7)) was CpG-SNP rs16872141 located in the potential promoter of ENC1. Overall, our results suggested that CpG-SNP methylation may reflect effects of environmental insults and can provide biomarkers in blood that could potentially improve disease management.

  13. Solar Radiation-Associated Adaptive SNP Genetic Differentiation in Wild Emmer Wheat, Triticum dicoccoides

    PubMed Central

    Ren, Jing; Chen, Liang; Jin, Xiaoli; Zhang, Miaomiao; You, Frank M.; Wang, Jirui; Frenkel, Vladimir; Yin, Xuegui; Nevo, Eviatar; Sun, Dongfa; Luo, Ming-Cheng; Peng, Junhua

    2017-01-01

    Whole-genome scans with large number of genetic markers provide the opportunity to investigate local adaptation in natural populations and identify candidate genes under positive selection. In the present study, adaptation genetic differentiation associated with solar radiation was investigated using 695 polymorphic SNP markers in wild emmer wheat originated in a micro-site at Yehudiyya, Israel. The test involved two solar radiation niches: (1) sun, in-between trees; and (2) shade, under tree canopy, separated apart by a distance of 2–4 m. Analysis of molecular variance showed a small (0.53%) but significant portion of overall variation between the sun and shade micro-niches, indicating a non-ignorable genetic differentiation between sun and shade habitats. Fifty SNP markers showed a medium (0.05 ≤ FST ≤ 0.15) or high genetic differentiation (FST > 0.15). A total of 21 outlier loci under positive selection were identified by using four different FST-outlier testing algorithms. The markers and genome locations under positive selection are consistent with the known patterns of selection. These results suggested that genetic differentiation between sun and shade habitats is substantial, radiation-associated, and therefore ecologically determined. Hence, the results of this study reflected effects of natural selection through solar radiation on EST-related SNP genetic diversity, resulting presumably in different adaptive complexes at a micro-scale divergence. The present work highlights the evolutionary theory and application significance of solar radiation-driven natural selection in wheat improvement. PMID:28352272

  14. Estimation of effective population size using single-nucleotide polymorphism (SNP) data in Jeju horse.

    PubMed

    Do, Kyoung-Tag; Lee, Joon-Ho; Lee, Hak-Kyo; Kim, Jun; Park, Kyung-Do

    2014-01-01

    This study was conducted to estimate the effective population size using SNPs data of 240 Jeju horses that had raced at the Jeju racing park. Of the total 61,746 genotyped autosomal SNPs, 17,320 (28.1%) SNPs (missing genotype rate of >10%, minor allele frequency of <0.05 and Hardy-Weinberg equilibrium test P-value of <10(-6)) were excluded after quality control processes. SNPs on the X and Y chromosomes and genotyped individuals with missing genotype rate over 10% were also excluded, and finally, 44,426 (71.9%) SNPs were selected and used for the analysis. The measures of the LD, square of correlation coefficient (r(2)) between SNP pairs, were calculated for each allele and the effective population size was determined based on r(2) measures. The polymorphism information contents (PIC) and expected heterozygosity (HE) were 0.27 and 0.34, respectively. In LD, the most rapid decline was observed over the first 1 Mb. But r(2) decreased more slowly with increasing distance and was constant after 2 Mb of distance and the decline was almost linear with log-transformed distance. The average r(2) between adjacent SNP pairs ranged from 0.20 to 0.31 in each chromosome and whole average was 0.26, while the whole average r(2) between all SNP pairs was 0.02. We observed an initial pattern of decreasing Ne and estimated values were closer to 41 at 1 ~ 5 generations ago. The effective population size (41 heads) estimated in this study seems to be large considering Jeju horse's population size (about 2,000 heads), but it should be interpreted with caution because of the technical limitations of the methods and sample size.

  15. High-Throughput SNP Allele-Frequency Determination in Pooled DNA Samples by Kinetic PCR

    PubMed Central

    Germer, Søren; Holland, Michael J.; Higuchi, Russell

    2000-01-01

    We have developed an accurate, yet inexpensive and high-throughput, method for determining the allele frequency of biallelic polymorphisms in pools of DNA samples. The assay combines kinetic (real-time quantitative) PCR with allele-specific amplification and requires no post-PCR processing. The relative amounts of each allele in a sample are quantified. This is performed by dividing equal aliquots of the pooled DNA between two separate PCR reactions, each of which contains a primer pair specific to one or the other allelic SNP variant. For pools with equal amounts of the two alleles, the two amplifications should reach a detectable level of fluorescence at the same cycle number. For pools that contain unequal ratios of the two alleles, the difference in cycle number between the two amplification reactions can be used to calculate the relative allele amounts. We demonstrate the accuracy and reliability of the assay on samples with known predetermined SNP allele frequencies from 5% to 95%, including pools of both human and mouse DNAs using eight different SNPs altogether. The accuracy of measuring known allele frequencies is very high, with the strength of correlation between measured and known frequencies having an r2 = 0.997. The loss of sensitivity as a result of measurement error is typically minimal, compared with that due to sampling error alone, for population samples up to 1000. We believe that by providing a means for SNP genotyping up to thousands of samples simultaneously, inexpensively, and reproducibly, this method is a powerful strategy for detecting meaningful polymorphic differences in candidate gene association studies and genome-wide linkage disequilibrium scans. PMID:10673283

  16. CLUSTAG & WCLUSTAG: Hierarchical Clustering Algorithms for Efficient Tag-SNP Selection

    NASA Astrophysics Data System (ADS)

    Ao, Sio-Iong

    More than 6 million single nucleotide polymorphisms (SNPs) in the human genome have been genotyped by the HapMap project. Although only a pro portion of these SNPs are functional, all can be considered as candidate markers for indirect association studies to detect disease-related genetic variants. The complete screening of a gene or a chromosomal region is nevertheless an expensive undertak ing for association studies. A key strategy for improving the efficiency of association studies is to select a subset of informative SNPs, called tag SNPs, for analysis. In the chapter, hierarchical clustering algorithms have been proposed for efficient tag SNP selection.

  17. mrsFAST-Ultra: a compact, SNP-aware mapper for high performance sequencing applications.

    PubMed

    Hach, Faraz; Sarrafi, Iman; Hormozdiari, Farhad; Alkan, Can; Eichler, Evan E; Sahinalp, S Cenk

    2014-07-01

    High throughput sequencing (HTS) platforms generate unprecedented amounts of data that introduce challenges for processing and downstream analysis. While tools that report the 'best' mapping location of each read provide a fast way to process HTS data, they are not suitable for many types of downstream analysis such as structural variation detection, where it is important to report multiple mapping loci for each read. For this purpose we introduce mrsFAST-Ultra, a fast, cache oblivious, SNP-aware aligner that can handle the multi-mapping of HTS reads very efficiently. mrsFAST-Ultra improves mrsFAST, our first cache oblivious read aligner capable of handling multi-mapping reads, through new and compact index structures that reduce not only the overall memory usage but also the number of CPU operations per alignment. In fact the size of the index generated by mrsFAST-Ultra is 10 times smaller than that of mrsFAST. As importantly, mrsFAST-Ultra introduces new features such as being able to (i) obtain the best mapping loci for each read, and (ii) return all reads that have at most n mapping loci (within an error threshold), together with these loci, for any user specified n. Furthermore, mrsFAST-Ultra is SNP-aware, i.e. it can map reads to reference genome while discounting the mismatches that occur at common SNP locations provided by db-SNP; this significantly increases the number of reads that can be mapped to the reference genome. Notice that all of the above features are implemented within the index structure and are not simple post-processing steps and thus are performed highly efficiently. Finally, mrsFAST-Ultra utilizes multiple available cores and processors and can be tuned for various memory settings. Our results show that mrsFAST-Ultra is roughly five times faster than its predecessor mrsFAST. In comparison to newly enhanced popular tools such as Bowtie2, it is more sensitive (it can report 10 times or more mappings per read) and much faster (six times or

  18. Electrostatically clean solar array

    NASA Technical Reports Server (NTRS)

    Stern, Theodore Garry (Inventor); Krumweide, Duane Eric (Inventor)

    2004-01-01

    Provided are methods of manufacturing an electrostatically clean solar array panel and the products resulting from the practice of these methods. The preferred method uses an array of solar cells, each with a coverglass where the method includes machining apertures into a flat, electrically conductive sheet so that each aperture is aligned with and undersized with respect to its matched coverglass sheet and thereby fashion a front side shield with apertures (FSA). The undersized portion about each aperture of the bottom side of the FSA shield is bonded to the topside portions nearest the edges of each aperture's matched coverglass. Edge clips are attached to the front side aperture shield edges with the edge clips electrically and mechanically connecting the tops of the coverglasses to the solar panel substrate. The FSA shield, edge clips and substrate edges are bonded so as to produce a conductively grounded electrostatically clean solar array panel.

  19. Interagency Array Study Report

    NASA Technical Reports Server (NTRS)

    Wayland, J. W.; Ruskin, A. M.; Bathker, D. A.; Rydgig, R. C.; Brown, D. W.; Madsen, B. D.; Clauss, R. C.; Levy, G. S.; Kerridge, S. J.; Klein, M. J.

    1983-01-01

    The interagency array study that was convened in early 1982 to determine which of the world's large radio reception facilities might be feasibly and beneficially enlisted to help support the Voyager encounters at Uranus (1986) and Neptune (1989), and also to examine the future for such similar events and options as might appear is discussed. A similar but more specific study of the Parkes Radio Telescope at Uranus Encounter was just then being completed with a strong positive recommendation, and formed the foundation of the broader study. The approach, driving considerations, and outcome of the interagency array study are discussed. The recommendations of the study team concentrated upon the Voyager Encounters are: specifically to develop Parkes for the Uranus Encounter, while pursuing related Advanced Systems development work with the Owens Valley Radio Observatory, and to seek support for the Neptune Encounter from Parkes, the Very Large Array near Socorro, Mexico, and the Japanese institute of Space and Astronautical Sciences 64 meter station.

  20. Array signal processing

    SciTech Connect

    Haykin, S.; Justice, J.H.; Owsley, N.L.; Yen, J.L.; Kak, A.C.

    1985-01-01

    This is the first book to be devoted completely to array signal processing, a subject that has become increasingly important in recent years. The book consists of six chapters. Chapter 1, which is introductory, reviews some basic concepts in wave propagation. The remaining five chapters deal with the theory and applications of array signal processing in (a) exploration seismology, (b) passive sonar, (c) radar, (d) radio astronomy, and (e) tomographic imaging. The various chapters of the book are self-contained. The book is written by a team of five active researchers, who are specialists in the individual fields covered by the pertinent chapters.

  1. Photovoltaic array performance model.

    SciTech Connect

    Kratochvil, Jay A.; Boyson, William Earl; King, David L.

    2004-08-01

    This document summarizes the equations and applications associated with the photovoltaic array performance model developed at Sandia National Laboratories over the last twelve years. Electrical, thermal, and optical characteristics for photovoltaic modules are included in the model, and the model is designed to use hourly solar resource and meteorological data. The versatility and accuracy of the model has been validated for flat-plate modules (all technologies) and for concentrator modules, as well as for large arrays of modules. Applications include system design and sizing, 'translation' of field performance measurements to standard reporting conditions, system performance optimization, and real-time comparison of measured versus expected system performance.

  2. Solar array welding developement

    NASA Technical Reports Server (NTRS)

    Elms, R. V., Jr.

    1974-01-01

    The present work describes parallel gap welding as used for joining solar cells to the cell interconnect system. Sample preparation, weldable cell parameter evaluation, bond scheduling, bond strength evaluation, and bonding and thermal shock tests are described. A range of weld schedule parameters - voltage, time, and force - can be identified for various cell/interconnect designs that will provide adequate bond strengths and acceptably small electrical degradation. Automation of solar array welding operations to a significant degree has been achieved in Europe and will be receiving increased attention in the U.S. to reduce solar array fabrication costs.

  3. Wire Array Photovoltaics

    NASA Astrophysics Data System (ADS)

    Turner-Evans, Dan

    Over the past five years, the cost of solar panels has dropped drastically and, in concert, the number of installed modules has risen exponentially. However, solar electricity is still more than twice as expensive as electricity from a natural gas plant. Fortunately, wire array solar cells have emerged as a promising technology for further lowering the cost of solar. Si wire array solar cells are formed with a unique, low cost growth method and use 100 times less material than conventional Si cells. The wires can be embedded in a transparent, flexible polymer to create a free-standing array that can be rolled up for easy installation in a variety of form factors. Furthermore, by incorporating multijunctions into the wire morphology, higher efficiencies can be achieved while taking advantage of the unique defect relaxation pathways afforded by the 3D wire geometry. The work in this thesis shepherded Si wires from undoped arrays to flexible, functional large area devices and laid the groundwork for multijunction wire array cells. Fabrication techniques were developed to turn intrinsic Si wires into full p-n junctions and the wires were passivated with a-Si:H and a-SiNx:H. Single wire devices yielded open circuit voltages of 600 mV and efficiencies of 9%. The arrays were then embedded in a polymer and contacted with a transparent, flexible, Ni nanoparticle and Ag nanowire top contact. The contact connected >99% of the wires in parallel and yielded flexible, substrate free solar cells featuring hundreds of thousands of wires. Building on the success of the Si wire arrays, GaP was epitaxially grown on the material to create heterostructures for photoelectrochemistry. These cells were limited by low absorption in the GaP due to its indirect bandgap, and poor current collection due to a diffusion length of only 80 nm. However, GaAsP on SiGe offers a superior combination of materials, and wire architectures based on these semiconductors were investigated for multijunction

  4. ALMA Extended Array

    NASA Astrophysics Data System (ADS)

    Kameno, S.; Nakai, N.; Honma, M.

    2013-10-01

    We propose to append five 12-m antennas within 300-km from ALMA to realize high angular resolution of < 1 mas and sensitivity to detect Tb < 1000 K. This ALMA extended array offers a new parameter space of "Thermal universe with VLBI resolution". Proposed science case includes black-hole formation in sub-mm galaxies, mass accretion processes onto protostars, imaging stellar photospheres, distance measurements of stars, and so on. The array also functions as a part of sub-mm VLBI that targets black-hole imaging.

  5. Automated array assembly

    NASA Technical Reports Server (NTRS)

    Williams, B. F.

    1976-01-01

    Manufacturing techniques are evaluated using expenses based on experience and studying basic cost factors for each step to evaluate expenses from a first-principles point of view. A formal cost accounting procedure is developed which is used throughout the study for cost comparisons. The first test of this procedure is a comparison of its predicted costs for array module manufacturing with costs from a study which is based on experience factors. A manufacturing cost estimate for array modules of $10/W is based on present-day manufacturing techniques, expenses, and materials costs.

  6. Single-cell SNP analyses and interpretations based on RNA-Seq data for colon cancer research.

    PubMed

    Chen, Jiahuan; Zhou, Qian; Wang, Yangfan; Ning, Kang

    2016-09-28

    Single-cell sequencing is useful for illustrating the cellular heterogeneities inherent in many intricate biological systems, particularly in human cancer. However, owing to the difficulties in acquiring, amplifying and analyzing single-cell genetic material, obstacles remain for single-cell diversity assessments such as single nucleotide polymorphism (SNP) analyses, rendering biological interpretations of single-cell omics data elusive. We used RNA-Seq data from single-cell and bulk colon cancer samples to analyze the SNP profiles for both structural and functional comparisons. Colon cancer-related pathways with single-cell level SNP enrichment, including the TGF-β and p53 signaling pathways, were also investigated based on both their SNP enrichment patterns and gene expression. We also detected a certain number of fusion transcripts, which may promote tumorigenesis, at the single-cell level. Based on these results, single-cell analyses not only recapitulated the SNP analysis results from the bulk samples but also detected cell-to-cell and cell-to-bulk variations, thereby aiding in early diagnosis and in identifying the precise mechanisms underlying cancers at the single-cell level.

  7. A chromatin-associated and transcriptionally inactive p53-Mdm2 complex occurs in mdm2 SNP309 homozygous cells.

    PubMed

    Arva, Nicoleta C; Gopen, Tamara R; Talbott, Kathryn E; Campbell, Latoya E; Chicas, Agustin; White, David E; Bond, Gareth L; Levine, Arnold J; Bargonetti, Jill

    2005-07-22

    In cancer cells, the function of the tumor suppressor protein p53 is usually blocked. Impairment of the p53 pathway results in tumor cells with endogenous overexpression of Mdm2 via a naturally occurring single nucleotide polymorphism (SNP) in the mdm2 gene at position 309. Here we report that in mdm2 SNP309 cells, inactivation of p53 results in a chromatin-associated Mdm2-p53 complex without clearance of p53 by protein degradation. Nuclear accumulation of p53 protein in mdm2 SNP309 cells results after 6 h of camptothecin, etoposide, or mitomycin C treatment, with the p53 protein phosphorylated at Ser15. Chromatin immunoprecipitation demonstrated p53 and Mdm2 bound to p53 responsive elements. Interestingly, although the p53 protein was able to bind to DNA, quantitative PCR showed compromised transcription of endogenous target genes. Additionally, exogenously introduced p53 was incapable of activating transcription from p53 responsive elements in SNP309 cells, confirming the trans-acting nature of the inhibitor. Inhibition of Mdm2 by siRNA resulted in transcriptional activation of these p53 targets. Our data suggest that overproduction of Mdm2, resulting from a naturally occurring SNP, inhibits chromatin-bound p53 from activating the transcription of its target genes.

  8. IL-32 promoter SNP rs4786370 predisposes to modified lipoprotein profiles in patients with rheumatoid arthritis

    PubMed Central

    Damen, Michelle S. M. A.; Agca, Rabia; Holewijn, Suzanne; de Graaf, Jacqueline; Dos Santos, Jéssica C.; van Riel, Piet L.; Fransen, Jaap; Coenen, Marieke J. H.; Nurmohamed, Mike T.; Netea, Mihai G.; Dinarello, Charles A.; Joosten, Leo A. B.; Heinhuis, Bas; Popa, Calin D.

    2017-01-01

    Patients with rheumatoid arthritis (RA) are at higher risk of developing cardiovascular diseases (CVD). Interleukin (IL)-32 has previously been shown to be involved in the pathogenesis of RA and might be linked to the development of atherosclerosis. However, the exact mechanism linking IL-32 to CVD still needs to be elucidated. The influence of a functional genetic variant of IL-32 on lipid profiles and CVD risk was therefore studied in whole blood from individuals from the NBS cohort and RA patients from 2 independent cohorts. Lipid profiles were matched to the specific IL-32 genotypes. Allelic distribution was similar in all three groups. Interestingly, significantly higher levels of high density lipoprotein cholesterol (HDLc) were observed in individuals from the NBS cohort and RA patients from the Nijmegen cohort homozygous for the C allele (p = 0.0141 and p = 0.0314 respectively). In contrast, the CC-genotype was associated with elevated low density lipoprotein cholesterol (LDLc) and total cholesterol (TC) in individuals at higher risk for CVD (plaque positive) (p = 0.0396; p = 0.0363 respectively). Our study shows a functional effect of a promoter single-nucleotide polymorphism (SNP) in IL32 on lipid profiles in RA patients and individuals, suggesting a possible protective role of this SNP against CVD. PMID:28134327

  9. PrimerMapper: high throughput primer design and graphical assembly for PCR and SNP detection.

    PubMed

    O'Halloran, Damien M

    2016-02-08

    Primer design represents a widely employed gambit in diverse molecular applications including PCR, sequencing, and probe hybridization. Variations of PCR, including primer walking, allele-specific PCR, and nested PCR provide specialized validation and detection protocols for molecular analyses that often require screening large numbers of DNA fragments. In these cases, automated sequence retrieval and processing become important features, and furthermore, a graphic that provides the user with a visual guide to the distribution of designed primers across targets is most helpful in quickly ascertaining primer coverage. To this end, I describe here, PrimerMapper, which provides a comprehensive graphical user interface that designs robust primers from any number of inputted sequences while providing the user with both, graphical maps of primer distribution for each inputted sequence, and also a global assembled map of all inputted sequences with designed primers. PrimerMapper also enables the visualization of graphical maps within a browser and allows the user to draw new primers directly onto the webpage. Other features of PrimerMapper include allele-specific design features for SNP genotyping, a remote BLAST window to NCBI databases, and remote sequence retrieval from GenBank and dbSNP. PrimerMapper is hosted at GitHub and freely available without restriction.

  10. Informatics Enhanced SNP Microarray Analysis of 30 Miscarriage Samples Compared to Routine Cytogenetics

    PubMed Central

    Lathi, Ruth B.; Loring, Megan; Massie, Jamie A. M.; Demko, Zachary P.; Johnson, David; Sigurjonsson, Styrmir; Gemelos, George; Rabinowitz, Matthew

    2012-01-01

    Purpose The metaphase karyotype is often used as a diagnostic tool in the setting of early miscarriage; however this technique has several limitations. We evaluate a new technique for karyotyping that uses single nucleotide polymorphism microarrays (SNP). This technique was compared in a blinded, prospective fashion, to the traditional metaphase karyotype. Methods Patients undergoing dilation and curettage for first trimester miscarriage between February and August 2010 were enrolled. Samples of chorionic villi were equally divided and sent for microarray testing in parallel with routine cytogenetic testing. Results Thirty samples were analyzed, with only four discordant results. Discordant results occurred when the entire genome was duplicated or when a balanced rearrangement was present. Cytogenetic karyotyping took an average of 29 days while microarray-based karytoyping took an average of 12 days. Conclusions Molecular karyotyping of POC after missed abortion using SNP microarray analysis allows for the ability to detect maternal cell contamination and provides rapid results with good concordance to standard cytogenetic analysis. PMID:22403611

  11. Use of Sequenom sample ID Plus® SNP genotyping in identification of FFPE tumor samples.

    PubMed

    Miller, Jessica K; Buchner, Nicholas; Timms, Lee; Tam, Shirley; Luo, Xuemei; Brown, Andrew M K; Pasternack, Danielle; Bristow, Robert G; Fraser, Michael; Boutros, Paul C; McPherson, John D

    2014-01-01

    Short tandem repeat (STR) analysis, such as the AmpFlSTR® Identifiler® Plus kit, is a standard, PCR-based human genotyping method used in the field of forensics. Misidentification of cell line and tissue DNA can be costly if not detected early; therefore it is necessary to have quality control measures such as STR profiling in place. A major issue in large-scale research studies involving archival formalin-fixed paraffin embedded (FFPE) tissues is that varying levels of DNA degradation can result in failure to correctly identify samples using STR genotyping. PCR amplification of STRs of several hundred base pairs is not always possible when DNA is degraded. The Sample ID Plus® panel from Sequenom allows for human DNA identification and authentication using SNP genotyping. In comparison to lengthy STR amplicons, this multiplexing PCR assay requires amplification of only 76-139 base pairs, and utilizes 47 SNPs to discriminate between individual samples. In this study, we evaluated both STR and SNP genotyping methods of sample identification, with a focus on paired FFPE tumor/normal DNA samples intended for next-generation sequencing (NGS). The ability to successfully validate the identity of FFPE samples can enable cost savings by reducing rework.

  12. The use of SNP markers for linkage mapping in diploid and tetraploid peanuts.

    PubMed

    Bertioli, David J; Ozias-Akins, Peggy; Chu, Ye; Dantas, Karinne M; Santos, Silvio P; Gouvea, Ediene; Guimarães, Patricia M; Leal-Bertioli, Soraya C M; Knapp, Steven J; Moretzsohn, Marcio C

    2014-01-10

    Single nucleotide polymorphic markers (SNPs) are attractive for use in genetic mapping and marker-assisted breeding because they can be scored in parallel assays at favorable costs. However, scoring SNP markers in polyploid plants like the peanut is problematic because of interfering signal generated from the DNA bases that are homeologous to those being assayed. The present study used a previously constructed 1536 GoldenGate SNP assay developed using SNPs identified between two A. duranensis accessions. In this study, the performance of this assay was tested on two RIL mapping populations, one diploid (A. duranensis × A. stenosperma) and one tetraploid [A. hypogaea cv. Runner IAC 886 × synthetic tetraploid (A. ipaënsis × A. duranensis)(4×)]. The scoring was performed using the software GenomeStudio version 2011.1. For the diploid, polymorphic markers provided excellent genotyping scores with default software parameters. In the tetraploid, as expected, most of the polymorphic markers provided signal intensity plots that were distorted compared to diploid patterns and that were incorrectly scored using default parameters. However, these scorings were easily corrected using the GenomeStudio software. The degree of distortion was highly variable. Of the polymorphic markers, approximately 10% showed no distortion at all behaving as expected for single-dose markers, and another 30% showed low distortion and could be considered high-quality. The genotyped markers were incorporated into diploid and tetraploid genetic maps of Arachis and, in the latter case, were located almost entirely on A genome linkage groups.

  13. A high-density SNP genome-wide linkage scan in a large autism extended pedigree.

    PubMed

    Allen-Brady, K; Miller, J; Matsunami, N; Stevens, J; Block, H; Farley, M; Krasny, L; Pingree, C; Lainhart, J; Leppert, M; McMahon, W M; Coon, H

    2009-06-01

    We performed a high-density, single nucleotide polymorphism (SNP), genome-wide scan on a six-generation pedigree from Utah with seven affected males, diagnosed with autism spectrum disorder. Using a two-stage linkage design, we first performed a nonparametric analysis on the entire genome using a 10K SNP chip to identify potential regions of interest. To confirm potentially interesting regions, we eliminated SNPs in high linkage disequilibrium (LD) using a principal components analysis (PCA) method and repeated the linkage results. Three regions met genome-wide significance criteria after controlling for LD: 3q13.2-q13.31 (nonparametric linkage (NPL), 5.58), 3q26.31-q27.3 (NPL, 4.85) and 20q11.21-q13.12 (NPL, 5.56). Two regions met suggestive criteria for significance 7p14.1-p11.22 (NPL, 3.18) and 9p24.3 (NPL, 3.44). All five chromosomal regions are consistent with other published findings. Haplotype sharing results showed that five of the affected subjects shared more than a single chromosomal region of interest with other affected subjects. Although no common autism susceptibility genes were found for all seven autism cases, these results suggest that multiple genetic loci within these regions may contribute to the autism phenotype in this family, and further follow-up of these chromosomal regions is warranted.

  14. Measuring inbreeding and inbreeding depression on pig growth from pedigree or SNP-derived metrics.

    PubMed

    Silió, L; Rodríguez, M C; Fernández, A; Barragán, C; Benítez, R; Óvilo, C; Fernández, A I

    2013-10-01

    Multilocus homozygosity, measured as the proportion of the autosomal genome in homozygous genotypes or in runs of homozygosity, was compared with the respective pedigree inbreeding coefficients in 64 Iberian pigs genotyped using the Porcine SNP60 Beadchip. Pigs were sampled from a set of experimental animals with a large inbreeding variation born in a closed strain with a completely recorded multi-generation genealogy. Individual inbreeding coefficients calculated from pedigree were strongly correlated with the different SNP-derived metrics of homozygosity (r = 0.814-0.919). However, unequal correlations between molecular and pedigree inbreeding were observed at chromosomal level being mainly dependent on the number of SNPs and on the correlation between heterozygosities measured across different loci. A panel of 192 SNPs of intermediate frequencies was selected for genotyping 322 piglets to test inbreeding depression on postweaning growth performance (daily gain and weight at 90 days). The negative effects on these traits of homozygosities calculated from the genotypes of 168 quality-checked SNPs were similar to those of inbreeding coefficients. The results support that few hundreds of SNPs may be useful for measuring inbreeding and inbreeding depression, when the population structure or the mating system causes a large variance of inbreeding.

  15. Predicting Alzheimer's Disease Using Combined Imaging-Whole Genome SNP Data.

    PubMed

    Kong, Dehan; Giovanello, Kelly S; Wang, Yalin; Lin, Weili; Lee, Eunjee; Fan, Yong; Murali Doraiswamy, P; Zhu, Hongtu

    2015-01-01

    The growing public threat of Alzheimer's disease (AD) has raised the urgency to discover and validate prognostic biomarkers in order to predicting time to onset of AD. It is anticipated that both whole genome single nucleotide polymorphism (SNP) data and high dimensional whole brain imaging data offer predictive values to identify subjects at risk for progressing to AD. The aim of this paper is to test whether both whole genome SNP data and whole brain imaging data offer predictive values to identify subjects at risk for progressing to AD. In 343 subjects with mild cognitive impairment (MCI) enrolled in the Alzheimer's Disease Neuroimaging Initiative (ADNI-1), we extracted high dimensional MR imaging (volumetric data on 93 brain regions plus a surface fluid registration based hippocampal subregion and surface data), and whole genome data (504,095 SNPs from GWAS), as well as routine neurocognitive and clinical data at baseline. MCI patients were then followed over 48 months, with 150 participants progressing to AD. Combining information from whole brain MR imaging and whole genome data was substantially superior to the standard model for predicting time to onset of AD in a 48-month national study of subjects at risk. Our findings demonstrate the promise of combined imaging-whole genome prognostic markers in people with mild memory impairment.

  16. A Pipeline for Classifying Relationships Using Dense SNP/SNV Data and Putative Pedigree Information.

    PubMed

    Zeng, Zhen; Weeks, Daniel E; Chen, Wei; Mukhopadhyay, Nandita; Feingold, Eleanor

    2016-02-01

    When genome-wide association studies (GWAS) or sequencing studies are performed on family-based datasets, the genotype data can be used to check the structure of putative pedigrees. Even in datasets of putatively unrelated people, close relationships can often be detected using dense single-nucleotide polymorphism/variant (SNP/SNV) data. A number of methods for finding relationships using dense genetic data exist, but they all have certain limitations, including that they typically use average genetic sharing, which is only a subset of the available information. Here, we present a set of approaches for classifying relationships in GWAS datasets or large-scale sequencing datasets. We first propose an empirical method for detecting identity by descent segments in close relative pairs using un-phased dense SNP data and demonstrate how that information can assist in building a relationship classifier. We then develop a strategy to take advantage of putative pedigree information to enhance classification accuracy. Our methods are tested and illustrated with two datasets from two distinct populations. Finally, we propose classification pipelines for checking and identifying relationships in datasets containing a large number of small pedigrees.

  17. Performance of different SNP panels for parentage testing in two East Asian cattle breeds.

    PubMed

    Strucken, E M; Gudex, B; Ferdosi, M H; Lee, H K; Song, K D; Gibson, J P; Kelly, M; Piper, E K; Porto-Neto, L R; Lee, S H; Gondro, C

    2014-08-01

    The International Society for Animal Genetics (ISAG) proposed a panel of single nucleotide polymorphisms (SNPs) for parentage testing in cattle (a core panel of 100 SNPs and an additional list of 100 SNPs). However, markers specific to East Asian taurine cattle breeds were not included, and no information is available as to whether the ISAG panel performs adequately for these breeds. We tested ISAG's core (100 SNP) and full (200 SNP) panels on two East Asian taurine breeds: the Korean Hanwoo and the Japanese Wagyu, the latter from the Australian herd. Even though the power of exclusion was high at 0.99 for both ISAG panels, the core panel performed poorly with 3.01% false-positive assignments in the Hanwoo population and 3.57% in the Wagyu. The full ISAG panel identified all sire-offspring relations correctly in both populations with 0.02% of relations wrongly excluded in the Hanwoo population. Based on these results, we created and tested two population-specific marker panels: one for the Wagyu population, which showed no false-positive assignments with either 100 or 200 SNPs, and a second panel for the Hanwoo, which still had some false-positive assignments with 100 SNPs but no false positives using 200 SNPs. In conclusion, for parentage assignment in East Asian cattle breeds, only the full ISAG panel is adequate for parentage testing. If fewer markers should be used, it is advisable to use population-specific markers rather than the ISAG panel.

  18. Transcriptome sequencing to produce SNP-based genetic maps of onion.

    PubMed

    Duangjit, J; Bohanec, B; Chan, A P; Town, C D; Havey, M J

    2013-08-01

    We used the Roche-454 platform to sequence from normalized cDNA libraries from each of two inbred lines of onion (OH1 and 5225). From approximately 1.6 million reads from each inbred, 27,065 and 33,254 cDNA contigs were assembled from OH1 and 5225, respectively. In total, 3,364 well supported single nucleotide polymorphisms (SNPs) on 1,716 cDNA contigs were identified between these two inbreds. One SNP on each of 1,256 contigs was randomly selected for genotyping. OH1 and 5225 were crossed and 182 gynogenic haploids extracted from hybrid plants were used for SNP mapping. A total of 597 SNPs segregated in the OH1 × 5225 haploid family and a genetic map of ten linkage groups (LOD ≥8) was constructed. Three hundred and thirty-nine of the newly identified SNPs were also mapped using a previously developed segregating family from BYG15-23 × AC43, and 223 common SNPs were used to join the two maps. Because these new SNPs are in expressed regions of the genome and commonly occur among onion germplasms, they will be useful for genetic mapping, gene tagging, marker-aided selection, quality control of seed lots, and fingerprinting of cultivars.

  19. Human Y-chromosome SNP characterization by multiplex amplified product-length polymorphism analysis.

    PubMed

    Medina, Laura Smeldy Jurado; Muzzio, Marina; Schwab, Marisol; Costantino, María Leticia Bravi; Barreto, Guillermo; Bailliet, Graciela

    2014-09-01

    We designed an allele-specific amplification protocol to optimize Y-chromosome SNP typing, which is an unavoidable step for defining the phylogenetic status of paternal lineages. It allows the simultaneous highly specific definition of up to six mutations in a single reaction by amplification fragment length polymorphism (AFLP) without the need of specialized equipment, at a considerably lower cost than that based on single-base primer extension (SNaPshot™) technology or PCR-RFLP systems, requiring as little as 0.5 ng DNA and compatible with the small fragments characteristic of low-quality DNA. By designation of two primers recognizing the derived and ancestral state for each SNP, wh