Science.gov

Sample records for highly-multiplexed snp arrays

  1. Development and implementation of a highly-multiplexed SNP array for genetic mapping in maritime pine and comparative mapping with loblolly pine

    PubMed Central

    2011-01-01

    Background Single nucleotide polymorphisms (SNPs) are the most abundant source of genetic variation among individuals of a species. New genotyping technologies allow examining hundreds to thousands of SNPs in a single reaction for a wide range of applications such as genetic diversity analysis, linkage mapping, fine QTL mapping, association studies, marker-assisted or genome-wide selection. In this paper, we evaluated the potential of highly-multiplexed SNP genotyping for genetic mapping in maritime pine (Pinus pinaster Ait.), the main conifer used for commercial plantation in southwestern Europe. Results We designed a custom GoldenGate assay for 1,536 SNPs detected through the resequencing of gene fragments (707 in vitro SNPs/Indels) and from Sanger-derived Expressed Sequenced Tags assembled into a unigene set (829 in silico SNPs/Indels). Offspring from three-generation outbred (G2) and inbred (F2) pedigrees were genotyped. The success rate of the assay was 63.6% and 74.8% for in silico and in vitro SNPs, respectively. A genotyping error rate of 0.4% was further estimated from segregating data of SNPs belonging to the same gene. Overall, 394 SNPs were available for mapping. A total of 287 SNPs were integrated with previously mapped markers in the G2 parental maps, while 179 SNPs were localized on the map generated from the analysis of the F2 progeny. Based on 98 markers segregating in both pedigrees, we were able to generate a consensus map comprising 357 SNPs from 292 different loci. Finally, the analysis of sequence homology between mapped markers and their orthologs in a Pinus taeda linkage map, made it possible to align the 12 linkage groups of both species. Conclusions Our results show that the GoldenGate assay can be used successfully for high-throughput SNP genotyping in maritime pine, a conifer species that has a genome seven times the size of the human genome. This SNP-array will be extended thanks to recent sequencing effort using new generation

  2. Demonstration of Array-Based Analysis for Highly Multiplexed PCR Assays

    PubMed Central

    Spence, Janice M.; Rothberg, Paul G.; Wang, Nancy; Burack, W. Richard

    2011-01-01

    We demonstrate an approach that allowed rapid development of a robust assay for the detection of chromosomal translocations. The method includes highly multiplexed PCR with analysis of the PCR products performed by array detection. As proof of principle, we applied this approach to the detection of IGH@-BCL2 translocations in DNA prepared from FFPE specimens. This translocation and specimen type were chosen because of the known difficulties associated with PCR-based detection of this lesion and the additional loss of sensitivity associated with FFPE samples. The multiplex PCR with array detection method detected the IGH@-BCL2 translocation in 26 of 36 FFPE follicular lymphoma specimens, whereas the BIOMED-2 assay detected 13 of 36 specimens. This increased sensitivity was the result of both the increased density of BCL2 primers and identification of PCR products by low-density array. The method was specific and allowed mapping of the BCL2 break point in all cases. The method detected the IGH@-BCL2 lesion when the tumor DNA was diluted more than 1:20 in normal DNA but not when it was diluted more than 1:100. This sensitivity allows detection of diagnostically relevant levels of IGH@-BCL2 but will not detect the rare cells with IGH@-BCL2 translocations in healthy individuals. PMID:21497287

  3. SNP Array in Hematopoietic Neoplasms: A Review

    PubMed Central

    Song, Jinming; Shao, Haipeng

    2015-01-01

    Cytogenetic analysis is essential for the diagnosis and prognosis of hematopoietic neoplasms in current clinical practice. Many hematopoietic malignancies are characterized by structural chromosomal abnormalities such as specific translocations, inversions, deletions and/or numerical abnormalities that can be identified by karyotype analysis or fluorescence in situ hybridization (FISH) studies. Single nucleotide polymorphism (SNP) arrays offer high-resolution identification of copy number variants (CNVs) and acquired copy-neutral loss of heterozygosity (LOH)/uniparental disomy (UPD) that are usually not identifiable by conventional cytogenetic analysis and FISH studies. As a result, SNP arrays have been increasingly applied to hematopoietic neoplasms to search for clinically-significant genetic abnormalities. A large numbers of CNVs and UPDs have been identified in a variety of hematopoietic neoplasms. CNVs detected by SNP array in some hematopoietic neoplasms are of prognostic significance. A few specific genes in the affected regions have been implicated in the pathogenesis and may be the targets for specific therapeutic agents in the future. In this review, we summarize the current findings of application of SNP arrays in a variety of hematopoietic malignancies with an emphasis on the clinically significant genetic variants. PMID:27600067

  4. SNPConvert: SNP Array Standardization and Integration in Livestock Species

    PubMed Central

    Nicolazzi, Ezequiel Luis; Marras, Gabriele; Stella, Alessandra

    2016-01-01

    One of the main advantages of single nucleotide polymorphism (SNP) array technology is providing genotype calls for a specific number of SNP markers at a relatively low cost. Since its first application in animal genetics, the number of available SNP arrays for each species has been constantly increasing. However, conversely to that observed in whole genome sequence data analysis, SNP array data does not have a common set of file formats or coding conventions for allele calling. Therefore, the standardization and integration of SNP array data from multiple sources have become an obstacle, especially for users with basic or no programming skills. Here, we describe the difficulties related to handling SNP array data, focusing on file formats, SNP allele coding, and mapping. We also present SNPConvert suite, a multi-platform, open-source, and user-friendly set of tools to overcome these issues. This tool, which can be integrated with open-source and open-access tools already available, is a first step towards an integrated system to standardize and integrate any type of raw SNP array data. The tool is available at: https://github. com/nicolazzie/SNPConvert.git. PMID:27600083

  5. SNPConvert: SNP Array Standardization and Integration in Livestock Species.

    PubMed

    Nicolazzi, Ezequiel Luis; Marras, Gabriele; Stella, Alessandra

    2016-01-01

    One of the main advantages of single nucleotide polymorphism (SNP) array technology is providing genotype calls for a specific number of SNP markers at a relatively low cost. Since its first application in animal genetics, the number of available SNP arrays for each species has been constantly increasing. However, conversely to that observed in whole genome sequence data analysis, SNP array data does not have a common set of file formats or coding conventions for allele calling. Therefore, the standardization and integration of SNP array data from multiple sources have become an obstacle, especially for users with basic or no programming skills. Here, we describe the difficulties related to handling SNP array data, focusing on file formats, SNP allele coding, and mapping. We also present SNPConvert suite, a multi-platform, open-source, and user-friendly set of tools to overcome these issues. This tool, which can be integrated with open-source and open-access tools already available, is a first step towards an integrated system to standardize and integrate any type of raw SNP array data. The tool is available at: https://github. com/nicolazzie/SNPConvert.git. PMID:27600083

  6. SNP Arrays for Species Identification in Salmonids.

    PubMed

    Wenne, Roman; Drywa, Agata; Kent, Matthew; Sundsaasen, Kristil Kindem; Lien, Sigbjørn

    2016-01-01

    The use of SNP genotyping microarrays, developed in one species to analyze a closely related species for which genomic sequence information is scarce, enables the rapid development of a genomic resource (SNP information) without the need to develop new species-specific markers. Using large numbers of microarray SNPs offers the best chance to detect informative markers in nontarget species, markers that can very often be assayed using a lower throughput platform as is described in this paper. PMID:27460372

  7. A SNP genotyping array for hexaploid oat

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Recognizing a need in cultivated hexaploid oat (Avena sativa L.) for a reliable set of reference SNPs, we have developed a 6K BeadChip design containing 257 Infinium I and 5,486 Infinium II designs corresponding to 5,743 SNPs. Of those, 4,975 SNPs yielded successful assays after array manufacturing...

  8. Software solutions for the livestock genomics SNP array revolution.

    PubMed

    Nicolazzi, E L; Biffani, S; Biscarini, F; Orozco Ter Wengel, P; Caprera, A; Nazzicari, N; Stella, A

    2015-08-01

    Since the beginning of the genomic era, the number of available single nucleotide polymorphism (SNP) arrays has grown considerably. In the bovine species alone, 11 SNP chips not completely covered by intellectual property are currently available, and the number is growing. Genomic/genotype data are not standardized, and this hampers its exchange and integration. In addition, software used for the analyses of these data usually requires not standard (i.e. case specific) input files which, considering the large amount of data to be handled, require at least some programming skills in their production. In this work, we describe a software toolkit for SNP array data management, imputation, genome-wide association studies, population genetics and genomic selection. However, this toolkit does not solve the critical need for standardization of the genotypic data and software input files. It only highlights the chaotic situation each researcher has to face on a daily basis and gives some helpful advice on the currently available tools in order to navigate the SNP array data complexity. PMID:25907889

  9. Genetic heterogeneity in rhabdomyosarcoma revealed by SNP array analysis.

    PubMed

    Walther, Charles; Mayrhofer, Markus; Nilsson, Jenny; Hofvander, Jakob; Jonson, Tord; Mandahl, Nils; Øra, Ingrid; Gisselsson, David; Mertens, Fredrik

    2016-01-01

    Rhabdomyosarcoma (RMS) is the most common soft tissue sarcoma in children and adolescents. Alveolar (ARMS) and embryonal (ERMS) histologies predominate, but rare cases are classified as spindle cell/sclerosing (SRMS). For treatment stratification, RMS is further subclassified as fusion-positive (FP-RMS) or fusion-negative (FN-RMS), depending on whether a gene fusion involving PAX3 or PAX7 is present or not. We investigated 19 cases of pediatric RMS using high resolution single-nucleotide polymorphism (SNP) array. FP-ARMS displayed, on average, more structural rearrangements than ERMS; the single FN-ARMS had a genomic profile similar to ERMS. Apart from previously known amplification (e.g., MYCN, CDK4, and MIR17HG) and deletion (e.g., NF1, CDKN2A, and CDKN2B) targets, amplification of ERBB2 and homozygous loss of ASCC3 or ODZ3 were seen. Combining SNP array with cytogenetic data revealed that most cases were polyploid, with at least one case having started as a near-haploid tumor. Further bioinformatic analysis of the SNP array data disclosed genetic heterogeneity, in the form of subclonal chromosomal imbalances, in five tumors. The outcome was worse for patients with FP-ARMS than ERMS or FN-ARMS (6/8 vs. 1/9 dead of disease), and the only children with ERMS showing intratumor diversity or with MYOD1 mutation-positive SRMS also died of disease. High resolution SNP array can be useful in evaluating genomic imbalances in pediatric RMS. PMID:26482321

  10. The development and characterization of a 57K SNP array for rainbow trout

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In this paper we describe the development and characterization of the first high density single nucleotide polymorphism (SNP) genotyping array for rainbow trout. The SNP array is publically available from a commercial vendor. The SNP genotyping quality was high and validation rate was close to 90%...

  11. Development and validation of the Axiom(®) Apple480K SNP genotyping array.

    PubMed

    Bianco, Luca; Cestaro, Alessandro; Linsmith, Gareth; Muranty, Hélène; Denancé, Caroline; Théron, Anthony; Poncet, Charles; Micheletti, Diego; Kerschbamer, Emanuela; Di Pierro, Erica A; Larger, Simone; Pindo, Massimo; Van de Weg, Eric; Davassi, Alessandro; Laurens, François; Velasco, Riccardo; Durel, Charles-Eric; Troggio, Michela

    2016-04-01

    Cultivated apple (Malus × domestica Borkh.) is one of the most important fruit crops in temperate regions, and has great economic and cultural value. The apple genome is highly heterozygous and has undergone a recent duplication which, combined with a rapid linkage disequilibrium decay, makes it difficult to perform genome-wide association (GWA) studies. Single nucleotide polymorphism arrays offer highly multiplexed assays at a relatively low cost per data point and can be a valid tool for the identification of the markers associated with traits of interest. Here, we describe the development and validation of a 487K SNP Affymetrix Axiom(®) genotyping array for apple and discuss its potential applications. The array has been built from the high-depth resequencing of 63 different cultivars covering most of the genetic diversity in cultivated apple. The SNPs were chosen by applying a focal points approach to enrich genic regions, but also to reach a uniform coverage of non-genic regions. A total of 1324 apple accessions, including the 92 progenies of two mapping populations, have been genotyped with the Axiom(®) Apple480K to assess the effectiveness of the array. A large majority of SNPs (359 994 or 74%) fell in the stringent class of poly high resolution polymorphisms. We also devised a filtering procedure to identify a subset of 275K very robust markers that can be safely used for germplasm surveys in apple. The Axiom(®) Apple480K has now been commercially released both for public and proprietary use and will likely be a reference tool for GWA studies in apple. PMID:26919684

  12. Genome-wide SNP detection, validation, and development of an 8K SNP array for apple

    Technology Transfer Automated Retrieval System (TEKTRAN)

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide...

  13. High-resolution SNP arrays in mental retardation diagnostics: how much do we gain?

    PubMed Central

    Bernardini, Laura; Alesi, Viola; Loddo, Sara; Novelli, Antonio; Bottillo, Irene; Battaglia, Agatino; Digilio, Maria Cristina; Zampino, Giuseppe; Ertel, Adam; Fortina, Paolo; Surrey, Saul; Dallapiccola, Bruno

    2010-01-01

    We used Affymetrix 6.0 GeneChip SNP arrays to characterize copy number variations (CNVs) in a cohort of 70 patients previously characterized on lower-density oligonucleotide arrays affected by idiopathic mental retardation and dysmorphic features. The SNP array platform includes ∼900 000 SNP probes and 900 000 non-SNP oligonucleotide probes at an average distance of 0.7 Kb, which facilitates coverage of the whole genome, including coding and noncoding regions. The high density of probes is critical for detecting small CNVs, but it can lead to data interpretation problems. To reduce the number of false positives, parameters were set to consider only imbalances >75 Kb encompassing at least 80 probe sets. The higher resolution of the SNP array platform confirmed the increased ability to detect small CNVs, although more than 80% of these CNVs overlapped to copy number ‘neutral' polymorphism regions and 4.4% of them did not contain known genes. In our cohort of 70 patients, of the 51 previously evaluated as ‘normal' on the Agilent 44K array, the SNP array platform disclosed six additional CNV changes, including three in three patients, which may be pathogenic. This suggests that about 6% of individuals classified as ‘normal' using the lower-density oligonucleotide array could be found to be affected by a genomic disorder when evaluated with the higher-density microarray platforms. PMID:19809473

  14. Construction of a versatile SNP array for pyramiding useful genes of rice.

    PubMed

    Kurokawa, Yusuke; Noda, Tomonori; Yamagata, Yoshiyuki; Angeles-Shim, Rosalyn; Sunohara, Hidehiko; Uehara, Kanako; Furuta, Tomoyuki; Nagai, Keisuke; Jena, Kshirod Kumar; Yasui, Hideshi; Yoshimura, Atsushi; Ashikari, Motoyuki; Doi, Kazuyuki

    2016-01-01

    DNA marker-assisted selection (MAS) has become an indispensable component of breeding. Single nucleotide polymorphisms (SNP) are the most frequent polymorphism in the rice genome. However, SNP markers are not readily employed in MAS because of limitations in genotyping platforms. Here the authors report a Golden Gate SNP array that targets specific genes controlling yield-related traits and biotic stress resistance in rice. As a first step, the SNP genotypes were surveyed in 31 parental varieties using the Affymetrix Rice 44K SNP microarray. The haplotype information for 16 target genes was then converted to the Golden Gate platform with 143-plex markers. Haplotypes for the 14 useful allele are unique and can discriminate among all other varieties. The genotyping consistency between the Affymetrix microarray and the Golden Gate array was 92.8%, and the accuracy of the Golden Gate array was confirmed in 3 F2 segregating populations. The concept of the haplotype-based selection by using the constructed SNP array was proofed. PMID:26566831

  15. Development of the catfish 250K SNP array for genome-wide association studies

    PubMed Central

    2014-01-01

    Background Quantitative traits, such as disease resistance, are most often controlled by a set of genes involving a complex array of regulation. The dissection of genetic basis of quantitative traits requires large numbers of genetic markers with good genome coverage. The application of next-generation sequencing technologies has allowed discovery of over eight million SNPs in catfish, but the challenge remains as to how to efficiently and economically use such SNP resources for genetic analysis. Results In this work, we developed a catfish 250K SNP array using Affymetrix Axiom genotyping technology. The SNPs were obtained from multiple sources including gene-associated SNPs, anonymous genomic SNPs, and inter-specific SNPs. A set of 640K high-quality SNPs obtained following specific requirements of array design were submitted. A panel of 250,113 SNPs was finalized for inclusion on the array. The performance evaluated by genotyping individuals from wild populations and backcross families suggested the good utility of the catfish 250K SNP array. Conclusions This is the first high-density SNP array for catfish. The array should be a valuable resource for genome-wide association studies (GWAS), fine QTL mapping, high-density linkage map construction, haplotype analysis, and whole genome-based selection. PMID:24618043

  16. Single Nucleotide Polymorphism (SNP) Arrays and Unexpected Consanguinity: Considerations for Clinicians When Returning Results to Families

    PubMed Central

    Delgado, Fernanda; Tabor, Holly K.; Chow, Penny M.; Conta, Jessie H.; Feldman, Kenneth W.; Tsuchiya, Karen D.; Beck, Anita E.

    2014-01-01

    Purpose The broad use of SNP microarrays has increased identification of unexpected consanguinity. Therefore, guidelines to address reporting of consanguinity have been published for clinical laboratories. Because no such guidelines exist for clinicians, we describe a case and present recommendations for clinicians to disclose unexpected consanguinity to families. Methods In a boy with multiple endocrine abnormalities and structural birth defects, SNP array analysis revealed ~23% autosomal homozygosity suggestive of a 1st-degree parental relationship. We assembled an interdisciplinary healthcare team, planned the most appropriate way to discuss results of the SNP array with the adult mother including the possibility of multiple autosomal recessive disorders in her child, and finally met with her as a team. Results From these discussions, we developed four major considerations for clinicians returning results of unexpected consanguinity, all guided by the child’s best interests: 1) ethical and legal obligations for reporting possible abuse, 2) preservation of the clinical relationship, 3) attention to justice and psychosocial challenges, and 4) utilization of the SNP array results to guide further testing. Conclusion As SNP arrays become a common clinical diagnostic tool, clinicians can use this framework to return results of unexpected consanguinity to families in a supportive and productive manner. PMID:25232848

  17. Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation

    PubMed Central

    Wan, Lin; Sun, Kelian; Ding, Qi; Cui, Yuehua; Li, Ming; Wen, Yalu; Elston, Robert C.; Qian, Minping; Fu, Wenjiang J

    2009-01-01

    Affymetrix SNP arrays have been widely used for single-nucleotide polymorphism (SNP) genotype calling and DNA copy number variation inference. Although numerous methods have achieved high accuracy in these fields, most studies have paid little attention to the modeling of hybridization of probes to off-target allele sequences, which can affect the accuracy greatly. In this study, we address this issue and demonstrate that hybridization with mismatch nucleotides (HWMMN) occurs in all SNP probe-sets and has a critical effect on the estimation of allelic concentrations (ACs). We study sequence binding through binding free energy and then binding affinity, and develop a probe intensity composite representation (PICR) model. The PICR model allows the estimation of ACs at a given SNP through statistical regression. Furthermore, we demonstrate with cell-line data of known true copy numbers that the PICR model can achieve reasonable accuracy in copy number estimation at a single SNP locus, by using the ratio of the estimated AC of each sample to that of the reference sample, and can reveal subtle genotype structure of SNPs at abnormal loci. We also demonstrate with HapMap data that the PICR model yields accurate SNP genotype calls consistently across samples, laboratories and even across array platforms. PMID:19586935

  18. Vitis Phylogenomics: Hybridization Intensities from a SNP Array Outperform Genotype Calls

    PubMed Central

    Miller, Allison J.; Matasci, Naim; Schwaninger, Heidi; Aradhya, Mallikarjuna K.; Prins, Bernard; Zhong, Gan-Yuan; Simon, Charles; Buckler, Edward S.; Myles, Sean

    2013-01-01

    Understanding relationships among species is a fundamental goal of evolutionary biology. Single nucleotide polymorphisms (SNPs) identified through next generation sequencing and related technologies enable phylogeny reconstruction by providing unprecedented numbers of characters for analysis. One approach to SNP-based phylogeny reconstruction is to identify SNPs in a subset of individuals, and then to compile SNPs on an array that can be used to genotype additional samples at hundreds or thousands of sites simultaneously. Although powerful and efficient, this method is subject to ascertainment bias because applying variation discovered in a representative subset to a larger sample favors identification of SNPs with high minor allele frequencies and introduces bias against rare alleles. Here, we demonstrate that the use of hybridization intensity data, rather than genotype calls, reduces the effects of ascertainment bias. Whereas traditional SNP calls assess known variants based on diversity housed in the discovery panel, hybridization intensity data survey variation in the broader sample pool, regardless of whether those variants are present in the initial SNP discovery process. We apply SNP genotype and hybridization intensity data derived from the Vitis9kSNP array developed for grape to show the effects of ascertainment bias and to reconstruct evolutionary relationships among Vitis species. We demonstrate that phylogenies constructed using hybridization intensities suffer less from the distorting effects of ascertainment bias, and are thus more accurate than phylogenies based on genotype calls. Moreover, we reconstruct the phylogeny of the genus Vitis using hybridization data, show that North American subgenus Vitis species are monophyletic, and resolve several previously poorly known relationships among North American species. This study builds on earlier work that applied the Vitis9kSNP array to evolutionary questions within Vitis vinifera and has general

  19. Measuring diversity in Gossypium hirsutum using the CottonSNP63K Array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A CottonSNP63K array and accompanying cluster file has been developed and includes 45,104 intra-specific SNPs and 17,954 inter-specific SNPs for automated genotyping of cotton (Gossypium spp.) samples. Development of the cluster file included genotyping of 1,156 samples, a subset of which were iden...

  20. Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Btau_4.0 and UMD3.1 are two distinct cattle reference genome assemblies. In our previous study using the low density BovineSNP50 array, we reported a copy number variation (CNV) analysis on Btau_4.0 with 521 animals of 21 cattle breeds, yielding 682 CNV regions with a total length of 139.8 megabases...

  1. Optimal design of low-density SNP arrays for genomic prediction: algorithm and applications

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for their optimal design. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optim...

  2. Development and Validation of a High-Density SNP Genotyping Array for African Oil Palm.

    PubMed

    Kwong, Qi Bin; Teh, Chee Keng; Ong, Ai Ling; Heng, Huey Ying; Lee, Heng Leng; Mohamed, Mohaimi; Low, Joel Zi-Bin; Apparow, Sukganah; Chew, Fook Tim; Mayes, Sean; Kulaveerasingam, Harikrishna; Tammi, Martti; Appleton, David Ross

    2016-08-01

    High-density single nucleotide polymorphism (SNP) genotyping arrays are powerful tools that can measure the level of genetic polymorphism within a population. To develop a whole-genome SNP array for oil palms, SNP discovery was performed using deep resequencing of eight libraries derived from 132 Elaeis guineensis and Elaeis oleifera palms belonging to 59 origins, resulting in the discovery of >3 million putative SNPs. After SNP filtering, the Illumina OP200K custom array was built with 170 860 successful probes. Phenetic clustering analysis revealed that the array could distinguish between palms of different origins in a way consistent with pedigree records. Genome-wide linkage disequilibrium declined more slowly for the commercial populations (ranging from 120 kb at r(2) = 0.43 to 146 kb at r(2) = 0.50) when compared with the semi-wild populations (19.5 kb at r(2) = 0.22). Genetic fixation mapping comparing the semi-wild and commercial population identified 321 selective sweeps. A genome-wide association study (GWAS) detected a significant peak on chromosome 2 associated with the polygenic component of the shell thickness trait (based on the trait shell-to-fruit; S/F %) in tenera palms. Testing of a genomic selection model on the same trait resulted in good prediction accuracy (r = 0.65) with 42% of the S/F % variation explained. The first high-density SNP genotyping array for oil palm has been developed and shown to be robust for use in genetic studies and with potential for developing early trait prediction to shorten the oil palm breeding cycle. PMID:27112659

  3. Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications.

    PubMed

    Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R; Taylor, Jeremy F; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart

    2016-01-01

    Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The

  4. A Customized Pigmentation SNP Array Identifies a Novel SNP Associated with Melanoma Predisposition in the SLC45A2 Gene

    PubMed Central

    Alonso, Santos; Boyano, M. Dolores; Peña-Chilet, Maria; Pita, Guillermo; Aviles, Jose A.; Mayor, Matias; Gomez-Fernandez, Cristina; Casado, Beatriz; Martin-Gonzalez, Manuel; Izagirre, Neskuts; De la Rua, Concepcion; Asumendi, Aintzane; Perez-Yarza, Gorka; Arroyo-Berdugo, Yoana; Boldo, Enrique; Lozoya, Rafael; Torrijos-Aguilar, Arantxa; Pitarch, Ana; Pitarch, Gerard; Sanchez-Motilla, Jose M.; Valcuende-Cavero, Francisca; Tomas-Cabedo, Gloria; Perez-Pastor, Gemma; Diaz-Perez, Jose L.; Gardeazabal, Jesus; de Lizarduy, Iñigo Martinez; Sanchez-Diez, Ana; Valdes, Carlos; Pizarro, Angel; Casado, Mariano; Carretero, Gregorio; Botella-Estrada, Rafael; Nagore, Eduardo; Lazaro, Pablo; Lluch, Ana; Benitez, Javier; Martinez-Cadenas, Conrado; Ribas, Gloria

    2011-01-01

    As the incidence of Malignant Melanoma (MM) reflects an interaction between skin colour and UV exposure, variations in genes implicated in pigmentation and tanning response to UV may be associated with susceptibility to MM. In this study, 363 SNPs in 65 gene regions belonging to the pigmentation pathway have been successfully genotyped using a SNP array. Five hundred and ninety MM cases and 507 controls were analyzed in a discovery phase I. Ten candidate SNPs based on a p-value threshold of 0.01 were identified. Two of them, rs35414 (SLC45A2) and rs2069398 (SILV/CKD2), were statistically significant after conservative Bonferroni correction. The best six SNPs were further tested in an independent Spanish series (624 MM cases and 789 controls). A novel SNP located on the SLC45A2 gene (rs35414) was found to be significantly associated with melanoma in both phase I and phase II (P<0.0001). None of the other five SNPs were replicated in this second phase of the study. However, three SNPs in TYR, SILV/CDK2 and ADAMTS20 genes (rs17793678, rs2069398 and rs1510521 respectively) had an overall p-value<0.05 when considering the whole DNA collection (1214 MM cases and 1296 controls). Both the SLC45A2 and the SILV/CDK2 variants behave as protective alleles, while the TYR and ADAMTS20 variants seem to function as risk alleles. Cumulative effects were detected when these four variants were considered together. Furthermore, individuals carrying two or more mutations in MC1R, a well-known low penetrance melanoma-predisposing gene, had a decreased MM risk if concurrently bearing the SLC45A2 protective variant. To our knowledge, this is the largest study on Spanish sporadic MM cases to date. PMID:21559390

  5. Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao

    PubMed Central

    Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos

    2015-01-01

    Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity. PMID:26070980

  6. Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao.

    PubMed

    Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos

    2015-08-01

    Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity. PMID:26070980

  7. An integrative segmentation method for detecting germline copy number variations in SNP arrays.

    PubMed

    Shi, Jianxin; Li, Peng

    2012-05-01

    Germline copy number variations (CNVs) are a major source of genetic variation in humans. In large-scale studies of complex diseases, CNVs are usually detected from data generated by single nucleotide polymorphism (SNP) genotyping arrays. In this paper, we develop an integrative segmentation method, SegCNV, for detecting CNVs integrating both log R ratio (LRR) and B allele frequency (BAF). Based on simulation studies, SegCNV had modestly better power to detect deletions and substantially better power to detect duplications compared with circular binary segmentation (CBS) that relies purely on LRRs; and it had better power to detect deletions and a comparable performance to detect duplications compared with PennCNV and QuantiSNP. In two Hapmap subjects with deep sequence data available as a gold standard, SegCNV detected more true short deletions than PennCNV and QuantiSNP. For 21 short duplications validated experimentally in the AGRE dataset, SegCNV, QuantiSNP, and PennCNV detected all of them while CBS detected only three. SegCNV is much faster than the HMM-based (where HMM is hidden Markov model) methods, taking only several seconds to analyze genome-wide data for one subject. PMID:22539397

  8. Prenatal SNP array testing in 1000 fetuses with ultrasound anomalies: causative, unexpected and susceptibility CNVs.

    PubMed

    Srebniak, Malgorzata I; Diderich, Karin Em; Joosten, Marieke; Govaerts, Lutgarde Cp; Knijnenburg, Jeroen; de Vries, Femke At; Boter, Marjan; Lont, Debora; Knapen, Maarten Fcm; de Wit, Merel C; Go, Attie Tji; Galjaard, Robert-Jan H; Van Opstal, Diane

    2016-05-01

    To evaluate the diagnostic value of single-nucleotide polymorphism (SNP) array testing in 1033 fetuses with ultrasound anomalies we investigated the prevalence and genetic nature of pathogenic findings. We reclassified all pathogenic findings into three categories: causative findings; unexpected diagnoses (UD); and susceptibility loci (SL) for neurodevelopmental disorders. After exclusion of trisomy 13, 18, 21, sex-chromosomal aneuploidy and triploidies, in 76/1033 (7.4%) fetuses a pathogenic chromosome abnormality was detected by genomic SNP array: in 19/1033 cases (1.8%) a microscopically detectable abnormality was found and in 57/1033 (5.5%) fetuses a pathogenic submicroscopic chromosome abnormality was detected. 58% (n=44) of all these pathogenic chromosome abnormalities involved a causative finding, 35% (n=27) a SL for neurodevelopmental disorder, and 6% (n=5) a UD of an early-onset untreatable disease. In 0.3% of parental samples an incidental pathogenic finding was encountered. Our results confirm that a genomic array should be the preferred first-tier technique in fetuses with ultrasound anomalies. All UDs involved early-onset diseases, which is beneficial for the patients to know. It also seems that UDs occur at a comparable frequency among microscopic and submicroscopic pathogenic findings. SL were more often detected than in pregnancies without ultrasound anomalies. PMID:26328504

  9. Allele-Specific Amplification in Cancer Revealed by SNP Array Analysis

    PubMed Central

    2005-01-01

    Amplification, deletion, and loss of heterozygosity of genomic DNA are hallmarks of cancer. In recent years a variety of studies have emerged measuring total chromosomal copy number at increasingly high resolution. Similarly, loss-of-heterozygosity events have been finely mapped using high-throughput genotyping technologies. We have developed a probe-level allele-specific quantitation procedure that extracts both copy number and allelotype information from single nucleotide polymorphism (SNP) array data to arrive at allele-specific copy number across the genome. Our approach applies an expectation-maximization algorithm to a model derived from a novel classification of SNP array probes. This method is the first to our knowledge that is able to (a) determine the generalized genotype of aberrant samples at each SNP site (e.g., CCCCT at an amplified site), and (b) infer the copy number of each parental chromosome across the genome. With this method, we are able to determine not just where amplifications and deletions occur, but also the haplotype of the region being amplified or deleted. The merit of our model and general approach is demonstrated by very precise genotyping of normal samples, and our allele-specific copy number inferences are validated using PCR experiments. Applying our method to a collection of lung cancer samples, we are able to conclude that amplification is essentially monoallelic, as would be expected under the mechanisms currently believed responsible for gene amplification. This suggests that a specific parental chromosome may be targeted for amplification, whether because of germ line or somatic variation. An R software package containing the methods described in this paper is freely available at http://genome.dfci.harvard.edu/~tlaframb/PLASQ. PMID:16322765

  10. SNP array mapping of chromosome 20p deletions: genotypes, phenotypes, and copy number variation.

    PubMed

    Kamath, Binita M; Thiel, Brian D; Gai, Xiaowu; Conlin, Laura K; Munoz, Pedro S; Glessner, Joseph; Clark, Dinah; Warthen, Daniel M; Shaikh, Tamim H; Mihci, Ercan; Piccoli, David A; Grant, Struan F A; Hakonarson, Hakon; Krantz, Ian D; Spinner, Nancy B

    2009-03-01

    The use of array technology to define chromosome deletions and duplications is bringing us closer to establishing a genotype/phenotype map of genomic copy number alterations. We studied 21 patients and five relatives with deletions of the short arm of chromosome 20 using the Illumina HumanHap550 SNP array to: 1) more accurately determine the deletion sizes; 2) identify and compare breakpoints; 3) establish genotype/phenotype correlations; and 4) investigate the use of the HumanHap550 platform for analysis of chromosome deletions. Deletions ranged from 95 kb to 14.62 Mb, and all of the breakpoints were unique. Eleven patients had deletions between 95 kb and 4 Mb and these individuals had normal development, with no anomalies outside of those associated with Alagille syndrome (AGS). The proximal and distal boundaries of these 11 deletions constitute a 5.4-Mb region, and we propose that haploinsufficiency for only 1 of the 12 genes in this region causes phenotypic abnormalities. This defines the JAG1-associated critical region, in which deletions do not confer findings other than those associated with AGS. The other 10 patients had deletions between 3.28 Mb and 14.62 Mb, which extended outside the critical region, and, notably, all of these patients had developmental delay. This group had other findings such as autism, scoliosis, and bifid uvula. We identified 47 additional polymorphic genome-wide copy number variants (>20 SNPs), with 0 to 5 variants called per patient. Deletions of the short arm of chromosome 20 are associated with relatively mild and limited clinical anomalies. The use of SNP arrays provides accurate high-resolution definition of genomic abnormalities. PMID:19058200

  11. Three clinical experiences with SNP array results consistent with parental incest: a narrative with lessons learned.

    PubMed

    Helm, Benjamin M; Langley, Katherine; Spangler, Brooke; Vergano, Samantha

    2014-08-01

    Single nucleotide polymorphism microarrays have the ability to reveal parental consanguinity which may or may not be known to healthcare providers. Consanguinity can have significant implications for the health of patients and for individual and family psychosocial well-being. These results often present ethical and legal dilemmas that can have important ramifications. Unexpected consanguinity can be confounding to healthcare professionals who may be unprepared to handle these results or to communicate them to families or other appropriate representatives. There are few published accounts of experiences with consanguinity and SNP arrays. In this paper we discuss three cases where molecular evidence of parental incest was identified by SNP microarray. We hope to further highlight consanguinity as a potential incidental finding, how the cases were handled by the clinical team, and what resources were found to be most helpful. This paper aims to contribute further to professional discourse on incidental findings with genomic technology and how they were addressed clinically. These experiences may provide some guidance on how others can prepare for these findings and help improve practice. As genetic and genomic testing is utilized more by non-genetics providers, we also hope to inform about the importance of engaging with geneticists and genetic counselors when addressing these findings. PMID:24222483

  12. Use of SNP-arrays for ChIP assays: computational aspects.

    PubMed

    Muro, Enrique M; McCann, Jennifer A; Rudnicki, Michael A; Andrade-Navarro, Miguel A

    2009-01-01

    The simultaneous genotyping of thousands of single nucleotide polymorphisms (SNPs) in a genome using SNP-Arrays is a very important tool that is revolutionizing genetics and molecular biology. We expanded the utility of this technique by using it following chromatin immunoprecipitation (ChIP) to assess the multiple genomic locations protected by a protein complex recognized by an antibody. The power of this technique is illustrated through an analysis of the changes in histone H4 acetylation, a marker of open chromatin and transcriptionally active genomic regions, which occur during differentiation of human myoblasts into myotubes. The findings have been validated by the observation of a significant correlation between the detected histone modifications and the expression of the nearby genes, as measured by DNA expression microarrays. This chapter focuses on the computational analysis of the data. PMID:19588091

  13. Genomic relationships computed from either next-generation sequence or array SNP data.

    PubMed

    Pérez-Enciso, M

    2014-04-01

    The use of sequence data in genomic prediction models is a topic of high interest, given the decreasing prices of current 'next'-generation sequencing technologies (NGS) and the theoretical possibility of directly interrogating the genomes for all causal mutations. Here, we compare by simulation how well genetic relationships (G) could be estimated using either NGS or ascertained SNP arrays. DNA sequences were simulated using the coalescence according to two scenarios: a 'cattle' scenario that consisted of a bottleneck followed by a split in two breeds without migration, and a 'pig' model where Chinese introgression into international pig breeds was simulated. We found that introgression results in a large amount of variability across the genome and between individuals, both in differentiation and in diversity. In general, NGS data allowed the most accurate estimates of G, provided enough sequencing depth was available, because shallow NGS (4×) may result in highly distorted estimates of G elements, especially if not standardized by allele frequency. However, high-density genotyping can also result in accurate estimates of G. Given that genotyping is much less noisy than NGS data, it is suggested that specific high-density arrays (~3M SNPs) that minimize the effects of ascertainment could be developed in the population of interest by sequencing the most influential animals and rely on those arrays for implementing genomic selection. PMID:24397314

  14. Mapping of Genetic Abnormalities of Primary Tumours from Metastatic CRC by High-Resolution SNP Arrays

    PubMed Central

    Sayagués, José María; Fontanillo, Celia; Abad, María del Mar; González-González, María; Sarasquete, María Eugenia; del Carmen Chillon, Maria; Garcia, Eva; Bengoechea, Oscar; Fonseca, Emilio; Gonzalez-Diaz, Marcos; De Las Rivas, Javier

    2010-01-01

    Background For years, the genetics of metastatic colorectal cancer (CRC) have been studied using a variety of techniques. However, most of the approaches employed so far have a relatively limited resolution which hampers detailed characterization of the common recurrent chromosomal breakpoints as well as the identification of small regions carrying genetic changes and the genes involved in them. Methodology/Principal Findings Here we applied 500K SNP arrays to map the most common chromosomal lesions present at diagnosis in a series of 23 primary tumours from sporadic CRC patients who had developed liver metastasis. Overall our results confirm that the genetic profile of metastatic CRC is defined by imbalanced gains of chromosomes 7, 8q, 11q, 13q, 20q and X together with losses of the 1p, 8p, 17p and 18q chromosome regions. In addition, SNP-array studies allowed the identification of small (<1.3 Mb) and extensive/large (>1.5 Mb) altered DNA sequences, many of which contain cancer genes known to be involved in CRC and the metastatic process. Detailed characterization of the breakpoint regions for the altered chromosomes showed four recurrent breakpoints at chromosomes 1p12, 8p12, 17p11.2 and 20p12.1; interestingly, the most frequently observed recurrent chromosomal breakpoint was localized at 17p11.2 and systematically targeted the FAM27L gene, whose role in CRC deserves further investigations. Conclusions/Significance In summary, in the present study we provide a detailed map of the genetic abnormalities of primary tumours from metastatic CRC patients, which confirm and extend on previous observations as regards the identification of genes potentially involved in development of CRC and the metastatic process. PMID:21060790

  15. A large maize (Zea Mays L.) SNP genotyping array: development and germplasm genotyping, and genetic mapping to compare with the B73 reference genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection for accelerated breeding. We report the establishment of a large SNP array for maize and i...

  16. Development and evaluation of a genome-wide 6K SNP array for diploid sweet cherry and tetraploid sour cherry

    Technology Transfer Automated Retrieval System (TEKTRAN)

    High-throughput genome scans are important tools for genetic studies and breeding applications. Here, a 6K SNP array for use with the Illumina Infinium® system was developed for diploid sweet cherry (Prunus avium) and allotetraploid sour cherry (P. cerasus). This effort was led by RosBREED, a commun...

  17. Development and application of a novel genome-wide SNP array reveals domestication history in soybean

    PubMed Central

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-01-01

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884

  18. Development and application of a novel genome-wide SNP array reveals domestication history in soybean.

    PubMed

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-01-01

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884

  19. Pharmacogenomics: accessing important alleles by imputation from commercial genome-wide SNP arrays.

    PubMed

    Liboredo, R; Pena, S D J

    2014-01-01

    Personalized medicine is becoming a medical reality, as important genotype-phenotype relationships are being unraveled. The availability of pharmacogenomic data is a key element of individualized care. In this study, we explored genotype imputation as a means to infer important pharmacogenomic alleles from a regular commercially available genome-wide SNP array. Using these arrays as a starting point can reduce testing costs, increasing access to these pharmacogenomic data and still retain a larger amount of genome-wide information. IMPUTE2 and MaCH-Admix were used to perform genotype imputation with a dense reference panel from 1000 Genomes data. We were able to correctly infer genotypes for the warfarin-related loci VKORC1 and CYP2C9 alleles 2, 3, 5, and 11 and also clopidogrel-related CYP2C19 alleles 2 and 17 for a small sample of Brazilian individuals, as well as for HapMap samples. The success of an imputation approach in admixed samples using publicly available reference panels can encourage further imputation initiatives in those populations. PMID:25117329

  20. Software comparison for evaluating genomic copy number variation for Affymetrix 6.0 SNP array platform

    PubMed Central

    2011-01-01

    Background Copy number data are routinely being extracted from genome-wide association study chips using a variety of software. We empirically evaluated and compared four freely-available software packages designed for Affymetrix SNP chips to estimate copy number: Affymetrix Power Tools (APT), Aroma.Affymetrix, PennCNV and CRLMM. Our evaluation used 1,418 GENOA samples that were genotyped on the Affymetrix Genome-Wide Human SNP Array 6.0. We compared bias and variance in the locus-level copy number data, the concordance amongst regions of copy number gains/deletions and the false-positive rate amongst deleted segments. Results APT had median locus-level copy numbers closest to a value of two, whereas PennCNV and Aroma.Affymetrix had the smallest variability associated with the median copy number. Of those evaluated, only PennCNV provides copy number specific quality-control metrics and identified 136 poor CNV samples. Regions of copy number variation (CNV) were detected using the hidden Markov models provided within PennCNV and CRLMM/VanillaIce. PennCNV detected more CNVs than CRLMM/VanillaIce; the median number of CNVs detected per sample was 39 and 30, respectively. PennCNV detected most of the regions that CRLMM/VanillaIce did as well as additional CNV regions. The median concordance between PennCNV and CRLMM/VanillaIce was 47.9% for duplications and 51.5% for deletions. The estimated false-positive rate associated with deletions was similar for PennCNV and CRLMM/VanillaIce. Conclusions If the objective is to perform statistical tests on the locus-level copy number data, our empirical results suggest that PennCNV or Aroma.Affymetrix is optimal. If the objective is to perform statistical tests on the summarized segmented data then PennCNV would be preferred over CRLMM/VanillaIce. Specifically, PennCNV allows the analyst to estimate locus-level copy number, perform segmentation and evaluate CNV-specific quality-control metrics within a single software package

  1. A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer

    PubMed Central

    Li, Ming; Wen, Yalu; Fu, Wenjiang

    2014-01-01

    Cumulative evidence has shown that structural variations, due to insertions, deletions, and inversions of DNA, may contribute considerably to the development of complex human diseases, such as breast cancer. High-throughput genotyping technologies, such as Affymetrix high density single-nucleotide polymorphism (SNP) arrays, have produced large amounts of genetic data for genome-wide SNP genotype calling and copy number estimation. Meanwhile, there is a great need for accurate and efficient statistical methods to detect copy number variants. In this article, we introduce a hidden-Markov-model (HMM)-based method, referred to as the PICR-CNV, for copy number inference. The proposed method first estimates copy number abundance for each single SNP on a single array based on the raw fluorescence values, and then standardizes the estimated copy number abundance to achieve equal footing among multiple arrays. This method requires no between-array normalization, and thus, maintains data integrity and independence of samples among individual subjects. In addition to our efforts to apply new statistical technology to raw fluorescence values, the HMM has been applied to the standardized copy number abundance in order to reduce experimental noise. Through simulations, we show our refined method is able to infer copy number variants accurately. Application of the proposed method to a breast cancer dataset helps to identify genomic regions significantly associated with the disease. PMID:26279618

  2. Finding Markers That Make a Difference: DNA Pooling and SNP-Arrays Identify Population Informative Markers for Genetic Stock Identification

    PubMed Central

    Ozerov, Mikhail; Vasemägi, Anti; Wennevik, Vidar; Diaz-Fernandez, Rogelio; Kent, Matthew; Gilbey, John; Prusov, Sergey; Niemelä, Eero; Vähä, Juha-Pekka

    2013-01-01

    Genetic stock identification (GSI) using molecular markers is an important tool for management of migratory species. Here, we tested a cost-effective alternative to individual genotyping, known as allelotyping, for identification of highly informative SNPs for accurate genetic stock identification. We estimated allele frequencies of 2880 SNPs from DNA pools of 23 Atlantic salmon populations using Illumina SNP-chip. We evaluated the performance of four common strategies (global FST, pairwise FST, Delta and outlier approach) for selection of the most informative set of SNPs and tested their effectiveness for GSI compared to random sets of SNP and microsatellite markers. For the majority of cases, SNPs selected using the outlier approach performed best followed by pairwise FST and Delta methods. Overall, the selection procedure reduced the number of SNPs required for accurate GSI by up to 53% compared with randomly chosen SNPs. However, GSI accuracy was more affected by populations in the ascertainment group rather than the ranking method itself. We demonstrated for the first time the compatibility of different large-scale SNP datasets by compiling the largest population genetic dataset for Atlantic salmon to date. Finally, we showed an excellent performance of our top SNPs on an independent set of populations covering the main European distribution range of Atlantic salmon. Taken together, we demonstrate how combination of DNA pooling and SNP arrays can be applied for conservation and management of salmonids as well as other species. PMID:24358184

  3. Finding markers that make a difference: DNA pooling and SNP-arrays identify population informative markers for genetic stock identification.

    PubMed

    Ozerov, Mikhail; Vasemägi, Anti; Wennevik, Vidar; Diaz-Fernandez, Rogelio; Kent, Matthew; Gilbey, John; Prusov, Sergey; Niemelä, Eero; Vähä, Juha-Pekka

    2013-01-01

    Genetic stock identification (GSI) using molecular markers is an important tool for management of migratory species. Here, we tested a cost-effective alternative to individual genotyping, known as allelotyping, for identification of highly informative SNPs for accurate genetic stock identification. We estimated allele frequencies of 2880 SNPs from DNA pools of 23 Atlantic salmon populations using Illumina SNP-chip. We evaluated the performance of four common strategies (global F ST, pairwise F ST, Delta and outlier approach) for selection of the most informative set of SNPs and tested their effectiveness for GSI compared to random sets of SNP and microsatellite markers. For the majority of cases, SNPs selected using the outlier approach performed best followed by pairwise F ST and Delta methods. Overall, the selection procedure reduced the number of SNPs required for accurate GSI by up to 53% compared with randomly chosen SNPs. However, GSI accuracy was more affected by populations in the ascertainment group rather than the ranking method itself. We demonstrated for the first time the compatibility of different large-scale SNP datasets by compiling the largest population genetic dataset for Atlantic salmon to date. Finally, we showed an excellent performance of our top SNPs on an independent set of populations covering the main European distribution range of Atlantic salmon. Taken together, we demonstrate how combination of DNA pooling and SNP arrays can be applied for conservation and management of salmonids as well as other species. PMID:24358184

  4. Interim report on updated microarray probes for the LLNL Burkholderia pseudomallei SNP array

    SciTech Connect

    Gardner, S; Jaing, C

    2012-03-27

    The overall goal of this project is to forensically characterize 100 unknown Burkholderia isolates in the US-Australia collaboration. We will identify genome-wide single nucleotide polymorphisms (SNPs) from B. pseudomallei and near neighbor species including B. mallei, B. thailandensis and B. oklahomensis. We will design microarray probes to detect these SNP markers and analyze 100 Burkholderia genomic DNAs extracted from environmental, clinical and near neighbor isolates from Australian collaborators on the Burkholderia SNP microarray. We will analyze the microarray genotyping results to characterize the genetic diversity of these new isolates and triage the samples for whole genome sequencing. In this interim report, we described the SNP analysis and the microarray probe design for the Burkholderia SNP microarray.

  5. Genome-wide linkage analysis of a Parkinsonian-pyramidal syndrome pedigree by 500 K SNP arrays.

    PubMed

    Shojaee, Seyedmehdi; Sina, Farzad; Banihosseini, Setareh Sadat; Kazemi, Mohammad Hossein; Kalhor, Reza; Shahidi, Gholam-Ali; Fakhrai-Rad, Hossein; Ronaghi, Mostafa; Elahi, Elahe

    2008-06-01

    Robust SNP genotyping technologies and data analysis programs have encouraged researchers in recent years to use SNPs for linkage studies. Platforms used to date have been 10 K chip arrays, but the possible value of interrogating SNPs at higher densities has been considered. Here, we present a genome-wide linkage analysis by means of a 500 K SNP platform. The analysis was done on a large pedigree affected with Parkinsonian-pyramidal syndrome (PPS), and the results showed linkage to chromosome 22. Sequencing of candidate genes revealed a disease-associated homozygous variation (R378G) in FBXO7. FBXO7 codes for a member of the F-box family of proteins, all of which may have a role in the ubiquitin-proteosome protein-degradation pathway. This pathway has been implicated in various neurodegenerative diseases, and identification of FBXO7 as the causative gene of PPS is expected to shed new light on its role. The performance of the array was assessed and systematic analysis of effects of SNP density reduction was performed with the real experimental data. Our results suggest that linkage in our pedigree may have been missed had we used chips containing less than 100,000 SNPs across the genome. PMID:18513678

  6. Detection of Chromosomal Structural Alterations in Single Cells by SNP Arrays: A Systematic Survey of Amplification Bias and Optimized Workflow

    PubMed Central

    Iwamoto, Kazuya; Bundo, Miki; Ueda, Junko; Nakano, Yoko; Ukai, Wataru; Hashimoto, Eri; Saito, Toshikazu; Kato, Tadafumi

    2007-01-01

    Background In single-cell human genome analysis using whole-genome amplified product, a strong amplification bias involving allele dropout and preferential amplification hampers the quality of results. Using an oligonucleotide single nucleotide polymorphism (SNP) array, we systematically examined the nature of this amplification bias, including frequency, degree, and preference for genomic location, and we assessed the effects of this amplification bias on subsequent genotype and chromosomal copy number analyses. Methodology/Principal Findings We found a large variability in amplification bias among the amplified products obtained by multiple displacement amplification (MDA), and this bias had a severe effect on the genotype and chromosomal copy number analyses. We established optimal experimental conditions for pre-screening for high-quality amplified products, processing array data, and analyzing chromosomal structural alterations. Using this optimized protocol, we successfully detected previously unidentified chromosomal structural alterations in single cells from a lymphoblastoid cell line. These alterations were subsequently confirmed by karyotype analysis. In addition, we successfully obtained reproducible chromosomal copy number profiles of single cells from the cell line with a complex karyotype, indicating the applicability and potential of our optimized workflow. Conclusions/Significance Our results suggest that the quality of amplification products should be critically assessed before using them for genomic analyses. The method of MDA-based whole-genome amplification followed by SNP array analysis described here will be useful for exploring chromosomal alterations in single cells. PMID:18074030

  7. Bivariate segmentation of SNP-array data for allele-specific copy number analysis in tumour samples

    PubMed Central

    2013-01-01

    Background SNP arrays output two signals that reflect the total genomic copy number (LRR) and the allelic ratio (BAF), which in combination allow the characterisation of allele-specific copy numbers (ASCNs). While methods based on hidden Markov models (HMMs) have been extended from array comparative genomic hybridisation (aCGH) to jointly handle the two signals, only one method based on change-point detection, ASCAT, performs bivariate segmentation. Results In the present work, we introduce a generic framework for bivariate segmentation of SNP array data for ASCN analysis. For the matter, we discuss the characteristics of the typically applied BAF transformation and how they affect segmentation, introduce concepts of multivariate time series analysis that are of concern in this field and discuss the appropriate formulation of the problem. The framework is implemented in a method named CnaStruct, the bivariate form of the structural change model (SCM), which has been successfully applied to transcriptome mapping and aCGH. Conclusions On a comprehensive synthetic dataset, we show that CnaStruct outperforms the segmentation of existing ASCN analysis methods. Furthermore, CnaStruct can be integrated into the workflows of several ASCN analysis tools in order to improve their performance, specially on tumour samples highly contaminated by normal cells. PMID:23497144

  8. A HapMap leads to a Capsicum annuum SNP infinium array: a new tool for pepper breeding

    PubMed Central

    Hulse-Kemp, Amanda M; Ashrafi, Hamid; Plieske, Joerg; Lemm, Jana; Stoffel, Kevin; Hill, Theresa; Luerssen, Hartmut; Pethiyagoda, Charit L; Lawley, Cindy T; Ganal, Martin W; Van Deynze, Allen

    2016-01-01

    The Capsicum genus (Pepper) is a part of the Solanacae family. It has been important in many cultures worldwide for its key nutritional components and uses as spices, medicines, ornamentals and vegetables. Worldwide population growth is associated with demand for more nutritionally valuable vegetables while contending with decreasing resources and available land. These conditions require increased efficiency in pepper breeding to deal with these imminent challenges. Through resequencing of inbred lines we have completed a valuable haplotype map (HapMap) for the pepper genome based on single-nucleotide polymorphisms (SNP). The identified SNPs were annotated and classified based on their gene annotation in the pepper draft genome sequence and phenotype of the sequenced inbred lines. A selection of one marker per gene model was utilized to create the PepperSNP16K array, which simultaneously genotyped 16 405 SNPs, of which 90.7% were found to be informative. A set of 84 inbred and hybrid lines and a mapping population of 90 interspecific F2 individuals were utilized to validate the array. Diversity analysis of the inbred lines shows a distinct separation of bell versus chile/hot pepper types and separates them into five distinct germplasm groups. The interspecific population created between Tabasco (C. frutescens chile type) and P4 (C. annuum blocky type) produced a linkage map with 5546 markers separated into 1361 bins on twelve 12 linkage groups representing 1392.3 cM. This publically available genotyping platform can be used to rapidly assess a large number of markers in a reproducible high-throughput manner for pepper. As a standardized tool for genetic analyses, the PepperSNP16K can be used worldwide to share findings and analyze QTLs for important traits leading to continued improvement of pepper for consumers. Data and information on the array are available through the Solanaceae Genomics Network. PMID:27602231

  9. A HapMap leads to a Capsicum annuum SNP infinium array: a new tool for pepper breeding.

    PubMed

    Hulse-Kemp, Amanda M; Ashrafi, Hamid; Plieske, Joerg; Lemm, Jana; Stoffel, Kevin; Hill, Theresa; Luerssen, Hartmut; Pethiyagoda, Charit L; Lawley, Cindy T; Ganal, Martin W; Van Deynze, Allen

    2016-01-01

    The Capsicum genus (Pepper) is a part of the Solanacae family. It has been important in many cultures worldwide for its key nutritional components and uses as spices, medicines, ornamentals and vegetables. Worldwide population growth is associated with demand for more nutritionally valuable vegetables while contending with decreasing resources and available land. These conditions require increased efficiency in pepper breeding to deal with these imminent challenges. Through resequencing of inbred lines we have completed a valuable haplotype map (HapMap) for the pepper genome based on single-nucleotide polymorphisms (SNP). The identified SNPs were annotated and classified based on their gene annotation in the pepper draft genome sequence and phenotype of the sequenced inbred lines. A selection of one marker per gene model was utilized to create the PepperSNP16K array, which simultaneously genotyped 16 405 SNPs, of which 90.7% were found to be informative. A set of 84 inbred and hybrid lines and a mapping population of 90 interspecific F2 individuals were utilized to validate the array. Diversity analysis of the inbred lines shows a distinct separation of bell versus chile/hot pepper types and separates them into five distinct germplasm groups. The interspecific population created between Tabasco (C. frutescens chile type) and P4 (C. annuum blocky type) produced a linkage map with 5546 markers separated into 1361 bins on twelve 12 linkage groups representing 1392.3 cM. This publically available genotyping platform can be used to rapidly assess a large number of markers in a reproducible high-throughput manner for pepper. As a standardized tool for genetic analyses, the PepperSNP16K can be used worldwide to share findings and analyze QTLs for important traits leading to continued improvement of pepper for consumers. Data and information on the array are available through the Solanaceae Genomics Network. PMID:27602231

  10. Evaluating the Influence of Quality Control Decisions and Software Algorithms on SNP Calling for the Affymetrix 6.0 SNP Array Platform

    PubMed Central

    de Andrade, Mariza; Atkinson, Elizabeth J.; Bamlet, William R.; Matsumoto, Martha E.; Maharjan, Sooraj; Slager, Susan L.; Vachon, Celine M.; Cunningham, Julie M.; Kardia, Sharon L.R.

    2011-01-01

    Objective Our goal was to evaluate the influence of quality control (QC) decisions using two genotype calling algorithms, CRLMM and Birdseed, designed for the Affymetrix SNP Array 6.0. Methods Various QC options were tried using the two algorithms and comparisons were made on subject and call rate and on association results using two data sets. Results For Birdseed, we recommend using the contrast QC instead of QC call rate for sample QC. For CRLMM, we recommend using the signal-to-noise rate ≥4 for sample QC and a posterior probability of 90% for genotype accuracy. For both algorithms, we recommend calling the genotype separately for each plate, and dropping SNPs with a lower call rate (<95%) before evaluating samples with lower call rates. To investigate whether the genotype calls from the two algorithms impacted the genome-wide association results, we performed association analysis using data from the GENOA cohort; we observed that the number of significant SNPs were similar using either CRLMM or Birdseed. Conclusions Using our suggested workflow both algorithms performed similarly; however, fewer samples were removed and CRLMM took half the time to run our 854 study samples (4.2 h) compared to Birdseed (8.4 h). PMID:21734406

  11. Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao.

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ~4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification pr...

  12. High-throughput SNP scoring with GAMMArrays: genomic analysis using multiplexed microsphere arrays

    NASA Astrophysics Data System (ADS)

    Green, Lance D.; Cai, Hong; Torney, David C.; Wood, Diane J.; Uribe-Romeo, Francisco J.; Kaderali, Lars; Nolan, John P.; White, P. S.

    2002-06-01

    We have developed a SNP scoring platform, yielding high throughput, inexpensive assays. The basic platform uses fluorescently labeled DNA fragments bound to microspheres, which are analyzed using flow cytometry. SNP scoring is performed using minisequencing primers and fluorescently labeled dideoxynucleotides. Furthermore, multiplexed microspheres make it possible to score hundreds of SNPs simultaneously. Multiplexing, coupled with high throughput rates allow inexpensive scoring of several million SNPs/day. GAMMArrays use universal tags that consist of computer designed, unique DNA tails. These are incorporated into each primer, and the reverse-component is attached to a discrete population of microspheres in a multiplexed set. This enables simultaneous minisequencing of many SNPs in solution, followed by capture onto the appropriate microsphere for multiplexed analysis by flow cytometry. We present results from multiplexed SNP analyses of bacterial pathogens, and human mtDNA variation. Analytes are performed on PCR amplicons, each containing numerous SNPs scored simultaneously. In addition, these assays easily integrate into conventional liquid handling automation, and require no unique instrumentation for setup and analysis. Very high signal-to-noise ratios, ease of setup, flexibility in format and scale, and low cost make these assays extremely versatile and valuable tools for a wide variety of SNP scoring applications.

  13. A novel hemizygous SACS mutation identified by whole exome sequencing and SNP array analysis in a Chinese ARSACS patient.

    PubMed

    Liu, L; Li, X B; Zi, X H; Shen, L; Hu, Zh M; Huang, Sh X; Yu, D L; Li, H B; Xia, K; Tang, B S; Zhang, R X

    2016-03-15

    The array of autosomal recessive spastic ataxia of Charlevoix-Saguenay (ARSACS) has expanded worldwide after the first description in the Charlevoix-Saguenay region of Québec. Here, we report a Chinese ARSACS patient presenting progressive peripheral neuropathy (CMTNS2=15) with horizontal gaze nystagmus and mild spastic gait. Genetic studies including whole exome sequencing (WES), Sanger sequencing and single nucleotide polymorphism (SNP) array analysis revealed a novel hemizygous nonsense mutation (c.11803C>T, p.Gln3935X) of SACS and a 1.33Mb deletion involved in SACS on chromosome 13q12.12 in the patient. Our findings highlight the necessity of SACS mutation screening in the gene panel of inherited peripheral neuropathies, and stress the need of testing copy number variation (CNV) in SACS mutation screening. PMID:26944128

  14. Next generation genome-wide association tool: Design and coverage of a high-throughput European-optimized SNP array

    PubMed Central

    Hoffmann, Thomas J.; Kvale, Mark N.; Hesselson, Stephanie E.; Zhan, Yiping; Aquino, Christine; Cao, Yang; Cawley, Simon; Chung, Elaine; Connell, Sheryl; Eshragh, Jasmin; Ewing, Marcia; Gollub, Jeremy; Henderson, Mary; Hubbell, Earl; Iribarren, Carlos; Kaufman, Jay; Lao, Richard Z.; Lu, Yontao; Ludwig, Dana; Mathauda, Gurpreet K.; McGuire, William; Mei, Gangwu; Miles, Sunita; Purdy, Matthew M.; Quesenberry, Charles; Ranatunga, Dilrini; Rowell, Sarah; Sadler, Marianne; Shapero, Michael H.; Shen, Ling; Shenoy, Tanushree R.; Smethurst, David; Van den Eeden, Stephen K.; Walter, Larry; Wan, Eunice; Wearley, Reid; Webster, Teresa; Wen, Christopher C.; Weng, Li; Whitmer, Rachel A.; Williams, Alan; Wong, Simon C.; Zau, Chia; Finn, Andrea; Schaefer, Catherine; Kwok, Pui-Yan; Risch, Neil

    2011-01-01

    The success of genome-wide association studies has paralleled the development of efficient genotyping technologies. We describe the development of a next-generation microarray based on the new highly-efficient Affymetrix Axiom genotyping technology that we are using to genotype individuals of European ancestry from the Kaiser Permanente Research Program on Genes, Environment and Health (RPGEH). The array contains 674,517 SNPs, and provides excellent genome-wide as well as gene-based and candidate-SNP coverage. Coverage was calculated using an approach based on imputation and cross validation. Preliminary results for the first 80,301 saliva-derived DNA samples from the RPGEH demonstrate very high quality genotypes, with sample success rates above 94% and over 98% of successful samples having SNP call rates exceeding 98%. At steady state, we have produced 462 million genotypes per week for each Axiom system. The new array provides a valuable addition to the repertoire of tools for large scale genome-wide association studies. PMID:21565264

  15. A high-throughput SNP array in the amphidiploid species Brassica napus shows diversity in resistance genes.

    PubMed

    Dalton-Morgan, Jessica; Hayward, Alice; Alamery, Salman; Tollenaere, Reece; Mason, Annaliese S; Campbell, Emma; Patel, Dhwani; Lorenc, Michał T; Yi, Bin; Long, Yan; Meng, Jinling; Raman, Rosy; Raman, Harsh; Lawley, Cindy; Edwards, David; Batley, Jacqueline

    2014-12-01

    Single-nucleotide polymorphisms (SNPs)are molecular markers based on nucleotide variation and can be used for genotyping assays across populations and to track genomic inheritance. SNPs offer a comprehensive genotyping alternative to whole-genome sequencing for both agricultural and research purposes including molecular breeding and diagnostics, genome evolution and genetic diversity analyses, genetic mapping, and trait association studies. Here genomic SNPs were discovered between four cultivars of the important amphidiploid oilseed species Brassica napus and used to develop a B. napus Infinium™ array containing 5,306 SNPs randomly dispersed across the genome. Assay success was high, with >94 % of these producing a reproducible, polymorphic genotype in the 1,070 samples screened. Although the assay was designed to B. napus, successful SNP amplification was achieved in the B. napus progenitor species, Brassica rapa and Brassica oleracea, and to a lesser extent in the related species Brassica nigra. Phylogenetic analysis was consistent with the expected relationships between B. napus individuals. This study presents an efficient custom SNP assay development pipeline in the complex polyploid Brassica genome and demonstrates the utility of the array for high-throughput genotyping in a number of related Brassica species. It also demonstrates the utility of this assay in genotyping resistance genes on chromosome A7, which segregate amongst the 1,070 samples. PMID:25147024

  16. Integrated analysis of copy number and loss of heterozygosity in primary breast carcinomas using high-density SNP array.

    PubMed

    Ching, Ho Ching; Naidu, Rakesh; Seong, Mun Kein; Har, Yip Cheng; Taib, Nur Aishah Mohd

    2011-09-01

    Breast cancer is a heterogeneous disease, marked by extensive chromosomal aberrations. In this study, we aimed to explicate the underlying chromosomal copy number (CN) alterations and loss of heterozygosity (LOH) implicated in a cohort of Malaysian hospital-based primary breast carcinoma samples using a single nucleotide polymorphism (SNP) array platform. The analysis was conducted by hybridizing the extracted DNA of 70 primary breast carcinomas and 37 normal peripheral blood samples to the Affymetrix 250K Sty SNP arrays. Locus-specific CN aberrations and LOH were statistically summarized using the binary segmentation algorithm and hidden Markov model. Selected genes from the SNP array analysis were also validated using quantitative real-time PCR. The merging of CN and LOH data fabricated distinctive integrated alteration profiles, which were comprised of finely demarcated minimal sites of aberrations. The most prevalent gains (≥ 30%) were detected at the 8q arm: 8q23.1, 8q23.3, 8q24.11, 8q24.13, 8q24.21, 8q24.22, 8q24.23 and 8q24.3, whilst the most ubiquitous losses (≥ 20%) were noted at the 8p12, 8p21.1, 8p21.2, 8p21.1-p21.2, 8p21.3, 8p22, 8p23.1, 8p23.1‑p23.2, 8p23.3, 17p11.2, 17p12, 17p11.2-p12, 17p13.1 and 17p13.2 regions. Copy-neutral LOH was characterized as the most prevailing LOH event, in which the most frequent distributions (≥ 30%) were revealed at 3p21.31, 5q33.2, 12q24.12, 12q24.12‑q24.13 and 14q23.1. These findings offer compre-hensive genome-wide views on breast cancer genomic changes, where the most recurrent gain, loss and copy-neutral LOH events were harboured within the 8q24.21, 8p21.1 and 14q23.1 loci, respectively. This will facilitate the uncovering of true driver genes pertinent to breast cancer biology and the develop-ment of prospective therapeutics. PMID:21687935

  17. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

    PubMed Central

    Troggio, Michela; Šurbanovski, Nada; Bianco, Luca; Moretto, Marco; Giongo, Lara; Banchi, Elisa; Viola, Roberto; Fernández, Felicdad Fernández; Costa, Fabrizio; Velasco, Riccardo; Cestaro, Alessandro; Sargent, Daniel James

    2013-01-01

    High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the ‘Golden Delicious’ genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies. PMID:23826289

  18. Varietal identification of tea (Camellia sinensis) using nanofluidic array of single nucleotide polymorphism (SNP) markers

    PubMed Central

    Fang, Wan-Ping; Meinhardt, Lyndel W; Tan, Hua-Wei; Zhou, Lin; Mischke, Sue; Zhang, Dapeng

    2014-01-01

    Apart from water, tea is the world’s most widely consumed beverage. Tea is produced in more than 50 countries with an annual production of approximately 4.7 million tons. The market segment for specialty tea has been expanding rapidly owing to increased demand, resulting in higher revenues and profits for tea growers and the industry. Accurate varietal identification is critically important to ensure traceability and authentication of premium tea products, which in turn contribute to on-farm conservation of tea genetic diversity. Using a set of single nucleotide polymorphism (SNP) markers developed from the expressed sequence tag (EST) database of Camilla senensis, we genotyped deoxyribonucleic acid (DNA) samples extracted from a diverse group of tea varieties, including both fresh and processed commercial loose-leaf teas. The validation led to the designation of 60 SNPs that unambiguously identified all 40 tested tea varieties with high statistical rigor (p<0.0001). Varietal authenticity and genetic relationships among the analyzed cultivars were further characterized by ordination and Bayesian clustering analysis. These SNP markers, in combination with a high-throughput genotyping protocol, effectively established and verified specific DNA fingerprints for all tested tea varieties. This method provides a powerful tool for variety authentication and quality control for the tea industry. It is also highly useful for the management of tea genetic resources and breeding, where accurate and efficient genotype identification is essential. PMID:26504544

  19. Design And Performance Of 44,100 SNP Genotyping Array For Rice

    Technology Transfer Automated Retrieval System (TEKTRAN)

    To document genome-wide allelic variation within and between the different subpopulations of both O. sativa and O. rufipogon, we developed an Affymetrix custom genotyping array containing 44,100 SNPs well distributed across the 400Mb rice genome. The SNPs on this array were selected from the MBML-in...

  20. Development of a 63K SNP Array for Cotton and High-Density Mapping of Intraspecific and Interspecific Populations of Gossypium spp.

    PubMed Central

    Hulse-Kemp, Amanda M.; Lemm, Jana; Plieske, Joerg; Ashrafi, Hamid; Buyyarapu, Ramesh; Fang, David D.; Frelichowski, James; Giband, Marc; Hague, Steve; Hinze, Lori L.; Kochan, Kelli J.; Riggs, Penny K.; Scheffler, Jodi A.; Udall, Joshua A.; Ulloa, Mauricio; Wang, Shirley S.; Zhu, Qian-Hao; Bag, Sumit K.; Bhardwaj, Archana; Burke, John J.; Byers, Robert L.; Claverie, Michel; Gore, Michael A.; Harker, David B.; Islam, Md S.; Jenkins, Johnie N.; Jones, Don C.; Lacape, Jean-Marc; Llewellyn, Danny J.; Percy, Richard G.; Pepper, Alan E.; Poland, Jesse A.; Mohan Rai, Krishan; Sawant, Samir V.; Singh, Sunil Kumar; Spriggs, Andrew; Taylor, Jen M.; Wang, Fei; Yourstone, Scott M.; Zheng, Xiuting; Lawley, Cindy T.; Ganal, Martin W.; Van Deynze, Allen; Wilson, Iain W.; Stelly, David M.

    2015-01-01

    High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community. PMID:25908569

  1. Development of a 63K SNP Array for Cotton and High-Density Mapping of Intraspecific and Interspecific Populations of Gossypium spp.

    PubMed

    Hulse-Kemp, Amanda M; Lemm, Jana; Plieske, Joerg; Ashrafi, Hamid; Buyyarapu, Ramesh; Fang, David D; Frelichowski, James; Giband, Marc; Hague, Steve; Hinze, Lori L; Kochan, Kelli J; Riggs, Penny K; Scheffler, Jodi A; Udall, Joshua A; Ulloa, Mauricio; Wang, Shirley S; Zhu, Qian-Hao; Bag, Sumit K; Bhardwaj, Archana; Burke, John J; Byers, Robert L; Claverie, Michel; Gore, Michael A; Harker, David B; Islam, Md S; Jenkins, Johnie N; Jones, Don C; Lacape, Jean-Marc; Llewellyn, Danny J; Percy, Richard G; Pepper, Alan E; Poland, Jesse A; Mohan Rai, Krishan; Sawant, Samir V; Singh, Sunil Kumar; Spriggs, Andrew; Taylor, Jen M; Wang, Fei; Yourstone, Scott M; Zheng, Xiuting; Lawley, Cindy T; Ganal, Martin W; Van Deynze, Allen; Wilson, Iain W; Stelly, David M

    2015-06-01

    High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community. PMID:25908569

  2. Development of a SNP array and its application to genetic mapping and diversity assessment in pepper (Capsicum spp.).

    PubMed

    Cheng, Jiaowen; Qin, Cheng; Tang, Xin; Zhou, Huangkai; Hu, Yafei; Zhao, Zicheng; Cui, Junjie; Li, Bo; Wu, Zhiming; Yu, Jiping; Hu, Kailin

    2016-01-01

    The development and application of single nucleotide polymorphisms (SNPs) is in its infancy for pepper. Here, a set of 15,000 SNPs were chosen from the resequencing data to develop an array for pepper with 12,720 loci being ultimately synthesized. Of these, 8,199 (~64.46%) SNPs were found to be scorable and covered ~81.18% of the whole genome. With this array, a high-density interspecific genetic map with 5,569 SNPs was constructed using 297 F2 individuals, and genetic diversity of a panel of 399 pepper elite/landrace lines was successfully characterized. Based on the genetic map, one major QTL, named Up12.1, was detected for the fruit orientation trait. A total of 65 protein-coding genes were predicted within this QTL region based on the current annotation of the Zunla-1 genome. In summary, the thousands of well-validated SNP markers, high-density genetic map and genetic diversity information will be useful for molecular genetics and innovative breeding in pepper. Furthermore, the mapping results lay foundation for isolating the genes underlying variation in fruit orientation of Capsicum. PMID:27623541

  3. Regions of homozygosity identified by oligonucleotide SNP arrays: evaluating the incidence and clinical utility.

    PubMed

    Wang, Jia-Chi; Ross, Leslie; Mahon, Loretta W; Owen, Renius; Hemmat, Morteza; Wang, Boris T; El Naggar, Mohammed; Kopita, Kimberly A; Randolph, Linda M; Chase, John M; Matas Aguilera, Maria J; Siles, Juan López; Church, Joseph A; Hauser, Natalie; Shen, Joseph J; Jones, Marilyn C; Wierenga, Klaas J; Jiang, Zhijie; Haddadin, Mary; Boyar, Fatih Z; Anguiano, Arturo; Strom, Charles M; Sahoo, Trilochan

    2015-05-01

    Copy neutral segments with allelic homozygosity, also known as regions of homozygosity (ROHs), are frequently identified in cases interrogated by oligonucleotide single-nucleotide polymorphism (oligo-SNP) microarrays. Presence of ROHs may be because of parental relatedness, chromosomal recombination or rearrangements and provides important clues regarding ancestral homozygosity, consanguinity or uniparental disomy. In this study of 14 574 consecutive cases, 832 (6%) were found to harbor one or more ROHs over 10 Mb, of which 651 cases (78%) had multiple ROHs, likely because of identity by descent (IBD), and 181 cases (22%) with ROHs involving a single chromosome. Parental relatedness was predicted to be first degree or closer in 5%, second in 9% and third in 19%. Of the 181 cases, 19 had ROHs for a whole chromosome revealing uniparental isodisomy (isoUPD). In all, 25 cases had significant ROHs involving a single chromosome; 5 cases were molecularly confirmed to have a mixed iso- and heteroUPD15 and 1 case each with segmental UPD9pat and segmental UPD22mat; 17 cases were suspected to have a mixed iso- and heteroUPD including 2 cases with small supernumerary marker and 2 cases with mosaic trisomy. For chromosome 15, 12 (92%) of 13 molecularly studied cases had either Prader-Willi or Angelman syndrome. Autosomal recessive disorders were confirmed in seven of nine cases from eight families because of the finding of suspected gene within a ROH. This study demonstrates that ROHs are much more frequent than previously recognized and often reflect parental relatedness, ascertain autosomal recessive diseases or unravel UPD in many cases. PMID:25118026

  4. Regions of homozygosity identified by oligonucleotide SNP arrays: evaluating the incidence and clinical utility

    PubMed Central

    Wang, Jia-Chi; Ross, Leslie; Mahon, Loretta W; Owen, Renius; Hemmat, Morteza; Wang, Boris T; El Naggar, Mohammed; Kopita, Kimberly A; Randolph, Linda M; Chase, John M; Matas Aguilera, Maria J; Siles, Juan López; Church, Joseph A; Hauser, Natalie; Shen, Joseph J; Jones, Marilyn C; Wierenga, Klaas J; Jiang, Zhijie; Haddadin, Mary; Boyar, Fatih Z; Anguiano, Arturo; Strom, Charles M; Sahoo, Trilochan

    2015-01-01

    Copy neutral segments with allelic homozygosity, also known as regions of homozygosity (ROHs), are frequently identified in cases interrogated by oligonucleotide single-nucleotide polymorphism (oligo-SNP) microarrays. Presence of ROHs may be because of parental relatedness, chromosomal recombination or rearrangements and provides important clues regarding ancestral homozygosity, consanguinity or uniparental disomy. In this study of 14 574 consecutive cases, 832 (6%) were found to harbor one or more ROHs over 10 Mb, of which 651 cases (78%) had multiple ROHs, likely because of identity by descent (IBD), and 181 cases (22%) with ROHs involving a single chromosome. Parental relatedness was predicted to be first degree or closer in 5%, second in 9% and third in 19%. Of the 181 cases, 19 had ROHs for a whole chromosome revealing uniparental isodisomy (isoUPD). In all, 25 cases had significant ROHs involving a single chromosome; 5 cases were molecularly confirmed to have a mixed iso- and heteroUPD15 and 1 case each with segmental UPD9pat and segmental UPD22mat; 17 cases were suspected to have a mixed iso- and heteroUPD including 2 cases with small supernumerary marker and 2 cases with mosaic trisomy. For chromosome 15, 12 (92%) of 13 molecularly studied cases had either Prader–Willi or Angelman syndrome. Autosomal recessive disorders were confirmed in seven of nine cases from eight families because of the finding of suspected gene within a ROH. This study demonstrates that ROHs are much more frequent than previously recognized and often reflect parental relatedness, ascertain autosomal recessive diseases or unravel UPD in many cases. PMID:25118026

  5. Copy number variation detection using SNP genotyping arrays in three Chinese pig breeds.

    PubMed

    Dong, K; Pu, Y; Yao, N; Shu, G; Liu, X; He, X; Zhao, Q; Guan, W; Ma, Y

    2015-04-01

    We performed genome-wide CNV detection based on SNP genotyping data of 96 Chinese-native Tibetan, Dahe and Wuzhishan pigs. These pigs are particularly interesting because of their excellent adaptation to hypoxia or small body size, which facilitates the use of them as models of different human diseases in addition to valuable agricultural animals. A total of 105 CNV regions (CNVRs) were identified, encompassing 16.71 Mb of the pig genome. Seven of 10 (70%) CNVRs selected randomly were validated by quantitative real-time PCR. Comparison with previous studies revealed 25 (23.81%) novel CNVRs, indicating that CNV coverage of the pig genome is still incomplete and there exists large diversity between pig breeds. Functional analysis of genes located in these CNVRs confirmed the high representation of genes involved in sensory perception, neurological system processes and other basic metabolic processes. In addition, the majority of these CNVRs were detected to span reported pig QTL that affect various traits, which highlighted three biologically interesting genes with copy number changes (i.e., ANKRD34B, FAM110B and ABCG1). These genes may have economic importance in pig breeding and are worth being further investigated. We also obtained some CNVRs harboring genes that had human orthologs involved in human diseases such as cardiovascular disease and Alzheimer's disease. The findings of this study are a significant extension of the coverage of CNVRs in the pig genome and provide valuable resources for follow-up-associated studies of CNVs in pig complex traits as well as important implications of human diseases. PMID:25590996

  6. Homozygosity mapping with SNP arrays identifies TRIM32, an E3 ubiquitin ligase, as a Bardet-Biedl syndrome gene (BBS11).

    PubMed

    Chiang, Annie P; Beck, John S; Yen, Hsan-Jan; Tayeh, Marwan K; Scheetz, Todd E; Swiderski, Ruth E; Nishimura, Darryl Y; Braun, Terry A; Kim, Kwang-Youn A; Huang, Jian; Elbedour, Khalil; Carmi, Rivka; Slusarski, Diane C; Casavant, Thomas L; Stone, Edwin M; Sheffield, Val C

    2006-04-18

    The identification of mutations in genes that cause human diseases has largely been accomplished through the use of positional cloning, which relies on linkage mapping. In studies of rare diseases, the resolution of linkage mapping is limited by the number of available meioses and informative marker density. One recent advance is the development of high-density SNP microarrays for genotyping. The SNP arrays overcome low marker informativity by using a large number of markers to achieve greater coverage at finer resolution. We used SNP microarray genotyping for homozygosity mapping in a small consanguineous Israeli Bedouin family with autosomal recessive Bardet-Biedl syndrome (BBS; obesity, pigmentary retinopathy, polydactyly, hypogonadism, renal and cardiac abnormalities, and cognitive impairment) in which previous linkage studies using short tandem repeat polymorphisms failed to identify a disease locus. SNP genotyping revealed a homozygous candidate region. Mutation analysis in the region of homozygosity identified a conserved homozygous missense mutation in the TRIM32 gene, a gene coding for an E3 ubiquitin ligase. Functional analysis of this gene in zebrafish and expression correlation analyses among other BBS genes in an expression quantitative trait loci data set demonstrate that TRIM32 is a BBS gene. This study shows the value of high-density SNP genotyping for homozygosity mapping and the use of expression correlation data for evaluation of candidate genes and identifies the proteasome degradation pathway as a pathway involved in BBS. PMID:16606853

  7. New resources for genetic studies in Populus nigra: genome-wide SNP discovery and development of a 12k Infinium array.

    PubMed

    Faivre-Rampant, P; Zaina, G; Jorge, V; Giacomello, S; Segura, V; Scalabrin, S; Guérin, V; De Paoli, E; Aluome, C; Viger, M; Cattonaro, F; Payne, A; PaulStephenRaj, P; Le Paslier, M C; Berard, A; Allwright, M R; Villar, M; Taylor, G; Bastien, C; Morgante, M

    2016-07-01

    Whole genome resequencing of 51 Populus nigra (L.) individuals from across Western Europe was performed using Illumina platforms. A total number of 1 878 727 SNPs distributed along the P. nigra reference sequence were identified. The SNP calling accuracy was validated with Sanger sequencing. SNPs were selected within 14 previously identified QTL regions, 2916 expressional candidate genes related to rust resistance, wood properties, water-use efficiency and bud phenology and 1732 genes randomly spread across the genome. Over 10 000 SNPs were selected for the construction of a 12k Infinium Bead-Chip array dedicated to association mapping. The SNP genotyping assay was performed with 888 P. nigra individuals. The genotyping success rate was 91%. Our high success rate was due to the discovery panel design and the stringent parameters applied for SNP calling and selection. In the same set of P. nigra genotypes, linkage disequilibrium throughout the genome decayed on average within 5-7 kb to half of its maximum value. As an application test, ADMIXTURE analysis was performed with a selection of 600 SNPs spread throughout the genome and 706 individuals collected along 12 river basins. The admixture pattern was consistent with genetic diversity revealed by neutral markers and the geographical distribution of the populations. These newly developed SNP resources and genotyping array provide a valuable tool for population genetic studies and identification of QTLs through natural-population based genetic association studies in P. nigra. PMID:26929265

  8. Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.)

    PubMed Central

    Koning-Boucoiran, Carole F. S.; Esselink, G. Danny; Vukosavljev, Mirjana; van 't Westende, Wendy P. C.; Gitonga, Virginia W.; Krens, Frans A.; Voorrips, Roeland E.; van de Weg, W. Eric; Schulz, Dietmar; Debener, Thomas; Maliepaard, Chris; Arens, Paul; Smulders, Marinus J. M.

    2015-01-01

    In order to develop a versatile and large SNP array for rose, we set out to mine ESTs from diverse sets of rose germplasm. For this RNA-Seq libraries containing about 700 million reads were generated from tetraploid cut and garden roses using Illumina paired-end sequencing, and from diploid Rosa multiflora using 454 sequencing. Separate de novo assemblies were performed in order to identify single nucleotide polymorphisms (SNPs) within and between rose varieties. SNPs among tetraploid roses were selected for constructing a genotyping array that can be employed for genetic mapping and marker-trait association discovery in breeding programs based on tetraploid germplasm, both from cut roses and from garden roses. In total 68,893 SNPs were included on the WagRhSNP Axiom array. Next, an orthology-guided assembly was performed for the construction of a non-redundant rose transcriptome database. A total of 21,740 transcripts had significant hits with orthologous genes in the strawberry (Fragaria vesca L.) genome. Of these 13,390 appeared to contain the full-length coding regions. This newly established transcriptome resource adds considerably to the currently available sequence resources for the Rosaceae family in general and the genus Rosa in particular. PMID:25954285

  9. Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.).

    PubMed

    Koning-Boucoiran, Carole F S; Esselink, G Danny; Vukosavljev, Mirjana; van 't Westende, Wendy P C; Gitonga, Virginia W; Krens, Frans A; Voorrips, Roeland E; van de Weg, W Eric; Schulz, Dietmar; Debener, Thomas; Maliepaard, Chris; Arens, Paul; Smulders, Marinus J M

    2015-01-01

    In order to develop a versatile and large SNP array for rose, we set out to mine ESTs from diverse sets of rose germplasm. For this RNA-Seq libraries containing about 700 million reads were generated from tetraploid cut and garden roses using Illumina paired-end sequencing, and from diploid Rosa multiflora using 454 sequencing. Separate de novo assemblies were performed in order to identify single nucleotide polymorphisms (SNPs) within and between rose varieties. SNPs among tetraploid roses were selected for constructing a genotyping array that can be employed for genetic mapping and marker-trait association discovery in breeding programs based on tetraploid germplasm, both from cut roses and from garden roses. In total 68,893 SNPs were included on the WagRhSNP Axiom array. Next, an orthology-guided assembly was performed for the construction of a non-redundant rose transcriptome database. A total of 21,740 transcripts had significant hits with orthologous genes in the strawberry (Fragaria vesca L.) genome. Of these 13,390 appeared to contain the full-length coding regions. This newly established transcriptome resource adds considerably to the currently available sequence resources for the Rosaceae family in general and the genus Rosa in particular. PMID:25954285

  10. Highly multiplexed DNA sequencing by capillary electrophoresis

    SciTech Connect

    Yeung, E.S.; Ueno, K.; Chang, H.T.

    1994-12-31

    It is obvious that irrespective of whichever basic technology is eventually selected to sequence the entire human genome there are substantial gains to be made if a high degree of multiplexing of parallel runs can be implemented. Such multiplexing should not involve expensive instrumentation and should not require additional personnel, or else the main objective of cost reduction will not be satisfied even though the total time for sequencing is reduced. In the last two years, several research groups have shown that capillary electrophoresis (CE) is an attractive alternative for DNA sequencing. Part of the improvement in sequencing speed in CE is counteracted by the inherent ability of slab gels for accommodating multiple lanes in a single run. Recently, the authors have developed several excitation schemes for highly multiplexed capillary electrophoresis. Detection at the pM level was demonstrated. The authors report here the use of a novel excitation geometry to simultaneously monitor 100 capillary tubes during electrophoresis. This represents a truly parallel multiplexing scheme for high-speed DNA sequencing.

  11. A 34K SNP genotyping array for Populus trichocarpa: design, application to the study of natural populations and transferability to other Populus species

    SciTech Connect

    Geraldes, Armando; Hannemann, Jan; Grassa, Chris; Farzaneh, Nima; Porth, Ilga; McKown, Athena; Skyba, Oleksandr; Li, Eryang; Mike, Fujita; Friedmann, Michael; Wasteneys, Geoffrey; Guy, Robert; El-Kassaby, Yousry; Mansfield, Shawn; Cronk, Quentin; Ehlting, Juergen; Douglas, Carl; DiFazio, Stephen P; Slavov, Gancho; Ranjan, Priya; Muchero, Wellington; Gunter, Lee E; Wymore, Ann; Tuskan, Gerald A; Martin, Joel; Schackwitz, Wendy; Pennacchio, Christa; Rokhsar, Daniel

    2013-01-01

    Genetic mapping of quantitative traits requires genotypic data for large numbers of markers in many individuals. Despite the declining costs of genotyping by sequencing, for most studies, the use of large SNP genotyping arrays still offers the most cost-effective solution for large-scale targeted genotyping. Here we report on the design and performance of a SNP genotyping array for Populus trichocarpa (black cottonwood). This genotyping array was designed with SNPs pre-ascertained in 34 wild accessions covering most of the species range. Due to the rapid decay of linkage disequilibrium in P. trichocarpa we adopted a candidate gene approach to the array design that resulted in the selection of 34,131 SNPs, the majority of which are located in, or within 2 kb, of 3,543 candidate genes. A subset of the SNPs (539) was selected based on patterns of variation among the SNP discovery accessions. We show that more than 95% of the loci produce high quality genotypes and that the genotyping error rate for these is likely below 2%, indicating that high-quality data are generated with this array. We demonstrate that even among small numbers of samples (n=10) from local populations over 84% of loci are polymorphic. We also tested the applicability of the array to other species in the genus and found that due to ascertainment bias the number of polymorphic loci decreases rapidly with genetic distance, with the largest numbers detected in other species in section Tacamahaca (P. balsamifera and P. angustifolia). Finally, we provide evidence for the utility of the array for intraspecific studies of genetic differentiation and for species assignment and the detection of natural hybrids.

  12. High-density SNP genotyping array for hexaploid wheat and its secondary and tertiary gene pool.

    PubMed

    Winfield, Mark O; Allen, Alexandra M; Burridge, Amanda J; Barker, Gary L A; Benbow, Harriet R; Wilkinson, Paul A; Coghill, Jane; Waterfall, Christy; Davassi, Alessandro; Scopes, Geoff; Pirani, Ali; Webster, Teresa; Brew, Fiona; Bloor, Claire; King, Julie; West, Claire; Griffiths, Simon; King, Ian; Bentley, Alison R; Edwards, Keith J

    2016-05-01

    In wheat, a lack of genetic diversity between breeding lines has been recognized as a significant block to future yield increases. Species belonging to bread wheat's secondary and tertiary gene pools harbour a much greater level of genetic variability, and are an important source of genes to broaden its genetic base. Introgression of novel genes from progenitors and related species has been widely employed to improve the agronomic characteristics of hexaploid wheat, but this approach has been hampered by a lack of markers that can be used to track introduced chromosome segments. Here, we describe the identification of a large number of single nucleotide polymorphisms that can be used to genotype hexaploid wheat and to identify and track introgressions from a variety of sources. We have validated these markers using an ultra-high-density Axiom(®) genotyping array to characterize a range of diploid, tetraploid and hexaploid wheat accessions and wheat relatives. To facilitate the use of these, both the markers and the associated sequence and genotype information have been made available through an interactive web site. PMID:26466852

  13. SNP Array Analysis Reveals Novel Genomic Abnormalities Including Copy Neutral Loss of Heterozygosity in Anaplastic Oligodendrogliomas

    PubMed Central

    Idbaih, Ahmed; Ducray, François; Dehais, Caroline; Courdy, Célia; Carpentier, Catherine; de Bernard, Simon; Uro-Coste, Emmanuelle; Mokhtari, Karima; Jouvet, Anne; Honnorat, Jérôme; Chinot, Olivier; Ramirez, Carole; Beauchesne, Patrick; Benouaich-Amiel, Alexandra; Godard, Joël; Eimer, Sandrine; Parker, Fabrice; Lechapt-Zalcman, Emmanuelle; Colin, Philippe; Loussouarn, Delphine; Faillot, Thierry; Dam-Hieu, Phong; Elouadhani-Hamdi, Selma; Bauchet, Luc; Langlois, Olivier; Le Guerinel, Caroline; Fontaine, Denys; Vauleon, Elodie; Menei, Philippe; Fotso, Marie Janette Motsuo; Desenclos, Christine; Verelle, Pierre; Ghiringhelli, François; Noel, Georges; Labrousse, François; Carpentier, Antoine; Dhermain, Frédéric; Delattre, Jean-Yves; Figarella-Branger, Dominique

    2012-01-01

    Anaplastic oligodendrogliomas (AOD) are rare glial tumors in adults with relative homogeneous clinical, radiological and histological features at the time of diagnosis but dramatically various clinical courses. Studies have identified several molecular abnormalities with clinical or biological relevance to AOD (e.g. t(1;19)(q10;p10), IDH1, IDH2, CIC and FUBP1 mutations). To better characterize the clinical and biological behavior of this tumor type, the creation of a national multicentric network, named “Prise en charge des OLigodendrogliomes Anaplasiques (POLA),” has been supported by the Institut National du Cancer (InCA). Newly diagnosed and centrally validated AOD patients and their related biological material (tumor and blood samples) were prospectively included in the POLA clinical database and tissue bank, respectively. At the molecular level, we have conducted a high-resolution single nucleotide polymorphism array analysis, which included 83 patients. Despite a careful central pathological review, AOD have been found to exhibit heterogeneous genomic features. A total of 82% of the tumors exhibited a 1p/19q-co-deletion, while 18% harbor a distinct chromosome pattern. Novel focal abnormalities, including homozygously deleted, amplified and disrupted regions, have been identified. Recurring copy neutral losses of heterozygosity (CNLOH) inducing the modulation of gene expression have also been discovered. CNLOH in the CDKN2A locus was associated with protein silencing in 1/3 of the cases. In addition, FUBP1 homozygous deletion was detected in one case suggesting a putative tumor suppressor role of FUBP1 in AOD. Our study showed that the genomic and pathological analyses of AOD are synergistic in detecting relevant clinical and biological subgroups of AOD. PMID:23071531

  14. Fast detection of de novo copy number variants from SNP arrays for case-parent trios

    PubMed Central

    2012-01-01

    Background In studies of case-parent trios, we define copy number variants (CNVs) in the offspring that differ from the parental copy numbers as de novo and of interest for their potential functional role in disease. Among the leading array-based methods for discovery of de novo CNVs in case-parent trios is the joint hidden Markov model (HMM) implemented in the PennCNV software. However, the computational demands of the joint HMM are substantial and the extent to which false positive identifications occur in case-parent trios has not been well described. We evaluate these issues in a study of oral cleft case-parent trios. Results Our analysis of the oral cleft trios reveals that genomic waves represent a substantial source of false positive identifications in the joint HMM, despite a wave-correction implementation in PennCNV. In addition, the noise of low-level summaries of relative copy number (log R ratios) is strongly associated with batch and correlated with the frequency of de novo CNV calls. Exploiting the trio design, we propose a univariate statistic for relative copy number referred to as the minimum distance that can reduce technical variation from probe effects and genomic waves. We use circular binary segmentation to segment the minimum distance and maximum a posteriori estimation to infer de novo CNVs from the segmented genome. Compared to PennCNV on simulated data, MinimumDistance identifies fewer false positives on average and is comparable to PennCNV with respect to false negatives. Genomic waves contribute to discordance of PennCNV and MinimumDistance for high coverage de novo calls, while highly concordant calls on chromosome 22 were validated by quantitative PCR. Computationally, MinimumDistance provides a nearly 8-fold increase in speed relative to the joint HMM in a study of oral cleft trios. Conclusions Our results indicate that batch effects and genomic waves are important considerations for case-parent studies of de novo CNV, and that the

  15. A High Density SNP Array for the Domestic Horse and Extant Perissodactyla: Utility for Association Mapping, Genetic Diversity, and Phylogeny Studies

    PubMed Central

    McCue, Molly E.; Bannasch, Danika L.; Petersen, Jessica L.; Gurr, Jessica; Bailey, Ernie; Binns, Matthew M.; Distl, Ottmar; Guérin, Gérard; Hasegawa, Telhisa; Hill, Emmeline W.; Leeb, Tosso; Lindgren, Gabriella; Penedo, M. Cecilia T.; Røed, Knut H.; Ryder, Oliver A.; Swinburne, June E.; Tozaki, Teruaki; Valberg, Stephanie J.; Vaudin, Mark; Lindblad-Toh, Kerstin

    2012-01-01

    An equine SNP genotyping array was developed and evaluated on a panel of samples representing 14 domestic horse breeds and 18 evolutionarily related species. More than 54,000 polymorphic SNPs provided an average inter-SNP spacing of ∼43 kb. The mean minor allele frequency across domestic horse breeds was 0.23, and the number of polymorphic SNPs within breeds ranged from 43,287 to 52,085. Genome-wide linkage disequilibrium (LD) in most breeds declined rapidly over the first 50–100 kb and reached background levels within 1–2 Mb. The extent of LD and the level of inbreeding were highest in the Thoroughbred and lowest in the Mongolian and Quarter Horse. Multidimensional scaling (MDS) analyses demonstrated the tight grouping of individuals within most breeds, close proximity of related breeds, and less tight grouping in admixed breeds. The close relationship between the Przewalski's Horse and the domestic horse was demonstrated by pair-wise genetic distance and MDS. Genotyping of other Perissodactyla (zebras, asses, tapirs, and rhinoceros) was variably successful, with call rates and the number of polymorphic loci varying across taxa. Parsimony analysis placed the modern horse as sister taxa to Equus przewalski. The utility of the SNP array in genome-wide association was confirmed by mapping the known recessive chestnut coat color locus (MC1R) and defining a conserved haplotype of ∼750 kb across all breeds. These results demonstrate the high quality of this SNP genotyping resource, its usefulness in diverse genome analyses of the horse, and potential use in related species. PMID:22253606

  16. Association between Genetic Subgroups of Pancreatic Ductal Adenocarcinoma Defined by High Density 500 K SNP-Arrays and Tumor Histopathology

    PubMed Central

    Gutiérrez, María Laura; Muñoz-Bellvis, Luís; Abad, María del Mar; Bengoechea, Oscar; González-González, María

    2011-01-01

    The specific genes and genetic pathways associated with pancreatic ductal adenocarcinoma are still largely unknown partially due to the low resolution of the techniques applied so far to their study. Here we used high-density 500 K single nucleotide polymorphism (SNP)-arrays to define those chromosomal regions which most commonly harbour copy number (CN) alterations and loss of heterozygozity (LOH) in a series of 20 PDAC tumors and we correlated the corresponding genetic profiles with the most relevant clinical and histopathological features of the disease. Overall our results showed that primary PDAC frequently display (>70%) extensive gains of chromosomes 1q, 7q, 8q and 20q, together with losses of chromosomes 1p, 9p, 12q, 17p and 18q, such chromosomal regions harboring multiple cancer- and PDAC-associated genes. Interestingly, these alterations clustered into two distinct genetic profiles characterized by gains of the 2q14.2, 3q22.1, 5q32, 10q26.13, 10q26.3, 11q13.1, 11q13.3, 11q13.4, 16q24.1, 16q24.3, 22q13.1, 22q13.31 and 22q13.32 chromosomal regions (group 1; n = 9) versus gains at 1q21.1 and losses of the 1p36.11, 6q25.2, 9p22.1, 9p24.3, 17p13.3 and Xp22.33 chromosomal regions (group 2; n = 11). From the clinical and histopathological point of view, group 1 cases were associated with smaller and well/moderately-differentiated grade I/II PDAC tumors, whereas and group 2 PDAC displayed a larger size and they mainly consisted of poorly-differentiated grade III carcinomas. These findings confirm the cytogenetic complexity and heterogenity of PDAC and provide evidence for the association between tumor cytogenetics and its histopathological features. In addition, we also show that the altered regions identified harbor multiple cancer associate genes that deserve further investigation to determine their relevance in the pathogenesis of PDAC. PMID:21811587

  17. SNP array screening of cryptic genomic imbalances in 450 Japanese subjects with intellectual disability and multiple congenital anomalies previously negative for large rearrangements.

    PubMed

    Uehara, Daniela Tiaki; Hayashi, Shin; Okamoto, Nobuhiko; Mizuno, Seiji; Chinen, Yasutsugu; Kosaki, Rika; Kosho, Tomoki; Kurosawa, Kenji; Matsumoto, Hiroshi; Mitsubuchi, Hiroshi; Numabe, Hironao; Saitoh, Shinji; Makita, Yoshio; Hata, Akira; Imoto, Issei; Inazawa, Johji

    2016-04-01

    Intellectual disability (ID) is a heterogeneous condition affecting 2-3% of the population, often associated with multiple congenital anomalies (MCA). The genetic cause remains largely unexplained for most cases. To investigate the causes of ID/MCA of unknown etiology in the Japanese population, 645 subjects have been recruited for the screening of pathogenic copy-number variants (CNVs). Two screenings using bacterial artificial chromosome (BAC) arrays were previously performed, which identified pathogenic CNVs in 133 cases (20.6%; Hayashi et al., J. Hum. Genet., 2011). Here, we present the findings of the third screening using a single-nucleotide polymorphism (SNP) array, performed in 450 negative cases from our previous report. Pathogenic CNVs were found in 22 subjects (4.9%), in which 19 CNVs were located in regions where clinical significance had been previously established. Among the 22 cases, we identified PPFIA2 as a novel candidate gene for ID. Analysis of copy-neutral loss of heterozygosity (CNLOH) detected one case in which the CNLOH regions seem to be significant. The SNP array detected a modest fraction of small causative CNVs, which is explained by the fact that the majority of causative CNVs have larger sizes, and those had been mostly identified in the two previous screenings. PMID:26740234

  18. Impact of SNP array karyotyping on the diagnosis and the outcome of chronic myelomonocytic leukemia with low risk cytogenetic features or no metaphases.

    PubMed

    Palomo, Laura; Xicoy, Blanca; Garcia, Olga; Mallo, Mar; Ademà, Vera; Cabezón, Marta; Arnan, Montse; Pomares, Helena; José Larrayoz, María; José Calasanz, María; Maciejewski, Jaroslaw P; Huang, Dayong; Shih, Lee-Yung; Ogawa, Seishi; Cervera, Jose; Such, Esperanza; Coll, Rosa; Grau, Javier; Solé, Francesc; Zamora, Lurdes

    2016-02-01

    Chronic myelomonocytic leukemia (CMML) is a clonal hematopoietic disorder with heterogeneous clinical, morphological and genetic characteristics. Clonal cytogenetic abnormalities are found in 20-30% of patients with CMML. Patients with low risk cytogenetic features (normal karyotype and isolated loss of Y chromosome) account for ∼80% of CMML patients and often fall into the low risk categories of CMML prognostic scores. We hypothesized that single nucleotide polymorphism arrays (SNP-A) karyotyping could detect cryptic chromosomal alterations with prognostic impact in these subgroup of patients. SNP-A were performed at diagnosis in 128 CMML patients with low risk karyotypes or uninformative results for conventional G-banding cytogenetics (CC). Copy number alterations (CNAs) and regions of copy number neutral loss of heterozygosity (CNN-LOH) were detected in 67% of patients. Recurrent CNAs included gains in regions 8p12 and 21q22 as well as losses in 10q21.1 and 12p13.2. Interstitial CNN-LOHs were recurrently detected in the following regions: 4q24-4q35, 7q32.1-7q36.3, and 11q13.3-11q25. Statistical analysis showed that some of the alterations detected by SNP-A associated with the patients' outcome. A shortened overall survival (OS) and progression free survival (PFS) was observed in cases where the affected size of the genome (considering CNAs and CNN-LOHs) was >11 Mb. In addition, presence of interstitial CNN-LOH was predictive of poor OS. Presence of CNAs (≥1) associated with poorer OS and PFS in the patients with myeloproliferative CMML. Overall, SNP-A analysis increased the diagnostic yield in patients with low risk cytogenetic features or uninformative CC and added prognostic value to this subset of patients. PMID:26509444

  19. The Use of High-Density SNP Array to Map Homozygosity in Consanguineous Families to Efficiently Identify Candidate Genes: Application to Woodhouse-Sakati Syndrome

    PubMed Central

    Sheridan, Molly B.; Wohler, Elizabeth; Batista, Denise A. S.; Applegate, Carolyn; Hoover-Fong, Julie

    2015-01-01

    Two consanguineous Qatari siblings presented for evaluation: a 17-4/12-year-old male with hypogonadotropic hypogonadism, alopecia, intellectual disability, and microcephaly and his 19-year-old sister with primary amenorrhea, alopecia, and normal cognition. Both required hormone treatment to produce secondary sex characteristics and pubertal development beyond Tanner 1. SNP array analysis of both probands was performed to detect shared regions of homozygosity which may harbor homozygous mutations in a gene causing their common features of abnormal pubertal development, alopecia, and variable cognitive delay. Our patients shared multiple homozygous genomic regions; ten shared regions were >1 Mb in length and constituted 0.99% of the genome. DCAF17, encoding a transmembrane nuclear protein of uncertain function, was the only gene identified in a homozygous region known to cause hypogonadotropic hypogonadism. DCAF17 mutations are associated with Woodhouse-Sakati syndrome, a rare disorder characterized by alopecia, hypogonadotropic hypogonadism, sensorineural hearing loss, diabetes mellitus, and extrapyramidal movements. Sequencing of the coding exons and flanking intronic regions of DCAF17 in the proband revealed homozygosity for a previously described founder mutation (c.436delC). Targeted DCAF17 sequencing of his affected sibling revealed the same homozygous mutation. This family illustrates the utility of SNP array testing in consanguineous families to efficiently and inexpensively identify regions of genomic homozygosity in which genetic candidates for recessive conditions can be identified. PMID:26664771

  20. Design and coverage of high throughput genotyping arrays optimized for individuals of East Asian, African American, and Latino race/ethnicity using imputation and a novel hybrid SNP selection algorithm

    PubMed Central

    Hoffmann, Thomas J.; Zhan, Yiping; Kvale, Mark N.; Hesselson, Stephanie E.; Gollub, Jeremy; Iribarren, Carlos; Lu, Yontao; Mei, Gangwu; Purdy, Matthew M.; Quesenberry, Charles; Rowell, Sarah; Shapero, Michael H.; Smethurst, David; Somkin, Carol P.; Van den Eeden, Stephen K.; Walter, Larry; Webster, Teresa; Whitmer, Rachel A.; Finn, Andrea; Schaefer, Catherine; Kwok, Pui-Yan; Risch, Neil

    2012-01-01

    Four custom Axiom genotyping arrays were designed for a genome-wide association (GWA) study of 100,000 participants from the Kaiser Permanente Research Program on Genes, Environment and Health. The array optimized for individuals of European race/ethnicity was previously described. Here we detail the development of three additional microarrays optimized for individuals of East Asian, African American, and Latino race/ethnicity. For these arrays, we decreased redundancy of high-performing SNPs to increase SNP capacity. The East Asian array was designed using greedy pairwise SNP selection. However, removing SNPs from the target set based on imputation coverage is more efficient than pairwise tagging. Therefore, we developed a novel hybrid SNP selection method for the African American and Latino arrays utilizing rounds of greedy pairwise SNP selection, followed by removal from the target set of SNPs covered by imputation. The arrays provide excellent genome-wide coverage and are valuable additions for large-scale GWA studies. PMID:21903159

  1. Genetic differentiation of brackish water populations of cod Gadus morhua in the southern Baltic, inferred from genotyping using SNP-arrays.

    PubMed

    Poćwierz-Kotus, A; Kijewska, A; Petereit, C; Bernaś, R; Więcaszek, B; Arnyasi, M; Lien, S; Kent, M P; Wenne, R

    2015-02-01

    The Baltic is a semi-enclosed sea characterised by decreasing salinity in the eastern and northern direction with only the deeper parts of the southern Baltic suitable as spawning grounds for marine species like cod. Baltic cod exhibits various adaptations to brackish water conditions, yet the inflow of salty North Sea water near the bottom remains an influence on the spawning success of the Baltic cod. The eastern Baltic population has been very weakly studied in comparison with the western population. The aim of this study is to demonstrate for the first time genetic differentiation by the use of a large number of SNPs between eastern and western Baltic populations existing in differentiated salinity conditions. Two cod samples were collected from the Bay of Gdańsk, Poland and one from the Kiel Bight, Germany. Samples were genotyped using a cod derived SNP-array (Illumina) with 10 913 SNPs. A selection of diagnostic SNPs was performed. A set of 7944 validated SNPs were analysed to assess the differentiation of three samples of cod. Results indicated a clear distinctness of the Kiel Bight from the populations of the eastern Baltic. FST comparison between both eastern samples was non-significant. Clustering analysis, principal coordinates analysis and assignment test clearly indicated that the eastern samples should be considered as one subpopulation, well differentiated from the western subpopulation. With the SNP approach, no differentiation between groups containing 'healthy' and 'non-healthy' cod individuals was observed. PMID:24910372

  2. A girl with incomplete Prader-Willi syndrome and negative MS-PCR, found to have mosaic maternal UPD-15 at SNP array.

    PubMed

    Morandi, Anita; Bonnefond, Amélie; Lobbens, Stéphane; Carotenuto, Marco; Del Giudice, Emanuele Miraglia; Froguel, Philippe; Maffeis, Claudio

    2015-11-01

    The Prader-Willi syndrome (PWS) is caused by lack of expression of paternal allele of the 15q11.2-q13 region, due to deletions at paternal 15q11.2-q13 (<70%), maternal uniparental disomy of chromosome 15 (mat-UPD 15) (30%) or imprinting defects (1%). Hyperphagia, intellectual disabilities/behavioral disorders, neonatal hypotonia, and hypogonadism are cardinal features for PWS. Methylation sensitive PCR (MS-PCR) of the SNRPN locus, which assesses the presence of both the unmethylated (paternal) and the methylated (maternal) allele of 15q11.2-q13, is considered a sensitive reference technique for PWS diagnosis regardless of genetic subtype. We describe a 17-year-old girl with severe obesity, short stature, and intellectual disability, without hypogonadism and history of neonatal hypotonia, who was suspected to have an incomplete PWS. The MS-PCR showed a normal pattern with similar maternal and paternal electrophoretic bands. Afterwards, a SNP array showed the presence of iso-UPD 15, that is, UPD15 with two copies of the same chromosome 15, in about 50% of cells, suggesting a diagnosis of partial PWS due to mosaic maternal iso-UPD15 arisen as rescue of a post-fertilization error. A quantitative methylation analysis confirmed the presence of mosaic UPD15 in about 50% of cells. We propose that complete clinical criteria for PWS and MS-PCR should not be considered sensitive in suspecting and diagnosing partial PWS due to mosaic UPD15. In contrast, clinical suspicion based on less restrictive criteria followed by SNP array is a more powerful approach to diagnose atypical PWS due to UPD15 mosaicism. PMID:26109092

  3. Increased Frequency of De Novo Copy Number Variations in Congenital Heart Disease by Integrative Analysis of SNP Array and Exome Sequence Data

    PubMed Central

    Rodriguez-Murillo, Laura; Fromer, Menachem; Mazaika, Erica; Vardarajan, Badri; Italia, Michael; Leipzig, Jeremy; DePalma, Steven R.; Golhar, Ryan; Sanders, Stephan J.; Yamrom, Boris; Ronemus, Michael; Iossifov, Ivan; Willsey, A. Jeremy; State, Matthew W.; Kaltman, Jonathan R.; White, Peter S.; Shen, Yufeng; Warburton, Dorothy; Brueckner, Martina; Seidman, Christine; Goldmuntz, Elizabeth; Gelb, Bruce D.; Lifton, Richard; Seidman, Jonathan; Hakonarson, Hakon; Chung, Wendy K.

    2014-01-01

    Rationale Congenital heart disease (CHD) is among the most common birth defects. Most cases are of unknown etiology. Objective To determine the contribution of de novo copy number variants (CNVs) in the etiology of sporadic CHD. Methods and Results We studied 538 CHD trios using genome-wide dense single nucleotide polymorphism (SNP) arrays and/or whole exome sequencing (WES). Results were experimentally validated using digital droplet PCR. We compared validated CNVs in CHD cases to CNVs in 1,301 healthy control trios. The two complementary high-resolution technologies identified 63 validated de novo CNVs in 51 CHD cases. A significant increase in CNV burden was observed when comparing CHD trios with healthy trios, using either SNP array (p=7x10−5, Odds Ratio (OR)=4.6) or WES data (p=6x10−4, OR=3.5) and remained after removing 16% of de novo CNV loci previously reported as pathogenic (p=0.02, OR=2.7). We observed recurrent de novo CNVs on 15q11.2 encompassing CYFIP1, NIPA1, and NIPA2 and single de novo CNVs encompassing DUSP1, JUN, JUP, MED15, MED9, PTPRE SREBF1, TOP2A, and ZEB2, genes that interact with established CHD proteins NKX2-5 and GATA4. Integrating de novo variants in WES and CNV data suggests that ETS1 is the pathogenic gene altered by 11q24.2-q25 deletions in Jacobsen syndrome and that CTBP2 is the pathogenic gene in 10q sub-telomeric deletions. Conclusions We demonstrate a significantly increased frequency of rare de novo CNVs in CHD patients compared with healthy controls and suggest several novel genetic loci for CHD. PMID:25205790

  4. GENOME-WIDE MAPPING OF COPY NUMBER VARIATIONS IN COMMERCIAL HYBRID PIGS USING A HIGH-DENSITY SNP GENOTYPING ARRAY.

    PubMed

    Zhou, L S; Li, J; Yang, J; Liu, C L; Xie, X H; He, Y N; Liu, X X; Xin, W S; Zhang, W C; Ren, J; Ma, J W; Huang, L S

    2016-01-01

    Copy number variations (CNVs) are important forms of structural variation in human and animals and can be considered as a major genetic component of phenotypic diversity. Here we used the Illumina PorcineSNP60 BeadChip V2 and a DLY [Duroc x (Large White x Landrace)] commercial hybrid population to identify 272 CNVs belonging to 165 CNV regions (CNVRs), of which 66 are new. As CNVRs are specific to origin of population, our DLY-specific data is an important complementary to the existing CNV map in the pig genome. Eight CNVRs were selected. for validation by quantitative real-time PCR (qRT-PCR) and the accurate rate was high (87.25%). Gene function analysis suggested that a common CNVR may play an important role in multiple traits, including growth rate and carcass quality. PMID:27183798

  5. A High-Resolution SNP Array-Based Linkage Map Anchors a New Domestic Cat Draft Genome Assembly and Provides Detailed Patterns of Recombination.

    PubMed

    Li, Gang; Hillier, LaDeana W; Grahn, Robert A; Zimin, Aleksey V; David, Victor A; Menotti-Raymond, Marilyn; Middleton, Rondo; Hannah, Steven; Hendrickson, Sher; Makunin, Alex; O'Brien, Stephen J; Minx, Pat; Wilson, Richard K; Lyons, Leslie A; Warren, Wesley C; Murphy, William J

    2016-01-01

    High-resolution genetic and physical maps are invaluable tools for building accurate genome assemblies, and interpreting results of genome-wide association studies (GWAS). Previous genetic and physical maps anchored good quality draft assemblies of the domestic cat genome, enabling the discovery of numerous genes underlying hereditary disease and phenotypes of interest to the biomedical science and breeding communities. However, these maps lacked sufficient marker density to order thousands of shorter scaffolds in earlier assemblies, which instead relied heavily on comparative mapping with related species. A high-resolution map would aid in validating and ordering chromosome scaffolds from existing and new genome assemblies. Here, we describe a high-resolution genetic linkage map of the domestic cat genome based on genotyping 453 domestic cats from several multi-generational pedigrees on the Illumina 63K SNP array. The final maps include 58,055 SNP markers placed relative to 6637 markers with unique positions, distributed across all autosomes and the X chromosome. Our final sex-averaged maps span a total autosomal length of 4464 cM, the longest described linkage map for any mammal, confirming length estimates from a previous microsatellite-based map. The linkage map was used to order and orient the scaffolds from a substantially more contiguous domestic cat genome assembly (Felis catus v8.0), which incorporated ∼20 × coverage of Illumina fragment reads. The new genome assembly shows substantial improvements in contiguity, with a nearly fourfold increase in N50 scaffold size to 18 Mb. We use this map to report probable structural errors in previous maps and assemblies, and to describe features of the recombination landscape, including a massive (∼50 Mb) recombination desert (of virtually zero recombination) on the X chromosome that parallels a similar desert on the porcine X chromosome in both size and physical location. PMID:27172201

  6. A High-Resolution SNP Array-Based Linkage Map Anchors a New Domestic Cat Draft Genome Assembly and Provides Detailed Patterns of Recombination

    PubMed Central

    Li, Gang; Hillier, LaDeana W.; Grahn, Robert A.; Zimin, Aleksey V.; David, Victor A.; Menotti-Raymond, Marilyn; Middleton, Rondo; Hannah, Steven; Hendrickson, Sher; Makunin, Alex; O’Brien, Stephen J.; Minx, Pat; Wilson, Richard K.; Lyons, Leslie A.; Warren, Wesley C.; Murphy, William J.

    2016-01-01

    High-resolution genetic and physical maps are invaluable tools for building accurate genome assemblies, and interpreting results of genome-wide association studies (GWAS). Previous genetic and physical maps anchored good quality draft assemblies of the domestic cat genome, enabling the discovery of numerous genes underlying hereditary disease and phenotypes of interest to the biomedical science and breeding communities. However, these maps lacked sufficient marker density to order thousands of shorter scaffolds in earlier assemblies, which instead relied heavily on comparative mapping with related species. A high-resolution map would aid in validating and ordering chromosome scaffolds from existing and new genome assemblies. Here, we describe a high-resolution genetic linkage map of the domestic cat genome based on genotyping 453 domestic cats from several multi-generational pedigrees on the Illumina 63K SNP array. The final maps include 58,055 SNP markers placed relative to 6637 markers with unique positions, distributed across all autosomes and the X chromosome. Our final sex-averaged maps span a total autosomal length of 4464 cM, the longest described linkage map for any mammal, confirming length estimates from a previous microsatellite-based map. The linkage map was used to order and orient the scaffolds from a substantially more contiguous domestic cat genome assembly (Felis catus v8.0), which incorporated ∼20 × coverage of Illumina fragment reads. The new genome assembly shows substantial improvements in contiguity, with a nearly fourfold increase in N50 scaffold size to 18 Mb. We use this map to report probable structural errors in previous maps and assemblies, and to describe features of the recombination landscape, including a massive (∼50 Mb) recombination desert (of virtually zero recombination) on the X chromosome that parallels a similar desert on the porcine X chromosome in both size and physical location. PMID:27172201

  7. SNP mistyping in genotyping arrays--an important cause of spurious association in case-control studies.

    PubMed

    Mitry, D; Campbell, H; Charteris, D G; Fleck, B W; Tenesa, A; Dunlop, M G; Hayward, C; Wright, A F; Vitart, V

    2011-07-01

    Using genome-wide association studies to identify genetic variants contributing to disease has been highly successful with many novel genetic predispositions identified and biological pathways revealed. Several pitfalls for spurious association or non-replication have been highlighted: from population structure, automated genotype scoring for cases and controls, to age-varying association. We describe an important yet unreported source of bias in case-control studies due to variations in chip technology between different commercial array releases. As cases are commonly genotyped with newer arrays and freely available control resources are frequently used for comparison, there exists an important potential for false associations which are robust to standard quality control and replication design. PMID:21254221

  8. Genome-wide detection of CNVs in Chinese indigenous sheep with different types of tails using ovine high-density 600K SNP arrays

    PubMed Central

    Zhu, Caiye; Fan, Hongying; Yuan, Zehu; Hu, Shijin; Ma, Xiaomeng; Xuan, Junli; Wang, Hongwei; Zhang, Li; Wei, Caihong; Zhang, Qin; Zhao, Fuping; Du, Lixin

    2016-01-01

    Chinese indigenous sheep can be classified into three types based on tail morphology: fat-tailed, fat-rumped, and thin-tailed sheep, of which the typical breeds are large-tailed Han sheep, Altay sheep, and Tibetan sheep, respectively. To unravel the genetic mechanisms underlying the phenotypic differences among Chinese indigenous sheep with tails of three different types, we used ovine high-density 600K SNP arrays to detect genome-wide copy number variation (CNV). In large-tailed Han sheep, Altay sheep, and Tibetan sheep, 371, 301, and 66 CNV regions (CNVRs) with lengths of 71.35 Mb, 51.65 Mb, and 10.56 Mb, respectively, were identified on autosomal chromosomes. Ten CNVRs were randomly chosen for confirmation, of which eight were successfully validated. The detected CNVRs harboured 3130 genes, including genes associated with fat deposition, such as PPARA, RXRA, KLF11, ADD1, FASN, PPP1CA, PDGFA, and PEX6. Moreover, multilevel bioinformatics analyses of the detected candidate genes were significantly enriched for involvement in fat deposition, GTPase regulator, and peptide receptor activities. This is the first high-resolution sheep CNV map for Chinese indigenous sheep breeds with three types of tails. Our results provide valuable information that will support investigations of genomic structural variation underlying traits of interest in sheep. PMID:27282145

  9. Genome-wide detection of CNVs in Chinese indigenous sheep with different types of tails using ovine high-density 600K SNP arrays.

    PubMed

    Zhu, Caiye; Fan, Hongying; Yuan, Zehu; Hu, Shijin; Ma, Xiaomeng; Xuan, Junli; Wang, Hongwei; Zhang, Li; Wei, Caihong; Zhang, Qin; Zhao, Fuping; Du, Lixin

    2016-01-01

    Chinese indigenous sheep can be classified into three types based on tail morphology: fat-tailed, fat-rumped, and thin-tailed sheep, of which the typical breeds are large-tailed Han sheep, Altay sheep, and Tibetan sheep, respectively. To unravel the genetic mechanisms underlying the phenotypic differences among Chinese indigenous sheep with tails of three different types, we used ovine high-density 600K SNP arrays to detect genome-wide copy number variation (CNV). In large-tailed Han sheep, Altay sheep, and Tibetan sheep, 371, 301, and 66 CNV regions (CNVRs) with lengths of 71.35 Mb, 51.65 Mb, and 10.56 Mb, respectively, were identified on autosomal chromosomes. Ten CNVRs were randomly chosen for confirmation, of which eight were successfully validated. The detected CNVRs harboured 3130 genes, including genes associated with fat deposition, such as PPARA, RXRA, KLF11, ADD1, FASN, PPP1CA, PDGFA, and PEX6. Moreover, multilevel bioinformatics analyses of the detected candidate genes were significantly enriched for involvement in fat deposition, GTPase regulator, and peptide receptor activities. This is the first high-resolution sheep CNV map for Chinese indigenous sheep breeds with three types of tails. Our results provide valuable information that will support investigations of genomic structural variation underlying traits of interest in sheep. PMID:27282145

  10. SNP array and phenotype correlation shows that FLI1 deletion per se is not responsible for thrombocytopenia development in Jacobsen syndrome.

    PubMed

    Trkova, Marie; Becvarova, Vera; Hynek, Martin; Hnykova, Lenka; Hlavova, Eva; Kreckova, Gabriela; Kulovany, Eduard; Cutka, David; Zatloukalova, Jitka; Markova, Kristyna; Sukova, Martina; Horacek, Jiri; Stejskal, David

    2012-10-01

    Jacobsen syndrome (JBS) is a rare chromosomal disorder caused by terminal deletion of the long arm of chromosome 11. We report on four prenatally diagnosed patients with JBS with variable prenatal and postnatal phenotypes and 11q deletions of varying sizes. Precise characterization of the deleted region in three patients was performed by SNP arrays. The severity of both the prenatal and postnatal phenotypes did not correlate with the size of the haploinsufficient region. Despite the large difference in the deletion size (nearly 6 Mb), both of the live-born patients had similar phenotypes corresponding to JBS. However, one of the most prominent features of JBS, thrombocytopenia, was only present in the live-born boy. The girl, who had a significantly longer deletion spanning all four genes suspected of being causative of JBS-related thrombocytopenia (FLI1, ETS1, NFRKB, and JAM3), did not manifest a platelet phenotype. Therefore, our findings do not support the traditional view of deletion size correlation in JBS or the causative role of FLI1, ETS1, NFRKB, and JAM3 deletion per se for the development of disease-related thrombocytopenia. PMID:22887642

  11. Comprehensive Genomic Characterization of Cutaneous Malignant Melanoma Cell Lines Derived from Metastatic Lesions by Whole-Exome Sequencing and SNP Array Profiling

    PubMed Central

    Cifola, Ingrid; Pietrelli, Alessandro; Consolandi, Clarissa; Severgnini, Marco; Mangano, Eleonora; Russo, Vincenzo; De Bellis, Gianluca; Battaglia, Cristina

    2013-01-01

    Cutaneous malignant melanoma is the most fatal skin cancer and although improved comprehension of its pathogenic pathways allowed to realize some effective molecular targeted therapies, novel targets and drugs are still needed. Aiming to add genetic information potentially useful for novel targets discovery, we performed an extensive genomic characterization by whole-exome sequencing and SNP array profiling of six cutaneous melanoma cell lines derived from metastatic patients. We obtained a total of 3,325 novel coding single nucleotide variants, including 2,172 non-synonymous variants. We catalogued the coding mutations according to Sanger COSMIC database and to a manually curated list including genes involved in melanoma pathways identified by mining recent literature. Besides confirming the presence of known melanoma driver mutations (BRAFV600E, NRASQ61R), we identified novel mutated genes involved in signalling pathways crucial for melanoma pathogenesis and already addressed by current targeted therapies (such as MAPK and glutamate pathways). We also identified mutations in four genes (MUC19, PAICS, RBMXL1, KIF23) never reported in melanoma, which might deserve further investigations. All data are available to the entire research community in our Melanoma Exome Database (at https://155.253.6.64/MExDB/). In summary, these cell lines are valuable biological tools to improve the genetic comprehension of this complex cancer disease and to study functional relevance of individual mutational events, and these findings could provide insights potentially useful for identification of novel therapeutic targets for cutaneous malignant melanoma. PMID:23704925

  12. High-density SNP assay development for genetic analysis in maritime pine (Pinus pinaster).

    PubMed

    Plomion, C; Bartholomé, J; Lesur, I; Boury, C; Rodríguez-Quilón, I; Lagraulet, H; Ehrenmann, F; Bouffier, L; Gion, J M; Grivet, D; de Miguel, M; de María, N; Cervera, M T; Bagnoli, F; Isik, F; Vendramin, G G; González-Martínez, S C

    2016-03-01

    Maritime pine provides essential ecosystem services in the south-western Mediterranean basin, where it covers around 4 million ha. Its scattered distribution over a range of environmental conditions makes it an ideal forest tree species for studies of local adaptation and evolutionary responses to climatic change. Highly multiplexed single nucleotide polymorphism (SNP) genotyping arrays are increasingly used to study genetic variation in living organisms and for practical applications in plant and animal breeding and genetic resource conservation. We developed a 9k Illumina Infinium SNP array and genotyped maritime pine trees from (i) a three-generation inbred (F2) pedigree, (ii) the French breeding population and (iii) natural populations from Portugal and the French Atlantic coast. A large proportion of the exploitable SNPs (2052/8410, i.e. 24.4%) segregated in the mapping population and could be mapped, providing the densest ever gene-based linkage map for this species. Based on 5016 SNPs, natural and breeding populations from the French gene pool exhibited similar level of genetic diversity. Population genetics and structure analyses based on 3981 SNP markers common to the Portuguese and French gene pools revealed high levels of differentiation, leading to the identification of a set of highly differentiated SNPs that could be used for seed provenance certification. Finally, we discuss how the validated SNPs could facilitate the identification of ecologically and economically relevant genes in this species, improving our understanding of the demography and selective forces shaping its natural genetic diversity, and providing support for new breeding strategies. PMID:26358548

  13. SNP array analysis of tyrosine kinase inhibitor-resistant chronic myeloid leukemia identifies heterogeneous secondary genomic alterations

    PubMed Central

    Müschen, Markus; Kato, Motohiro; Kawamata, Norihiko; Meixel, Antonie; Nowak, Verena; Kim, Han S.; Kang, Sharon; Paquette, Ronald; Chang, Mi-Sook; Thoenissen, Nils H.; Mossner, Max; Hofmann, Wolf-Karsten; Kohlmann, Alexander; Weiss, Tamara; Haferlach, Torsten; Haferlach, Claudia; Koeffler, H. Phillip

    2010-01-01

    To elucidate whether tyrosine kinase inhibitor (TKI) resistance in chronic myeloid leukemia is associated with characteristic genomic alterations, we analyzed DNA samples from 45 TKI-resistant chronic myeloid leukemia patients with 250K single nucleotide polymorphism arrays. From 20 patients, matched serial samples of pretreatment and TKI resistance time points were available. Eleven of the 45 TKI-resistant patients had mutations of BCR-ABL1, including 2 T315I mutations. Besides known TKI resistance-associated genomic lesions, such as duplication of the BCR-ABL1 gene (n = 8) and trisomy 8 (n = 3), recurrent submicroscopic alterations, including acquired uniparental disomy, were detectable on chromosomes 1, 8, 9, 17, 19, and 22. On chromosome 22, newly acquired and recurrent deletions of the IGLC1 locus were detected in 3 patients, who had previously presented with lymphoid or myeloid blast crisis. This may support a hypothesis of TKI-induced selection of subclones differentiating into immature B-cell progenitors as a mechanism of disease progression and evasion of TKI sensitivity. PMID:19965645

  14. Use of SNP array analysis to identify a novel TRIM32 mutation in limb-girdle muscular dystrophy type 2H.

    PubMed

    Cossée, Mireille; Lagier-Tourenne, Clotilde; Seguela, Claire; Mohr, Michel; Leturcq, France; Gundesli, Hulya; Chelly, Jamel; Tranchant, Christine; Koenig, Michel; Mandel, Jean-Louis

    2009-04-01

    Molecular diagnosis of monogenic diseases with high genetic heterogeneity is usually challenging. In the case of limb-girdle muscular dystrophy, multiplex Western blot analysis is a very useful initial step, but that often fails to identify the primarily affected protein. We report how homozygosity analysis using a genome-wide SNP array allowed us to solve the diagnostic enigma in a patient with a moderate form of LGMD, born from consanguineous parents. The genome-wide scan performed on the patient's DNA revealed several regions of homozygosity, that were compared to the location of known LGMD genes. One such region indeed contained the TRIM32 gene. This gene was previously found mutated in families with limb-girdle muscular dystrophy type 2H (LGMD2H), a mild autosomal recessive myopathy described in Hutterite populations and in 4 patients with a diagnosis of sarcotubular myopathy. A single missense mutation was found in all these patients, located in a conserved domain of the C-terminal part of the protein. Another missense mutation affecting the N-terminal part of TRIM32, observed in a single consanguineous Bedouin family, was reported to cause the phenotypically unrelated and genetically heterogeneous Bardet-Biedl syndrome, defining the BBS11 locus. Sequencing of TRIM32 in our patient revealed a distal frameshift mutation, c.1753_1766dup14 (p.Ile590Leu fsX38). Together with two recently reported mutations, this novel mutation confirms that integrity of the C-terminal domain of TRIM32 is necessary for muscle maintenance. PMID:19303295

  15. 77 FR 67379 - Draft Guidance for Industry and Food and Drug Administration Staff; Highly Multiplexed...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-11-09

    ...The Food and Drug Administration (FDA) is announcing the availability of the draft guidance entitled ``Highly Multiplexed Microbiological/Medical Countermeasure In Vitro Nucleic Acid Based Diagnostic Devices.'' This draft guidance is to provide industry and Agency staff with recommendations for studies to establish the analytical and clinical performance of highly multiplexed......

  16. Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality.

    PubMed

    Ali, Shahin S; Shao, Jonathan; Strem, Mary D; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W; Bailey, Bryan A

    2015-01-01

    Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri. PMID:26379633

  17. Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality

    PubMed Central

    Ali, Shahin S.; Shao, Jonathan; Strem, Mary D.; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W.; Bailey, Bryan A.

    2015-01-01

    Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri. PMID:26379633

  18. 76 FR 48169 - Advancing Regulatory Science for Highly Multiplexed Microbiology/Medical Countermeasure Devices...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-08-08

    ... Microbiology/ Medical Countermeasure Devices; Public Meeting AGENCY: Food and Drug Administration, HHS. ACTION... following public meeting: ``Advancing Regulatory Science for Highly Multiplexed Microbiology/Medical... multiplexed microbiology/medical countermeasure (MCM) devices, their clinical application and public...

  19. Identifying chromosomal selection-sweep regions in facial eczema selection-line animals using an ovine 50K-SNP array.

    PubMed

    Phua, S H; Brauning, R; Baird, H J; Dodds, K G

    2014-04-01

    Facial eczema (FE) is a hepato-mycotoxicosis found mainly in New Zealand sheep and cattle. When genetics was found to be a factor in FE susceptibility, resistant and susceptible selection lines of Romney sheep were established to enable further investigations of this disease trait. Using the Illumina OvineSNP50 BeadChip, we conducted a selection-sweep experiment on these FE genetic lines. Two analytical methods were used to detect selection signals, namely the Peddrift test (Dodds & McEwan, 1997) and fixation index FST (Weir & Hill, 2002). Of 50 975 single nucleotide polymorphism (SNP) markers tested, there were three that showed highly significant allele frequency differences between the resistant and susceptible animals (Peddrift nominal P < 0.000001). These SNP loci are located on chromosomes OAR1, OAR11 and OAR12 that coincide precisely with the three highest genomic FST peaks. In addition, there are nine less significant Peddrift SNPs (nominal P ≤ 0.000009) on OAR6 (n = 2), OAR9 (n = 2), OAR12, OAR19 (n = 2), OAR24 and OAR26. In smoothed FST (five-SNP moving average) plots, the five most prominent peaks are on OAR1, OAR6, OAR7, OAR13 and OAR19. Although these smoothed FST peaks do not coincide with the three most significant Peddrift SNP loci, two (on OAR6 and OAR19) overlap with the set of less significant Peddrift SNPs above. Of these 12 Peddrift SNPs and five smoothed FST regions, none is close to the FE candidate genes catalase and ABCG2; however, two on OAR1 and one on OAR13 fall within suggestive quantitative trait locus regions identified in a previous genome screen experiment. The present studies indicated that there are at least eight genomic regions that underwent a selection sweep in the FE lines. PMID:24521158

  20. Ploidy status and copy number aberrations in primary glioblastomas defined by integrated analysis of allelic ratios, signal ratios and loss of heterozygosity using 500K SNP Mapping Arrays

    PubMed Central

    Gardina, Paul J; Lo, Ken C; Lee, Walter; Cowell, John K; Turpaz, Yaron

    2008-01-01

    Background Genomic hybridization platforms, including BAC-CGH and genotyping arrays, have been used to estimate chromosome copy number (CN) in tumor samples by detecting the relative strength of genomic signal. The methods rely on the assumption that the predominant chromosomal background of the samples is diploid, an assumption that is frequently incorrect for tumor samples. In addition to generally greater resolution, an advantage of genotyping arrays over CGH arrays is the ability to detect signals from individual alleles, allowing estimation of loss-of-heterozygosity (LOH) and allelic ratios to enhance the interpretation of copy number alterations. Copy number events associated with LOH potentially have the same genetic consequences as deletions. Results We have utilized allelic ratios to detect patterns that are indicative of higher ploidy levels. An integrated analysis using allelic ratios, total signal and LOH indicates that many or most of the chromosomes from 24 glioblastoma tumors are in fact aneuploid. Some putative whole-chromosome losses actually represent trisomy, and many apparent sub-chromosomal losses are in fact relative losses against a triploid or tetraploid background. Conclusion These results suggest a re-interpretation of previous findings based only on total signal ratios. One interesting observation is that many single or multiple-copy deletions occur at common putative tumor suppressor sites subsequent to chromosomal duplication; these losses do not necessarily result in LOH, but nonetheless occur in conspicuous patterns. The 500 K Mapping array was also capable of detecting many sub-mega base losses and gains that were overlooked by CGH-BAC arrays, and was superior to CGH-BAC arrays in resolving regions of complex CN variation. PMID:18928532

  1. SNP-VISTA

    SciTech Connect

    Shah, Nameeta; Teplitsky, Michael; Minovitsky, Simon; Dubchak, Inna

    2005-11-07

    SNP-VISTA aids in analyses of the following types of data: A. Large-scale re-sequence data of disease-related genes for discovery of associated and/or causative alleles (GeneSNP-VISTA). B. Massive amounts of ecogenomics data for studying homologous recombination in microbial populations (EcoSNP-VISTA). The main features and capabilities of SNP-VISTA are: 1) Mapping of SNPs to gene structure; 2) classification of SNPs, based on their location in the gene, frequency of occurrence in samples and allele composition; 3) clustering, based on user-defined subsets of SNPs, highlighting haplotypes as well as recombinant sequences; 4) integration of protein conservation visualization; and 5) display of automatically calculated recombination points that are user-editable. The main strength of SNP-VISTA is its graphical interface and use of visual representations, which support interactive exploration and hence better understanding of large-scale SNPs data.

  2. Development of a 63K SNP array for Gossypium and high-density mapping of intra- and inter-specific populations of cotton (G. hirsutum L.)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    High-throughput genotyping arrays provide a standardized resource for crop research communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), candidate marker and quantitative trait loci (QTL) ide...

  3. Lipids, obesity and gallbladder disease in women: insights from genetic studies using the cardiovascular gene-centric 50K SNP array

    PubMed Central

    Rodriguez, Santiago; Gaunt, Tom R; Guo, Yiran; Zheng, Jie; Barnes, Michael R; Tang, Weihang; Danish, Fazal; Johnson, Andrew; Castillo, Berta A; Li, Yun R; Hakonarson, Hakon; Buxbaum, Sarah G; Palmer, Tom; Tsai, Michael Y; Lange, Leslie A; Ebrahim, Shah; Davey Smith, George; Lawlor, Debbie A; Folsom, Aaron R; Hoogeveen, Ron; Reiner, Alex; Keating, Brendan; Day, Ian NM

    2016-01-01

    Gallbladder disease (GBD) has an overall prevalence of 10–40% depending on factors such as age, gender, population, obesity and diabetes, and represents a major economic burden. Although gallstones are composed of cholesterol by-products and are associated with obesity, presumed causal pathways remain unproven, although BMI reduction is typically recommended. We performed genetic studies to discover candidate genes and define pathways involved in GBD. We genotyped 15 241 women of European ancestry from three cohorts, including 3216 with GBD, using the Human cardiovascular disease (HumanCVD) BeadChip containing up to ~53 000 single-nucleotide polymorphisms (SNPs). Effect sizes with P-values for development of GBD were generated. We identify two new loci associated with GBD, GCKR rs1260326:T>C (P=5.88 × 10−7, ß=−0.146) and TTC39B rs686030:C>A (P=6.95x10−7, ß=0.271) and detect four independent SNP effects in ABCG8 rs4953023:G>A (P=7.41 × 10−47, ß=0.734), ABCG8 rs4299376:G>T (P=2.40 × 10−18, ß=0.278), ABCG5 rs6544718:T>C (P=2.08 × 10−14, ß=0.044) and ABCG5 rs6720173:G>C (P=3.81 × 10−12, ß=0.262) in conditional analyses taking genotypes of rs4953023:G>A as a covariate. We also delineate the risk effects among many genotypes known to influence lipids. These data, from the largest GBD genetic study to date, show that specific, mainly hepatocyte-centred, components of lipid metabolism are important to GBD risk in women. We discuss the potential pharmaceutical implications of our findings. PMID:25920552

  4. SNP-VISTA

    Energy Science and Technology Software Center (ESTSC)

    2005-11-07

    SNP-VISTA aids in analyses of the following types of data: A. Large-scale re-sequence data of disease-related genes for discovery of associated and/or causative alleles (GeneSNP-VISTA). B. Massive amounts of ecogenomics data for studying homologous recombination in microbial populations (EcoSNP-VISTA). The main features and capabilities of SNP-VISTA are: 1) Mapping of SNPs to gene structure; 2) classification of SNPs, based on their location in the gene, frequency of occurrence in samples and allele composition; 3) clustering,more » based on user-defined subsets of SNPs, highlighting haplotypes as well as recombinant sequences; 4) integration of protein conservation visualization; and 5) display of automatically calculated recombination points that are user-editable. The main strength of SNP-VISTA is its graphical interface and use of visual representations, which support interactive exploration and hence better understanding of large-scale SNPs data.« less

  5. High resolution SNP array genomic profiling of peripheral T cell lymphomas, not otherwise specified, identifies a subgroup with chromosomal aberrations affecting the REL locus.

    PubMed

    Hartmann, Sylvia; Gesk, Stefan; Scholtysik, René; Kreuz, Markus; Bug, Stefanie; Vater, Inga; Döring, Claudia; Cogliatti, Sergio; Parrens, Marie; Merlio, Jean-Philippe; Kwiecinska, Anna; Porwit, Anna; Piccaluga, Pier Paolo; Pileri, Stefano; Hoefler, Gerald; Küppers, Ralf; Siebert, Reiner; Hansmann, Martin-Leo

    2010-02-01

    Little is known about genomic aberrations in peripheral T cell lymphoma, not otherwise specified (PTCL NOS). We studied 47 PTCL NOS by 250k GeneChip single nucleotide polymorphism arrays and detected genomic imbalances in 22 of the cases. Recurrent gains and losses were identified, including gains of chromosome regions 1q32-43, 2p15-16, 7, 8q24, 11q14-25, 17q11-21 and 21q11-21 (> or = 5 cases each) as well as losses of chromosome regions 1p35-36, 5q33, 6p22, 6q16, 6q21-22, 8p21-23, 9p21, 10p11-12, 10q11-22, 10q25-26, 13q14, 15q24, 16q22, 16q24, 17p11, 17p13 and Xp22 (> or = 4 cases each). Genomic imbalances affected several regions containing members of nuclear factor-kappaB signalling and genes involved in cell cycle control. Gains of 2p15-16 were confirmed in each of three cases analysed by fluorescence in situ hybridization (FISH) and were associated with breakpoints at the REL locus in two of these cases. Three additional cases with gains of the REL locus were detected by FISH among 18 further PTCL NOS. Five of 27 PTCL NOS investigated showed nuclear expression of the REL protein by immunohistochemistry, partly associated with genomic gains of the REL locus. Therefore, in a subgroup of PTCL NOS gains/rearrangements of REL and expression of REL protein may be of pathogenetic relevance. PMID:19863542

  6. New aQTL SNPs for the CYP2D6 Identified by a Novel Mediation Analysis of Genome-Wide SNP Arrays, Gene Expression Arrays, and CYP2D6 Activity

    PubMed Central

    Wang, Zhiping; Boustani, Malaz; Liu, Yunlong; Skaar, Todd; Li, Lang

    2013-01-01

    Background. The genome-wide association studies (GWAS) have been successful during the last few years. A key challenge is that the interpretation of the results is not straightforward, especially for transacting SNPs. Integration of transcriptome data into GWAS may provide clues elucidating the mechanisms by which a genetic variant leads to a disease. Methods. Here, we developed a novel mediation analysis approach to identify new expression quantitative trait loci (eQTL) driving CYP2D6 activity by combining genotype, gene expression, and enzyme activity data. Results. 389,573 and 1,214,416 SNP-transcript-CYP2D6 activity trios are found strongly associated (P < 10−5, FDR = 16.6% and 11.7%) for two different genotype platforms, namely, Affymetrix and Illumina, respectively. The majority of eQTLs are trans-SNPs. A single polymorphism leads to widespread downstream changes in the expression of distant genes by affecting major regulators or transcription factors (TFs), which would be visible as an eQTL hotspot and can lead to large and consistent biological effects. Overlapped eQTL hotspots with the mediators lead to the discovery of 64 TFs. Conclusions. Our mediation analysis is a powerful approach in identifying the trans-QTL-phenotype associations. It improves our understanding of the functional genetic variations for the liver metabolism mechanisms. PMID:24232670

  7. Development of a novel depth of interaction PET detector using highly multiplexed G-APD cross-strip encoding

    SciTech Connect

    Kolb, A. Parl, C.; Liu, C. C.; Pichler, B. J.; Mantlik, F.; Lorenz, E.; Renker, D.

    2014-08-15

    0.15 mm. Conclusions: The novel DoI PET detector, which is based on strip G-APD arrays, yielded a DoI resolution of 2.9 mm and excellent timing and energy resolution. Its high multiplexing factor reduces the number of electronic channels. Thus, this cross-strip approach enables low-cost, high-performance PET detectors for dedicated small animal PET and PET/MRI and potentially clinical PET/MRI systems.

  8. Delineation of a de novo 7q21.3q31.1 Deletion by CGH-SNP Arrays in a Girl with Multiple Congenital Anomalies Including Severe Glaucoma.

    PubMed

    Martínez-Jacobo, L; Córdova-Fletes, C; Ortiz-López, R; Rivas, F; Saucedo-Carrasco, C; Rojas-Martínez, A

    2013-09-01

    In this study, we present a female patient with a constitutional de novo deletion in 7q21.3q31.1 as determined by G-banding and CGH-SNP arrays. She exhibited, among other features, psychomotor retardation, congenital severe bilateral glaucoma, a cleft palate, and heart defect. Microarray assay disclosed a deleted 12.5-Mb region roughly 88 kb downstream the ectrodactyly critical region; thus, the patient's final karyotype was 46,XX.arr 7q21.3q31.1(96,742,140-109,246,085)×1 dn. This girl represents the fourth patient described so far with congenital glaucoma and a deletion encompassing or overlapping the 7q21.3q31.1 region, and confirms the presence of a locus or loci related to such a clinical feature. According to our results, the proneness to ocular defects secondary to 7q intermediate deletions could be caused by co-deletion of TAC1, HBP1, and a small cluster of cytochrome P450 genes (subfamily 3A). This conclusion is supported by their functional roles and expression locations as well as because TAC1 is related to the functional pathway of the MYOC gene whose mutations are linked to glaucoma. Moreover, given that this girl is clinically reminiscent of several phenotypes related to diverse deletions within 7q21q32, our results and observations offer a general overview of the gene content of deletions/phenotypes overlapping 7q21.3q31.1 and confirm that loci distal to DLX genes including the CUX1 gene and potential regulatory elements downstream from DLX5 are unrelated to ectrodactyly. PMID:24167464

  9. Delineation of a de novo 7q21.3q31.1 Deletion by CGH-SNP Arrays in a Girl with Multiple Congenital Anomalies Including Severe Glaucoma

    PubMed Central

    Martínez-Jacobo, L.; Córdova-Fletes, C.; Ortiz-López, R.; Rivas, F.; Saucedo-Carrasco, C.; Rojas-Martínez, A.

    2013-01-01

    In this study, we present a female patient with a constitutional de novo deletion in 7q21.3q31.1 as determined by G-banding and CGH-SNP arrays. She exhibited, among other features, psychomotor retardation, congenital severe bilateral glaucoma, a cleft palate, and heart defect. Microarray assay disclosed a deleted 12.5-Mb region roughly 88 kb downstream the ectrodactyly critical region; thus, the patient's final karyotype was 46,XX.arr 7q21.3q31.1(96,742,140-109,246,085)×1 dn. This girl represents the fourth patient described so far with congenital glaucoma and a deletion encompassing or overlapping the 7q21.3q31.1 region, and confirms the presence of a locus or loci related to such a clinical feature. According to our results, the proneness to ocular defects secondary to 7q intermediate deletions could be caused by co-deletion of TAC1, HBP1, and a small cluster of cytochrome P450 genes (subfamily 3A). This conclusion is supported by their functional roles and expression locations as well as because TAC1 is related to the functional pathway of the MYOC gene whose mutations are linked to glaucoma. Moreover, given that this girl is clinically reminiscent of several phenotypes related to diverse deletions within 7q21q32, our results and observations offer a general overview of the gene content of deletions/phenotypes overlapping 7q21.3q31.1 and confirm that loci distal to DLX genes including the CUX1 gene and potential regulatory elements downstream from DLX5 are unrelated to ectrodactyly. PMID:24167464

  10. SNP panels/Imputation

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Participants from thirteen countries discussed services that Interbull can perform or recommendations that Interbull can make to promote harmonization and assist member countries in improving their genomic evaluations in regard to SNP panels and imputation. The panel recommended: A mechanism to shar...

  11. Nested Patch PCR enables highly multiplexed mutation discovery in candidate genes

    PubMed Central

    Varley, Katherine Elena; Mitra, Robi David

    2008-01-01

    Medical resequencing of candidate genes in individual patient samples is becoming increasingly important in the clinic and in clinical research. Medical resequencing requires the amplification and sequencing of many candidate genes in many patient samples. Here we introduce Nested Patch PCR, a novel method for highly multiplexed PCR that is very specific, can sensitively detect SNPs and mutations, and is easy to implement. This is the first method that couples multiplex PCR with sample-specific DNA barcodes and next-generation sequencing to enable highly multiplex mutation discovery in candidate genes for multiple samples in parallel. In our pilot study, we amplified exons from colon cancer and matched normal human genomic DNA. From each sample, we successfully amplified 96% (90 of 94) targeted exons from across the genome, totaling 21.6 kbp of sequence. Ninety percent of all sequencing reads were from targeted exons, demonstrating that Nested Patch PCR is highly specific. We found that the abundance of reads per exon was reproducible across samples. We reliably detected germline SNPs and discovered a colon tumor specific nonsense mutation in APC, a gene causally implicated in colorectal cancer. With Nested Patch PCR, candidate gene mutation discovery across multiple individual patient samples can now utilize the power of second-generation sequencing. PMID:18849522

  12. KinSNP software for homozygosity mapping of disease genes using SNP microarrays.

    PubMed

    Amir, El-Ad David; Bartal, Ofer; Morad, Efrat; Nagar, Tal; Sheynin, Jony; Parvari, Ruti; Chalifa-Caspi, Vered

    2010-08-01

    Consanguineous families affected with a recessive genetic disease caused by homozygotisation of a mutation offer a unique advantage for positional cloning of rare diseases. Homozygosity mapping of patient genotypes is a powerful technique for the identification of the genomic locus harbouring the causing mutation. This strategy relies on the observation that in these patients a large region spanning the disease locus is also homozygous with high probability. The high marker density in single nucleotide polymorphism (SNP) arrays is extremely advantageous for homozygosity mapping. We present KinSNP, a user-friendly software tool for homozygosity mapping using SNP arrays. The software searches for stretches of SNPs which are homozygous to the same allele in all ascertained sick individuals. User-specified parameters control the number of allowed genotyping 'errors' within homozygous blocks. Candidate disease regions are then reported in a detailed, coloured Excel file, along with genotypes of family members and healthy controls. An interactive genome browser has been included which shows homozygous blocks, individual genotypes, genes and further annotations along the chromosomes, with zooming and scrolling capabilities. The software has been used to identify the location of a mutated gene causing insensitivity to pain in a large Bedouin family. KinSNP is freely available from. PMID:20846928

  13. Highly multiplexed simultaneous detection of RNAs and proteins in single cells

    PubMed Central

    Frei, Andreas P; Bava, Felice Alessio; Zunder, Eli R; Hsieh, Elena WY; Chen, Shih-Yu; Nolan, Garry P; Gherardini, Pier Federico

    2016-01-01

    Precise gene expression measurement has been fundamental to developing an advanced understanding of the roles of biological networks in health and disease. To enable detection of expression signatures specific to individual cells we developed PLAYR (Proximity Ligation Assay for RNA). PLAYR enables highly multiplexed quantification of transcripts in single cells by flow- and mass-cytometry and is compatible with standard antibody staining of proteins. With mass cytometry, this currently enables simultaneous quantification of more than 40 different mRNAs and proteins. The technology was demonstrated in primary cells to be capable of quantifying multiple gene expression transcripts while the identity and the functional state of each analyzed cell was defined based on the expression of other transcripts or proteins. PLAYR now enables high throughput deep phenotyping of cells to readily expand beyond protein epitopes to include RNA expression, thereby opening a new venue on the characterization of cellular metabolism. PMID:26808670

  14. Evaluation of Two Highly-Multiplexed Custom Panels for Massively Parallel Semiconductor Sequencing on Paraffin DNA

    PubMed Central

    Kotoula, Vassiliki; Lyberopoulou, Aggeliki; Papadopoulou, Kyriaki; Charalambous, Elpida; Alexopoulou, Zoi; Gakou, Chryssa; Lakis, Sotiris; Tsolaki, Eleftheria; Lilakos, Konstantinos; Fountzilas, George

    2015-01-01

    Background—Aim Massively parallel sequencing (MPS) holds promise for expanding cancer translational research and diagnostics. As yet, it has been applied on paraffin DNA (FFPE) with commercially available highly multiplexed gene panels (100s of DNA targets), while custom panels of low multiplexing are used for re-sequencing. Here, we evaluated the performance of two highly multiplexed custom panels on FFPE DNA. Methods Two custom multiplex amplification panels (B, 373 amplicons; T, 286 amplicons) were coupled with semiconductor sequencing on DNA samples from FFPE breast tumors and matched peripheral blood samples (n samples: 316; n libraries: 332). The two panels shared 37% DNA targets (common or shifted amplicons). Panel performance was evaluated in paired sample groups and quartets of libraries, where possible. Results Amplicon read ratios yielded similar patterns per gene with the same panel in FFPE and blood samples; however, performance of common amplicons differed between panels (p<0.001). FFPE genotypes were compared for 1267 coding and non-coding variant replicates, 999 out of which (78.8%) were concordant in different paired sample combinations. Variant frequency was highly reproducible (Spearman’s rho 0.959). Repeatedly discordant variants were of high coverage / low frequency (p<0.001). Genotype concordance was (a) high, for intra-run duplicates with the same panel (mean±SD: 97.2±4.7, 95%CI: 94.8–99.7, p<0.001); (b) modest, when the same DNA was analyzed with different panels (mean±SD: 81.1±20.3, 95%CI: 66.1–95.1, p = 0.004); and (c) low, when different DNA samples from the same tumor were compared with the same panel (mean±SD: 59.9±24.0; 95%CI: 43.3–76.5; p = 0.282). Low coverage / low frequency variants were validated with Sanger sequencing even in samples with unfavourable DNA quality. Conclusions Custom MPS may yield novel information on genomic alterations, provided that data evaluation is adjusted to tumor tissue FFPE DNA. To this

  15. Analysis of genetic diversity using SNP markers in oat

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A large-scale single nucleotide polymorphism (SNP) discovery was carried out in cultivated oat using Roche 454 sequencing methods. DNA sequences were generated from cDNAs originating from a panel of 20 diverse oat cultivars, and from Diversity Array Technology (DArT) genomic complexity reductions fr...

  16. Ultrahigh-density linkage map for cultivated cucumber (Cucumis sativus L.) using a single-nucleotide polymorphism genotyping array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    With the low cost of single nucleotide polymorphism (SNP) discovery, use of SNP markers for SNP array development is becoming more affordable. The SNP array is a very useful tool for high throughput genotyping and has a number of applications such as genome-wide association studies (GWAS). Since the...

  17. High throughput SNP discovery and validation in the pig: towards the development of a high density swine SNP chip

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Recent developments in sequencing technology have allowed the generation of millions of short read sequences in a fast and inexpensive way. This enables the cost effective large scale identification of hundreds of thousands of SNPs needed for the development of high density SNP arrays. Currently, a ...

  18. Highly multiplexed imaging of single cells using a high-throughput cyclic immunofluorescence method.

    PubMed

    Lin, Jia-Ren; Fallahi-Sichani, Mohammad; Sorger, Peter K

    2015-01-01

    Single-cell analysis reveals aspects of cellular physiology not evident from population-based studies, particularly in the case of highly multiplexed methods such as mass cytometry (CyTOF) able to correlate the levels of multiple signalling, differentiation and cell fate markers. Immunofluorescence (IF) microscopy adds information on cell morphology and the microenvironment that are not obtained using flow-based techniques, but the multiplicity of conventional IF is limited. This has motivated development of imaging methods that require specialized instrumentation, exotic reagents or proprietary protocols that are difficult to reproduce in most laboratories. Here we report a public-domain method for achieving high multiplicity single-cell IF using cyclic immunofluorescence (CycIF), a simple and versatile procedure in which four-colour staining alternates with chemical inactivation of fluorophores to progressively build a multichannel image. Because CycIF uses standard reagents and instrumentation and is no more expensive than conventional IF, it is suitable for high-throughput assays and screening applications. PMID:26399630

  19. Highly multiplexed imaging of single cells using a high-throughput cyclic immunofluorescence method

    PubMed Central

    Lin, Jia-Ren; Fallahi-Sichani, Mohammad; Sorger, Peter K.

    2015-01-01

    Single-cell analysis reveals aspects of cellular physiology not evident from population-based studies, particularly in the case of highly multiplexed methods such as mass cytometry (CyTOF) able to correlate the levels of multiple signalling, differentiation and cell fate markers. Immunofluorescence (IF) microscopy adds information on cell morphology and the microenvironment that are not obtained using flow-based techniques, but the multiplicity of conventional IF is limited. This has motivated development of imaging methods that require specialized instrumentation, exotic reagents or proprietary protocols that are difficult to reproduce in most laboratories. Here we report a public-domain method for achieving high multiplicity single-cell IF using cyclic immunofluorescence (CycIF), a simple and versatile procedure in which four-colour staining alternates with chemical inactivation of fluorophores to progressively build a multichannel image. Because CycIF uses standard reagents and instrumentation and is no more expensive than conventional IF, it is suitable for high-throughput assays and screening applications. PMID:26399630

  20. Optimizing sparse sequencing of single cells for highly multiplex copy number profiling

    PubMed Central

    Baslan, Timour; Kendall, Jude; Ward, Brian; Cox, Hilary; Leotta, Anthony; Rodgers, Linda; Riggs, Michael; D'Italia, Sean; Sun, Guoli; Yong, Mao; Miskimen, Kristy; Gilmore, Hannah; Saborowski, Michael; Dimitrova, Nevenka; Krasnitz, Alexander; Harris, Lyndsay; Wigler, Michael; Hicks, James

    2015-01-01

    Genome-wide analysis at the level of single cells has recently emerged as a powerful tool to dissect genome heterogeneity in cancer, neurobiology, and development. To be truly transformative, single-cell approaches must affordably accommodate large numbers of single cells. This is feasible in the case of copy number variation (CNV), because CNV determination requires only sparse sequence coverage. We have used a combination of bioinformatic and molecular approaches to optimize single-cell DNA amplification and library preparation for highly multiplexed sequencing, yielding a method that can produce genome-wide CNV profiles of up to a hundred individual cells on a single lane of an Illumina HiSeq instrument. We apply the method to human cancer cell lines and biopsied cancer tissue, thereby illustrating its efficiency, reproducibility, and power to reveal underlying genetic heterogeneity and clonal phylogeny. The capacity of the method to facilitate the rapid profiling of hundreds to thousands of single-cell genomes represents a key step in making single-cell profiling an easily accessible tool for studying cell lineage. PMID:25858951

  1. Highly-multiplexed barcode sequencing: an efficient method for parallel analysis of pooled samples

    PubMed Central

    Smith, Andrew M.; Heisler, Lawrence E.; St.Onge, Robert P.; Farias-Hesson, Eveline; Wallace, Iain M.; Bodeau, John; Harris, Adam N.; Perry, Kathleen M.; Giaever, Guri; Pourmand, Nader; Nislow, Corey

    2010-01-01

    Next-generation sequencing has proven an extremely effective technology for molecular counting applications where the number of sequence reads provides a digital readout for RNA-seq, ChIP-seq, Tn-seq and other applications. The extremely large number of sequence reads that can be obtained per run permits the analysis of increasingly complex samples. For lower complexity samples, however, a point of diminishing returns is reached when the number of counts per sequence results in oversampling with no increase in data quality. A solution to making next-generation sequencing as efficient and affordable as possible involves assaying multiple samples in a single run. Here, we report the successful 96-plexing of complex pools of DNA barcoded yeast mutants and show that such ‘Bar-seq’ assessment of these samples is comparable with data provided by barcode microarrays, the current benchmark for this application. The cost reduction and increased throughput permitted by highly multiplexed sequencing will greatly expand the scope of chemogenomics assays and, equally importantly, the approach is suitable for other sequence counting applications that could benefit from massive parallelization. PMID:20460461

  2. Lanthanide-Containing Polymer Microspheres by Multiple-Stage Dispersion Polymerization for Highly Multiplexed Bioassays

    PubMed Central

    Abdelrahman, Ahmed I.; Dai, Sheng; Thickett, Stuart C.; Ornatsky, Olga; Bandura, Dmitry; Baranov, Vladimir; Winnik, Mitchell A.

    2009-01-01

    We describe the synthesis and characterization of metal-encoded polystyrene microspheres by multiple-stage dispersion polymerization with diameters on the order of 2 µm and a very narrow size distribution. Different lanthanides were loaded into these microspheres through the addition of a mixture of LnCl3 salts and excess acrylic acid or acetoacetylethyl methacrylate (AAEM) dissolved in ethanol to the reaction after about 10% conversion of styrene, i.e., well after the particle nucleation stage was complete. Individual microspheres contain ca. 106 – 108 chelated lanthanide ions, of either a single element or a mixture of elements. These microspheres were characterized one-by-one utilizing a novel mass cytometer with an inductively coupled plasma (ICP) ionization source and time-of-flight (TOF) mass spectrometry detection. Microspheres containing a range of different metals at different levels of concentration were synthesized to meet the requirements of binary encoding and enumeration encoding protocols. With four different metals at five levels of concentration, we could achieve a variability of 624, and the strategy we report should allow one to obtain much larger variability. To demonstrate the usefulness of element-encoded beads for highly multiplexed immunoassays, we carried out a proof-of-principle model bioassay involving conjugation of mouse IgG to the surface of La and Tm containing particles, and its detection by an anti-mouse IgG bearing a metal-chelating polymer with Pr. PMID:19807075

  3. QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data.

    PubMed

    Colella, Stefano; Yau, Christopher; Taylor, Jennifer M; Mirza, Ghazala; Butler, Helen; Clouston, Penny; Bassett, Anne S; Seller, Anneke; Holmes, Christopher C; Ragoussis, Jiannis

    2007-01-01

    Array-based technologies have been used to detect chromosomal copy number changes (aneuploidies) in the human genome. Recent studies identified numerous copy number variants (CNV) and some are common polymorphisms that may contribute to disease susceptibility. We developed, and experimentally validated, a novel computational framework (QuantiSNP) for detecting regions of copy number variation from BeadArray SNP genotyping data using an Objective Bayes Hidden-Markov Model (OB-HMM). Objective Bayes measures are used to set certain hyperparameters in the priors using a novel re-sampling framework to calibrate the model to a fixed Type I (false positive) error rate. Other parameters are set via maximum marginal likelihood to prior training data of known structure. QuantiSNP provides probabilistic quantification of state classifications and significantly improves the accuracy of segmental aneuploidy identification and mapping, relative to existing analytical tools (Beadstudio, Illumina), as demonstrated by validation of breakpoint boundaries. QuantiSNP identified both novel and validated CNVs. QuantiSNP was developed using BeadArray SNP data but it can be adapted to other platforms and we believe that the OB-HMM framework has widespread applicability in genomic research. In conclusion, QuantiSNP is a novel algorithm for high-resolution CNV/aneuploidy detection with application to clinical genetics, cancer and disease association studies. PMID:17341461

  4. HIV drug resistance testing by high-multiplex "wide" sequencing on the MiSeq instrument.

    PubMed

    Lapointe, H R; Dong, W; Lee, G Q; Bangsberg, D R; Martin, J N; Mocello, A R; Boum, Y; Karakas, A; Kirkby, D; Poon, A F Y; Harrigan, P R; Brumme, C J

    2015-11-01

    Limited access to HIV drug resistance testing in low- and middle-income countries impedes clinical decision-making at the individual patient level. An efficient protocol to address this issue must be established to minimize negative therapeutic outcomes for HIV-1-infected individuals in such settings. This is an observational study to ascertain the potential of newer genomic sequencing platforms, such as the Illumina MiSeq instrument, to provide accurate HIV drug resistance genotypes for hundreds of samples simultaneously. Plasma samples were collected from Canadian patients during routine drug resistance testing (n = 759) and from a Ugandan study cohort (n = 349). Amplicons spanning HIV reverse transcriptase codons 90 to 234 were sequenced with both MiSeq sequencing and conventional Sanger sequencing methods. Sequences were evaluated for nucleotide concordance between methods, using coverage and mixture parameters for quality control. Consensus sequences were also analyzed for disparities in the identification of drug resistance mutations. Sanger and MiSeq sequencing was successful for 881 samples (80%) and 892 samples (81%), respectively, with 832 samples having results from both methods. Most failures were for samples with viral loads of <3.0 log10 HIV RNA copies/ml. Overall, 99.3% nucleotide concordance between methods was observed. MiSeq sequencing achieved 97.4% sensitivity and 99.3% specificity in detecting resistance mutations identified by Sanger sequencing. Findings suggest that the Illumina MiSeq platform can yield high-quality data with a high-multiplex "wide" sequencing approach. This strategy can be used for multiple HIV subtypes, demonstrating the potential for widespread individual testing and annual population surveillance in resource-limited settings. PMID:26282425

  5. Computational tradeoffs in multiplex PCR assay design for SNP genotyping

    PubMed Central

    Rachlin, John; Ding, Chunming; Cantor, Charles; Kasif, Simon

    2005-01-01

    Background Multiplex PCR is a key technology for detecting infectious microorganisms, whole-genome sequencing, forensic analysis, and for enabling flexible yet low-cost genotyping. However, the design of a multiplex PCR assays requires the consideration of multiple competing objectives and physical constraints, and extensive computational analysis must be performed in order to identify the possible formation of primer-dimers that can negatively impact product yield. Results This paper examines the computational design limits of multiplex PCR in the context of SNP genotyping and examines tradeoffs associated with several key design factors including multiplexing level (the number of primer pairs per tube), coverage (the % of SNP whose associated primers are actually assigned to one of several available tube), and tube-size uniformity. We also examine how design performance depends on the total number of available SNPs from which to choose, and primer stringency criterial. We show that finding high-multiplexing/high-coverage designs is subject to a computational phase transition, becoming dramatically more difficult when the probability of primer pair interaction exceeds a critical threshold. The precise location of this critical transition point depends on the number of available SNPs and the level of multiplexing required. We also demonstrate how coverage performance is impacted by the number of available snps, primer selection criteria, and target multiplexing levels. Conclusion The presence of a phase transition suggests limits to scaling Multiplex PCR performance for high-throughput genomics applications. Achieving broad SNP coverage rapidly transitions from being very easy to very hard as the target multiplexing level (# of primer pairs per tube) increases. The onset of a phase transition can be "delayed" by having a larger pool of SNPs, or loosening primer selection constraints so as to increase the number of candidate primer pairs per SNP, though the latter

  6. Whole genome SNP scanning of snow sheep (Ovis nivicola).

    PubMed

    Deniskova, T E; Okhlopkov, I M; Sermyagin, A A; Gladyr', E A; Bagirov, V A; Sölkner, J; Mamaev, N V; Brem, G; Zinov'eva, N A

    2016-07-01

    This is the first report performing the whole genome SNP scanning of snow sheep (Ovis nivicola). Samples of snow sheep (n = 18) collected in six different regions of the Republic of Sakha (Yakutia) from 64° to 71° N. For SNP genotyping, we applied Ovine 50K SNP BeadChip (Illumina, United States), designed for domestic sheep. The total number of genotyped SNPs (call rate 90%) was 47796 (88.1% of total SNPs), wherein 1006 SNPs were polymorphic (2.1%). Principal component analysis (PCA) showed the clear differentiation within the species O. nivicola: studied individuals were distributed among five distinct arrays corresponding to the geographical locations of sampling points. Our results demonstrate that the DNA chip designed for domestic sheep can be successfully used to study the allele pool and the genetic structure of snow sheep populations. PMID:27599514

  7. Detection of copy number variation by SNP-allelotyping.

    PubMed

    Parker, Brett; Alexander, Ryan; Wu, Xingyao; Feely, Shawna; Shy, Michael; Schnetz-Boutaud, Nathalie; Li, Jun

    2015-03-01

    Charcot-Marie-Tooth disease type 1A (CMT1A) is caused by an abnormal copy number variation (CNV) with a trisomy of chromosome 17p12. The increase of the DNA-segment copy number is expected to alter the allele frequency of single nucleotide polymorphism (SNP) within the duplicated region. We tested whether SNP allele frequency determined by a Sequenom MassArray can be used to detect the CMT1A mutation. Our results revealed distinct patterns of SNP allele frequency distribution, which reliably differentiated CMT1A patients from controls. This finding suggests that this technique may serve as an alternative approach to identifying CNV in certain diseases, including CMT1A. PMID:24830919

  8. SNP-VISTA: An interactive SNP visualization tool

    PubMed Central

    Shah, Nameeta; Teplitsky, Michael V; Minovitsky, Simon; Pennacchio, Len A; Hugenholtz, Philip; Hamann, Bernd; Dubchak, Inna L

    2005-01-01

    Background Recent advances in sequencing technologies promise to provide a better understanding of the genetics of human disease as well as the evolution of microbial populations. Single Nucleotide Polymorphisms (SNPs) are established genetic markers that aid in the identification of loci affecting quantitative traits and/or disease in a wide variety of eukaryotic species. With today's technological capabilities, it has become possible to re-sequence a large set of appropriate candidate genes in individuals with a given disease in an attempt to identify causative mutations. In addition, SNPs have been used extensively in efforts to study the evolution of microbial populations, and the recent application of random shotgun sequencing to environmental samples enables more extensive SNP analysis of co-occurring and co-evolving microbial populations. The program is available at [1]. Results We have developed and present two modifications of an interactive visualization tool, SNP-VISTA, to aid in the analyses of the following types of data: A. Large-scale re-sequence data of disease-related genes for discovery of associated and/or causative alleles (GeneSNP-VISTA). B. Massive amounts of ecogenomics data for studying homologous recombination in microbial populations (EcoSNP-VISTA). The main features and capabilities of SNP-VISTA are: 1) mapping of SNPs to gene structure; 2) classification of SNPs, based on their location in the gene, frequency of occurrence in samples and allele composition; 3) clustering, based on user-defined subsets of SNPs, highlighting haplotypes as well as recombinant sequences; 4) integration of protein evolutionary conservation visualization; and 5) display of automatically calculated recombination points that are user-editable. Conclusion The main strength of SNP-VISTA is its graphical interface and use of visual representations, which support interactive exploration and hence better understanding of large-scale SNP data by the user. PMID

  9. SNP genotyping by heteroduplex analysis.

    PubMed

    Paniego, Norma; Fusari, Corina; Lia, Verónica; Puebla, Andrea

    2015-01-01

    Heteroduplex-based genotyping methods have proven to be technologically effective and economically efficient for low- to medium-range throughput single-nucleotide polymorphism (SNP) determination. In this chapter we describe two protocols that were successfully applied for SNP detection and haplotype analysis of candidate genes in association studies. The protocols involve (1) enzymatic mismatch cleavage with endonuclease CEL1 from celery, associated with fragment separation using capillary electrophoresis (CEL1 cleavage), and (2) differential retention of the homo/heteroduplex DNA molecules under partial denaturing conditions on ion pair reversed-phase liquid chromatography (dHPLC). Both methods are complementary since dHPLC is more versatile than CEL1 cleavage for identifying multiple SNP per target region, and the latter is easily optimized for sequences with fewer SNPs or small insertion/deletion polymorphisms. Besides, CEL1 cleavage is a powerful method to localize the position of the mutation when fragment resolution is done using capillary electrophoresis. PMID:25373754

  10. Highly multiplexible thermal kinetic inductance detectors for x-ray imaging spectroscopy

    SciTech Connect

    Ulbricht, Gerhard Mazin, Benjamin A.; Szypryt, Paul; Walter, Alex B.; Bockstiegel, Clint; Bumble, Bruce

    2015-06-22

    For X-ray imaging spectroscopy, high spatial resolution over a large field of view is often as important as high energy resolution, but current X-ray detectors do not provide both in the same device. Thermal Kinetic Inductance Detectors (TKIDs) are being developed as they offer a feasible way to combine the energy resolution of transition edge sensors with pixel counts approaching CCDs and thus promise significant improvements for many X-ray spectroscopy applications. TKIDs are a variation of Microwave Kinetic Inductance Detectors (MKIDs) and share their multiplexibility: working MKID arrays with 2024 pixels have recently been demonstrated and much bigger arrays are under development. In this work, we present a TKID prototype, which is able to achieve an energy resolution of 75 eV at 5.9 keV, even though its general design still has to be optimized. We further describe TKID fabrication, characterization, multiplexing, and working principle and demonstrate the necessity of a data fitting algorithm in order to extract photon energies. With further design optimizations, we expect to be able to improve our TKID energy resolution to less than 10 eV at 5.9 keV.

  11. Highly multiplexible thermal kinetic inductance detectors for x-ray imaging spectroscopy

    NASA Astrophysics Data System (ADS)

    Ulbricht, Gerhard; Mazin, Benjamin A.; Szypryt, Paul; Walter, Alex B.; Bockstiegel, Clint; Bumble, Bruce

    2015-06-01

    For X-ray imaging spectroscopy, high spatial resolution over a large field of view is often as important as high energy resolution, but current X-ray detectors do not provide both in the same device. Thermal Kinetic Inductance Detectors (TKIDs) are being developed as they offer a feasible way to combine the energy resolution of transition edge sensors with pixel counts approaching CCDs and thus promise significant improvements for many X-ray spectroscopy applications. TKIDs are a variation of Microwave Kinetic Inductance Detectors (MKIDs) and share their multiplexibility: working MKID arrays with 2024 pixels have recently been demonstrated and much bigger arrays are under development. In this work, we present a TKID prototype, which is able to achieve an energy resolution of 75 eV at 5.9 keV, even though its general design still has to be optimized. We further describe TKID fabrication, characterization, multiplexing, and working principle and demonstrate the necessity of a data fitting algorithm in order to extract photon energies. With further design optimizations, we expect to be able to improve our TKID energy resolution to less than 10 eV at 5.9 keV.

  12. ARCONS: a Highly Multiplexed Superconducting UV-to-Near-IR Camera

    NASA Astrophysics Data System (ADS)

    O'Brien, Kieran; Mazin, Ben; McHugh, Sean; Meeker, Seth; Bumble, Bruce

    2012-04-01

    ARCONS, the Array Camera for Optical to Near-infrared Spectrophotometry, was recently commissioned at the coudé focus of the 200-inch Hale Telescope at the Palomar Observatory. At the heart of this unique instrument is a 1024-pixel Microwave Kinetic Inductance Detector (MKID), exploiting the Kinetic Inductance effect to measure the energy of the incoming photon to better than several percent. The ground-breaking instrument is lens-coupled with a pixel scale of 0''.23/pixel, each pixel recording the arrival time (< 2 μ sec) and energy of a photon (~10%) in the optical to near-IR (0.4-1.1 microns) range. The scientific objectives of the instrument include the rapid follow-up and classification of transient phenomena.

  13. SNP Microarray in FISH Negative Clinically Suspected 22q11.2 Microdeletion Syndrome

    PubMed Central

    Jain, Manish; Kalsi, Amanpreet Kaur

    2016-01-01

    The present study evaluated the role of SNP microarray in 101 cases of clinically suspected FISH negative (noninformative/normal) 22q11.2 microdeletion syndrome. SNP microarray was carried out using 300 K HumanCytoSNP-12 BeadChip array or CytoScan 750 K array. SNP microarray identified 8 cases of 22q11.2 microdeletions and/or microduplications in addition to cases of chromosomal abnormalities and other pathogenic/likely pathogenic CNVs. Clinically suspected specific deletions (22q11.2) were detectable in approximately 8% of cases by SNP microarray, mostly from FISH noninformative cases. This study also identified several LOH/AOH loci with known and well-defined UPD (uniparental disomy) disorders. In conclusion, this study suggests more strict clinical criteria for FISH analysis. However, if clinical criteria are few or doubtful, in particular newborn/neonate in intensive care, SNP microarray should be the first screening test to be ordered. FISH is ideal test for detecting mosaicism, screening family members, and prenatal diagnosis in proven families. PMID:27051557

  14. SNP Microarray in FISH Negative Clinically Suspected 22q11.2 Microdeletion Syndrome.

    PubMed

    Halder, Ashutosh; Jain, Manish; Kalsi, Amanpreet Kaur

    2016-01-01

    The present study evaluated the role of SNP microarray in 101 cases of clinically suspected FISH negative (noninformative/normal) 22q11.2 microdeletion syndrome. SNP microarray was carried out using 300 K HumanCytoSNP-12 BeadChip array or CytoScan 750 K array. SNP microarray identified 8 cases of 22q11.2 microdeletions and/or microduplications in addition to cases of chromosomal abnormalities and other pathogenic/likely pathogenic CNVs. Clinically suspected specific deletions (22q11.2) were detectable in approximately 8% of cases by SNP microarray, mostly from FISH noninformative cases. This study also identified several LOH/AOH loci with known and well-defined UPD (uniparental disomy) disorders. In conclusion, this study suggests more strict clinical criteria for FISH analysis. However, if clinical criteria are few or doubtful, in particular newborn/neonate in intensive care, SNP microarray should be the first screening test to be ordered. FISH is ideal test for detecting mosaicism, screening family members, and prenatal diagnosis in proven families. PMID:27051557

  15. SNP-SNP interactions in breast cancer susceptibility

    PubMed Central

    Onay, Venüs Ümmiye; Briollais, Laurent; Knight, Julia A; Shi, Ellen; Wang, Yuanyuan; Wells, Sean; Li, Hong; Rajendram, Isaac; Andrulis, Irene L; Ozcelik, Hilmi

    2006-01-01

    Background Breast cancer predisposition genes identified to date (e.g., BRCA1 and BRCA2) are responsible for less than 5% of all breast cancer cases. Many studies have shown that the cancer risks associated with individual commonly occurring single nucleotide polymorphisms (SNPs) are incremental. However, polygenic models suggest that multiple commonly occurring low to modestly penetrant SNPs of cancer related genes might have a greater effect on a disease when considered in combination. Methods In an attempt to identify the breast cancer risk conferred by SNP interactions, we have studied 19 SNPs from genes involved in major cancer related pathways. All SNPs were genotyped by TaqMan 5'nuclease assay. The association between the case-control status and each individual SNP, measured by the odds ratio and its corresponding 95% confidence interval, was estimated using unconditional logistic regression models. At the second stage, two-way interactions were investigated using multivariate logistic models. The robustness of the interactions, which were observed among SNPs with stronger functional evidence, was assessed using a bootstrap approach, and correction for multiple testing based on the false discovery rate (FDR) principle. Results None of these SNPs contributed to breast cancer risk individually. However, we have demonstrated evidence for gene-gene (SNP-SNP) interaction among these SNPs, which were associated with increased breast cancer risk. Our study suggests cross talk between the SNPs of the DNA repair and immune system (XPD-[Lys751Gln] and IL10-[G(-1082)A]), cell cycle and estrogen metabolism (CCND1-[Pro241Pro] and COMT-[Met108/158Val]), cell cycle and DNA repair (BARD1-[Pro24Ser] and XPD-[Lys751Gln]), and within carcinogen metabolism (GSTP1-[Ile105Val] and COMT-[Met108/158Val]) pathways. Conclusion The importance of these pathways and their communication in breast cancer predisposition has been emphasized previously, but their biological interactions

  16. Genome-wide association study using a high-density SNP-array and case-control design identifies a novel essential hypertension susceptibility locus in the promoter region of eNOS

    PubMed Central

    Salvi, Erika; Kutalik, Zoltán; Glorioso, Nicola; Benaglio, Paola; Frau, Francesca; Kuznetsova, Tatiana; Arima, Hisatomi; Hoggart, Clive; Tichet, Jean; Nikitin, Yury P.; Conti, Costanza; Seidlerova, Jitka; Tikhonoff, Valérie; Stolarz-Skrzypek, Katarzyna; Johnson, Toby; Devos, Nabila; Zagato, Laura; Guarrera, Simonetta; Zaninello, Roberta; Calabria, Andrea; Stancanelli, Benedetta; Troffa, Chiara; Thijs, Lutgarde; Rizzi, Federica; Simonova, Galina; Lupoli, Sara; Argiolas, Giuseppe; Braga, Daniele; D’Alessio, Maria C.; Ortu, Maria F.; Ricceri, Fulvio; Mercurio, Maurizio; Descombes, Patrick; Marconi, Maurizio; Chalmers, John; Harrap, Stephen; Filipovsky, Jan; Bochud, Murielle; Iacoviello, Licia; Ellis, Justine; Stanton, Alice V.; Laan, Maris; Padmanabhan, Sandosh; Dominiczak, Anna F.; Samani, Nilesh J.; Melander, Olle; Jeunemaitre, Xavier; Manunta, Paolo; Shabo, Amnon; Vineis, Paolo; Cappuccio, Francesco P.; Caulfield, Mark J.; Matullo, Giuseppe; Rivolta, Carlo; Munroe, Patricia B.; Barlassina, Cristina; Staessen, Jan A; Beckmann, Jacques S.; Cusi, Daniele

    2012-01-01

    Essential hypertension is a multi-factorial disorder and is the main risk factor for renal and cardiovascular complications. The research on the genetics of hypertension has been frustrated by the small predictive value of the discovered genetic variants. The HYPERGENES Project investigated associations between genetic variants and essential hypertension pursuing a two-stage study by recruiting cases and controls from extensively characterized cohorts recruited over many years in different European regions. The discovery phase consisted of 1,865 cases and 1,750 controls genotyped with 1M Illumina array. Best hits were followed up in a validation panel of 1,385 cases and 1,246 controls that were genotyped with a custom array of 14,055 markers. We identified a new hypertension susceptibility locus (rs3918226) in the promoter region of the endothelial nitric oxide synthase (eNOS) gene (odds ratio 1.54; 95% CI 1.37-1.73; combined p=2.58·10−13). A meta-analysis, using other in-silico/de novo genotyping data for a total of 21714 subjects, resulted in an overall odds ratio of 1.34 (95% CI 1.25-1.44, p=1.032·10−14). The quantitative analysis on a population-based sample revealed an effect size of 1.91 (95% CI 0.16-3.66) for systolic and 1.40 (95% CI 0.25-2.55) for diastolic blood pressure. We identified in-silico a potential binding site for ETS transcription-factors directly next to rs3918226, suggesting a potential modulation of eNOS expression. Biological evidence links eNOS with hypertension, as it is a critical mediator of cardiovascular homeostasis and blood pressure control via vascular tone regulation. This finding supports the hypothesis that there may be a causal genetic variation at this locus. PMID:22184326

  17. BM-SNP: A Bayesian Model for SNP Calling Using High Throughput Sequencing Data.

    PubMed

    Xu, Yanxun; Zheng, Xiaofeng; Yuan, Yuan; Estecio, Marcos R; Issa, Jean-Pierre; Qiu, Peng; Ji, Yuan; Liang, Shoudan

    2014-01-01

    A single-nucleotide polymorphism (SNP) is a sole base change in the DNA sequence and is the most common polymorphism. Detection and annotation of SNPs are among the central topics in biomedical research as SNPs are believed to play important roles on the manifestation of phenotypic events, such as disease susceptibility. To take full advantage of the next-generation sequencing (NGS) technology, we propose a Bayesian approach, BM-SNP, to identify SNPs based on the posterior inference using NGS data. In particular, BM-SNP computes the posterior probability of nucleotide variation at each covered genomic position using the contents and frequency of the mapped short reads. The position with a high posterior probability of nucleotide variation is flagged as a potential SNP. We apply BM-SNP to two cell-line NGS data, and the results show a high ratio of overlap ( >95 percent) with the dbSNP database. Compared with MAQ, BM-SNP identifies more SNPs that are in dbSNP, with higher quality. The SNPs that are called only by BM-SNP but not in dbSNP may serve as new discoveries. The proposed BM-SNP method integrates information from multiple aspects of NGS data, and therefore achieves high detection power. BM-SNP is fast, capable of processing whole genome data at 20-fold average coverage in a short amount of time. PMID:26357041

  18. The development and characterization of a 57K SNP Chip for rainbow trout

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In this paper we describe the development and characterization of the first high density SNP chip for rainbow trout. The chip included 57,500 putative SNPs, of which 49,500 (86%) were validated as high quality and polymorphic in our validation panel of 960 rainbow trout samples. This array is compa...

  19. Construction and application of a bovine high-density SNP assay

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Bovine genomics has entered a new era and has been transformed by the availability of the whole genome sequence data. An additional resource currently under development is a 60,000 single nucleotide polymorphism (SNP) array that will soon be made commercially available. Targetted content for this SN...

  20. A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation

    PubMed Central

    2013-01-01

    Background Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. Results We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Conclusions Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array—more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation

  1. Detecting Susceptibility to Breast Cancer with SNP-SNP Interaction Using BPSOHS and Emotional Neural Networks.

    PubMed

    Wang, Xiao; Peng, Qinke; Fan, Yue

    2016-01-01

    Studies for the association between diseases and informative single nucleotide polymorphisms (SNPs) have received great attention. However, most of them just use the whole set of useful SNPs and fail to consider the SNP-SNP interactions, while these interactions have already been proven in biology experiments. In this paper, we use a binary particle swarm optimization with hierarchical structure (BPSOHS) algorithm to improve the effective of PSO for the identification of the SNP-SNP interactions. Furthermore, in order to use these SNP interactions in the susceptibility analysis, we propose an emotional neural network (ENN) to treat SNP interactions as emotional tendency. Different from the normal architecture, just as the emotional brain, this architecture provides a specific path to treat the emotional value, by which the SNP interactions can be considered more quickly and directly. The ENN helps us use the prior knowledge about the SNP interactions and other influence factors together. Finally, the experimental results prove that the proposed BPSOHS_ENN algorithm can detect the informative SNP-SNP interaction and predict the breast cancer risk with a much higher accuracy than existing methods. PMID:27294121

  2. Detecting Susceptibility to Breast Cancer with SNP-SNP Interaction Using BPSOHS and Emotional Neural Networks

    PubMed Central

    Wang, Xiao; Fan, Yue

    2016-01-01

    Studies for the association between diseases and informative single nucleotide polymorphisms (SNPs) have received great attention. However, most of them just use the whole set of useful SNPs and fail to consider the SNP-SNP interactions, while these interactions have already been proven in biology experiments. In this paper, we use a binary particle swarm optimization with hierarchical structure (BPSOHS) algorithm to improve the effective of PSO for the identification of the SNP-SNP interactions. Furthermore, in order to use these SNP interactions in the susceptibility analysis, we propose an emotional neural network (ENN) to treat SNP interactions as emotional tendency. Different from the normal architecture, just as the emotional brain, this architecture provides a specific path to treat the emotional value, by which the SNP interactions can be considered more quickly and directly. The ENN helps us use the prior knowledge about the SNP interactions and other influence factors together. Finally, the experimental results prove that the proposed BPSOHS_ENN algorithm can detect the informative SNP-SNP interaction and predict the breast cancer risk with a much higher accuracy than existing methods. PMID:27294121

  3. HIV Drug Resistance Testing by High-Multiplex “Wide” Sequencing on the MiSeq Instrument

    PubMed Central

    Lapointe, H. R.; Dong, W.; Lee, G. Q.; Bangsberg, D. R.; Martin, J. N.; Mocello, A. R.; Boum, Y.; Karakas, A.; Kirkby, D.; Poon, A. F. Y.

    2015-01-01

    Limited access to HIV drug resistance testing in low- and middle-income countries impedes clinical decision-making at the individual patient level. An efficient protocol to address this issue must be established to minimize negative therapeutic outcomes for HIV-1-infected individuals in such settings. This is an observational study to ascertain the potential of newer genomic sequencing platforms, such as the Illumina MiSeq instrument, to provide accurate HIV drug resistance genotypes for hundreds of samples simultaneously. Plasma samples were collected from Canadian patients during routine drug resistance testing (n = 759) and from a Ugandan study cohort (n = 349). Amplicons spanning HIV reverse transcriptase codons 90 to 234 were sequenced with both MiSeq sequencing and conventional Sanger sequencing methods. Sequences were evaluated for nucleotide concordance between methods, using coverage and mixture parameters for quality control. Consensus sequences were also analyzed for disparities in the identification of drug resistance mutations. Sanger and MiSeq sequencing was successful for 881 samples (80%) and 892 samples (81%), respectively, with 832 samples having results from both methods. Most failures were for samples with viral loads of <3.0 log10 HIV RNA copies/ml. Overall, 99.3% nucleotide concordance between methods was observed. MiSeq sequencing achieved 97.4% sensitivity and 99.3% specificity in detecting resistance mutations identified by Sanger sequencing. Findings suggest that the Illumina MiSeq platform can yield high-quality data with a high-multiplex “wide” sequencing approach. This strategy can be used for multiple HIV subtypes, demonstrating the potential for widespread individual testing and annual population surveillance in resource-limited settings. PMID:26282425

  4. 4MOST - The new wide-field, high-multiplex spectroscopic survey facility for ESO's VISTA telescope

    NASA Astrophysics Data System (ADS)

    de Jong, Roelof S.; Consortium, 4MOST

    2015-08-01

    4MOST is a wide-field, high-multiplex spectroscopic survey facility under development for the VISTA telescope of the European Southern Observatory (ESO), the only facility of its kind planned for the southern sky. Its main science drivers are in the fields of galactic archeology, high-energy physics, galaxy evolution and cosmology. 4MOST will in particular provide the spectroscopic complements to the large area surveys coming from space missions like Gaia, eROSITA, Euclid, and PLATO and from ground-based facilities like VISTA, VST, DECam, LSST and SKA. Surveys are planned to determine the chemo-dynamical structure and evolution of the Milky Way and to characterize Dark Energy as function of time through multiple methods (e.g., BAO, RSD, lensing) and tracers (galaxies, AGN, Ly-α Forest, galaxy clusters, SNe 1a).The 4MOST baseline concept features a 2.7 degree diameter field-of-view with about 2400 fibres in the focal surface that are configured by a fibre positioner based on the tilting spine principle. The fibres feed two types of spectrographs; ˜1600 fibres go to two spectrographs with resolution R>5000 (˜390-930 nm) and ˜800 fibres to a spectrograph with R>18,000 (˜392-437 nm, 515-572 nm and 605-675 nm). Both types of spectrographs are fixed-configuration, three-channel spectrographs.4MOST will have an unique operations concept in which 5 year public surveys from both the consortium and the ESO community will be combined and observed in parallel during each exposure, resulting in more than 25 million spectra of targets spread over a large fraction of the southern sky. The 4MOST Facility Simulator (4FS) was developed to demonstrate the feasibility of this observing concept. 4MOST has been accepted for implementation by ESO and has moved into its construction phase. Operations are expected to start by the early 2021.

  5. SNP genotyping by DNA photoligation: application to SNP detection of genes from food crops

    NASA Astrophysics Data System (ADS)

    Yoshimura, Yoshinaga; Ohtake, Tomoko; Okada, Hajime; Ami, Takehiro; Tsukaguchi, Tadashi; Fujimoto, Kenzo

    2009-06-01

    We describe a simple and inexpensive single-nucleotide polymorphism (SNP) typing method, using DNA photoligation with 5-carboxyvinyl-2'-deoxyuridine and two fluorophores. This SNP-typing method facilitates qualitative determination of genes from indica and japonica rice, and showed a high degree of single nucleotide specificity up to 10 000. This method can be used in the SNP typing of actual genomic DNA samples from food crops.

  6. A SNP-Based Molecular Barcode for Characterization of Common Wheat

    PubMed Central

    Gao, LiFeng; Jia, JiZeng; Kong, XiuYing

    2016-01-01

    Wheat is grown as a staple crop worldwide. It is important to develop an effective genotyping tool for this cereal grain both to identify germplasm diversity and to protect the rights of breeders. Single-nucleotide polymorphism (SNP) genotyping provides a means for developing a practical, rapid, inexpensive and high-throughput assay. Here, we investigated SNPs as robust markers of genetic variation for typing wheat cultivars. We identified SNPs from an array of 9000 across a collection of 429 well-known wheat cultivars grown in China, of which 43 SNP markers with high minor allele frequency and variations discriminated the selected wheat varieties and their wild ancestors. This SNP-based barcode will allow for the rapid and precise identification of wheat germplasm resources and newly released varieties and will further assist in the wheat breeding program. PMID:26985664

  7. A SNP-Based Molecular Barcode for Characterization of Common Wheat.

    PubMed

    Gao, LiFeng; Jia, JiZeng; Kong, XiuYing

    2016-01-01

    Wheat is grown as a staple crop worldwide. It is important to develop an effective genotyping tool for this cereal grain both to identify germplasm diversity and to protect the rights of breeders. Single-nucleotide polymorphism (SNP) genotyping provides a means for developing a practical, rapid, inexpensive and high-throughput assay. Here, we investigated SNPs as robust markers of genetic variation for typing wheat cultivars. We identified SNPs from an array of 9000 across a collection of 429 well-known wheat cultivars grown in China, of which 43 SNP markers with high minor allele frequency and variations discriminated the selected wheat varieties and their wild ancestors. This SNP-based barcode will allow for the rapid and precise identification of wheat germplasm resources and newly released varieties and will further assist in the wheat breeding program. PMID:26985664

  8. Fish scales and SNP chips: SNP genotyping and allele frequency estimation in individual and pooled DNA from historical samples of Atlantic salmon (Salmo salar)

    PubMed Central

    2013-01-01

    Background DNA extracted from historical samples is an important resource for understanding genetic consequences of anthropogenic influences and long-term environmental change. However, such samples generally yield DNA of a lower amount and quality, and the extent to which DNA degradation affects SNP genotyping success and allele frequency estimation is not well understood. We conducted high density SNP genotyping and allele frequency estimation in both individual DNA samples and pooled DNA samples extracted from dried Atlantic salmon (Salmo salar) scales stored at room temperature for up to 35 years, and assessed genotyping success, repeatability and accuracy of allele frequency estimation using a high density SNP genotyping array. Results In individual DNA samples, genotyping success and repeatability was very high (> 0.973 and > 0.998, respectively) in samples stored for up to 35 years; both increased with the proportion of DNA of fragment size > 1000 bp. In pooled DNA samples, allele frequency estimation was highly repeatable (Repeatability = 0.986) and highly correlated with empirical allele frequency measures (Mean Adjusted R2 = 0.991); allele frequency could be accurately estimated in > 95% of pooled DNA samples with a reference group of at least 30 individuals. SNPs located in polyploid regions of the genome were more sensitive to DNA degradation: older samples had lower genotyping success at these loci, and a larger reference panel of individuals was required to accurately estimate allele frequencies. Conclusions SNP genotyping was highly successful in degraded DNA samples, paving the way for the use of degraded samples in SNP genotyping projects. DNA pooling provides the potential for large scale population genetic studies with fewer assays, provided enough reference individuals are also genotyped and DNA quality is properly assessed beforehand. We provide recommendations for future studies intending to conduct high-throughput SNP

  9. SNPMeta: SNP annotation and SNP metadata collection without a reference genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The increase in availability of resequencing data is greatly accelerating SNP discovery and has facilitated the development of SNP genotyping assays. This, in turn, is increasing interest in annotation of individual SNPs. Currently, these data are only available through curation, or comparison to a ...

  10. Recombination mapping using Boolean logic and high-density SNP genotyping for exome sequence filtering

    PubMed Central

    Markello, Thomas C.; Han, Ted; Carlson-Donohoe, Hannah; Ahaghotu, Chidi; Harper, Ursula; Jones, MaryPat; Chandrasekharappa, Settara; Anikster, Yair; Adams, David R.; Gahl, William A.; Boerkoel, Cornelius F.

    2012-01-01

    Whole genome sequence data for small pedigrees has been shown to provide sufficient information to resolve detailed haplotypes in small pedigrees. Using such information, recombinations can be mapped onto chromosomes, compared with the segregation of a disease of interest and used to filter genome sequence variants. We now show that relatively inexpensive SNP array data from small pedigrees can be used in a similar manner to provide a means of identifying regions of interest in exome sequencing projects. We demonstrate that in those situations where one can assume complete penetrance and parental DNA is available, SNP recombination mapping using Boolean logic identifies chromosomal regions identical to those detected by multipoint linkage using microsatellites but with much less computation. We further show that this approach is successful because the probability of a double crossover between informative SNP loci is negligible. Our observations provide a rationale for using SNP arrays and recombination mapping as a rapid and cost-effective means of incorporating chromosome segregation information into exome sequencing projects intended for disease-gene identification. PMID:22264778

  11. SNP Discovery Using Next Generation Transcriptomic Sequencing.

    PubMed

    De Wit, Pierre

    2016-01-01

    In this chapter, I will guide the user through methods to find new SNP markers from expressed sequence (RNA-Seq) data, focusing on the sample preparation and also on the bioinformatic analyses needed to sort through the immense flood of data from high-throughput sequencing machines. The general steps included are as follows: sample preparation, sequencing, quality control of data, assembly, mapping, SNP discovery, filtering, validation. The first few steps are traditional laboratory protocols, whereas steps following the sequencing are of bioinformatic nature. The bioinformatics described herein are by no means exhaustive, rather they serve as one example of a simple way of analyzing high-throughput sequence data to find SNP markers. Ideally, one would like to run through this protocol several times with a new dataset, while varying software parameters slightly, in order to determine the robustness of the results. The final validation step, although not described in much detail here, is also quite critical as that will be the final test of the accuracy of the assumptions made in silico.There is a plethora of downstream applications of a SNP dataset, not covered in this chapter. For an example of a more thorough protocol also including differential gene expression and functional enrichment analyses, BLAST annotation and downstream applications of SNP markers, a good starting point could be the "Simple Fool's Guide to population genomics via RNA-Seq," which is available at http://sfg.stanford.edu . PMID:27460371

  12. SSCP-SNP in pearl millet--a new marker system for comparative genetics.

    PubMed

    Bertin, I; Zhu, J H; Gale, M D

    2005-05-01

    A considerable array of genomic resources are in place in pearl millet, and marker-aided selection is already in use in the public breeding programme at ICRISAT. This paper describes experiments to extend these publicly available resources to a single nucleotide polymorphism (SNP)-based marker system. A new marker system, single-strand conformational polymorphism (SSCP)-SNP, was developed using annotated rice genomic sequences to initially predict the intron-exon borders in millet expressed sequence tags (ESTs) and then to design primers that would amplify across the introns. An adequate supply of millet ESTs was available for us to identify 299 homologues of single-copy rice genes in which the intron positions could be precisely predicted. PCR primers were then designed to amplify approximately 500-bp genomic fragments containing introns. Analysis of these fragments on SSCP gels revealed considerable polymorphism. A detailed DNA sequence analysis of variation at four of the SSCP-SNP loci over a panel of eight inbred genotypes showed complex patterns of variation, with about one SNP or indel (insertion-deletion) every 59 bp in the introns, but considerably fewer in the exons. About two-thirds of the variation was derived from SNPs and one-third from indels. Most haplotypes were detected by SSCP. As a marker system, SSCP-SNP has lower development costs than simple sequence repeats (SSRs), because much of the work is in silico, and similar deployment costs and through-put potential. The rates of polymorphism were lower but useable, with a mean PIC of 0.49 relative to 0.72 for SSRs in our eight inbred genotype panel screen. The major advantage of the system is in comparative applications. Syntenic information can be used to target SSCP-SNP markers to specific chromosomal regions or, conversely, SSCP-SNP markers can be used to unravel detailed syntenic relationships in specific parts of the genome. Finally, a preliminary analysis showed that the millet SSCP-SNP primers

  13. Linkage Analysis and QTL Mapping Using SNP Dosage Data in a Tetraploid Potato Mapping Population

    PubMed Central

    Hackett, Christine A.; McLean, Karen; Bryan, Glenn J.

    2013-01-01

    New sequencing and genotyping technologies have enabled researchers to generate high density SNP genotype data for mapping populations. In polyploid species, SNP data usually contain a new type of information, the allele dosage, which is not used by current methodologies for linkage analysis and QTL mapping. Here we extend existing methodology to use dosage data on SNPs in an autotetraploid mapping population. The SNP dosages are inferred from allele intensity ratios using normal mixture models. The steps of the linkage analysis (testing for distorted segregation, clustering SNPs, calculation of recombination fractions and LOD scores, ordering of SNPs and inference of parental phase) are extended to use the dosage information. For QTL analysis, the probability of each possible offspring genotype is inferred at a grid of locations along the chromosome from the ordered parental genotypes and phases and the offspring dosages. A normal mixture model is then used to relate trait values to the offspring genotypes and to identify the most likely locations for QTLs. These methods are applied to analyse a tetraploid potato mapping population of parents and 190 offspring, genotyped using an Infinium 8300 Potato SNP Array. Linkage maps for each of the 12 chromosomes are constructed. The allele intensity ratios are mapped as quantitative traits to check that their position and phase agrees with that of the corresponding SNP. This analysis confirms most SNP positions, and eliminates some problem SNPs to give high-density maps for each chromosome, with between 74 and 152 SNPs mapped and between 100 and 300 further SNPs allocated to approximate bins. Low numbers of double reduction products were detected. Overall 3839 of the 5378 polymorphic SNPs can be assigned putative genetic locations. This methodology can be applied to construct high-density linkage maps in any autotetraploid species, and could also be extended to higher autopolyploids. PMID:23704960

  14. Reverse Phase Protein Arrays for Compound Profiling.

    PubMed

    Moerke, Nathan; Fallahi-Sichani, Mohammad

    2016-01-01

    Reverse phase protein arrays (RPPAs), also called reverse phase lysate arrays (RPLAs), involve immobilizing cell or tissue lysates, in small spots, onto solid supports which are then probed with primary antibodies specific for proteins or post-translational modifications of interest. RPPA assays are well suited for large-scale, high-throughput measurement of protein and PTM levels in cells and tissues. RPPAs are affordable and highly multiplexable, as a large number of arrays can readily be produced in parallel and then probed separately with distinct primary antibodies. This article describes a procedure for treating cells and preparing cell lysates, as well as a procedure for generating RPPAs using these lysates. A method for probing, imaging, and analyzing RPPAs is also described. These procedures are readily adaptable to a wide range of studies of cell signaling in response to drugs and other perturbations. © 2016 by John Wiley & Sons, Inc. PMID:27622568

  15. The SNP Consortium website: past, present and future.

    PubMed

    Thorisson, Gudmundur A; Stein, Lincoln D

    2003-01-01

    The SNP Consortium website (http://snp.cshl.org) has undergone many changes since its initial conception three years ago. The database back end has been changed from the venerable ACeDB to the more scalable MySQL engine. Users can access the data via gene or single nucleotide polymorphism (SNP) keyword searches and browse or dump SNP data to textfiles. A graphical genome browsing interface shows SNPs mapped onto the genome assembly in the context of externally available gene predictions and other features. SNP allele frequency and genotype data are available via FTP-download and on individual SNP report web pages. SNP linkage maps are available for download and for browsing in a comparative map viewer. All software components of the data coordinating center (DCC) website (http://snp.cshl.org) are open source. PMID:12519964

  16. An Improved Opposition-Based Learning Particle Swarm Optimization for the Detection of SNP-SNP Interactions

    PubMed Central

    Shang, Junliang; Sun, Yan; Li, Shengjun; Liu, Jin-Xing; Zheng, Chun-Hou; Zhang, Junying

    2015-01-01

    SNP-SNP interactions have been receiving increasing attention in understanding the mechanism underlying susceptibility to complex diseases. Though many works have been done for the detection of SNP-SNP interactions, the algorithmic development is still ongoing. In this study, an improved opposition-based learning particle swarm optimization (IOBLPSO) is proposed for the detection of SNP-SNP interactions. Highlights of IOBLPSO are the introduction of three strategies, namely, opposition-based learning, dynamic inertia weight, and a postprocedure. Opposition-based learning not only enhances the global explorative ability, but also avoids premature convergence. Dynamic inertia weight allows particles to cover a wider search space when the considered SNP is likely to be a random one and converges on promising regions of the search space while capturing a highly suspected SNP. The postprocedure is used to carry out a deep search in highly suspected SNP sets. Experiments of IOBLPSO are performed on both simulation data sets and a real data set of age-related macular degeneration, results of which demonstrate that IOBLPSO is promising in detecting SNP-SNP interactions. IOBLPSO might be an alternative to existing methods for detecting SNP-SNP interactions. PMID:26236727

  17. A novel approach to analyzing fMRI and SNP data via parallel independent component analysis

    NASA Astrophysics Data System (ADS)

    Liu, Jingyu; Pearlson, Godfrey; Calhoun, Vince; Windemuth, Andreas

    2007-03-01

    There is current interest in understanding genetic influences on brain function in both the healthy and the disordered brain. Parallel independent component analysis, a new method for analyzing multimodal data, is proposed in this paper and applied to functional magnetic resonance imaging (fMRI) and a single nucleotide polymorphism (SNP) array. The method aims to identify the independent components of each modality and the relationship between the two modalities. We analyzed 92 participants, including 29 schizophrenia (SZ) patients, 13 unaffected SZ relatives, and 50 healthy controls. We found a correlation of 0.79 between one fMRI component and one SNP component. The fMRI component consists of activations in cingulate gyrus, multiple frontal gyri, and superior temporal gyrus. The related SNP component is contributed to significantly by 9 SNPs located in sets of genes, including those coding for apolipoprotein A-I, and C-III, malate dehydrogenase 1 and the gamma-aminobutyric acid alpha-2 receptor. A significant difference in the presences of this SNP component is found between the SZ group (SZ patients and their relatives) and the control group. In summary, we constructed a framework to identify the interactions between brain functional and genetic information; our findings provide new insight into understanding genetic influences on brain function in a common mental disorder.

  18. Exhaustive Genome-Wide Search for SNP-SNP Interactions Across 10 Human Diseases.

    PubMed

    Murk, William; DeWan, Andrew T

    2016-01-01

    The identification of statistical SNP-SNP interactions may help explain the genetic etiology of many human diseases, but exhaustive genome-wide searches for these interactions have been difficult, due to a lack of power in most datasets. We aimed to use data from the Resource for Genetic Epidemiology Research on Adult Health and Aging (GERA) study to search for SNP-SNP interactions associated with 10 common diseases. FastEpistasis and BOOST were used to evaluate all pairwise interactions among approximately N = 300,000 single nucleotide polymorphisms (SNPs) with minor allele frequency (MAF) ≥ 0.15, for the dichotomous outcomes of allergic rhinitis, asthma, cardiac disease, depression, dermatophytosis, type 2 diabetes, dyslipidemia, hemorrhoids, hypertensive disease, and osteoarthritis. A total of N = 45,171 subjects were included after quality control steps were applied. These data were divided into discovery and replication subsets; the discovery subset had > 80% power, under selected models, to detect genome-wide significant interactions (P < 10(-12)). Interactions were also evaluated for enrichment in particular SNP features, including functionality, prior disease relevancy, and marginal effects. No interaction in any disease was significant in both the discovery and replication subsets. Enrichment analysis suggested that, for some outcomes, interactions involving SNPs with marginal effects were more likely to be nominally replicated, compared to interactions without marginal effects. If SNP-SNP interactions play a role in the etiology of the studied conditions, they likely have weak effect sizes, involve lower-frequency variants, and/or involve complex models of interaction that are not captured well by the methods that were utilized. PMID:27185397

  19. Exhaustive Genome-Wide Search for SNP-SNP Interactions Across 10 Human Diseases

    PubMed Central

    Murk, William; DeWan, Andrew T.

    2016-01-01

    The identification of statistical SNP-SNP interactions may help explain the genetic etiology of many human diseases, but exhaustive genome-wide searches for these interactions have been difficult, due to a lack of power in most datasets. We aimed to use data from the Resource for Genetic Epidemiology Research on Adult Health and Aging (GERA) study to search for SNP-SNP interactions associated with 10 common diseases. FastEpistasis and BOOST were used to evaluate all pairwise interactions among approximately N = 300,000 single nucleotide polymorphisms (SNPs) with minor allele frequency (MAF) ≥ 0.15, for the dichotomous outcomes of allergic rhinitis, asthma, cardiac disease, depression, dermatophytosis, type 2 diabetes, dyslipidemia, hemorrhoids, hypertensive disease, and osteoarthritis. A total of N = 45,171 subjects were included after quality control steps were applied. These data were divided into discovery and replication subsets; the discovery subset had > 80% power, under selected models, to detect genome-wide significant interactions (P < 10−12). Interactions were also evaluated for enrichment in particular SNP features, including functionality, prior disease relevancy, and marginal effects. No interaction in any disease was significant in both the discovery and replication subsets. Enrichment analysis suggested that, for some outcomes, interactions involving SNPs with marginal effects were more likely to be nominally replicated, compared to interactions without marginal effects. If SNP-SNP interactions play a role in the etiology of the studied conditions, they likely have weak effect sizes, involve lower-frequency variants, and/or involve complex models of interaction that are not captured well by the methods that were utilized. PMID:27185397

  20. A Bayesian Framework for SNP Identification

    SciTech Connect

    Webb-Robertson, Bobbie-Jo M.; Havre, Susan L.; Payne, Deborah A.

    2005-07-01

    Current proteomics techniques, such as mass spectrometry, focus on protein identification, usually ignoring most types of modifications beyond post-translational modifications, with the assumption that only a small number of peptides have to be matched to a protein for a positive identification. However, not all proteins are being identified with current techniques and improved methods to locate points of mutation are becoming a necessity. In the case when single-nucleotide polymorphisms (SNPs) are observed, brute force is the most common method to locate them, quickly becoming computationally unattractive as the size of the database associated with the model organism grows. We have developed a Bayesian model for SNPs, BSNP, incorporating evolutionary information at both the nucleotide and amino acid levels. Formulating SNPs as a Bayesian inference problem allows probabilities of interest to be easily obtained, for example the probability of a specific SNP or specific type of mutation over a gene or entire genome. Three SNP databases were observed in the evaluation of the BSNP model; the first SNP database is a disease specific gene in human, hemoglobin, the second is also a disease specific gene in human, p53, and the third is a more general SNP database for multiple genes in mouse. We validate that the BSNP model assigns higher posterior probabilities to the SNPs defined in all three separate databases than can be attributed to chance under specific evolutionary information, for example the amino acid model described by Majewski and Ott in conjunction with either the four-parameter nucleotide model by Bulmer or seven-parameter nucleotide model by Majewski and Ott.

  1. ASSIsT: an automatic SNP scoring tool for in- and outbreeding species

    PubMed Central

    Di Guardo, Mario; Micheletti, Diego; Bianco, Luca; Koehorst-van Putten, Herma J. J.; Longhi, Sara; Costa, Fabrizio; Aranzana, Maria J.; Velasco, Riccardo; Arús, Pere; Troggio, Michela; van de Weg, Eric W.

    2015-01-01

    ASSIsT (Automatic SNP ScorIng Tool) is a user-friendly customized pipeline for efficient calling and filtering of SNPs from Illumina Infinium arrays, specifically devised for custom genotyping arrays. Illumina has developed an integrated software for SNP data visualization and inspection called GenomeStudio® (GS). ASSIsT builds on GS-derived data and identifies those markers that follow a bi-allelic genetic model and show reliable genotype calls. Moreover, ASSIsT re-edits SNP calls with null alleles or additional SNPs in the probe annealing site. ASSIsT can be employed in the analysis of different population types such as full-sib families and mating schemes used in the plant kingdom (backcross, F1, F2), and unrelated individuals. The final result can be directly exported in the format required by the most common software for genetic mapping and marker–trait association analysis. ASSIsT is developed in Python and runs in Windows and Linux. Availability and implementation: The software, example data sets and tutorials are freely available at http://compbiotoolbox.fmach.it/assist/. Contact: eric.vandeweg@wur.nl PMID:26249809

  2. METU-SNP: an integrated software system for SNP-complex disease association analysis.

    PubMed

    Ustünkar, Gürkan; Aydın Son, Yeşim

    2011-01-01

    Recently, there has been increasing research to discover genomic biomarkers, haplotypes, and potentially other variables that together contribute to the development of diseases. Single Nucleotide Polymorphisms (SNPs) are the most common form of genomic variations and they can represent an individual’s genetic variability in greatest detail. Genome-wide association studies (GWAS) of SNPs, high-dimensional case-control studies, are among the most promising approaches for identifying disease causing variants. METU-SNP software is a Java based integrated desktop application specifically designed for the prioritization of SNP biomarkers and the discovery of genes and pathways related to diseases via analysis of the GWAS case-control data. Outputs of METU-SNP can easily be utilized for the downstream biomarkers research to allow the prediction and the diagnosis of diseases and other personalized medical approaches. Here, we introduce and describe the system functionality and architecture of the METU-SNP. We believe that the METU-SNP will help researchers with the reliable identification of SNPs that are involved in the etiology of complex diseases, ultimately supporting the development of personalized medicine approaches and targeted drug discoveries. PMID:22156365

  3. Genome-wide SNP association-based localization of a dwarfism gene in Friesian dwarf horses.

    PubMed

    Orr, N; Back, W; Gu, J; Leegwater, P; Govindarajan, P; Conroy, J; Ducro, B; Van Arendonk, J A M; MacHugh, D E; Ennis, S; Hill, E W; Brama, P A J

    2010-12-01

    The recent completion of the horse genome and commercial availability of an equine SNP genotyping array has facilitated the mapping of disease genes. We report putative localization of the gene responsible for dwarfism, a trait in Friesian horses that is thought to have a recessive mode of inheritance, to a 2-MB region of chromosome 14 using just 10 affected animals and 10 controls. We successfully genotyped 34,429 SNPs that were tested for association with dwarfism using chi-square tests. The most significant SNP in our study, BIEC2-239376 (P(2df)=4.54 × 10(-5), P(rec)=7.74 × 10(-6)), is located close to a gene implicated in human dwarfism. Fine-mapping and resequencing analyses did not aid in further localization of the causative variant, and replication of our findings in independent sample sets will be necessary to confirm these results. PMID:21070269

  4. eSNPO: An eQTL-based SNP Ontology and SNP functional enrichment analysis platform

    PubMed Central

    Li, Jin; Wang, Limei; Jiang, Tao; Wang, Jizhe; Li, Xue; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Zhang, Ruijie; Lv, Hongchao; Guo, Maozu

    2016-01-01

    Genome-wide association studies (GWASs) have mined many common genetic variants associated with human complex traits like diseases. After that, the functional annotation and enrichment analysis of significant SNPs are important tasks. Classic methods are always based on physical positions of SNPs and genes. Expression quantitative trait loci (eQTLs) are genomic loci that contribute to variation in gene expression levels and have been proven efficient to connect SNPs and genes. In this work, we integrated the eQTL data and Gene Ontology (GO), constructed associations between SNPs and GO terms, then performed functional enrichment analysis. Finally, we constructed an eQTL-based SNP Ontology and SNP functional enrichment analysis platform. Taking Parkinson Disease (PD) as an example, the proposed platform and method are efficient. We believe eSNPO will be a useful resource for SNP functional annotation and enrichment analysis after we have got significant disease related SNPs. PMID:27470167

  5. eSNPO: An eQTL-based SNP Ontology and SNP functional enrichment analysis platform.

    PubMed

    Li, Jin; Wang, Limei; Jiang, Tao; Wang, Jizhe; Li, Xue; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Zhang, Ruijie; Lv, Hongchao; Guo, Maozu

    2016-01-01

    Genome-wide association studies (GWASs) have mined many common genetic variants associated with human complex traits like diseases. After that, the functional annotation and enrichment analysis of significant SNPs are important tasks. Classic methods are always based on physical positions of SNPs and genes. Expression quantitative trait loci (eQTLs) are genomic loci that contribute to variation in gene expression levels and have been proven efficient to connect SNPs and genes. In this work, we integrated the eQTL data and Gene Ontology (GO), constructed associations between SNPs and GO terms, then performed functional enrichment analysis. Finally, we constructed an eQTL-based SNP Ontology and SNP functional enrichment analysis platform. Taking Parkinson Disease (PD) as an example, the proposed platform and method are efficient. We believe eSNPO will be a useful resource for SNP functional annotation and enrichment analysis after we have got significant disease related SNPs. PMID:27470167

  6. SNP sets and reading ability: testing confirmation of a 10-SNP set in a population sample.

    PubMed

    Luciano, Michelle; Montgomery, Grant W; Martin, Nicholas G; Wright, Margaret J; Bates, Timothy C

    2011-06-01

    A set of 10 SNPs associated with reading ability in 7-year-olds was reported based on initial pooled analyses of 100K SNP chip data, with follow-up testing stages using pooling and individual testing. Here we examine this association in an adolescent population sample of Australian twins and siblings (N = 1177) aged 12 to 25 years. One (rs1842129) of the 10 SNPs approached significance (P = .05) but no support was found for the remaining 9 SNPs or the SNP set itself. Results indicate that these SNPs are not associated with reading ability in an Australian population. The results are interpreted as supporting use of much larger SNP sets in common disorders where effects are small. PMID:21623652

  7. Snat: a SNP annotation tool for bovine by integrating various sources of genomic information

    PubMed Central

    2011-01-01

    Background Most recently, with maturing of bovine genome sequencing and high throughput SNP genotyping technologies, a large number of significant SNPs associated with economic important traits can be identified by genome-wide association studies (GWAS). To further determine true association findings in GWAS, the common strategy is to sift out most promising SNPs for follow-up replication studies. Hence it is crucial to explore the functional significance of the candidate SNPs in order to screen and select the potential functional ones. To systematically prioritize these statistically significant SNPs and facilitate follow-up replication studies, we developed a bovine SNP annotation tool (Snat) based on a web interface. Results With Snat, various sources of genomic information are integrated and retrieved from several leading online databases, including SNP information from dbSNP, gene information from Entrez Gene, protein features from UniProt, linkage information from AnimalQTLdb, conserved elements from UCSC Genome Browser Database and gene functions from Gene Ontology (GO), KEGG PATHWAY and Online Mendelian Inheritance in Animals (OMIA). Snat provides two different applications, including a CGI-based web utility and a command-line version, to access the integrated database, target any single nucleotide loci of interest and perform multi-level functional annotations. For further validation of the practical significance of our study, SNPs involved in two commercial bovine SNP chips, i.e., the Affymetrix Bovine 10K chip array and the Illumina 50K chip array, have been annotated by Snat, and the corresponding outputs can be directly downloaded from Snat website. Furthermore, a real dataset involving 20 identified SNPs associated with milk yield in our recent GWAS was employed to demonstrate the practical significance of Snat. Conclusions To our best knowledge, Snat is one of first tools focusing on SNP annotation for livestock. Snat confers researchers with a

  8. Genome Rearrangements Detected by SNP Microarrays in Individuals with Intellectual Disability Referred with Possible Williams Syndrome

    PubMed Central

    Pani, Ariel M.; Hobart, Holly H.; Morris, Colleen A.; Mervis, Carolyn B.; Bray-Ward, Patricia; Kimberley, Kendra W.; Rios, Cecilia M.; Clark, Robin C.; Gulbronson, Maricela D.; Gowans, Gordon C.; Gregg, Ronald G.

    2010-01-01

    Background Intellectual disability (ID) affects 2–3% of the population and may occur with or without multiple congenital anomalies (MCA) or other medical conditions. Established genetic syndromes and visible chromosome abnormalities account for a substantial percentage of ID diagnoses, although for ∼50% the molecular etiology is unknown. Individuals with features suggestive of various syndromes but lacking their associated genetic anomalies pose a formidable clinical challenge. With the advent of microarray techniques, submicroscopic genome alterations not associated with known syndromes are emerging as a significant cause of ID and MCA. Methodology/Principal Findings High-density SNP microarrays were used to determine genome wide copy number in 42 individuals: 7 with confirmed alterations in the WS region but atypical clinical phenotypes, 31 with ID and/or MCA, and 4 controls. One individual from the first group had the most telomeric gene in the WS critical region deleted along with 2 Mb of flanking sequence. A second person had the classic WS deletion and a rearrangement on chromosome 5p within the Cri du Chat syndrome (OMIM:123450) region. Six individuals from the ID/MCA group had large rearrangements (3 deletions, 3 duplications), one of whom had a large inversion associated with a deletion that was not detected by the SNP arrays. Conclusions/Significance Combining SNP microarray analyses and qPCR allowed us to clone and sequence 21 deletion breakpoints in individuals with atypical deletions in the WS region and/or ID or MCA. Comparison of these breakpoints to databases of genomic variation revealed that 52% occurred in regions harboring structural variants in the general population. For two probands the genomic alterations were flanked by segmental duplications, which frequently mediate recurrent genome rearrangements; these may represent new genomic disorders. While SNP arrays and related technologies can identify potentially pathogenic deletions and

  9. Smarter clustering methods for SNP genotype calling

    PubMed Central

    Lin, Yan; Tseng, George C.; Cheong, Soo Yeon; Bean, Lora J. H.; Sherman, Stephanie L.; Feingold, Eleanor

    2008-01-01

    Motivation: Most genotyping technologies for single nucleotide polymorphism (SNP) markers use standard clustering methods to ‘call’ the SNP genotypes. These methods are not always optimal in distinguishing the genotype clusters of a SNP because they do not take advantage of specific features of the genotype calling problem. In particular, when family data are available, pedigree information is ignored. Furthermore, prior information about the distribution of the measurements for each cluster can be used to choose an appropriate model-based clustering method and can significantly improve the genotype calls. One special genotyping problem that has never been discussed in the literature is that of genotyping of trisomic individuals, such as individuals with Down syndrome. Calling trisomic genotypes is a more complicated problem, and the addition of external information becomes very important. Results: In this article, we discuss the impact of incorporating external information into clustering algorithms to call the genotypes for both disomic and trisomic data. We also propose two new methods to call genotypes using family data. One is a modification of the K-means method and uses the pedigree information by updating all members of a family together. The other is a likelihood-based method that combines the Gaussian or beta-mixture model with pedigree information. We compare the performance of these two methods and some other existing methods using simulation studies. We also compare the performance of these methods on a real dataset generated by the Illumina platform (www.illumina.com). Availability: The R code for the family-based genotype calling methods (SNPCaller) is available to be downloaded from the following website: http://watson.hgen.pitt.edu/register. Contact: liny@upmc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:18826959

  10. Linkage mapping bovine EST-based SNP

    PubMed Central

    Snelling, Warren M; Casas, Eduardo; Stone, Roger T; Keele, John W; Harhay, Gregory P; Bennett, Gary L; Smith, Timothy PL

    2005-01-01

    Background Existing linkage maps of the bovine genome primarily contain anonymous microsatellite markers. These maps have proved valuable for mapping quantitative trait loci (QTL) to broad regions of the genome, but more closely spaced markers are needed to fine-map QTL, and markers associated with genes and annotated sequence are needed to identify genes and sequence variation that may explain QTL. Results Bovine expressed sequence tag (EST) and bacterial artificial chromosome (BAC)sequence data were used to develop 918 single nucleotide polymorphism (SNP) markers to map genes on the bovine linkage map. DNA of sires from the MARC reference population was used to detect SNPs, and progeny and mates of heterozygous sires were genotyped. Chromosome assignments for 861 SNPs were determined by twopoint analysis, and positions for 735 SNPs were established by multipoint analyses. Linkage maps of bovine autosomes with these SNPs represent 4585 markers in 2475 positions spanning 3058 cM . Markers include 3612 microsatellites, 913 SNPs and 60 other markers. Mean separation between marker positions is 1.2 cM. New SNP markers appear in 511 positions, with mean separation of 4.7 cM. Multi-allelic markers, mostly microsatellites, had a mean (maximum) of 216 (366) informative meioses, and a mean 3-lod confidence interval of 3.6 cM Bi-allelic markers, including SNP and other marker types, had a mean (maximum) of 55 (191) informative meioses, and were placed within a mean 8.5 cM 3-lod confidence interval. Homologous human sequences were identified for 1159 markers, including 582 newly developed and mapped SNP. Conclusion Addition of these EST- and BAC-based SNPs to the bovine linkage map not only increases marker density, but provides connections to gene-rich physical maps, including annotated human sequence. The map provides a resource for fine-mapping quantitative trait loci and identification of positional candidate genes, and can be integrated with other data to guide and

  11. pfSNP: An integrated potentially functional SNP resource that facilitates hypotheses generation through knowledge syntheses.

    PubMed

    Wang, Jingbo; Ronaghi, Mostafa; Chong, Samuel S; Lee, Caroline G L

    2011-01-01

    Currently, >14,000,000 single nucleotide polymorphisms (SNPs) are reported. Identifying phenotype-affecting SNPs among these many SNPs pose significant challenges. Although several Web resources are available that can inform about the functionality of SNPs, these resources are mainly annotation databases and are not very comprehensive. In this article, we present a comprehensive, well-annotated, integrated pfSNP (potentially functional SNPs) Web resource (http://pfs.nus.edu.sg/), which is aimed to facilitate better hypothesis generation through knowledge syntheses mediated by better data integration and a user-friendly Web interface. pfSNP integrates >40 different algorithms/resources to interrogate >14,000,000 SNPs from the dbSNP database for SNPs of potential functional significance based on previous published reports, inferred potential functionality from genetic approaches as well as predicted potential functionality from sequence motifs. Its query interface has the user-friendly "auto-complete, prompt-as-you-type" feature and is highly customizable, facilitating different combination of queries using Boolean-logic. Additionally, to facilitate better understanding of the results and aid in hypotheses generation, gene/pathway-level information with text clouds highlighting enriched tissues/pathways as well as detailed-related information are also provided on the results page. Hence, the pfSNP resource will be of great interest to scientists focusing on association studies as well as those interested to experimentally address the functionality of SNPs. PMID:20672376

  12. SNP marker detection and genotyping in tilapia.

    PubMed

    Van Bers, N E M; Crooijmans, R P M A; Groenen, M A M; Dibbits, B W; Komen, J

    2012-09-01

    We have generated a unique resource consisting of nearly 175 000 short contig sequences and 3569 SNP markers from the widely cultured GIFT (Genetically Improved Farmed Tilapia) strain of Nile tilapia (Oreochromis niloticus). In total, 384 SNPs were selected to monitor the wider applicability of the SNPs by genotyping tilapia individuals from different strains and different geographical locations. In all strains and species tested (O. niloticus, O. aureus and O. mossambicus), the genotyping assay was working for a similar number of SNPs (288-305 SNPs). The actual number of polymorphic SNPs was, as expected, highest for individuals from the GIFT population (255 SNPs). In the individuals from an Egyptian strain and in individuals caught in the wild in the basin of the river Volta, 197 and 163 SNPs were polymorphic, respectively. A pairwise calculation of Nei's genetic distance allowed the discrimination of the individual strains and species based on the genotypes determined with the SNP set. We expect that this set will be widely applicable for use in tilapia aquaculture, e.g. for pedigree reconstruction. In addition, this set is currently used for assaying the genetic diversity of native Nile tilapia in areas where tilapia is, or will be, introduced in aquaculture projects. This allows the tracing of escapees from aquaculture and the monitoring of effects of introgression and hybridization. PMID:22524158

  13. Comparative Analysis of CNV Calling Algorithms: Literature Survey and a Case Study Using Bovine High-Density SNP Data

    PubMed Central

    Xu, Lingyang; Hou, Yali; Bickhart, Derek M.; Song, Jiuzhou; Liu, George E.

    2013-01-01

    Copy number variations (CNVs) are gains and losses of genomic sequence between two individuals of a species when compared to a reference genome. The data from single nucleotide polymorphism (SNP) microarrays are now routinely used for genotyping, but they also can be utilized for copy number detection. Substantial progress has been made in array design and CNV calling algorithms and at least 10 comparison studies in humans have been published to assess them. In this review, we first survey the literature on existing microarray platforms and CNV calling algorithms. We then examine a number of CNV calling tools to evaluate their impacts using bovine high-density SNP data. Large incongruities in the results from different CNV calling tools highlight the need for standardizing array data collection, quality assessment and experimental validation. Only after careful experimental design and rigorous data filtering can the impacts of CNVs on both normal phenotypic variability and disease susceptibility be fully revealed.

  14. Large-Scale SNP Marker Development and Genotyping in Oat

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In this study, our goals are to develop genome-wide SNP markers using next generation sequencing technologies and to apply a highly parallel SNP genotyping system developed by Illumina for genetics and breeding applications in oat. The large amount of DNA sequence sources generated from cDNAs and Di...

  15. Accelerating genetic improvement with SNP chips and DNA sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The development of high-density single nucleotide polymorphism (SNP) assays is expected to have a profound impact on genetic progress in the U.S. dairy industry. In the 16 months since its initial availability, the Illumina BovineSNP50 BeadChip has been used to genotype nearly 20,000 Holsteins. Thes...

  16. Atomic Force Microscopy for DNA SNP Identification

    NASA Astrophysics Data System (ADS)

    Valbusa, Ugo; Ierardi, Vincenzo

    The knowledge of the effects of single-nucleotide polymorphisms (SNPs) in the human genome greatly contributes to better comprehension of the relation between genetic factors and diseases. Sequence analysis of genomic DNA in different individuals reveals positions where variations that involve individual base substitutions can occur. Single-nucleotide polymorphisms are highly abundant and can have different consequences at phenotypic level. Several attempts were made to apply atomic force microscopy (AFM) to detect and map SNP sites in DNA strands. The most promising approach is the study of DNA mutations producing heteroduplex DNA strands and identifying the mismatches by means of a protein that labels the mismatches. MutS is a protein that is part of a well-known complex of mismatch repair, which initiates the process of repairing when the MutS binds to the mismatched DNA filament. The position of MutS on the DNA filament can be easily recorded by means of AFM imaging.

  17. SNP genotyping by combination of 192-well MADGE, ARMS and computerized gel image analysis.

    PubMed

    O'Dell, S D; Gaunt, T R; Day, I N

    2000-09-01

    A new modification of the microplate array diagonal gel electrophoresis (MADGE) system accommodates the dual amplification refractory mutation system (ARMS) products of 96 samples on one 192-well gel. Simultaneous electrophoresis of a number of horizontal ARMS-MADGE gels achieves high throughput. Gels are imaged digitally, here using the FluorImager 595 fluorescent scanning system. Customized software by Phoretix enables rapid computerized calling of band patterns in ARMS-MADGE arrays, in which the two wells receiving a pair of allele-specific assays for a single template are juxtaposed to form one virtual track, with genotype data exported directly into Microsoft Excel for statistical analysis. An ARMS assay of the A/T base change at the -23/HphI RFLP in the insulin gene promoter, which initiates from 2.5 ng template DNA, was used here to demonstrate this improved general approach for population SNP analyses. PMID:10997263

  18. Heritability of Recurrent Exertional Rhabdomyolysis in Standardbred and Thoroughbred Racehorses Derived From SNP Genotyping Data.

    PubMed

    Norton, Elaine M; Mickelson, James R; Binns, Matthew M; Blott, Sarah C; Caputo, Paul; Isgren, Cajsa M; McCoy, Annette M; Moore, Alison; Piercy, Richard J; Swinburne, June E; Vaudin, Mark; McCue, Molly E

    2016-11-01

    Recurrent exertional rhabdomyolysis (RER) in Thoroughbred and Standardbred racehorses is characterized by episodes of muscle rigidity and cell damage that often recur upon strenuous exercise. The objective was to evaluate the importance of genetic factors in RER by obtaining an unbiased estimate of heritability in cohorts of unrelated Thoroughbred and Standardbred racehorses. Four hundred ninety-one Thoroughbred and 196 Standardbred racehorses were genotyped with the 54K or 74K SNP genotyping arrays. Heritability was calculated from genome-wide SNP data with a mixed linear and Bayesian model, utilizing the standard genetic relationship matrix (GRM). Both the mixed linear and Bayesian models estimated heritability of RER in Thoroughbreds to be approximately 0.34 and in Standardbred racehorses to be approximately 0.45 after adjusting for disease prevalence and sex. To account for potential differences in the genetic architecture of the underlying causal variants, heritability estimates were adjusted based on linkage disequilibrium weighted kinship matrix, minor allele frequency and variant effect size, yielding heritability estimates that ranged between 0.41-0.46 (Thoroughbreds) and 0.39-0.49 (Standardbreds). In conclusion, between 34-46% and 39-49% of the variance in RER susceptibility in Thoroughbred and Standardbred racehorses, respectively, can be explained by the SNPs present on these 2 genotyping arrays, indicating that RER is moderately heritable. These data provide further rationale for the investigation of genetic mutations associated with RER susceptibility. PMID:27489252

  19. SNP-SNP interaction analysis of NF-κB signaling pathway on breast cancer survival.

    PubMed

    Jamshidi, Maral; Fagerholm, Rainer; Khan, Sofia; Aittomäki, Kristiina; Czene, Kamila; Darabi, Hatef; Li, Jingmei; Andrulis, Irene L; Chang-Claude, Jenny; Devilee, Peter; Fasching, Peter A; Michailidou, Kyriaki; Bolla, Manjeet K; Dennis, Joe; Wang, Qin; Guo, Qi; Rhenius, Valerie; Cornelissen, Sten; Rudolph, Anja; Knight, Julia A; Loehberg, Christian R; Burwinkel, Barbara; Marme, Frederik; Hopper, John L; Southey, Melissa C; Bojesen, Stig E; Flyger, Henrik; Brenner, Hermann; Holleczek, Bernd; Margolin, Sara; Mannermaa, Arto; Kosma, Veli-Matti; Van Dyck, Laurien; Nevelsteen, Ines; Couch, Fergus J; Olson, Janet E; Giles, Graham G; McLean, Catriona; Haiman, Christopher A; Henderson, Brian E; Winqvist, Robert; Pylkäs, Katri; Tollenaar, Rob A E M; García-Closas, Montserrat; Figueroa, Jonine; Hooning, Maartje J; Martens, John W M; Cox, Angela; Cross, Simon S; Simard, Jacques; Dunning, Alison M; Easton, Douglas F; Pharoah, Paul D P; Hall, Per; Blomqvist, Carl; Schmidt, Marjanka K; Nevanlinna, Heli

    2015-11-10

    In breast cancer, constitutive activation of NF-κB has been reported, however, the impact of genetic variation of the pathway on patient prognosis has been little studied. Furthermore, a combination of genetic variants, rather than single polymorphisms, may affect disease prognosis. Here, in an extensive dataset (n = 30,431) from the Breast Cancer Association Consortium, we investigated the association of 917 SNPs in 75 genes in the NF-κB pathway with breast cancer prognosis. We explored SNP-SNP interactions on survival using the likelihood-ratio test comparing multivariate Cox' regression models of SNP pairs without and with an interaction term. We found two interacting pairs associating with prognosis: patients simultaneously homozygous for the rare alleles of rs5996080 and rs7973914 had worse survival (HRinteraction 6.98, 95% CI=3.3-14.4, P=1.42E-07), and patients carrying at least one rare allele for rs17243893 and rs57890595 had better survival (HRinteraction 0.51, 95% CI=0.3-0.6, P = 2.19E-05). Based on in silico functional analyses and literature, we speculate that the rs5996080 and rs7973914 loci may affect the BAFFR and TNFR1/TNFR3 receptors and breast cancer survival, possibly by disturbing both the canonical and non-canonical NF-κB pathways or their dynamics, whereas, rs17243893-rs57890595 interaction on survival may be mediated through TRAF2-TRAIL-R4 interplay. These results warrant further validation and functional analyses. PMID:26317411

  20. SNP-SNP interaction analysis of NF-κB signaling pathway on breast cancer survival

    PubMed Central

    Jamshidi, Maral; Fagerholm, Rainer; Khan, Sofia; Aittomäki, Kristiina; Czene, Kamila; Darabi, Hatef; Li, Jingmei; Andrulis, Irene L.; Chang-Claude, Jenny; Devilee, Peter; Fasching, Peter A.; Michailidou, Kyriaki; Bolla, Manjeet K.; Dennis, Joe; Wang, Qin; Guo, Qi; Rhenius, Valerie; Cornelissen, Sten; Rudolph, Anja; Knight, Julia A.; Loehberg, Christian R.; Burwinkel, Barbara; Marme, Frederik; Hopper, John L.; Southey, Melissa C.; Bojesen, Stig E.; Flyger, Henrik; Brenner, Hermann; Holleczek, Bernd; Margolin, Sara; Mannermaa, Arto; Kosma, Veli-Matti; Dyck, Laurien Van; Nevelsteen, Ines; Couch, Fergus J.; Olson, Janet E.; Giles, Graham G.; McLean, Catriona; Haiman, Christopher A.; Henderson, Brian E.; Winqvist, Robert; Pylkäs, Katri; Tollenaar, Rob A.E.M.; García-Closas, Montserrat; Figueroa, Jonine; Hooning, Maartje J.; Martens, John W.M.; Cox, Angela; Cross, Simon S.; Simard, Jacques; Dunning, Alison M.; Easton, Douglas F.; Pharoah, Paul D.P.; Hall, Per; Blomqvist, Carl; Schmidt, Marjanka K.; Nevanlinna, Heli

    2015-01-01

    In breast cancer, constitutive activation of NF-κB has been reported, however, the impact of genetic variation of the pathway on patient prognosis has been little studied. Furthermore, a combination of genetic variants, rather than single polymorphisms, may affect disease prognosis. Here, in an extensive dataset (n = 30,431) from the Breast Cancer Association Consortium, we investigated the association of 917 SNPs in 75 genes in the NF-κB pathway with breast cancer prognosis. We explored SNP-SNP interactions on survival using the likelihood-ratio test comparing multivariate Cox’ regression models of SNP pairs without and with an interaction term. We found two interacting pairs associating with prognosis: patients simultaneously homozygous for the rare alleles of rs5996080 and rs7973914 had worse survival (HRinteraction 6.98, 95% CI=3.3-14.4, P = 1.42E-07), and patients carrying at least one rare allele for rs17243893 and rs57890595 had better survival (HRinteraction 0.51, 95% CI=0.3-0.6, P = 2.19E-05). Based on in silico functional analyses and literature, we speculate that the rs5996080 and rs7973914 loci may affect the BAFFR and TNFR1/TNFR3 receptors and breast cancer survival, possibly by disturbing both the canonical and non-canonical NF-κB pathways or their dynamics, whereas, rs17243893-rs57890595 interaction on survival may be mediated through TRAF2-TRAIL-R4 interplay. These results warrant further validation and functional analyses. PMID:26317411

  1. Frequent detection of parental consanguinity in children with developmental disorders by a combined CGH and SNP microarray

    PubMed Central

    2013-01-01

    Background Genomic microarrays have been used as the first-tier cytogenetic diagnostic test for patients with developmental delay/intellectual disability, autism spectrum disorders and/or multiple congenital anomalies. The use of SNP arrays has revealed regions of homozygosity in the genome which can lead to identification of uniparental disomy and parental consanguinity in addition to copy number variations. Consanguinity is associated with an increased risk of birth defects and autosomal recessive disorders. However, the frequency of parental consanguinity in children with developmental disabilities is unknown, and consanguineous couples may not be identified during doctor’s visit or genetic counseling without microarray. Results We studied 607 proband pediatric patients referred for developmental disorders using a 4 × 180 K array containing both CGH and SNP probes. Using 720, 360, 180, and 90 Mb as the expected sizes of homozygosity for an estimated coefficient of inbreeding (F) 1/4, 1/8, 1/16, 1/32, parental consanguinity was detected in 21cases (3.46%). Conclusion Parental consanguinity is not uncommon in children with developmental problems in our study population, and can be identified by use of a combined CGH and SNP chromosome microarray. Identification of parental consanguinity in such cases can be important for further diagnostic testing. PMID:24053112

  2. The Perils of SNP Microarray Testing: Uncovering Unexpected Consanguinity

    PubMed Central

    Tarini, Beth A.; Konczal, Laura; Goldenberg, Aaron J.; Goldman, Edward B.; McCandless, Shawn E.

    2013-01-01

    Background While single nucleotide polymorphism (SNP) chromosomal microarrays identify areas of small genetic deletions/duplications, they can also reveal regions of homozygosity indicative of consanguinity. As more non-geneticists order SNP microarrays, they must prepare for the potential ethical, legal and social issues that result from revelation of unanticipated consanguinity. Patient An infant with multiple congenital anomalies underwent SNP microarray testing. Results The results of the SNP microarray revealed several large regions of homozygosity that indicated identity by descent most consistent with a second or third degree relative mating (e.g., uncle/ niece, half brother/sister, first cousins). Mother was not aware of the test's potential to reveal consanguinity. When informed of the test results, she reluctantly admitted to being raped by her half-brother around the time of conception. Conclusions During the pre-testing consent process, providers should inform parents that SNP microarray testing could reveal consanguinity. Providers must also understand the psychological implications, as well as the legal and moral obligations, that accompany SNP microarray results that indicate consanguinity. PMID:23827427

  3. Virtual karyotyping with SNP microarrays reduces uncertainty in the diagnosis of renal epithelial tumors

    PubMed Central

    Hagenkord, Jill M; Parwani, Anil V; Lyons-Weiler, Maureen A; Alvarez, Karla; Amato, Robert; Gatalica, Zoran; Gonzalez-Berjon, Jose M; Peterson, Leif; Dhir, Rajiv; Monzon, Federico A

    2008-01-01

    Background Renal epithelial tumors are morphologically, biologically, and clinically heterogeneous. Different morphologic subtypes require specific management due to markedly different prognosis and response to therapy. Each common subtype has characteristic chromosomal gains and losses, including some with prognostic value. However, copy number information has not been readily accessible for clinical purposes and thus has not been routinely used in the diagnostic evaluation of these tumors. This information can be useful for classification of tumors with complex or challenging morphology. 'Virtual karyotypes' generated using SNP arrays can readily detect characteristic chromosomal lesions in paraffin embedded renal tumors and can be used to correctly categorize the common subtypes with performance characteristics that are amenable for routine clinical use. Methods To investigate the use of virtual karyotypes for diagnostically challenging renal epithelial tumors, we evaluated 25 archived renal neoplasms where sub-classification could not be definitively rendered based on morphology and other ancillary studies. We generated virtual karyotypes with the Affymetrix 10 K 2.0 mapping array platform and identified the presence of genomic lesions across all 22 autosomes. Results In 91% of challenging cases the virtual karyotype unambiguously detected the presence or absence of chromosomal aberrations characteristic of one of the common subtypes of renal epithelial tumors, while immunohistochemistry and fluorescent in situ hybridization had no or limited utility in the diagnosis of these tumors. Conclusion These results show that virtual karyotypes generated by SNP arrays can be used as a practical ancillary study for the classification of renal epithelial tumors with complex or ambiguous morphology. PMID:18990225

  4. GACT: a Genome build and Allele definition Conversion Tool for SNP imputation and meta-analysis in genetic association studies

    PubMed Central

    2014-01-01

    Background Genome-wide association studies (GWAS) have successfully identified genes associated with complex human diseases. Although much of the heritability remains unexplained, combining single nucleotide polymorphism (SNP) genotypes from multiple studies for meta-analysis will increase the statistical power to identify new disease-associated variants. Meta-analysis requires same allele definition (nomenclature) and genome build among individual studies. Similarly, imputation, commonly-used prior to meta-analysis, requires the same consistency. However, the genotypes from various GWAS are generated using different genotyping platforms, arrays or SNP-calling approaches, resulting in use of different genome builds and allele definitions. Incorrect assumptions of identical allele definition among combined GWAS lead to a large portion of discarded genotypes or incorrect association findings. There is no published tool that predicts and converts among all major allele definitions. Results In this study, we have developed a tool, GACT, which stands for Genome build and Allele definition Conversion Tool, that predicts and inter-converts between any of the common SNP allele definitions and between the major genome builds. In addition, we assessed several factors that may affect imputation quality, and our results indicated that inclusion of singletons in the reference had detrimental effects while ambiguous SNPs had no measurable effect. Unexpectedly, exclusion of genotypes with missing rate > 0.001 (40% of study SNPs) showed no significant decrease of imputation quality (even significantly higher when compared to the imputation with singletons in the reference), especially for rare SNPs. Conclusion GACT is a new, powerful, and user-friendly tool with both command-line and interactive online versions that can accurately predict, and convert between any of the common allele definitions and between genome builds for genome-wide meta-analysis and imputation of

  5. Magnetic arrays

    DOEpatents

    Trumper, D.L.; Kim, W.; Williams, M.E.

    1997-05-20

    Electromagnet arrays are disclosed which can provide selected field patterns in either two or three dimensions, and in particular, which can provide single-sided field patterns in two or three dimensions. These features are achieved by providing arrays which have current densities that vary in the windings both parallel to the array and in the direction of array thickness. 12 figs.

  6. Magnetic arrays

    SciTech Connect

    Trumper, David L.; Kim, Won-jong; Williams, Mark E.

    1997-05-20

    Electromagnet arrays which can provide selected field patterns in either two or three dimensions, and in particular, which can provide single-sided field patterns in two or three dimensions. These features are achieved by providing arrays which have current densities that vary in the windings both parallel to the array and in the direction of array thickness.

  7. Whole-Genome Analysis of Diversity and SNP-Major Gene Association in Peach Germplasm

    PubMed Central

    Micheletti, Diego; Dettori, Maria Teresa; Micali, Sabrina; Aramini, Valeria; Pacheco, Igor; Da Silva Linge, Cassia; Foschi, Stefano; Banchi, Elisa; Barreneche, Teresa; Quilot-Turion, Bénédicte; Lambert, Patrick; Pascal, Thierry; Iglesias, Ignasi; Carbó, Joaquim; Wang, Li-rong; Ma, Rui-juan; Li, Xiong-wei; Gao, Zhong-shan; Nazzicari, Nelson; Troggio, Michela; Bassi, Daniele; Rossini, Laura; Verde, Ignazio; Laurens, François; Arús, Pere; Aranzana, Maria José

    2015-01-01

    Peach was domesticated in China more than four millennia ago and from there it spread world-wide. Since the middle of the last century, peach breeding programs have been very dynamic generating hundreds of new commercial varieties, however, in most cases such varieties derive from a limited collection of parental lines (founders). This is one reason for the observed low levels of variability of the commercial gene pool, implying that knowledge of the extent and distribution of genetic variability in peach is critical to allow the choice of adequate parents to confer enhanced productivity, adaptation and quality to improved varieties. With this aim we genotyped 1,580 peach accessions (including a few closely related Prunus species) maintained and phenotyped in five germplasm collections (four European and one Chinese) with the International Peach SNP Consortium 9K SNP peach array. The study of population structure revealed the subdivision of the panel in three main populations, one mainly made up of Occidental varieties from breeding programs (POP1OCB), one of Occidental landraces (POP2OCT) and the third of Oriental accessions (POP3OR). Analysis of linkage disequilibrium (LD) identified differential patterns of genome-wide LD blocks in each of the populations. Phenotypic data for seven monogenic traits were integrated in a genome-wide association study (GWAS). The significantly associated SNPs were always in the regions predicted by linkage analysis, forming haplotypes of markers. These diagnostic haplotypes could be used for marker-assisted selection (MAS) in modern breeding programs. PMID:26352671

  8. DoGSD: the dog and wolf genome SNP database.

    PubMed

    Bai, Bing; Zhao, Wen-Ming; Tang, Bi-Xia; Wang, Yan-Qing; Wang, Lu; Zhang, Zhang; Yang, He-Chuan; Liu, Yan-Hu; Zhu, Jun-Wei; Irwin, David M; Wang, Guo-Dong; Zhang, Ya-Ping

    2015-01-01

    The rapid advancement of next-generation sequencing technology has generated a deluge of genomic data from domesticated dogs and their wild ancestor, grey wolves, which have simultaneously broadened our understanding of domestication and diseases that are shared by humans and dogs. To address the scarcity of single nucleotide polymorphism (SNP) data provided by authorized databases and to make SNP data more easily/friendly usable and available, we propose DoGSD (http://dogsd.big.ac.cn), the first canidae-specific database which focuses on whole genome SNP data from domesticated dogs and grey wolves. The DoGSD is a web-based, open-access resource comprising ∼ 19 million high-quality whole-genome SNPs. In addition to the dbSNP data set (build 139), DoGSD incorporates a comprehensive collection of SNPs from two newly sequenced samples (1 wolf and 1 dog) and collected SNPs from three latest dog/wolf genetic studies (7 wolves and 68 dogs), which were taken together for analysis with the population genetic statistics, Fst. In addition, DoGSD integrates some closely related information including SNP annotation, summary lists of SNPs located in genes, synonymous and non-synonymous SNPs, sampling location and breed information. All these features make DoGSD a useful resource for in-depth analysis in dog-/wolf-related studies. PMID:25404132

  9. DoGSD: the dog and wolf genome SNP database

    PubMed Central

    Bai, Bing; Zhao, Wen-Ming; Tang, Bi-Xia; Wang, Yan-Qing; Wang, Lu; Zhang, Zhang; Yang, He-Chuan; Liu, Yan-Hu; Zhu, Jun-Wei; Irwin, David M.; Wang, Guo-Dong; Zhang, Ya-Ping

    2015-01-01

    The rapid advancement of next-generation sequencing technology has generated a deluge of genomic data from domesticated dogs and their wild ancestor, grey wolves, which have simultaneously broadened our understanding of domestication and diseases that are shared by humans and dogs. To address the scarcity of single nucleotide polymorphism (SNP) data provided by authorized databases and to make SNP data more easily/friendly usable and available, we propose DoGSD (http://dogsd.big.ac.cn), the first canidae-specific database which focuses on whole genome SNP data from domesticated dogs and grey wolves. The DoGSD is a web-based, open-access resource comprising ∼19 million high-quality whole-genome SNPs. In addition to the dbSNP data set (build 139), DoGSD incorporates a comprehensive collection of SNPs from two newly sequenced samples (1 wolf and 1 dog) and collected SNPs from three latest dog/wolf genetic studies (7 wolves and 68 dogs), which were taken together for analysis with the population genetic statistics, Fst. In addition, DoGSD integrates some closely related information including SNP annotation, summary lists of SNPs located in genes, synonymous and non-synonymous SNPs, sampling location and breed information. All these features make DoGSD a useful resource for in-depth analysis in dog-/wolf-related studies. PMID:25404132

  10. Substantial SNP-based heritability estimates for working memory performance

    PubMed Central

    Vogler, C; Gschwind, L; Coynel, D; Freytag, V; Milnik, A; Egli, T; Heck, A; de Quervain, D J-F; Papassotiropoulos, A

    2014-01-01

    Working memory (WM) is an important endophenotype in neuropsychiatric research and its use in genetic association studies is thought to be a promising approach to increase our understanding of psychiatric disease. As for any genetically complex trait, demonstration of sufficient heritability within the specific study context is a prerequisite for conducting genetic studies of that trait. Recently developed methods allow estimating trait heritability using sets of common genetic markers from genome-wide association study (GWAS) data in samples of unrelated individuals. Here we present single-nucleotide polymorphism (SNP)-based heritability estimates (h2SNP) for a WM phenotype. A Caucasian sample comprising a total of N=2298 healthy and young individuals was subjected to an N-back WM task. We calculated the genetic relationship between all individuals on the basis of genome-wide SNP data and performed restricted maximum likelihood analyses for variance component estimation to derive the h2SNP estimates. Heritability estimates for three 2-back derived WM performance measures based on all autosomal chromosomes ranged between 31 and 41%, indicating a substantial SNP-based heritability for WM traits. These results indicate that common genetic factors account for a prominent part of the phenotypic variation in WM performance. Hence, the application of GWAS on WM phenotypes is a valid method to identify the molecular underpinnings of WM. PMID:25203169

  11. Conditions for the validity of SNP-based heritability estimation

    PubMed Central

    2014-01-01

    The heritability of a trait (h2) is the proportion of its population variance caused by genetic differences, and estimates of this parameter are important for interpreting the results of genome-wide association studies (GWAS). In recent years, researchers have adopted a novel method for estimating a lower bound on heritability directly from GWAS data that uses realized genetic similarities between nominally unrelated individuals. The quantity estimated by this method is purported to be the contribution to heritability that could in principle be recovered from association studies employing the given panel of SNPs (hSNP2). Thus far, the validity of this approach has mostly been tested empirically. Here, we provide a mathematical explication and show that the method should remain a robust means of obtaining hSNP2 SNP under circumstances wider than those under which it has so far been derived. PMID:24744256

  12. A robust SNP barcode for typing Mycobacterium tuberculosis complex strains

    PubMed Central

    Coll, Francesc; McNerney, Ruth; Guerra-Assunção, José Afonso; Glynn, Judith R.; Perdigão, João; Viveiros, Miguel; Portugal, Isabel; Pain, Arnab; Martin, Nigel; Clark, Taane G.

    2014-01-01

    Strain-specific genomic diversity in the Mycobacterium tuberculosis complex (MTBC) is an important factor in pathogenesis that may affect virulence, transmissibility, host response and emergence of drug resistance. Several systems have been proposed to classify MTBC strains into distinct lineages and families. Here, we investigate single-nucleotide polymorphisms (SNPs) as robust (stable) markers of genetic variation for phylogenetic analysis. We identify ~92k SNP across a global collection of 1,601 genomes. The SNP-based phylogeny is consistent with the gold-standard regions of difference (RD) classification system. Of the ~7k strain-specific SNPs identified, 62 markers are proposed to discriminate known circulating strains. This SNP-based barcode is the first to cover all main lineages, and classifies a greater number of sublineages than current alternatives. It may be used to classify clinical isolates to evaluate tools to control the disease, including therapeutics and vaccines whose effectiveness may vary by strain type. PMID:25176035

  13. Temple syndrome: A patient with maternal hetero-UPD14, mixed iso- and hetero-disomy detected by SNP microarray typing of patient-father duos.

    PubMed

    Shin, Eun-Hye; Cho, Eunhae; Lee, Cha Gon

    2016-08-01

    Temple syndrome (TS, MIM 616222) is an imprinting disorder involving genes within the imprinted region of chromosome 14q32. TS is a genetically complex disorder, which is associated with maternal uniparental disomy of chromosome 14 (UPD14), paternal deletions on chromosome 14, or loss of methylation at the intergenic differentially methylated region (IG-DMR). Here, we describe the case of a patient with maternal hetero-UPD14, mixed iso-/hetero-disomy mechanism identified by a single nucleotide polymorphism (SNP) array analysis of patient-father duos study. The phenotype of our case is similarities to Prader-Willi syndrome (PWS) during infancy and to Russell-Silver syndrome (RSS) during childhood. This SNP array appears to be an effective initial screening tool for patients with nonspecific clinical features suggestive of chromosomal disorders. PMID:26867509

  14. A 48 SNP set for grapevine cultivar identification

    PubMed Central

    2011-01-01

    Background Rapid and consistent genotyping is an important requirement for cultivar identification in many crop species. Among them grapevine cultivars have been the subject of multiple studies given the large number of synonyms and homonyms generated during many centuries of vegetative multiplication and exchange. Simple sequence repeat (SSR) markers have been preferred until now because of their high level of polymorphism, their codominant nature and their high profile repeatability. However, the rapid application of partial or complete genome sequencing approaches is identifying thousands of single nucleotide polymorphisms (SNP) that can be very useful for such purposes. Although SNP markers are bi-allelic, and therefore not as polymorphic as microsatellites, the high number of loci that can be multiplexed and the possibilities of automation as well as their highly repeatable results under any analytical procedure make them the future markers of choice for any type of genetic identification. Results We analyzed over 300 SNP in the genome of grapevine using a re-sequencing strategy in a selection of 11 genotypes. Among the identified polymorphisms, we selected 48 SNP spread across all grapevine chromosomes with allele frequencies balanced enough as to provide sufficient information content for genetic identification in grapevine allowing for good genotyping success rate. Marker stability was tested in repeated analyses of a selected group of cultivars obtained worldwide to demonstrate their usefulness in genetic identification. Conclusions We have selected a set of 48 stable SNP markers with a high discrimination power and a uniform genome distribution (2-3 markers/chromosome), which is proposed as a standard set for grapevine (Vitis vinifera L.) genotyping. Any previous problems derived from microsatellite allele confusion between labs or the need to run reference cultivars to identify allele sizes disappear using this type of marker. Furthermore, because SNP

  15. Kokkos Array

    Energy Science and Technology Software Center (ESTSC)

    2012-09-12

    The Kokkos Array library implements shared-memory array data structures and parallel task dispatch interfaces for data-parallel computational kernels that are performance-portable to multicore-CPU and manycore-accelerator (e.g., GPGPU) devices.

  16. Systolic arrays

    SciTech Connect

    Moore, W.R.; McCabe, A.P.H.; Vrquhart, R.B.

    1987-01-01

    Selected Contents of this book are: Efficient Systolic Arrays for the Solution of Toeplitz Systems, The Derivation and Utilization of Bit Level Systolic Array Architectures, an Efficient Systolic Array for Distance Computation Required in a Video-Codec Based Motion-Detection, On Realizations of Least-Squares Estimation and Kalman Filtering by Systolic Arrays, and Comparison of Systolic and SIMD Architectures for Computer Vision Computations.

  17. Nanocylinder arrays

    DOEpatents

    Tuominen, Mark; Schotter, Joerg; Thurn-Albrecht, Thomas; Russell, Thomas P.

    2007-03-13

    Pathways to rapid and reliable fabrication of nanocylinder arrays are provided. Simple methods are described for the production of well-ordered arrays of nanopores, nanowires, and other materials. This is accomplished by orienting copolymer films and removing a component from the film to produce nanopores, that in turn, can be filled with materials to produce the arrays. The resulting arrays can be used to produce nanoscale media, devices, and systems.

  18. Nanocylinder arrays

    DOEpatents

    Tuominen, Mark; Schotter, Joerg; Thurn-Albrecht, Thomas; Russell, Thomas P.

    2009-08-11

    Pathways to rapid and reliable fabrication of nanocylinder arrays are provided. Simple methods are described for the production of well-ordered arrays of nanopores, nanowires, and other materials. This is accomplished by orienting copolymer films and removing a component from the film to produce nanopores, that in turn, can be filled with materials to produce the arrays. The resulting arrays can be used to produce nanoscale media, devices, and systems.

  19. Rapid Diagnosis of Imprinting Disorders Involving Copy Number Variation and Uniparental Disomy Using Genome-Wide SNP Microarrays.

    PubMed

    Liu, Weiqiang; Zhang, Rui; Wei, Jun; Zhang, Huimin; Yu, Guojiu; Li, Zhihua; Chen, Min; Sun, Xiaofang

    2015-01-01

    Imprinting disorders, such as Beckwith-Wiedemann syndrome (BWS), Prader-Willi syndrome (PWS) and Angelman syndrome (AS), can be detected via methylation analysis, methylation-specific multiplex ligation-dependent probe amplification (MS-MLPA), or other methods. In this study, we applied single nucleotide polymorphism (SNP)-based chromosomal microarray analysis to detect copy number variations (CNVs) and uniparental disomy (UPD) events in patients with suspected imprinting disorders. Of 4 patients, 2 had a 5.25-Mb microdeletion in the 15q11.2q13.2 region, 1 had a 38.4-Mb mosaic UPD in the 11p15.4 region, and 1 had a 60-Mb detectable UPD between regions 14q13.2 and 14q32.13. Although the 14q32.2 region was classified as normal by SNP array for the 14q13 UPD patient, it turned out to be a heterodisomic UPD by short tandem repeat marker analysis. MS-MLPA analysis was performed to validate the variations. In conclusion, SNP-based microarray is an efficient alternative method for quickly and precisely diagnosing PWS, AS, BWS, and other imprinted gene-associated disorders when considering aberrations due to CNVs and most types of UPD. PMID:26184742

  20. Buffy coat specimens remain viable as a DNA source for highly multiplexed genome-wide genetic tests after long term storage

    PubMed Central

    2011-01-01

    Background Blood specimen collection at an early study visit is often included in observational studies or clinical trials for analysis of secondary outcome biomarkers. A common protocol is to store buffy coat specimens for future DNA isolation and these may remain in frozen storage for many years. It is uncertain if the DNA remains suitable for modern genome wide association (GWA) genotyping. Methods We isolated DNA from 120 Action to Control Cardiovascular Risk in Diabetes (ACCORD) clinical trial buffy coats sampling a range of storage times up to 9 years and other factors that could influence DNA yield. We performed TaqMan SNP and GWA genotyping to test whether the DNA retained integrity for high quality genetic analysis. Results We tested two QIAGEN automated protocols for DNA isolation, preferring the Compromised Blood Protocol despite similar yields. We isolated DNA from all 120 specimens (yield range 1.1-312 ug per 8.5 ml ACD tube of whole blood) with only 3/120 samples yielding < 10 ug DNA. Age of participant at blood draw was negatively associated with yield (mean change -2.1 ug/year). DNA quality was very good based on gel electrophoresis QC, TaqMan genotyping of 6 SNPs (genotyping no-call rate 1.1% in 702 genotypes), and excellent quality GWA genotyping data (maximum per sample genotype missing rate 0.64%). Conclusions When collected as a long term clinical trial or biobank specimen for DNA, buffy coats can be stored for up to 9 years in a -80degC frozen state and still produce high yields of DNA suitable for GWA analysis and other genetic testing. Trial Registration The Action to Control Cardiovascular Risk in Diabetes (ACCORD) trial is registered with ClinicalTrials.gov, number NCT00000620. PMID:21663644

  1. Genetic mapping in grapevine using a SNP microarray: intensity values

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genotyping microarrays are widely used for genome wide association studies, but in high-diversity organisms, the quality of SNP calls can be diminished by genetic variation near the assayed nucleotide. To address this limitation in grapevine, we developed a simple heuristic that uses hybridization i...

  2. SNP marker diversity in common bean (Phaseolus vulgaris L.).

    PubMed

    Cortés, Andrés J; Chavarro, Martha C; Blair, Matthew W

    2011-09-01

    Single nucleotide polymorphism (SNP) markers have become a genetic technology of choice because of their automation and high precision of allele calls. In this study, our goal was to develop 94 SNPs and test them across well-chosen common bean (Phaseolus vulgaris L.) germplasm. We validated and accessed SNP diversity at 84 gene-based and 10 non-genic loci using KASPar technology in a panel of 70 genotypes that have been used as parents of mapping populations and have been previously evaluated for SSRs. SNPs exhibited high levels of genetic diversity, an excess of middle frequency polymorphism, and a within-genepool mismatch distribution as expected for populations affected by sudden demographic expansions after domestication bottlenecks. This set of markers was useful for distinguishing Andean and Mesoamerican genotypes but less useful for distinguishing within each gene pool. In summary, slightly greater polymorphism and race structure was found within the Andean gene pool than within the Mesoamerican gene pool but polymorphism rate between genotypes was consistent with genepool and race identity. Our survey results represent a baseline for the choice of SNP markers for future applications because gene-associated SNPs could themselves be causative SNPs for traits. Finally, we discuss that the ideal genetic marker combination with which to carry out diversity, mapping and association studies in common bean should consider a mix of both SNP and SSR markers. PMID:21785951

  3. SNP Discovery through Next-Generation Sequencing and Its Applications

    PubMed Central

    Kumar, Santosh; Banks, Travis W.; Cloutier, Sylvie

    2012-01-01

    The decreasing cost along with rapid progress in next-generation sequencing and related bioinformatics computing resources has facilitated large-scale discovery of SNPs in various model and nonmodel plant species. Large numbers and genome-wide availability of SNPs make them the marker of choice in partially or completely sequenced genomes. Although excellent reviews have been published on next-generation sequencing, its associated bioinformatics challenges, and the applications of SNPs in genetic studies, a comprehensive review connecting these three intertwined research areas is needed. This paper touches upon various aspects of SNP discovery, highlighting key points in availability and selection of appropriate sequencing platforms, bioinformatics pipelines, SNP filtering criteria, and applications of SNPs in genetic analyses. The use of next-generation sequencing methodologies in many non-model crops leading to discovery and implementation of SNPs in various genetic studies is discussed. Development and improvement of bioinformatics software that are open source and freely available have accelerated the SNP discovery while reducing the associated cost. Key considerations for SNP filtering and associated pipelines are discussed in specific topics. A list of commonly used software and their sources is compiled for easy access and reference. PMID:23227038

  4. Do you really know where this SNP goes?

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The release of build 10.2 of the swine genome was a marked improvement over previous builds and has proven extremely useful. However, as most know, there are regions of the genome that this particular build does not accurately represent. For instance, nearly 25% of the 62,162 SNP on the Illumina Por...

  5. Association of Atherosclerotic Peripheral Arterial Disease with Adiponectin Genes SNP+45 and SNP+276: A Case-Control Study

    PubMed Central

    Gherman, Claudia D.; Bolboacă, Sorana D.

    2013-01-01

    Objectives. We hypothesized that adiponectin gene SNP+45 (rs2241766) and SNP+276 (rs1501299) would be associated with atherosclerotic peripheral arterial disease (PAD). Furthermore, the association between circulating adiponectin levels, fetuin-A, and tumoral necrosis factor-alpha (TNF-α) in patients with atherosclerotic peripheral arterial disease was investigated. Method. Several blood parameters (such as adiponectin, fetuin-A, and TNF-α) were measured in 346 patients, 226 with atherosclerotic peripheral arterial disease (PAD) and 120 without symptomatic PAD (non-PAD). Two common SNPs of the ADIPOQ gene represented by +45T/G 2 and +276G/T were also investigated. Results. Adiponectin concentrations showed lower circulating levels in the PAD patients compared to non-PAD patients (P < 0.001). Decreasing adiponectin concentration was associated with increasing serum levels of fetuin-A in the PAD patients. None of the investigated adiponectin SNPs proved to be associated with the subjects' susceptibility to PAD (P > 0.05). Conclusion. The results of our study demonstrated that neither adiponectin SNP+45 nor SNP+276 is associated with the risk of PAD. PMID:23819115

  6. Large-Scale SNP Discovery through RNA Sequencing and SNP Genotyping by Targeted Enrichment Sequencing in Cassava (Manihot esculenta Crantz)

    PubMed Central

    Pootakham, Wirulda; Shearman, Jeremy R.; Ruang-areerate, Panthita; Sonthirod, Chutima; Sangsrakru, Duangjai; Jomchai, Nukoon; Yoocha, Thippawan; Triwitayakorn, Kanokporn; Tragoonrung, Somvong; Tangphatsornruang, Sithichoke

    2014-01-01

    Cassava (Manihot esculenta Crantz) is one of the most important crop species being the main source of dietary energy in several countries. Marker-assisted selection has become an essential tool in plant breeding. Single nucleotide polymorphism (SNP) discovery via transcriptome sequencing is an attractive strategy for genome complexity reduction in organisms with large genomes. We sequenced the transcriptome of 16 cassava accessions using the Illumina HiSeq platform and identified 675,559 EST-derived SNP markers. A subset of those markers was subsequently genotyped by capture-based targeted enrichment sequencing in 100 F1 progeny segregating for starch viscosity phenotypes. A total of 2,110 non-redundant SNP markers were used to construct a genetic map. This map encompasses 1,785 cM and consists of 19 linkage groups. A major quantitative trait locus (QTL) controlling starch pasting properties was identified and shown to coincide with the QTL previously reported for this trait. With a high-density SNP-based linkage map presented here, we also uncovered a novel QTL associated with starch pasting time on LG 10. PMID:25551642

  7. Multiple SNP Set Analysis for Genome-Wide Association Studies Through Bayesian Latent Variable Selection.

    PubMed

    Lu, Zhao-Hua; Zhu, Hongtu; Knickmeyer, Rebecca C; Sullivan, Patrick F; Williams, Stephanie N; Zou, Fei

    2015-12-01

    The power of genome-wide association studies (GWAS) for mapping complex traits with single-SNP analysis (where SNP is single-nucleotide polymorphism) may be undermined by modest SNP effect sizes, unobserved causal SNPs, correlation among adjacent SNPs, and SNP-SNP interactions. Alternative approaches for testing the association between a single SNP set and individual phenotypes have been shown to be promising for improving the power of GWAS. We propose a Bayesian latent variable selection (BLVS) method to simultaneously model the joint association mapping between a large number of SNP sets and complex traits. Compared with single SNP set analysis, such joint association mapping not only accounts for the correlation among SNP sets but also is capable of detecting causal SNP sets that are marginally uncorrelated with traits. The spike-and-slab prior assigned to the effects of SNP sets can greatly reduce the dimension of effective SNP sets, while speeding up computation. An efficient Markov chain Monte Carlo algorithm is developed. Simulations demonstrate that BLVS outperforms several competing variable selection methods in some important scenarios. PMID:26515609

  8. High-throughput SNP genotyping for breeding applications in rice using the BeadXpress platform

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Multiplexed single nucleotide polymorphism (SNP) markers have the potential to increase the speed and cost-effectiveness of genotyping, provided that an optimal SNP density is used for each application. To test the efficiency of multiplexed SNP genotyping for diversity, mapping and breeding applicat...

  9. Development of Single Nucleotide Polymorphism (SNP) Markers for Use in Commercial Maize (Zea Mays L.) Germplasm

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The development of single nucleotide polymorphism (SNP) markers in maize offer the opportunity to utilize DNA markers in many new areas of population genetics, gene discovery, plant breeding, and germplasm identification. However, the steps from sequencing and SNP discovery to SNP marker design and ...

  10. Construction and Annotation of a High Density SNP Linkage Map of the Atlantic Salmon (Salmo salar) Genome

    PubMed Central

    Tsai, Hsin Y.; Robledo, Diego; Lowe, Natalie R.; Bekaert, Michael; Taggart, John B.; Bron, James E.; Houston, Ross D.

    2016-01-01

    High density linkage maps are useful tools for fine-scale mapping of quantitative trait loci, and characterization of the recombination landscape of a species’ genome. Genomic resources for Atlantic salmon (Salmo salar) include a well-assembled reference genome, and high density single nucleotide polymorphism (SNP) arrays. Our aim was to create a high density linkage map, and to align it with the reference genome assembly. Over 96,000 SNPs were mapped and ordered on the 29 salmon linkage groups using a pedigreed population comprising 622 fish from 60 nuclear families, all genotyped with the ‘ssalar01’ high density SNP array. The number of SNPs per group showed a high positive correlation with physical chromosome length (r = 0.95). While the order of markers on the genetic and physical maps was generally consistent, areas of discrepancy were identified. Approximately 6.5% of the previously unmapped reference genome sequence was assigned to chromosomes using the linkage map. Male recombination rate was lower than females across the vast majority of the genome, but with a notable peak in subtelomeric regions. Finally, using RNA-Seq data to annotate the reference genome, the mapped SNPs were categorized according to their predicted function, including annotation of ∼2500 putative nonsynonymous variants. The highest density SNP linkage map for any salmonid species has been created, annotated, and integrated with the Atlantic salmon reference genome assembly. This map highlights the marked heterochiasmy of salmon, and provides a useful resource for salmonid genetics and genomics research. PMID:27194803

  11. Construction and Annotation of a High Density SNP Linkage Map of the Atlantic Salmon (Salmo salar) Genome.

    PubMed

    Tsai, Hsin Y; Robledo, Diego; Lowe, Natalie R; Bekaert, Michael; Taggart, John B; Bron, James E; Houston, Ross D

    2016-01-01

    High density linkage maps are useful tools for fine-scale mapping of quantitative trait loci, and characterization of the recombination landscape of a species' genome. Genomic resources for Atlantic salmon (Salmo salar) include a well-assembled reference genome, and high density single nucleotide polymorphism (SNP) arrays. Our aim was to create a high density linkage map, and to align it with the reference genome assembly. Over 96,000 SNPs were mapped and ordered on the 29 salmon linkage groups using a pedigreed population comprising 622 fish from 60 nuclear families, all genotyped with the 'ssalar01' high density SNP array. The number of SNPs per group showed a high positive correlation with physical chromosome length (r = 0.95). While the order of markers on the genetic and physical maps was generally consistent, areas of discrepancy were identified. Approximately 6.5% of the previously unmapped reference genome sequence was assigned to chromosomes using the linkage map. Male recombination rate was lower than females across the vast majority of the genome, but with a notable peak in subtelomeric regions. Finally, using RNA-Seq data to annotate the reference genome, the mapped SNPs were categorized according to their predicted function, including annotation of ∼2500 putative nonsynonymous variants. The highest density SNP linkage map for any salmonid species has been created, annotated, and integrated with the Atlantic salmon reference genome assembly. This map highlights the marked heterochiasmy of salmon, and provides a useful resource for salmonid genetics and genomics research. PMID:27194803

  12. Influence of MDM2 SNP309 and SNP285 status on the risk of cancer in the breast, prostate, lung and colon.

    PubMed

    Gansmo, Liv B; Knappskog, Stian; Romundstad, Pål; Hveem, Kristian; Vatten, Lars; Lønning, Per E

    2015-07-01

    MDM2 is a key regulator of the p53 tumor suppressor protein and is overexpressed in many human cancers. Two single nucleotide polymorphisms (SNPs) located in the MDM2 intronic promoter (P2) have been found to exert biological function. The G-allele of SNP309T>G; rs2279744 increases MDM2 transcription and has been linked to increased cancer risk. In contrast, the less frequent SNP285G>C; rs117039649, which is in complete linkage disequilibrium with SNP309 (generating a SNP285C/309G variant haplotype), has been related to reduced MDM2 transcription and to reduced risk of breast, endometrial and ovarian cancer. In this large population-based case-control study, we genotyped SNP309 and SNP285 in 10,830 individuals, including cases with cancer of the breast (n=1,717), colon (n=1,532), lung (n=1,331) and prostate (n=2,501), as well as 3,749 non-cancer controls. We found a slightly reduced risk for lung cancer among individuals harboring the SNP309TG/GG genotypes compared to the SNP309TT genotype (OR= 0.86; CI = 0.67-0.98), but this association was restricted to women (OR = 0.77; CI = 0.63-0.95) and was not present among men (OR = 0.91; CI = 0.77-1.08). Consistent with previous findings, we found a reduced risk for breast cancer among individuals carrying the SNP285GC/309GG genotype versus the SNP285GG/309GG genotype (OR = 0.55; CI = 0.33-0.93). In conclusion, our data support the hypothesis that the effects of both SNP285 and SNP309 status are tissue dependent. PMID:25431177

  13. SNP analysis of follistatin gene associated with polycystic ovarian syndrome

    PubMed Central

    Panneerselvam, Palanisamy; Sivakumari, Kanakarajan; Jayaprakash, Ponmani; Srikanth, Ramanathan

    2010-01-01

    Follistatin has been reported as a candidate gene for polycystic ovarian syndrome (PCOS) based on linkage and association studies. In this study, investigation of polymorphisms in the FST gene was done to determine if genetic variation is associated with susceptibility to PCOS. The nucleotide sequence of human follistatin and the protein sequence of human follistatin were retrieved from the NCBI database using Entrez. The follistatin protein of human was retrieved from the Swiss-Prot database. There are 344 amino acids and the molecular weight is 38,007 Da. The ProtParam analysis shows that the isoelectric point is 5.53 and the aliphatic index is 61.25. The hydropathicity is −0.490. The domains in FST protein are as follows: Pfam-B 5005 domain from 1 to 92; EGF-like subdomain from 93 to 116; Kazal 1 domain, occurred in three places, namely, 118–164, 192–239, and 270–316. There are 31 single-nucleotide polymorphisms (SNPs) for this gene. Some are nonsynonymous, some occur in the intron region, and some in an untranslated region. Two nonsynonymous SNPs, namely, rs11745088 and rs1127760, were taken for analysis. In the SNP rs11745088, the change is E152Q. Likewise, in rs1127760, the change is C239S. SIFT (Sorting Intolerant from Tolerant) showed positions of amino acids and the single letter code of amino acids that can be tolerated or deleterious for each position. There were six SNP results and each result had links to it. The dbSNP id, primary database id, and the type of mutation whether silent and if occurring in coding region are given as phenotype alterations. The FASTA format of protein was given to the nsSNP Analyzer tool, and the variation E152Q and C239S were given as inputs in the SNP data field. E152Q change was neutral and C239S causes disease. Using PANTHER for evolutionary analysis of coding SNPs, the protein sequence was given as input and analyzed for the E152Q and C239S SNPs for deleterious effect on protein function. The genetic association

  14. SNP analysis of follistatin gene associated with polycystic ovarian syndrome.

    PubMed

    Panneerselvam, Palanisamy; Sivakumari, Kanakarajan; Jayaprakash, Ponmani; Srikanth, Ramanathan

    2010-01-01

    Follistatin has been reported as a candidate gene for polycystic ovarian syndrome (PCOS) based on linkage and association studies. In this study, investigation of polymorphisms in the FST gene was done to determine if genetic variation is associated with susceptibility to PCOS. The nucleotide sequence of human follistatin and the protein sequence of human follistatin were retrieved from the NCBI database using Entrez. The follistatin protein of human was retrieved from the Swiss-Prot database. There are 344 amino acids and the molecular weight is 38,007 Da. The ProtParam analysis shows that the isoelectric point is 5.53 and the aliphatic index is 61.25. The hydropathicity is -0.490. The domains in FST protein are as follows: Pfam-B 5005 domain from 1 to 92; EGF-like subdomain from 93 to 116; Kazal 1 domain, occurred in three places, namely, 118-164, 192-239, and 270-316. There are 31 single-nucleotide polymorphisms (SNPs) for this gene. Some are nonsynonymous, some occur in the intron region, and some in an untranslated region. Two nonsynonymous SNPs, namely, rs11745088 and rs1127760, were taken for analysis. In the SNP rs11745088, the change is E152Q. Likewise, in rs1127760, the change is C239S. SIFT (Sorting Intolerant from Tolerant) showed positions of amino acids and the single letter code of amino acids that can be tolerated or deleterious for each position. There were six SNP results and each result had links to it. The dbSNP id, primary database id, and the type of mutation whether silent and if occurring in coding region are given as phenotype alterations. The FASTA format of protein was given to the nsSNP Analyzer tool, and the variation E152Q and C239S were given as inputs in the SNP data field. E152Q change was neutral and C239S causes disease. Using PANTHER for evolutionary analysis of coding SNPs, the protein sequence was given as input and analyzed for the E152Q and C239S SNPs for deleterious effect on protein function. The genetic association

  15. Investigating single nucleotide polymorphism (SNP) density in the human genome and its implications for molecular evolution.

    PubMed

    Zhao, Zhongming; Fu, Yun-Xin; Hewett-Emmett, David; Boerwinkle, Eric

    2003-07-17

    We investigated the single nucleotide polymorphism (SNP) density across the human genome and in different genic categories using two SNP databases: Celera's CgsSNP, which includes SNPs identified by comparing genomic sequences, and Celera's RefSNP, which includes SNPs from a variety of sources and is biased toward disease-associated genes. Based on CgsSNP, the average numbers of SNPs per 10 kb was 8.33, 8.44, and 8.09 in the human genome, in intergenic regions, and in genic regions, respectively. In genic regions, the SNP density in intronic, exonic and adjoining untranslated regions was 8.21, 5.28, and 7.51 SNPs per 10 kb, respectively. The pattern of SNP density based on RefSNP was different from that based on CgsSNP, emphasizing its utility for genotype-phenotype association studies but not for most population genetic studies. The number of SNPs per chromosome was correlated with chromosome length, but the density of SNPs estimated by CgsSNP was not significantly correlated with the GC content of the chromosome. Based on CgsSNP, the ratio of nonsense to missense mutations (0.027), the ratio of missense to silent mutations (1.15), and the ratio of non-synonymous to synonymous mutations (1.18) was less than half of that expected in a human protein coding sequence under the neutral mutation theory, reflecting a role for natural selection, especially purifying selection. PMID:12909357

  16. Multiple SNP-sets Analysis for Genome-wide Association Studies through Bayesian Latent Variable Selection

    PubMed Central

    Lu, Zhaohua; Zhu, Hongtu; Knickmeyer, Rebecca C; Sullivan, Patrick F.; Stephanie, Williams N.; Zou, Fei

    2015-01-01

    The power of genome-wide association studies (GWAS) for mapping complex traits with single SNP analysis may be undermined by modest SNP effect sizes, unobserved causal SNPs, correlation among adjacent SNPs, and SNP-SNP interactions. Alternative approaches for testing the association between a single SNP-set and individual phenotypes have been shown to be promising for improving the power of GWAS. We propose a Bayesian latent variable selection (BLVS) method to simultaneously model the joint association mapping between a large number of SNP-sets and complex traits. Compared to single SNP-set analysis, such joint association mapping not only accounts for the correlation among SNP-sets, but also is capable of detecting causal SNP-sets that are marginally uncorrelated with traits. The spike-slab prior assigned to the effects of SNP-sets can greatly reduce the dimension of effective SNP-sets, while speeding up computation. An efficient MCMC algorithm is developed. Simulations demonstrate that BLVS outperforms several competing variable selection methods in some important scenarios. PMID:26515609

  17. Introgression browser: high-throughput whole-genome SNP visualization.

    PubMed

    Aflitos, Saulo Alves; Sanchez-Perez, Gabino; de Ridder, Dick; Fransz, Paul; Schranz, Michael E; de Jong, Hans; Peters, Sander A

    2015-04-01

    Breeding by introgressive hybridization is a pivotal strategy to broaden the genetic basis of crops. Usually, the desired traits are monitored in consecutive crossing generations by marker-assisted selection, but their analyses fail in chromosome regions where crossover recombinants are rare or not viable. Here, we present the Introgression Browser (iBrowser), a bioinformatics tool aimed at visualizing introgressions at nucleotide or SNP (Single Nucleotide Polymorphisms) accuracy. The software selects homozygous SNPs from Variant Call Format (VCF) information and filters out heterozygous SNPs, multi-nucleotide polymorphisms (MNPs) and insertion-deletions (InDels). For data analysis iBrowser makes use of sliding windows, but if needed it can generate any desired fragmentation pattern through General Feature Format (GFF) information. In an example of tomato (Solanum lycopersicum) accessions we visualize SNP patterns and elucidate both position and boundaries of the introgressions. We also show that our tool is capable of identifying alien DNA in a panel of the closely related S. pimpinellifolium by examining phylogenetic relationships of the introgressed segments in tomato. In a third example, we demonstrate the power of the iBrowser in a panel of 597 Arabidopsis accessions, detecting the boundaries of a SNP-free region around a polymorphic 1.17 Mbp inverted segment on the short arm of chromosome 4. The architecture and functionality of iBrowser makes the software appropriate for a broad set of analyses including SNP mining, genome structure analysis, and pedigree analysis. Its functionality, together with the capability to process large data sets and efficient visualization of sequence variation, makes iBrowser a valuable breeding tool. PMID:25704554

  18. Detection of homologous horizontal gene transfer in SNP data

    Energy Science and Technology Software Center (ESTSC)

    2012-07-23

    We study the detection of mutations, sequencing errors, and homologous horizontal gene transfers (HGT) in a set of closely related microbial genomes. We base the model on single nucleotide polymorphisms (SNP's) and break the genomes into blocks to handle the rearrangement problem. Then we apply a synamic programming algorithm to model whether changes within each block are likely a result of mutations, sequencing errors, or HGT.

  19. Robust Demographic Inference from Genomic and SNP Data

    PubMed Central

    Excoffier, Laurent; Dupanloup, Isabelle; Huerta-Sánchez, Emilia; Sousa, Vitor C.; Foll, Matthieu

    2013-01-01

    We introduce a flexible and robust simulation-based framework to infer demographic parameters from the site frequency spectrum (SFS) computed on large genomic datasets. We show that our composite-likelihood approach allows one to study evolutionary models of arbitrary complexity, which cannot be tackled by other current likelihood-based methods. For simple scenarios, our approach compares favorably in terms of accuracy and speed with , the current reference in the field, while showing better convergence properties for complex models. We first apply our methodology to non-coding genomic SNP data from four human populations. To infer their demographic history, we compare neutral evolutionary models of increasing complexity, including unsampled populations. We further show the versatility of our framework by extending it to the inference of demographic parameters from SNP chips with known ascertainment, such as that recently released by Affymetrix to study human origins. Whereas previous ways of handling ascertained SNPs were either restricted to a single population or only allowed the inference of divergence time between a pair of populations, our framework can correctly infer parameters of more complex models including the divergence of several populations, bottlenecks and migration. We apply this approach to the reconstruction of African demography using two distinct ascertained human SNP panels studied under two evolutionary models. The two SNP panels lead to globally very similar estimates and confidence intervals, and suggest an ancient divergence (>110 Ky) between Yoruba and San populations. Our methodology appears well suited to the study of complex scenarios from large genomic data sets. PMID:24204310

  20. PanSNPdb: The Pan-Asian SNP Genotyping Database

    PubMed Central

    Ngamphiw, Chumpol; Assawamakin, Anunchai; Xu, Shuhua; Shaw, Philip J.; Yang, Jin Ok; Ghang, Ho; Bhak, Jong; Liu, Edison; Tongsima, Sissades

    2011-01-01

    The HUGO Pan-Asian SNP consortium conducted the largest survey to date of human genetic diversity among Asians by sampling 1,719 unrelated individuals among 71 populations from China, India, Indonesia, Japan, Malaysia, the Philippines, Singapore, South Korea, Taiwan, and Thailand. We have constructed a database (PanSNPdb), which contains these data and various new analyses of them. PanSNPdb is a research resource in the analysis of the population structure of Asian peoples, including linkage disequilibrium patterns, haplotype distributions, and copy number variations. Furthermore, PanSNPdb provides an interactive comparison with other SNP and CNV databases, including HapMap3, JSNP, dbSNP and DGV and thus provides a comprehensive resource of human genetic diversity. The information is accessible via a widely accepted graphical interface used in many genetic variation databases. Unrestricted access to PanSNPdb and any associated files is available at: http://www4a.biotec.or.th/PASNP. PMID:21731755

  1. How far from the SNP may the causative genes be?

    PubMed

    Brodie, Aharon; Azaria, Johnathan Roy; Ofran, Yanay

    2016-07-27

    While GWAS identify many disease-associated SNPs, using them to decipher disease mechanisms is hindered by the difficulty in mapping SNPs to genes. Most SNPs are in non-coding regions and it is often hard to identify the genes they implicate. To explore how far the SNP may be from the affected genes we used a pathway-based approach. We found that affected genes are often up to 2 Mbps away from the associated SNP, and are not necessarily the closest genes to the SNP. Existing approaches for mapping SNPs to genes leave many SNPs unmapped to genes and reveal only 86 significant phenotype-pathway associations for all known GWAS hits combined. Using the pathway-based approach we propose here allows mapping of virtually all SNPs to genes and reveals 435 statistically significant phenotype-pathway associations. In search for mechanisms that may explain the relationships between SNPs and distant genes, we found that SNPs that are mapped to distant genes have significantly more large insertions/deletions around them than other SNPs, suggesting that these SNPs may sometimes be markers for large insertions/deletions that may affect large genomic regions. PMID:27269582

  2. Development of a forensic identity SNP panel for Indonesia.

    PubMed

    Augustinus, Daniel; Gahan, Michelle E; McNevin, Dennis

    2015-07-01

    Genetic markers included in forensic identity panels must exhibit Hardy-Weinberg and linkage equilibrium (HWE and LE). "Universal" panels designed for global use can fail these tests in regional jurisdictions exhibiting high levels of genetic differentiation such as the Indonesian archipelago. This is especially the case where a single DNA database is required for allele frequency estimates to calculate random match probabilities (RMPs) and associated likelihood ratios (LRs). A panel of 65 single nucleotide polymorphisms (SNPs) and a reduced set of 52 SNPs have been selected from 15 Indonesian subpopulations in the HUGO Pan Asian SNP database using a SNP selection strategy that could be applied to any panel of forensic identity markers. The strategy consists of four screening steps: (1) application of a G test for HWE; (2) ranking for high heterozygosity; (3) selection for LE; and (4) selection for low inbreeding depression. SNPs in our Indonesian panel perform well in comparison to some other universal SNP and short tandem repeat (STR) panels as measured by Fisher's exact test for HWE and LE and Wright's F statistics. PMID:25104323

  3. Multiplex Detection and SNP Genotyping in a Single Fluorescence Channel

    PubMed Central

    Fu, Guoliang; Miles, Andrea; Alphey, Luke

    2012-01-01

    Probe-based PCR is widely used for SNP (single nucleotide polymorphism) genotyping and pathogen nucleic acid detection due to its simplicity, sensitivity and cost-effectiveness. However, the multiplex capability of hydrolysis probe-based PCR is normally limited to one target (pathogen or allele) per fluorescence channel. Current fluorescence PCR machines typically have 4–6 channels. We present a strategy permitting the multiplex detection of multiple targets in a single detection channel. The technique is named Multiplex Probe Amplification (MPA). Polymorphisms of the CYP2C9 gene (cytochrome P450, family 2, subfamily C, polypeptide 9, CYP2C9*2) and human papillomavirus sequences HPV16, 18, 31, 52 and 59 were chosen as model targets for testing MPA. The allele status of the CYP2C9*2 determined by MPA was entirely concordant with the reference TaqMan® SNP Genotyping Assays. The four HPV strain sequences could be independently detected in a single fluorescence detection channel. The results validate the multiplex capacity, the simplicity and accuracy of MPA for SNP genotyping and multiplex detection using different probes labeled with the same fluorophore. The technique offers a new way to multiplex in a single detection channel of a closed-tube PCR. PMID:22272339

  4. Multiplex detection and SNP genotyping in a single fluorescence channel.

    PubMed

    Fu, Guoliang; Miles, Andrea; Alphey, Luke

    2012-01-01

    Probe-based PCR is widely used for SNP (single nucleotide polymorphism) genotyping and pathogen nucleic acid detection due to its simplicity, sensitivity and cost-effectiveness. However, the multiplex capability of hydrolysis probe-based PCR is normally limited to one target (pathogen or allele) per fluorescence channel. Current fluorescence PCR machines typically have 4-6 channels. We present a strategy permitting the multiplex detection of multiple targets in a single detection channel. The technique is named Multiplex Probe Amplification (MPA). Polymorphisms of the CYP2C9 gene (cytochrome P450, family 2, subfamily C, polypeptide 9, CYP2C9*2) and human papillomavirus sequences HPV16, 18, 31, 52 and 59 were chosen as model targets for testing MPA. The allele status of the CYP2C9*2 determined by MPA was entirely concordant with the reference TaqMan® SNP Genotyping Assays. The four HPV strain sequences could be independently detected in a single fluorescence detection channel. The results validate the multiplex capacity, the simplicity and accuracy of MPA for SNP genotyping and multiplex detection using different probes labeled with the same fluorophore. The technique offers a new way to multiplex in a single detection channel of a closed-tube PCR. PMID:22272339

  5. Population distribution and ancestry of the cancer protective MDM2 SNP285 (rs117039649).

    PubMed

    Knappskog, Stian; Gansmo, Liv B; Dibirova, Khadizha; Metspalu, Andres; Cybulski, Cezary; Peterlongo, Paolo; Aaltonen, Lauri; Vatten, Lars; Romundstad, Pål; Hveem, Kristian; Devilee, Peter; Evans, Gareth D; Lin, Dongxin; Van Camp, Guy; Manolopoulos, Vangelis G; Osorio, Ana; Milani, Lili; Ozcelik, Tayfun; Zalloua, Pierre; Mouzaya, Francis; Bliznetz, Elena; Balanovska, Elena; Pocheshkova, Elvira; Kučinskas, Vaidutis; Atramentova, Lubov; Nymadawa, Pagbajabyn; Titov, Konstantin; Lavryashina, Maria; Yusupov, Yuldash; Bogdanova, Natalia; Koshel, Sergey; Zamora, Jorge; Wedge, David C; Charlesworth, Deborah; Dörk, Thilo; Balanovsky, Oleg; Lønning, Per E

    2014-09-30

    The MDM2 promoter SNP285C is located on the SNP309G allele. While SNP309G enhances Sp1 transcription factor binding and MDM2 transcription, SNP285C antagonizes Sp1 binding and reduces the risk of breast-, ovary- and endometrial cancer. Assessing SNP285 and 309 genotypes across 25 different ethnic populations (>10.000 individuals), the incidence of SNP285C was 6-8% across European populations except for Finns (1.2%) and Saami (0.3%). The incidence decreased towards the Middle-East and Eastern Russia, and SNP285C was absent among Han Chinese, Mongolians and African Americans. Interhaplotype variation analyses estimated SNP285C to have originated about 14,700 years ago (95% CI: 8,300 - 33,300). Both this estimate and the geographical distribution suggest SNP285C to have arisen after the separation between Caucasians and modern day East Asians (17,000 - 40,000 years ago). We observed a strong inverse correlation (r = -0.805; p < 0.001) between the percentage of SNP309G alleles harboring SNP285C and the MAF for SNP309G itself across different populations suggesting selection and environmental adaptation with respect to MDM2 expression in recent human evolution. In conclusion, we found SNP285C to be a pan-Caucasian variant. Ethnic variation regarding distribution of SNP285C needs to be taken into account when assessing the impact of MDM2 SNPs on cancer risk. PMID:25327560

  6. Population distribution and ancestry of the cancer protective MDM2 SNP285 (rs117039649)

    PubMed Central

    Knappskog, Stian; Gansmo, Liv B.; Dibirova, Khadizha; Metspalu, Andres; Cybulski, Cezary; Peterlongo, Paolo; Aaltonen, Lauri; Vatten, Lars; Romundstad, Pål; Hveem, Kristian; Devilee, Peter; Evans, Gareth D.; Lin, Dongxin; Camp, Guy Van; Manolopoulos, Vangelis G.; Osorio, Ana; Milani, Lili; Ozcelik, Tayfun; Zalloua, Pierre; Mouzaya, Francis; Bliznetz, Elena; Balanovska, Elena; Pocheshkova, Elvira; Kučinskas, Vaidutis; Atramentova, Lubov; Nymadawa, Pagbajabyn; Titov, Konstantin; Lavryashina, Maria; Yusupov, Yuldash; Bogdanova, Natalia; Koshel, Sergey; Zamora, Jorge; Wedge, David C.; Charlesworth, Deborah; Dörk, Thilo; Balanovsky, Oleg; Lønning, Per E.

    2014-01-01

    The MDM2 promoter SNP285C is located on the SNP309G allele. While SNP309G enhances Sp1 transcription factor binding and MDM2 transcription, SNP285C antagonizes Sp1 binding and reduces the risk of breast-, ovary- and endometrial cancer. Assessing SNP285 and 309 genotypes across 25 different ethnic populations (>10.000 individuals), the incidence of SNP285C was 6-8% across European populations except for Finns (1.2%) and Saami (0.3%). The incidence decreased towards the Middle-East and Eastern Russia, and SNP285C was absent among Han Chinese, Mongolians and African Americans. Interhaplotype variation analyses estimated SNP285C to have originated about 14,700 years ago (95% CI: 8,300 – 33,300). Both this estimate and the geographical distribution suggest SNP285C to have arisen after the separation between Caucasians and modern day East Asians (17,000 - 40,000 years ago). We observed a strong inverse correlation (r = -0.805; p < 0.001) between the percentage of SNP309G alleles harboring SNP285C and the MAF for SNP309G itself across different populations suggesting selection and environmental adaptation with respect to MDM2 expression in recent human evolution. In conclusion, we found SNP285C to be a pan-Caucasian variant. Ethnic variation regarding distribution of SNP285C needs to be taken into account when assessing the impact of MDM2 SNPs on cancer risk. PMID:25327560

  7. Genome-wide inbreeding estimation within Lebanese communities using SNP arrays.

    PubMed

    Jalkh, Nadine; Sahbatou, Mourad; Chouery, Eliane; Megarbane, André; Leutenegger, Anne-Louise; Serre, Jean-Louis

    2015-10-01

    Consanguineous marriages have been widely practiced in several global communities with varying rates depending on religion, culture, and geography. In consanguineous marriages, parents pass to their children autozygous segments known as homozygous by descent segments. In this study, single-nucleotide polymorphisms were analyzed in 165 unrelated Lebanese people from Greek Orthodox, Maronite, Shiite and Sunni communities. Runs of homozygosity, total inbreeding levels, remote consanguinity, and population admixture and structure were estimated. The inbreeding coefficient value was estimated to be 1.61% in offspring of unrelated parents over three generations and 8.33% in offspring of first cousins. From these values, remote consanguinity values, resulting from genetic drift or recurrent consanguineous unions, were estimated in offspring of unrelated and first-cousin parents to be 0.61 and 1.2%, respectively. This remote consanguinity value suggests that for any unrelated marriages in Lebanon, the mates could be related as third cousins or as second cousins once removed. Under the assumption that 25% of marriages occur between first cousins, the mean inbreeding value of 2.3% may explain the increased incidence of recessive disease in offspring. Our analysis reveals a common ancestral population in the four Lebanese communities we studied. PMID:25424710

  8. A patient with constitutional ring 1 chromosome characterized by SNP array CGH.

    PubMed

    Saliganan, Sheila; Lee, Joanna; Wei, Sainan

    2016-04-01

    We present a male patient with constitutional ring 1 chromosome and subsequent 6 Mb deletion at 1q43q44. The patient displays overlapping clinical features with reported patients with ring 1 chromosome and 1q43q44 microdeletion syndrome. To our knowledge, this is the first patient with ring 1 chromosome characterized by comparative genomic hybridization. PMID:27099748

  9. Vitis phylogenomics: hybridization intensities from a SNP array outperform genotype calls

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Understanding relationships among species is a fundamental goal of evolutionary biology. Single nucleotide polymorphisms (SNPs) identified through next generation sequencing and related technologies enable phylogeny reconstruction by providing unprecedented numbers of characters for analysis. One ...

  10. Correlation Analysis between SNP and Expression Arrays in Gliomas Identify Potentially Relevant Targets Genes1

    PubMed Central

    Kotliarov, Yuri; Kotliarova, Svetlana; Charong, Nurdina; Li, Aiguo; Walling, Jennifer; Aquilanti, Elisa; Ahn, Susie; Steed, Mary Ellen; Su, Qin; Center, Angela; Zenklusen, Jean C; Fine, Howard A.

    2008-01-01

    Primary brain tumors are a major cause of cancer mortality in the United States. Therapy for gliomas, the most common type of primary brain tumors, remains suboptimal. The development of improved therapeutics will require greater knowledge of the biology of gliomas at both the genomic and transcriptional levels. We have previously reported whole genome profiling of chromosome copy number alterations (CNA) in gliomas, and now present our findings on how those changes may affect transcription of genes that may be involved in tumor induction and progression. By calculating correlation values of mRNA expression vs. DNA copy number average in a moving window around a given RNA probeset, biologically relevant information can be gained that is obscured by the analysis of a single data type. Correlation coefficients ranged from −0.6 to 0.7; highly significant when compared to previously studies. Most correlated genes are located on chromosomes 1, 7, 9, 10, 13, 14, 19, 20 and 22, chromosomes known to have genomic alterations in gliomas. Additionally, we were able to identify CNAs whose gene expression correlation suggests possible epigenetic regulation. This analysis revealed a number of interesting candidates such as CXCL12, PTER, LRRN6C, among others. The results have been verified using real-time PCR and methylation sequencing assays. These data will further help differentiate genes involved in the induction and/or maintenance of the tumorigenic process from those that are mere passenger mutations, thereby enriching for a population of potentially new therapeutic molecular targets. PMID:19190341

  11. Review of the initial validation and characterization of a 3K chicken SNP array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The year 2004 was a historic one for biologists and especially the chicken research community as the first draft of the chicken genome was published (International Chicken Genome Sequencing Consortium, 2004). The 6.6X coverage of a UCD001 female Red Jungle Fowl (RJF) genome was the first complete d...

  12. Design of a bovine low-density SNP array optimized for imputation

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The Illumina BovineLD BeadChip was designed to support imputation to higher density genotypes in dairy and beef breeds by including single-nucleotide polymorphisms (SNPs) that had a high minor allele frequency as well as uniform spacing across the genome except at the ends of the chromosome where de...

  13. Interagency arraying

    NASA Astrophysics Data System (ADS)

    Cox, Henry G.

    Activities performed to match ground aperture requirements for the Neptune encounter in August 1989 with the expected capabilities of the JPL Deep Space Network (DSN) are discussed. Ground aperture requirements, DSN capabilities, and the capabilities of other agencies are reviewed. The design and configurations of the receiver subsystem, combiner subsystem, monitor and control subsystem, recording subsystem, and supporting subsystems are described. The implementation of the Very Large Array-Goldstone Telemetry Array is discussed, and the differences involved with the Parkes-Canberra Telemetry Array implementation are highlighted. The operational concept is addressed.

  14. Enthalpy arrays

    NASA Astrophysics Data System (ADS)

    Torres, Francisco E.; Kuhn, Peter; de Bruyker, Dirk; Bell, Alan G.; Wolkin, Michal V.; Peeters, Eric; Williamson, James R.; Anderson, Gregory B.; Schmitz, Gregory P.; Recht, Michael I.; Schweizer, Sandra; Scott, Lincoln G.; Ho, Jackson H.; Elrod, Scott A.; Schultz, Peter G.; Lerner, Richard A.; Bruce, Richard H.

    2004-06-01

    We report the fabrication of enthalpy arrays and their use to detect molecular interactions, including protein-ligand binding, enzymatic turnover, and mitochondrial respiration. Enthalpy arrays provide a universal assay methodology with no need for specific assay development such as fluorescent labeling or immobilization of reagents, which can adversely affect the interaction. Microscale technology enables the fabrication of 96-detector enthalpy arrays on large substrates. The reduction in scale results in large decreases in both the sample quantity and the measurement time compared with conventional microcalorimetry. We demonstrate the utility of the enthalpy arrays by showing measurements for two protein-ligand binding interactions (RNase A + cytidine 2'-monophosphate and streptavidin + biotin), phosphorylation of glucose by hexokinase, and respiration of mitochondria in the presence of 2,4-dinitrophenol uncoupler.

  15. Array tomography: production of arrays.

    PubMed

    Micheva, Kristina D; O'Rourke, Nancy; Busse, Brad; Smith, Stephen J

    2010-11-01

    Array tomography is a volumetric microscopy method based on physical serial sectioning. Ultrathin sections of a plastic-embedded tissue are cut using an ultramicrotome, bonded in an ordered array to a glass coverslip, stained as desired, and imaged. The resulting two-dimensional image tiles can then be reconstructed computationally into three-dimensional volume images for visualization and quantitative analysis. The minimal thickness of individual sections permits high-quality rapid staining and imaging, whereas the array format allows reliable and convenient section handling, staining, and automated imaging. Also, the physical stability of the arrays permits images to be acquired and registered from repeated cycles of staining, imaging, and stain elution, as well as from imaging using multiple modalities (e.g., fluorescence and electron microscopy). Array tomography makes it possible to visualize and quantify previously inaccessible features of tissue structure and molecular architecture. However, careful preparation of the tissue is essential for successful array tomography; these steps can be time consuming and require some practice to perfect. This protocol describes the sectioning of embedded tissues and the mounting of the serial arrays. The procedures require some familiarity with the techniques used for ultramicrotome sectioning for electron microscopy. PMID:21041397

  16. Array tomography: imaging stained arrays.

    PubMed

    Micheva, Kristina D; O'Rourke, Nancy; Busse, Brad; Smith, Stephen J

    2010-11-01

    Array tomography is a volumetric microscopy method based on physical serial sectioning. Ultrathin sections of a plastic-embedded tissue are cut using an ultramicrotome, bonded in an ordered array to a glass coverslip, stained as desired, and imaged. The resulting two-dimensional image tiles can then be reconstructed computationally into three-dimensional volume images for visualization and quantitative analysis. The minimal thickness of individual sections permits high-quality rapid staining and imaging, whereas the array format allows reliable and convenient section handling, staining, and automated imaging. Also, the physical stability of the arrays permits images to be acquired and registered from repeated cycles of staining, imaging, and stain elution, as well as from imaging using multiple modalities (e.g., fluorescence and electron microscopy). Array tomography makes it possible to visualize and quantify previously inaccessible features of tissue structure and molecular architecture. However, careful preparation of the tissue is essential for successful array tomography; these steps can be time-consuming and require some practice to perfect. In this protocol, tissue arrays are imaged using conventional wide-field fluorescence microscopy. Images can be captured manually or, with the appropriate software and hardware, the process can be automated. PMID:21041399

  17. A Genome-Wide Study of Cytogenetic Changes in Colorectal Cancer Using SNP Microarrays: Opportunities for Future Personalized Treatment

    PubMed Central

    Jasmine, Farzana; Rahaman, Ronald; Dodsworth, Charlotte; Roy, Shantanu; Paul, Rupash; Raza, Maruf; Paul-Brutus, Rachelle; Kamal, Mohammed; Ahsan, Habibul; Kibriya, Muhammad G.

    2012-01-01

    In colorectal cancer (CRC), chromosomal instability (CIN) is typically studied using comparative-genomic hybridization (CGH) arrays. We studied paired (tumor and surrounding healthy) fresh frozen tissue from 86 CRC patients using Illumina's Infinium-based SNP array. This method allowed us to study CIN in CRC, with simultaneous analysis of copy number (CN) and B-allele frequency (BAF) - a representation of allelic composition. These data helped us to detect mono-allelic and bi-allelic amplifications/deletion, copy neutral loss of heterozygosity, and levels of mosaicism for mixed cell populations, some of which can not be assessed with other methods that do not measure BAF. We identified associations between CN abnormalities and different CRC phenotypes (histological diagnosis, location, tumor grade, stage, MSI and presence of lymph node metastasis). We showed commonalities between regions of CN change observed in CRC and the regions reported in previous studies of other solid cancers (e.g. amplifications of 20q, 13q, 8q, 5p and deletions of 18q, 17p and 8p). From Therapeutic Target Database, we identified relevant drugs, targeted to the genes located in these regions with CN changes, approved or in trials for other cancers and common diseases. These drugs may be considered for future therapeutic trials in CRC, based on personalized cytogenetic diagnosis. We also found many regions, harboring genes, which are not currently targeted by any relevant drugs that may be considered for future drug discovery studies. Our study shows the application of high density SNP arrays for cytogenetic study in CRC and its potential utility for personalized treatment. PMID:22363777

  18. Automated SNP detection from a large collection of white spruce expressed sequences: contributing factors and approaches for the categorization of SNPs

    PubMed Central

    Pavy, Nathalie; Parsons, Lee S; Paule, Charles; MacKay, John; Bousquet, Jean

    2006-01-01

    Background High-throughput genotyping technologies represent a highly efficient way to accelerate genetic mapping and enable association studies. As a first step toward this goal, we aimed to develop a resource of candidate Single Nucleotide Polymorphisms (SNP) in white spruce (Picea glauca [Moench] Voss), a softwood tree of major economic importance. Results A white spruce SNP resource encompassing 12,264 SNPs was constructed from a set of 6,459 contigs derived from Expressed Sequence Tags (EST) and by using the bayesian-based statistical software PolyBayes. Several parameters influencing the SNP prediction were analysed including the a priori expected polymorphism, the probability score (PSNP), and the contig depth and length. SNP detection in 3' and 5' reads from the same clones revealed a level of inconsistency between overlapping sequences as low as 1%. A subset of 245 predicted SNPs were verified through the independent resequencing of genomic DNA of a genotype also used to prepare cDNA libraries. The validation rate reached a maximum of 85% for SNPs predicted with either PSNP ≥ 0.95 or ≥ 0.99. A total of 9,310 SNPs were detected by using PSNP ≥ 0.95 as a criterion. The SNPs were distributed among 3,590 contigs encompassing an array of broad functional categories, with an overall frequency of 1 SNP per 700 nucleotide sites. Experimental and statistical approaches were used to evaluate the proportion of paralogous SNPs, with estimates in the range of 8 to 12%. The 3,789 coding SNPs identified through coding region annotation and ORF prediction, were distributed into 39% nonsynonymous and 61% synonymous substitutions. Overall, there were 0.9 SNP per 1,000 nonsynonymous sites and 5.2 SNPs per 1,000 synonymous sites, for a genome-wide nonsynonymous to synonymous substitution rate ratio (Ka/Ks) of 0.17. Conclusion We integrated the SNP data in the ForestTreeDB database along with functional annotations to provide a tool facilitating the choice of candidate

  19. A new diagnostic workflow for patients with mental retardation and/or multiple congenital abnormalities: test arrays first

    PubMed Central

    Gijsbers, Antoinet CJ; Lew, Janet YK; Bosch, Cathy AJ; Schuurs-Hoeijmakers, Janneke HM; van Haeringen, Arie; den Hollander, Nicolette S; Kant, Sarina G; Bijlsma, Emilia K; Breuning, Martijn H; Bakker, Egbert; Ruivenkamp, Claudia AL

    2009-01-01

    High-density single-nucleotide polymorphism (SNP) genotyping technology enables extensive genotyping as well as the detection of increasingly smaller chromosomal aberrations. In this study, we assess molecular karyotyping as first-round analysis of patients with mental retardation and/or multiple congenital abnormalities (MR/MCA). We used different commercially available SNP array platforms, the Affymetrix GeneChip 262K NspI, the Genechip 238K StyI, the Illumina HumanHap 300 and HumanCNV 370 BeadChip, to detect copy number variants (CNVs) in 318 patients with unexplained MR/MCA. We found abnormalities in 22.6% of the patients, including six CNVs that overlap known microdeletion/duplication syndromes, eight CNVs that overlap recently described syndromes, 63 potentially pathogenic CNVs (in 52 patients), four large segments of homozygosity and two mosaic trisomies for an entire chromosome. This study shows that high-density SNP array analysis reveals a much higher diagnostic yield as that of conventional karyotyping. SNP arrays have the potential to detect CNVs, mosaics, uniparental disomies and loss of heterozygosity in one experiment. We, therefore, propose a novel diagnostic approach to all MR/MCA patients by first analyzing every patient with an SNP array instead of conventional karyotyping. PMID:19436329

  20. Highly specific DNA detection employing ligation on suspension bead array readout.

    PubMed

    Mezger, Anja; Kühnemund, Malte; Nilsson, Mats; Herthnek, David

    2015-09-25

    We show for the first time that monomerized rolling circle amplification (RCA) products can be directly detected with the Luminex suspension bead array readout without the need of PCR amplification. Furthermore, using monomerized RCA products to guide ligation of the detection oligonucleotide (DO) to barcode sequences on the magnetic Luminex beads, combined with efficient washing and increased measurement temperature, yields a higher signal to noise ratio. As a proof-of-principle, we demonstrate detection of pathogenic DNA sequences with high reproducibility, sensitivity and a dynamic range over four orders of magnitude. Using padlock probes in combination with bead suspension arrays opens up the possibility for highly multiplexed DNA targeting and readout. PMID:25681158

  1. Exploration of SNP variants affecting hair colour prediction in Europeans.

    PubMed

    Söchtig, Jens; Phillips, Chris; Maroñas, Olalla; Gómez-Tato, Antonio; Cruz, Raquel; Alvarez-Dios, Jose; de Cal, María-Ángeles Casares; Ruiz, Yarimar; Reich, Kristian; Fondevila, Manuel; Carracedo, Ángel; Lareu, María V

    2015-09-01

    DNA profiling is a key tool for forensic analysis; however, current methods identify a suspect either by direct comparison or from DNA database searches. In cases with unidentified suspects, prediction of visible physical traits e.g. pigmentation or hair distribution of the DNA donors can provide important probative information. This study aimed to explore single nucleotide polymorphism (SNP) variants for their effect on hair colour prediction. A discovery panel of 63 SNPs consisting of already established hair colour markers from the HIrisPlex hair colour phenotyping assay as well as additional markers for which associations to human pigmentation traits were previously identified was used to develop multiplex assays based on SNaPshot single-base extension technology. A genotyping study was performed on a range of European populations (n = 605). Hair colour phenotyping was accomplished by matching donor's hair to a graded colour category system of reference shades and photography. Since multiple SNPs in combination contribute in varying degrees to hair colour predictability in Europeans, we aimed to compile a compact marker set that could provide a reliable hair colour inference from the fewest SNPs. The predictive approach developed uses a naïve Bayes classifier to provide hair colour assignment probabilities for the SNP profiles of the key SNPs and was embedded into the Snipper online SNP classifier ( http://mathgene.usc.es/snipper/ ). Results indicate that red, blond, brown and black hair colours are predictable with informative probabilities in a high proportion of cases. Our study resulted in the identification of 12 most strongly associated SNPs to hair pigmentation variation in six genes. PMID:26162598

  2. Microlens arrays

    NASA Astrophysics Data System (ADS)

    Hutley, Michael C.; Stevens, Richard F.; Daly, Daniel J.

    1992-04-01

    Microlenses have been with us for a long time as indeed the very word lens reminds us. Many early lenses,including those made by Hooke and Leeuwenhoek in the 17th century were small and resembled lentils. Many languages use the same word for both (French tilentillelt and German "Linse") and the connection is only obscure in English because we use the French word for the vegetable and the German for the optic. Many of the applications for arrays of inicrolenses are also well established. Lippmann's work on integral photography at the turn of the century required lens arrays and stimulated an interest that is very much alive today. At one stage, lens arrays played an important part in high speed photography and various schemes have been put forward to take advantage of the compact imaging properties of combinations of lens arrays. The fact that many of these ingenious schemes have not been developed to their full potential has to a large degree been due to the absence of lens arrays of a suitable quality and cost.

  3. Identification of null alleles and deletions from SNP genotypes for an intercross between domestic and wild chickens.

    PubMed

    Crooks, Lucy; Carlborg, Örjan; Marklund, Stefan; Johansson, Anna M

    2013-08-01

    We analyzed genotypes from ~10K single-nucleotide polymorphisms (SNPs) in two families of an F2 intercross between Red Junglefowl and White Leghorn chickens. Possible null alleles were found by patterns of incompatible and missing genotypes. We estimated that 2.6% of SNPs had null alleles compared with 2.3% with genotyping errors and that 40% of SNPs in which a parent and offspring were genotyped as different homozygotes had null alleles. Putative deletions were identified by null alleles at adjacent markers. We found two candidate deletions that were supported by fluorescence intensity data from a 60K SNP chip. One of the candidate deletions was from the Red Junglefowl, and one was present in both the Red Junglefowl and White Leghorn. Both candidate deletions spanned protein-coding regions and were close to a previously detected quantitative trait locus affecting body weight in this population. This study demonstrates that the ~50K SNP genotyping arrays now available for several agricultural species can be used to identify null alleles and deletions in data from large families. We suggest that our approach could be a useful complement to linkage analysis in experimental crosses. PMID:23708300

  4. Single-copy gene based 50 K SNP chip for genetic studies and molecular breeding in rice

    PubMed Central

    Singh, Nisha; Jayaswal, Pawan Kumar; Panda, Kabita; Mandal, Paritra; Kumar, Vinod; Singh, Balwant; Mishra, Shefali; Singh, Yashi; Singh, Renu; Rai, Vandna; Gupta, Anita; Raj Sharma, Tilak; Singh, Nagendra Kumar

    2015-01-01

    Single nucleotide polymorphism (SNP) is the most abundant DNA sequence variation present in plant genomes. Here, we report the design and validation of a unique genic-SNP genotyping chip for genetic and evolutionary studies as well as molecular breeding applications in rice. The chip incorporates 50,051 SNPs from 18,980 different genes spanning 12 rice chromosomes, including 3,710 single-copy (SC) genes conserved between wheat and rice, 14,959 SC genes unique to rice, 194 agronomically important cloned rice genes and 117 multi-copy rice genes. Assays with this chip showed high success rate and reproducibility because of the SC gene based array with no sequence redundancy and cross-hybridisation problems. The usefulness of the chip in genetic diversity and phylogenetic studies of cultivated and wild rice germplasm was demonstrated. Furthermore, its efficacy was validated for analysing background recovery in improved mega rice varieties with submergence tolerance developed through marker-assisted backcross breeding. PMID:26111882

  5. A Search for Low-Mass Dark Matter with the Cryogenic Dark Matter Search and the Development of Highly Multiplexed Phonon-Mediated Particle Detectors

    NASA Astrophysics Data System (ADS)

    Moore, David Craig

    2012-06-01

    A wide variety of astrophysical observations indicate that approximately 85% of the matter in the universe is nonbaryonic and nonluminous. Understanding the nature of this "dark matter" is one of the most important outstanding questions in cosmology. Weakly Interacting Massive Particles (WIMPs) are a leading candidate for dark matter since they would be thermally produced in the early universe in the correct abundance to account for the observed relic density of dark matter. If WIMPs account for the dark matter, then rare interactions from relic WIMPs should be observable in terrestrial detectors. Recently, unexplained excess events in the DAMA/LIBRA, CoGeNT, and CRESST-II experiments have been interpreted as evidence of scattering from WIMPs with masses ˜10 GeV and spin-independent scattering cross sections of 10--41--10 --40 cm2. The Cryogenic Dark Matter Search (CDMS II) attempts to identify WIMP interactions using an array of cryogenic germanium and silicon particle detectors located at the Soudan Underground Laboratory in northern Minnesota. In this dissertation, data taken by CDMS II are reanalyzed using a 2 keV recoil energy threshold to increase the sensitivity to WIMPs with masses ˜10 GeV. These data disfavor an explanation for the DAMA/LIBRA, CoGeNT, and CRESST-II results in terms of spin-independent elastic scattering of WIMPs with masses ≲ 12 GeV, under standard assumptions. At the time of publication, they provided the strongest constraints on spin-independent elastic scattering from 5--9 GeV, ruling out previously unexplored parameter space. To detect WIMPs or exclude the remaining parameter space favored by the most popular models will ultimately require detectors with target masses ≳ 1 ton, requiring an increase in mass by more than two orders of magnitude over CDMS II. For cryogenic detectors such as CDMS, scaling to such large target masses will require individual detector elements to be fabricated more quickly and cheaply, while maintaining

  6. A search for low-mass dark matter with the cryogenic dark matter search and the development of highly multiplexed phonon-mediated particle detectors

    SciTech Connect

    Moore, David Craig

    2012-01-01

    A wide variety of astrophysical observations indicate that approximately 85% of the matter in the universe is nonbaryonic and nonluminous. Understanding the nature of this "dark matter" is one of the most important outstanding questions in cosmology. Weakly Interacting Massive Particles (WIMPs) are a leading candidate for dark matter since they would be thermally produced in the early universe in the correct abundance to account for the observed relic density of dark matter. If WIMPs account for the dark matter, then rare interactions from relic WIMPs should be observable in terrestrial detectors. Recently, unexplained excess events in the DAMA/LIBRA, CoGeNT, and CRESST-II experiments have been interpreted as evidence of scattering from WIMPs with masses ~10 GeV and spin-independent scattering cross sections of 10-41-10-40 cm2. The Cryogenic Dark Matter Search (CDMS II) attempts to identify WIMP interactions using an array of cryogenic germanium and silicon particle detectors located at the Soudan Underground Laboratory in northern Minnesota. In this dissertation, data taken by CDMS II are reanalyzed using a 2 keV recoil energy threshold to increase the sensitivity to WIMPs with masses ~10 GeV. These data disfavor an explanation for the DAMA/LIBRA, CoGeNT, and CRESST-II results in terms of spin-independent elastic scattering of WIMPs with masses ≲12 GeV, under standard assumptions. At the time of publication, they provided the strongest constraints on spin-independent elastic scattering from 5-9 GeV, ruling out previously unexplored parameter space. To detect WIMPs or exclude the remaining parameter space favored by the most popular models will ultimately require detectors with target masses ≳1 ton, requiring an increase in mass by more than two orders of magnitude over CDMS II. For cryogenic detectors such as CDMS, scaling to such large target masses will require individual detector elements to be fabricated more quickly and cheaply, while

  7. PCR amplification of SNP loci from crude DNA for large-scale genotyping of oomycetes.

    PubMed

    Hu, Jian; Lyon, Rebecca; Zhou, Yuxin; Lamour, Kurt

    2014-01-01

    Similar to other eukaryotes, single nucleotide polymorphism (SNP) markers are abundant in many oomycete plant pathogen genomes. High resolution DNA melting analysis (HR-DMA) is a cost-effective method for SNP genotyping, but like many SNP marker technologies, is limited by the amount and quality of template DNA. We describe PCR preamplification of Phytophthora and Peronospora SNP loci from crude DNA extracted from a small amount of mycelium and/or infected plant tissue to produce sufficient template to genotype at least 10 000 SNPs. The approach is fast, inexpensive, requires minimal biological material and should be useful for many organisms in a variety of contexts. PMID:24871597

  8. Accuracy of direct genomic values in Holstein bulls and cows using subsets of SNP markers

    PubMed Central

    2010-01-01

    Background At the current price, the use of high-density single nucleotide polymorphisms (SNP) genotyping assays in genomic selection of dairy cattle is limited to applications involving elite sires and dams. The objective of this study was to evaluate the use of low-density assays to predict direct genomic value (DGV) on five milk production traits, an overall conformation trait, a survival index, and two profit index traits (APR, ASI). Methods Dense SNP genotypes were available for 42,576 SNP for 2,114 Holstein bulls and 510 cows. A subset of 1,847 bulls born between 1955 and 2004 was used as a training set to fit models with various sets of pre-selected SNP. A group of 297 bulls born between 2001 and 2004 and all cows born between 1992 and 2004 were used to evaluate the accuracy of DGV prediction. Ridge regression (RR) and partial least squares regression (PLSR) were used to derive prediction equations and to rank SNP based on the absolute value of the regression coefficients. Four alternative strategies were applied to select subset of SNP, namely: subsets of the highest ranked SNP for each individual trait, or a single subset of evenly spaced SNP, where SNP were selected based on their rank for ASI, APR or minor allele frequency within intervals of approximately equal length. Results RR and PLSR performed very similarly to predict DGV, with PLSR performing better for low-density assays and RR for higher-density SNP sets. When using all SNP, DGV predictions for production traits, which have a higher heritability, were more accurate (0.52-0.64) than for survival (0.19-0.20), which has a low heritability. The gain in accuracy using subsets that included the highest ranked SNP for each trait was marginal (5-6%) over a common set of evenly spaced SNP when at least 3,000 SNP were used. Subsets containing 3,000 SNP provided more than 90% of the accuracy that could be achieved with a high-density assay for cows, and 80% of the high-density assay for young bulls

  9. Lack of association between MDM2 promoter SNP309 and clinical outcome in patients with neuroblastoma.

    PubMed

    Rihani, Ali; Van Maerken, Tom; De Wilde, Bram; Zeka, Fjoralba; Laureys, Geneviève; Norga, Koen; Tonini, Gian Paolo; Coco, Simona; Versteeg, Rogier; Noguera, Rosa; Schulte, Johannes H; Eggert, Angelika; Stallings, Raymond L; Speleman, Frank; Vandesompele, Jo

    2014-10-01

    While a polymorphism located within the promoter region of the MDM2 proto-oncogene, SNP309 (T > G), has previously been associated with increased risk and aggressiveness of neuroblastoma and other tumor entities, a protective effect has also been reported in certain other cancers. In this study, we evaluated the association of MDM2 SNP309 with outcome in 496 patients with neuroblastoma and its effect on MDM2 expression. No significant difference in overall or event-free survival was observed among patients with neuroblastoma with or without MDM2 SNP309. The presence of SNP309 does not affect MDM2 expression in neuroblastoma. PMID:24391119

  10. Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms

    PubMed Central

    2012-01-01

    Background A genome-wide set of single nucleotide polymorphisms (SNPs) is a valuable resource in genetic research and breeding and is usually developed by re-sequencing a genome. If a genome sequence is not available, an alternative strategy must be used. We previously reported the development of a pipeline (AGSNP) for genome-wide SNP discovery in coding sequences and other single-copy DNA without a complete genome sequence in self-pollinating (autogamous) plants. Here we updated this pipeline for SNP discovery in outcrossing (allogamous) species and demonstrated its efficacy in SNP discovery in walnut (Juglans regia L.). Results The first step in the original implementation of the AGSNP pipeline was the construction of a reference sequence and the identification of single-copy sequences in it. To identify single-copy sequences, multiple genome equivalents of short SOLiD reads of another individual were mapped to shallow genome coverage of long Sanger or Roche 454 reads making up the reference sequence. The relative depth of SOLiD reads was used to filter out repeated sequences from single-copy sequences in the reference sequence. The second step was a search for SNPs between SOLiD reads and the reference sequence. Polymorphism within the mapped SOLiD reads would have precluded SNP discovery; hence both individuals had to be homozygous. The AGSNP pipeline was updated here for using SOLiD or other type of short reads of a heterozygous individual for these two principal steps. A total of 32.6X walnut genome equivalents of SOLiD reads of vegetatively propagated walnut scion cultivar ‘Chandler’ were mapped to 48,661 ‘Chandler’ bacterial artificial chromosome (BAC) end sequences (BESs) produced by Sanger sequencing during the construction of a walnut physical map. A total of 22,799 putative SNPs were initially identified. A total of 6,000 Infinium II type SNPs evenly distributed along the walnut physical map were selected for the construction of an Infinium Bead

  11. Four-copy number intervals in SNP microarray analysis: unique patterns and positions.

    PubMed

    Papenhausen, Peter R; Kelly, Carla A; Zvereff, Val; Schwartz, Stuart

    2014-01-01

    Over the past several years, the utility of microarray technology in delineating copy number changes has become well established. In the past 4 years, we have used the SNP array to detect and analyze allele ratios in 150 cases with 4-copy intervals, confirmed by FISH, offering insight into the underlying mechanisms of formation. These cases may be divided into 5 allele patterns--the first 4 of which involve a single homologue--as detected by the genotyping aspects of the microarray: (1) triplications combining homozygous and heterozygous alleles, with a 3:1 ratio of heterozygotes; (2) triplications with allele patterns combining homozygous and heterozygous alleles, with heterozygote ratios of both 3:1 and 2:2; (3) triplications that have homozygous alleles combined with only 2:2 heterozygous alleles; (4) triplications that are completely homozygous; and (5) homozygous duplications on each homologue with no heterozygous alleles. The implications of copy number variants with diverse allelic segregations are presented in this study. PMID:25401283

  12. On the use of dense SNP marker data for the identification of distant relative pairs.

    PubMed

    Sun, M; Jobling, M A; Taliun, D; Pramstaller, P P; Egeland, T; Sheehan, N A

    2016-02-01

    There has been recent interest in the exploitation of readily available dense genome scan marker data for the identification of relatives. However, there are conflicting findings on how informative these data are in practical situations and, in particular, sets of thinned markers are often used with no concrete justification for the chosen spacing. We explore the potential usefulness of dense single nucleotide polymorphism (SNP) arrays for this application with a focus on inferring distant relative pairs. We distinguish between relationship estimation, as defined by a pedigree connecting the two individuals of interest, and estimation of general relatedness as would be provided by a kinship coefficient or a coefficient of relatedness. Since our primary interest is in the former case, we adopt a pedigree likelihood approach. We consider the effect of additional SNPs and data on an additional typed relative, together with choice of that relative, on relationship inference. We also consider the effect of linkage disequilibrium. When overall relatedness, rather than the specific relationship, would suffice, we propose an approximate approach that is easy to implement and appears to compete well with a popular moment-based estimator and a recent maximum likelihood approach based on chromosomal sharing. We conclude that denser marker data are more informative for distant relatives. However, linkage disequilibrium cannot be ignored and will be the main limiting factor for applications to real data. PMID:26474828

  13. Prospective diagnostic analysis of copy number variants using SNP microarrays in individuals with autism spectrum disorders.

    PubMed

    Nava, Caroline; Keren, Boris; Mignot, Cyril; Rastetter, Agnès; Chantot-Bastaraud, Sandra; Faudet, Anne; Fonteneau, Eric; Amiet, Claire; Laurent, Claudine; Jacquette, Aurélia; Whalen, Sandra; Afenjar, Alexandra; Périsse, Didier; Doummar, Diane; Dorison, Nathalie; Leboyer, Marion; Siffroi, Jean-Pierre; Cohen, David; Brice, Alexis; Héron, Delphine; Depienne, Christel

    2014-01-01

    Copy number variants (CNVs) have repeatedly been found to cause or predispose to autism spectrum disorders (ASDs). For diagnostic purposes, we screened 194 individuals with ASDs for CNVs using Illumina SNP arrays. In several probands, we also analyzed candidate genes located in inherited deletions to unmask autosomal recessive variants. Three CNVs, a de novo triplication of chromosome 15q11-q12 of paternal origin, a deletion on chromosome 9p24 and a de novo 3q29 deletion, were identified as the cause of the disorder in one individual each. An autosomal recessive cause was considered possible in two patients: a homozygous 1p31.1 deletion encompassing PTGER3 and a deletion of the entire DOCK10 gene associated with a rare hemizygous missense variant. We also identified multiple private or recurrent CNVs, the majority of which were inherited from asymptomatic parents. Although highly penetrant CNVs or variants inherited in an autosomal recessive manner were detected in rare cases, our results mainly support the hypothesis that most CNVs contribute to ASDs in association with other CNVs or point variants located elsewhere in the genome. Identification of these genetic interactions in individuals with ASDs constitutes a formidable challenge. PMID:23632794

  14. Prospective diagnostic analysis of copy number variants using SNP microarrays in individuals with autism spectrum disorders

    PubMed Central

    Nava, Caroline; Keren, Boris; Mignot, Cyril; Rastetter, Agnès; Chantot-Bastaraud, Sandra; Faudet, Anne; Fonteneau, Eric; Amiet, Claire; Laurent, Claudine; Jacquette, Aurélia; Whalen, Sandra; Afenjar, Alexandra; Périsse, Didier; Doummar, Diane; Dorison, Nathalie; Leboyer, Marion; Siffroi, Jean-Pierre; Cohen, David; Brice, Alexis; Héron, Delphine; Depienne, Christel

    2014-01-01

    Copy number variants (CNVs) have repeatedly been found to cause or predispose to autism spectrum disorders (ASDs). For diagnostic purposes, we screened 194 individuals with ASDs for CNVs using Illumina SNP arrays. In several probands, we also analyzed candidate genes located in inherited deletions to unmask autosomal recessive variants. Three CNVs, a de novo triplication of chromosome 15q11–q12 of paternal origin, a deletion on chromosome 9p24 and a de novo 3q29 deletion, were identified as the cause of the disorder in one individual each. An autosomal recessive cause was considered possible in two patients: a homozygous 1p31.1 deletion encompassing PTGER3 and a deletion of the entire DOCK10 gene associated with a rare hemizygous missense variant. We also identified multiple private or recurrent CNVs, the majority of which were inherited from asymptomatic parents. Although highly penetrant CNVs or variants inherited in an autosomal recessive manner were detected in rare cases, our results mainly support the hypothesis that most CNVs contribute to ASDs in association with other CNVs or point variants located elsewhere in the genome. Identification of these genetic interactions in individuals with ASDs constitutes a formidable challenge. PMID:23632794

  15. RNASEL and MIR146A SNP-SNP Interaction as a Susceptibility Factor for Non-Melanoma Skin Cancer

    PubMed Central

    Farzan, Shohreh F.; Karagas, Margaret R.; Christensen, Brock C.; Li, Zhongze; Kuriger, Jacquelyn K.; Nelson, Heather H.

    2014-01-01

    Immunity and inflammatory pathways are important in the genesis of non-melanoma skin cancers (NMSC). Functional genetic variation in immune modulators has the potential to affect disease etiology. We investigated associations between common variants in two key regulators, MIR146A and RNASEL, and their relation to NMSCs. Using a large population-based case-control study of basal cell (BCC) and squamous cell carcinoma (SCC), we investigated the impact of MIR146A SNP rs2910164 on cancer risk, and interaction with a SNP in one of its putative targets (RNASEL, rs486907). To examine associations between genotype and BCC and SCC, occurrence odds ratios (OR) and 95% confidence intervals (95%CI) were calculated using unconditional logistic regression, accounting for multiple confounding factors. We did not observe an overall change in the odds ratios for SCC or BCC among individuals carrying either of the RNASEL or MIR146A variants compared with those who were wild type at these loci. However, there was a sex-specific association between BCC and MIR146A in women (ORGC = 0.73, [95%CI = 0.52–1.03]; ORCC = 0.29, [95% CI = 0.14–0.61], p-trend<0.001), and a reduction in risk, albeit not statistically significant, associated with RNASEL and SCC in men (ORAG = 0.88, [95%CI = 0.65–1.19]; ORAA = 0.68, [95%CI = 0.43–1.08], p-trend = 0.10). Most striking was the strong interaction between the two genes. Among individuals carrying variant alleles of both rs2910164 and rs486907, we observed inverse relationships with SCC (ORSCC = 0.56, [95%CI = 0.38–0.81], p-interaction = 0.012) and BCC (ORBCC = 0.57, [95%CI = 0.40–0.80], p-interaction = 0.005). Our results suggest that genetic variation in immune and inflammatory regulators may influence susceptibility to NMSC, and novel SNP-SNP interaction for a microRNA and its target. These data suggest that RNASEL, an enzyme involved in RNA turnover, is controlled by miR-146a

  16. RNASEL and MIR146A SNP-SNP interaction as a susceptibility factor for non-melanoma skin cancer.

    PubMed

    Farzan, Shohreh F; Karagas, Margaret R; Christensen, Brock C; Li, Zhongze; Kuriger, Jacquelyn K; Nelson, Heather H

    2014-01-01

    Immunity and inflammatory pathways are important in the genesis of non-melanoma skin cancers (NMSC). Functional genetic variation in immune modulators has the potential to affect disease etiology. We investigated associations between common variants in two key regulators, MIR146A and RNASEL, and their relation to NMSCs. Using a large population-based case-control study of basal cell (BCC) and squamous cell carcinoma (SCC), we investigated the impact of MIR146A SNP rs2910164 on cancer risk, and interaction with a SNP in one of its putative targets (RNASEL, rs486907). To examine associations between genotype and BCC and SCC, occurrence odds ratios (OR) and 95% confidence intervals (95%CI) were calculated using unconditional logistic regression, accounting for multiple confounding factors. We did not observe an overall change in the odds ratios for SCC or BCC among individuals carrying either of the RNASEL or MIR146A variants compared with those who were wild type at these loci. However, there was a sex-specific association between BCC and MIR146A in women (ORGC = 0.73, [95%CI = 0.52-1.03]; ORCC = 0.29, [95% CI = 0.14-0.61], p-trend<0.001), and a reduction in risk, albeit not statistically significant, associated with RNASEL and SCC in men (ORAG = 0.88, [95%CI = 0.65-1.19]; ORAA = 0.68, [95%CI = 0.43-1.08], p-trend = 0.10). Most striking was the strong interaction between the two genes. Among individuals carrying variant alleles of both rs2910164 and rs486907, we observed inverse relationships with SCC (ORSCC = 0.56, [95%CI = 0.38-0.81], p-interaction = 0.012) and BCC (ORBCC = 0.57, [95%CI = 0.40-0.80], p-interaction = 0.005). Our results suggest that genetic variation in immune and inflammatory regulators may influence susceptibility to NMSC, and novel SNP-SNP interaction for a microRNA and its target. These data suggest that RNASEL, an enzyme involved in RNA turnover, is controlled by miR-146a and may be important in NMSC etiology. PMID:24699816

  17. SNP Markers and Their Impact on Plant Breeding

    PubMed Central

    Mammadov, Jafar; Aggarwal, Rajat; Buyyarapu, Ramesh; Kumpatla, Siva

    2012-01-01

    The use of molecular markers has revolutionized the pace and precision of plant genetic analysis which in turn facilitated the implementation of molecular breeding of crops. The last three decades have seen tremendous advances in the evolution of marker systems and the respective detection platforms. Markers based on single nucleotide polymorphisms (SNPs) have rapidly gained the center stage of molecular genetics during the recent years due to their abundance in the genomes and their amenability for high-throughput detection formats and platforms. Computational approaches dominate SNP discovery methods due to the ever-increasing sequence information in public databases; however, complex genomes pose special challenges in the identification of informative SNPs warranting alternative strategies in those crops. Many genotyping platforms and chemistries have become available making the use of SNPs even more attractive and efficient. This paper provides a review of historical and current efforts in the development, validation, and application of SNP markers in QTL/gene discovery and plant breeding by discussing key experimental strategies and cases exemplifying their impact. PMID:23316221

  18. Eigenanalysis of SNP data with an identity by descent interpretation.

    PubMed

    Zheng, Xiuwen; Weir, Bruce S

    2016-02-01

    Principal component analysis (PCA) is widely used in genome-wide association studies (GWAS), and the principal component axes often represent perpendicular gradients in geographic space. The explanation of PCA results is of major interest for geneticists to understand fundamental demographic parameters. Here, we provide an interpretation of PCA based on relatedness measures, which are described by the probability that sets of genes are identical-by-descent (IBD). An approximately linear transformation between ancestral proportions (AP) of individuals with multiple ancestries and their projections onto the principal components is found. In addition, a new method of eigenanalysis "EIGMIX" is proposed to estimate individual ancestries. EIGMIX is a method of moments with computational efficiency suitable for millions of SNP data, and it is not subject to the assumption of linkage equilibrium. With the assumptions of multiple ancestries and their surrogate ancestral samples, EIGMIX is able to infer ancestral proportions (APs) of individuals. The methods were applied to the SNP data from the HapMap Phase 3 project and the Human Genome Diversity Panel. The APs of individuals inferred by EIGMIX are consistent with the findings of the program ADMIXTURE. In conclusion, EIGMIX can be used to detect population structure and estimate genome-wide ancestral proportions with a relatively high accuracy. PMID:26482676

  19. SNPs Array Karyotyping in Non-Hodgkin Lymphoma

    PubMed Central

    Etebari, Maryam; Navari, Mohsen; Piccaluga, Pier Paolo

    2015-01-01

    The traditional methods for detection of chromosomal aberrations, which included cytogenetic or gene candidate solutions, suffered from low sensitivity or the need for previous knowledge of the target regions of the genome. With the advent of single nucleotide polymorphism (SNP) arrays, genome screening at global level in order to find chromosomal aberrations like copy number variants, DNA amplifications, deletions, and also loss of heterozygosity became feasible. In this review, we present an update of the knowledge, gained by SNPs arrays, of the genomic complexity of the most important subtypes of non-Hodgkin lymphomas.

  20. Global Arrays

    SciTech Connect

    2006-02-23

    The Global Arrays (GA) toolkit provides an efficient and portable “shared-memory” programming interface for distributed-memory computers. Each process in a MIMD parallel program can asynchronously access logical blocks of physically distributed dense multi-dimensional arrays, without need for explicit cooperation by other processes. Unlike other shared-memory environments, the GA model exposes to the programmer the non-uniform memory access (NUMA) characteristics of the high performance computers and acknowledges that access to a remote portion of the shared data is slower than to the local portion. The locality information for the shared data is available, and a direct access to the local portions of shared data is provided. Global Arrays have been designed to complement rather than substitute for the message-passing programming model. The programmer is free to use both the shared-memory and message-passing paradigms in the same program, and to take advantage of existing message-passing software libraries. Global Arrays are compatible with the Message Passing Interface (MPI).

  1. Pacific Array

    NASA Astrophysics Data System (ADS)

    Kawakatsu, H.; Takeo, A.; Isse, T.; Nishida, K.; Shiobara, H.; Suetsugu, D.

    2014-12-01

    Based on our recent results on broadband ocean bottom seismometry, we propose a next generation large-scale array experiment in the ocean. Recent advances in ocean bottom broadband seismometry (e.g., Suetsugu & Shiobara, 2014, Annual Review EPS), together with advances in the seismic analysis methodology, have now enabled us to resolve the regional 1-D structure of the entire lithosphere/asthenosphere system, including seismic anisotropy (both radial and azimuthal), with deployments of ~10-15 broadband ocean bottom seismometers (BBOBSs) (namely "ocean-bottom broadband dispersion survey"; Takeo et al., 2013, JGR; Kawakatsu et al., 2013, AGU; Takeo, 2014, Ph.D. Thesis; Takeo et al., 2014, JpGU). Having ~15 BBOBSs as an array unit for 2-year deployment, and repeating such deployments in a leap-frog way (an array of arrays) for a decade or so would enable us to cover a large portion of the Pacific basin. Such efforts, not only by giving regional constraints on the 1-D structure, but also by sharing waveform data for global scale waveform tomography, would drastically increase our knowledge of how plate tectonics works on this planet, as well as how it worked for the past 150 million years. International collaborations might be sought.

  2. Global Arrays

    Energy Science and Technology Software Center (ESTSC)

    2006-02-23

    The Global Arrays (GA) toolkit provides an efficient and portable “shared-memory” programming interface for distributed-memory computers. Each process in a MIMD parallel program can asynchronously access logical blocks of physically distributed dense multi-dimensional arrays, without need for explicit cooperation by other processes. Unlike other shared-memory environments, the GA model exposes to the programmer the non-uniform memory access (NUMA) characteristics of the high performance computers and acknowledges that access to a remote portion of the sharedmore » data is slower than to the local portion. The locality information for the shared data is available, and a direct access to the local portions of shared data is provided. Global Arrays have been designed to complement rather than substitute for the message-passing programming model. The programmer is free to use both the shared-memory and message-passing paradigms in the same program, and to take advantage of existing message-passing software libraries. Global Arrays are compatible with the Message Passing Interface (MPI).« less

  3. Genome-wide single-nucleotide polymorphism array-based karyotyping in myelodysplastic syndrome and chronic myelomonocytic leukemia and its impact on treatment outcomes following decitabine treatment.

    PubMed

    Yi, Jun Ho; Huh, Jungwon; Kim, Hee-Jin; Kim, Sun-Hee; Kim, Sung Hyun; Kim, Kyoung Ha; Do, Young Rok; Mun, Yeung-Chul; Kim, Hawk; Kim, Min Kyoung; Kim, Hyeoung-Joon; Kim, TaeHyung; Kim, Dennis Dong Hwan

    2013-04-01

    Decitabine is a hypomethylating agent with proven clinical efficacy in myelodysplastic syndrome (MDS). The current study analyzed the role of single nucleotide polymorphism array (SNP-A)-based karyotyping in prediction of clinical outcome in MDS or chronic myelomonocytic leukemia (CMML) patients following decitabine therapy. A total of 61 MDS/CMML patients treated with decitabine were evaluated with Genome-Wide Human SNP 6.0 Array using DNAs derived from marrow samples. The primary endpoint was the best response rate including complete (CR) and partial response (PR) with overall (OS) and event-free survival (EFS) as secondary endpoints. Best response was noted in 14 patients (26.4 %) out of 53 evaluated patients including 12 CR and two PR with median follow-up of 21.6 months. A total of 81 abnormal SNP lesions were found in 25 out of 61 patients (41.0 %). The patients carrying abnormal SNP lesions showed an inferior CR/PR rate (p = 0.002) and showed a trend of worse OS (p = 0.02 in univariate, p = 0.09 in multivariate) compared to those without SNP lesions, but not were associated with inferior EFS. The presence of abnormal SNP lesions in MDS was associated with adverse outcomes following decitabine therapy. Further study is strongly warranted to establish the role of SNP-A karyotyping in MDS. PMID:23262795

  4. FunctSNP: an R package to link SNPs to functional knowledge and dbAutoMaker: a suite of Perl scripts to build SNP databases

    PubMed Central

    2010-01-01

    Background Whole genome association studies using highly dense single nucleotide polymorphisms (SNPs) are a set of methods to identify DNA markers associated with variation in a particular complex trait of interest. One of the main outcomes from these studies is a subset of statistically significant SNPs. Finding the potential biological functions of such SNPs can be an important step towards further use in human and agricultural populations (e.g., for identifying genes related to susceptibility to complex diseases or genes playing key roles in development or performance). The current challenge is that the information holding the clues to SNP functions is distributed across many different databases. Efficient bioinformatics tools are therefore needed to seamlessly integrate up-to-date functional information on SNPs. Many web services have arisen to meet the challenge but most work only within the framework of human medical research. Although we acknowledge the importance of human research, we identify there is a need for SNP annotation tools for other organisms. Description We introduce an R package called FunctSNP, which is the user interface to custom built species-specific databases. The local relational databases contain SNP data together with functional annotations extracted from online resources. FunctSNP provides a unified bioinformatics resource to link SNPs with functional knowledge (e.g., genes, pathways, ontologies). We also introduce dbAutoMaker, a suite of Perl scripts, which can be scheduled to run periodically to automatically create/update the customised SNP databases. We illustrate the use of FunctSNP with a livestock example, but the approach and software tools presented here can be applied also to human and other organisms. Conclusions Finding the potential functional significance of SNPs is important when further using the outcomes from whole genome association studies. FunctSNP is unique in that it is the only R package that links SNPs to

  5. SNP Discovery for mapping alien introgressions in wheat

    PubMed Central

    2014-01-01

    Background Monitoring alien introgressions in crop plants is difficult due to the lack of genetic and molecular mapping information on the wild crop relatives. The tertiary gene pool of wheat is a very important source of genetic variability for wheat improvement against biotic and abiotic stresses. By exploring the 5Mg short arm (5MgS) of Aegilops geniculata, we can apply chromosome genomics for the discovery of SNP markers and their use for monitoring alien introgressions in wheat (Triticum aestivum L). Results The short arm of chromosome 5Mg of Ae. geniculata Roth (syn. Ae. ovata L.; 2n = 4x = 28, UgUgMgMg) was flow-sorted from a wheat line in which it is maintained as a telocentric chromosome. DNA of the sorted arm was amplified and sequenced using an Illumina Hiseq 2000 with ~45x coverage. The sequence data was used for SNP discovery against wheat homoeologous group-5 assemblies. A total of 2,178 unique, 5MgS-specific SNPs were discovered. Randomly selected samples of 59 5MgS-specific SNPs were tested (44 by KASPar assay and 15 by Sanger sequencing) and 84% were validated. Of the selected SNPs, 97% mapped to a chromosome 5Mg addition to wheat (the source of t5MgS), and 94% to 5Mg introgressed from a different accession of Ae. geniculata substituting for chromosome 5D of wheat. The validated SNPs also identified chromosome segments of 5MgS origin in a set of T5D-5Mg translocation lines; eight SNPs (25%) mapped to TA5601 [T5DL · 5DS-5MgS(0.75)] and three (8%) to TA5602 [T5DL · 5DS-5MgS (0.95)]. SNPs (gsnp_5ms83 and gsnp_5ms94), tagging chromosome T5DL · 5DS-5MgS(0.95) with the smallest introgression carrying resistance to leaf rust (Lr57) and stripe rust (Yr40), were validated in two released germplasm lines with Lr57 and Yr40 genes. Conclusion This approach should be widely applicable for the identification of species/genome-specific SNPs. The development of a large number of SNP markers will facilitate the precise introgression and

  6. Identification of SNP barcode biomarkers for genes associated with facial emotion perception using particle swarm optimization algorithm

    PubMed Central

    2014-01-01

    Background Facial emotion perception (FEP) can affect social function. We previously reported that parts of five tested single-nucleotide polymorphisms (SNPs) in the MET and AKT1 genes may individually affect FEP performance. However, the effects of SNP-SNP interactions on FEP performance remain unclear. Methods This study compared patients with high and low FEP performances (n = 89 and 93, respectively). A particle swarm optimization (PSO) algorithm was used to identify the best SNP barcodes (i.e., the SNP combinations and genotypes that revealed the largest differences between the high and low FEP groups). Results The analyses of individual SNPs showed no significant differences between the high and low FEP groups. However, comparisons of multiple SNP-SNP interactions involving different combinations of two to five SNPs showed that the best PSO-generated SNP barcodes were significantly associated with high FEP score. The analyses of the joint effects of the best SNP barcodes for two to five interacting SNPs also showed that the best SNP barcodes had significantly higher odds ratios (2.119 to 3.138; P < 0.05) compared to other SNP barcodes. In conclusion, the proposed PSO algorithm effectively identifies the best SNP barcodes that have the strongest associations with FEP performance. Conclusions This study also proposes a computational methodology for analyzing complex SNP-SNP interactions in social cognition domains such as recognition of facial emotion. PMID:24955105

  7. A new SNP panel for evaluating genetic diversity in a composite cattle breed

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A custom 60K SNP panel, extracted from Bovine HD SNP chip was used to evaluate genotypic frequency changes in Braford (BF, a composite breed) when compared to progenitor breeds: Hereford (HF), Brahman (BR), and Nelore (NE). Samples from both the U. S. and Brazil were used. The new panel differentiat...

  8. A Coordinated Approach to Peach SNP Discovery in RosBREED

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In the USDA-funded multi-institutional and trans-disciplinary project, “RosBREED”, crop-specific SNP genome scan platforms are being developed for peach, apple, strawberry, and cherry at a resolution of at least one polymorphic SNP marker every 5 cM in any random cross, for use in Pedigree-Based Ana...

  9. A genome-wide SNP panel for genetic diversity, mapping and breeding studies in rice

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide SNP resource was developed for rice using the GoldenGate assay and used to genotype 400 landrace accessions of O. sativa. SNPs were originally discovered using Perlegen re-sequencing technology in 20 diverse landraces of O. sativa as part of OryzaSNP project (http://irfgc.irri.org). An...

  10. Characterization of the Cattle HapMap Population using the Illumina Bovine-50K SNP Chip

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A Bovine 50K Illumina™ iSelect SNP chip (51,386 polymorphic SNP markers) was designed using a combination of publicly available SNPs along with highly informative novel SNPs discovered using a reduced representation and next-generation sequencing technology strategy. A total of 576 animals (426 mal...

  11. Strategies to build high-density linkage maps of the porcine 60k SNP chip

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We present here two different strategies to compute high-density linkage maps based on the porcine 60k SNP chip that was genotyped on 4 different pedigrees with a total of 5600 animals. The first strategy uses the draft sequence as a reference order, the SNP being first mapped to it. The second stra...

  12. Development and Applications of a Bovine 50,000 SNP Chip

    Technology Transfer Automated Retrieval System (TEKTRAN)

    To develop an Illumina iSelect high density single nucleotide polymorphism (SNP) assay for cattle, the collaborative iBMC (Illumina, USDA ARS Beltsville, University of Missouri, USDA ARS Clay Center) Consortium first performed a de novo SNP discovery project in which genomic reduced representation l...

  13. The development and characterization of a 60K SNP chip for chicken

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In livestock species like the chicken, high throughput SNP genotyping assays are increasingly being used for whole genome association studies and as a tool in breeding (referred to as genomic selection). We describe the design of a moderate density (60K) Illumina SNP BeadChip in chicken consisting o...

  14. SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome projects routinely produce draft sequences for species from diverse evolutionary clades, but generally do not create single nucleotide polymorphism (SNP) resources. We present an approach for de novo SNP discovery based on short-read sequencing of reduced representation libraries (RRL) to ge...

  15. SNP-VISTA: An Interactive SNPs Visualization Tool

    SciTech Connect

    Shah, Nameeta; Teplitsky, Michael V.; Pennacchio, Len A.; Hugenholtz, Philip; Hamann, Bernd; Dubchak, Inna L.

    2005-07-05

    Recent advances in sequencing technologies promise better diagnostics for many diseases as well as better understanding of evolution of microbial populations. Single Nucleotide Polymorphisms(SNPs) are established genetic markers that aid in the identification of loci affecting quantitative traits and/or disease in a wide variety of eukaryotic species. With today's technological capabilities, it is possible to re-sequence a large set of appropriate candidate genes in individuals with a given disease and then screen for causative mutations.In addition, SNPs have been used extensively in efforts to study the evolution of microbial populations, and the recent application of random shotgun sequencing to environmental samples makes possible more extensive SNP analysis of co-occurring and co-evolving microbial populations. The program is available at http://genome.lbl.gov/vista/snpvista.

  16. SNP and haplotype mapping for genetic analysis in the rat.

    PubMed

    Saar, Kathrin; Beck, Alfred; Bihoreau, Marie-Thérèse; Birney, Ewan; Brocklebank, Denise; Chen, Yuan; Cuppen, Edwin; Demonchy, Stephanie; Dopazo, Joaquin; Flicek, Paul; Foglio, Mario; Fujiyama, Asao; Gut, Ivo G; Gauguier, Dominique; Guigo, Roderic; Guryev, Victor; Heinig, Matthias; Hummel, Oliver; Jahn, Niels; Klages, Sven; Kren, Vladimir; Kube, Michael; Kuhl, Heiner; Kuramoto, Takashi; Kuroki, Yoko; Lechner, Doris; Lee, Young-Ae; Lopez-Bigas, Nuria; Lathrop, G Mark; Mashimo, Tomoji; Medina, Ignacio; Mott, Richard; Patone, Giannino; Perrier-Cornet, Jeanne-Antide; Platzer, Matthias; Pravenec, Michal; Reinhardt, Richard; Sakaki, Yoshiyuki; Schilhabel, Markus; Schulz, Herbert; Serikawa, Tadao; Shikhagaie, Medya; Tatsumoto, Shouji; Taudien, Stefan; Toyoda, Atsushi; Voigt, Birger; Zelenika, Diana; Zimdahl, Heike; Hubner, Norbert

    2008-05-01

    The laboratory rat is one of the most extensively studied model organisms. Inbred laboratory rat strains originated from limited Rattus norvegicus founder populations, and the inherited genetic variation provides an excellent resource for the correlation of genotype to phenotype. Here, we report a survey of genetic variation based on almost 3 million newly identified SNPs. We obtained accurate and complete genotypes for a subset of 20,238 SNPs across 167 distinct inbred rat strains, two rat recombinant inbred panels and an F2 intercross. Using 81% of these SNPs, we constructed high-density genetic maps, creating a large dataset of fully characterized SNPs for disease gene mapping. Our data characterize the population structure and illustrate the degree of linkage disequilibrium. We provide a detailed SNP map and demonstrate its utility for mapping of quantitative trait loci. This community resource is openly available and augments the genetic tools for this workhorse of physiological studies. PMID:18443594

  17. TcSNP: a database of genetic variation in Trypanosoma cruzi

    PubMed Central

    Ackermann, Alejandro A.; Carmona, Santiago J.; Agüero, Fernán

    2009-01-01

    The TcSNP database (http://snps.tcruzi.org) integrates information on genetic variation (polymorphisms and mutations) for different stocks, strains and isolates of Trypanosoma cruzi, the causative agent of Chagas disease. The database incorporates sequences (genes from the T. cruzi reference genome, mRNAs, ESTs and genomic sequences); multiple sequence alignments obtained from these sequences; and single-nucleotide polymorphisms and small indels identified by scanning these multiple sequence alignments. Information in TcSNP can be readily interrogated to arrive at gene sets, or SNP sets of interest based on a number of attributes. Sequence similarity searches using BLAST are also supported. This first release of TcSNP contains nearly 170 000 high-confidence candidate SNPs, derived from the analysis of annotated coding sequences. As new sequence data become available, TcSNP will incorporate these data, mapping new candidate SNPs onto the reference genome sequences. PMID:18974180

  18. Genotyping NAT2 with only two SNPs (rs1041983 and rs1801280) outperforms the tagging SNP rs1495741 and is equivalent to the conventional 7-SNP NAT2 genotype.

    PubMed

    Selinski, Silvia; Blaszkewicz, Meinolf; Lehmann, Marie-Louise; Ovsiannikov, Daniel; Moormann, Oliver; Guballa, Christoph; Kress, Alexander; Truss, Michael C; Gerullis, Holger; Otto, Thomas; Barski, Dimitri; Niegisch, Günter; Albers, Peter; Frees, Sebastian; Brenner, Walburgis; Thüroff, Joachim W; Angeli-Greaves, Miriam; Seidel, Thilo; Roth, Gerhard; Dietrich, Holger; Ebbinghaus, Rainer; Prager, Hans M; Bolt, Hermann M; Falkenstein, Michael; Zimmermann, Anna; Klein, Torsten; Reckwitz, Thomas; Roemer, Hermann C; Löhlein, Dietrich; Weistenhöfer, Wobbeke; Schöps, Wolfgang; Hassan Rizvi, Syed Adibul; Aslam, Muhammad; Bánfi, Gergely; Romics, Imre; Steffens, Michael; Ekici, Arif B; Winterpacht, Andreas; Ickstadt, Katja; Schwender, Holger; Hengstler, Jan G; Golka, Klaus

    2011-10-01

    Genotyping N-acetyltransferase 2 (NAT2) is of high relevance for individualized dosing of antituberculosis drugs and bladder cancer epidemiology. In this study we compared a recently published tagging single nucleotide polymorphism (SNP) (rs1495741) to the conventional 7-SNP genotype (G191A, C282T, T341C, C481T, G590A, A803G and G857A haplotype pairs) and systematically analysed if novel SNP combinations outperform the latter. For this purpose, we studied 3177 individuals by PCR and phenotyped 344 individuals by the caffeine test. Although the tagSNP and the 7-SNP genotype showed a high degree of correlation (R=0.933, P<0.0001) the 7-SNP genotype nevertheless outperformed the tagging SNP with respect to specificity (1.0 vs. 0.9444, P=0.0065). Considering all possible SNP combinations in a receiver operating characteristic analysis we identified a 2-SNP genotype (C282T, T341C) that outperformed the tagging SNP and was equivalent to the 7-SNP genotype. The 2-SNP genotype predicted the correct phenotype with a sensitivity of 0.8643 and a specificity of 1.0. In addition, it predicted the 7-SNP genotype with sensitivity and specificity of 0.9993 and 0.9880, respectively. The prediction of the NAT2 genotype by the 2-SNP genotype performed similar in populations of Caucasian, Venezuelan and Pakistani background. A 2-SNP genotype predicts NAT2 phenotypes with similar sensitivity and specificity as the conventional 7-SNP genotype. This procedure represents a facilitation in individualized dosing of NAT2 substrates without losing sensitivity or specificity. PMID:21750470

  19. Design and Characterization of a 52K SNP Chip for Goats

    PubMed Central

    Tosser-Klopp, Gwenola; Bardou, Philippe; Bouchez, Olivier; Cabau, Cédric; Crooijmans, Richard; Dong, Yang; Donnadieu-Tonon, Cécile; Eggen, André; Heuven, Henri C. M.; Jamli, Saadiah; Jiken, Abdullah Johari; Klopp, Christophe; Lawley, Cynthia T.; McEwan, John; Martin, Patrice; Moreno, Carole R.; Mulsant, Philippe; Nabihoudine, Ibouniyamine; Pailhoux, Eric; Palhière, Isabelle; Rupp, Rachel; Sarry, Julien; Sayre, Brian L.; Tircazes, Aurélie; Jun Wang; Wang, Wen; Zhang, Wenguang

    2014-01-01

    The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a 50–60,000 SNP chip for goats. The success of a moderate density SNP assay depends on reliable bioinformatic SNP detection procedures, the technological success rate of the SNP design, even spacing of SNPs on the genome and selection of Minor Allele Frequencies (MAF) suitable to use in diverse breeds. Through the federation of three SNP discovery projects consolidated as the International Goat Genome Consortium, we have identified approximately twelve million high quality SNP variants in the goat genome stored in a database together with their biological and technical characteristics. These SNPs were identified within and between six breeds (meat, milk and mixed): Alpine, Boer, Creole, Katjang, Saanen and Savanna, comprising a total of 97 animals. Whole genome and Reduced Representation Library sequences were aligned on >10 kb scaffolds of the de novo goat genome assembly. The 60,000 selected SNPs, evenly spaced on the goat genome, were submitted for oligo manufacturing (Illumina, Inc) and published in dbSNP along with flanking sequences and map position on goat assemblies (i.e. scaffolds and pseudo-chromosomes), sheep genome V2 and cattle UMD3.1 assembly. Ten breeds were then used to validate the SNP content and 52,295 loci could be successfully genotyped and used to generate a final cluster file. The combined strategy of using mainly whole genome Next Generation Sequencing and mapping on a contig genome assembly, complemented with Illumina design tools proved to be efficient in producing this GoatSNP50 chip. Advances in use of molecular markers are expected to accelerate goat genomic studies in coming years. PMID:24465974

  20. A conductometric indium oxide semiconducting nanoparticle enzymatic biosensor array.

    PubMed

    Lee, Dongjin; Ondrake, Janet; Cui, Tianhong

    2011-01-01

    We report a conductometric nanoparticle biosensor array to address the significant variation of electrical property in nanomaterial biosensors due to the random network nature of nanoparticle thin-film. Indium oxide and silica nanoparticles (SNP) are assembled selectively on the multi-site channel area of the resistors using layer-by-layer self-assembly. To demonstrate enzymatic biosensing capability, glucose oxidase is immobilized on the SNP layer for glucose detection. The packaged sensor chip onto a ceramic pin grid array is tested using syringe pump driven feed and multi-channel I-V measurement system. It is successfully demonstrated that glucose is detected in many different sensing sites within a chip, leading to concentration dependent currents. The sensitivity has been found to be dependent on the channel length of the resistor, 4-12 nA/mM for channel lengths of 5-20 μm, while the apparent Michaelis-Menten constant is 20 mM. By using sensor array, analytical data could be obtained with a single step of sample solution feeding. This work sheds light on the applicability of the developed nanoparticle microsensor array to multi-analyte sensors, novel bioassay platforms, and sensing components in a lab-on-a-chip. PMID:22163696

  1. A Conductometric Indium Oxide Semiconducting Nanoparticle Enzymatic Biosensor Array

    PubMed Central

    Lee, Dongjin; Ondrake, Janet; Cui, Tianhong

    2011-01-01

    We report a conductometric nanoparticle biosensor array to address the significant variation of electrical property in nanomaterial biosensors due to the random network nature of nanoparticle thin-film. Indium oxide and silica nanoparticles (SNP) are assembled selectively on the multi-site channel area of the resistors using layer-by-layer self-assembly. To demonstrate enzymatic biosensing capability, glucose oxidase is immobilized on the SNP layer for glucose detection. The packaged sensor chip onto a ceramic pin grid array is tested using syringe pump driven feed and multi-channel I–V measurement system. It is successfully demonstrated that glucose is detected in many different sensing sites within a chip, leading to concentration dependent currents. The sensitivity has been found to be dependent on the channel length of the resistor, 4–12 nA/mM for channel lengths of 5–20 μm, while the apparent Michaelis-Menten constant is 20 mM. By using sensor array, analytical data could be obtained with a single step of sample solution feeding. This work sheds light on the applicability of the developed nanoparticle microsensor array to multi-analyte sensors, novel bioassay platforms, and sensing components in a lab-on-a-chip. PMID:22163696

  2. Identification, validation and survey of a single nucleotide polymorphism (SNP) associated with pungency in Capsicum spp.

    PubMed

    Garcés-Claver, Ana; Fellman, Shanna Moore; Gil-Ortega, Ramiro; Jahn, Molly; Arnedo-Andrés, María S

    2007-11-01

    A single nucleotide polymorphism (SNP) associated with pungency was detected within an expressed sequence tag (EST) of 307 bp. This fragment was identified after expression analysis of the EST clone SB2-66 in placenta tissue of Capsicum fruits. Sequence alignments corresponding to this new fragment allowed us to identify an SNP between pungent and non-pungent accessions. Two methods were chosen for the development of the SNP marker linked to pungency: tetra-primer amplification refractory mutation system-PCR (tetra-primer ARMS-PCR) and cleaved amplified polymorphic sequence. Results showed that both methods were successful in distinguishing genotypes. Nevertheless, tetra-primer ARMS-PCR was chosen for SNP genotyping because it was more rapid, reliable and less cost-effective. The utility of this SNP marker for pungency was demonstrated by the ability to distinguish between 29 pungent and non-pungent cultivars of Capsicum annuum. In addition, the SNP was also associated with phenotypic pungent character in the tested genotypes of C. chinense, C. baccatum, C. frutescens, C. galapagoense, C. eximium, C. tovarii and C. cardenasi. This SNP marker is a faster, cheaper and more reproducible method for identifying pungent peppers than other techniques such as panel tasting, and allows rapid screening of the trait in early growth stages. PMID:17882396

  3. A system for exact and approximate genetic linkage analysis of SNP data in large pedigrees

    PubMed Central

    Silberstein, Mark; Weissbrod, Omer; Otten, Lars; Tzemach, Anna; Anisenia, Andrei; Shtark, Oren; Tuberg, Dvir; Galfrin, Eddie; Gannon, Irena; Shalata, Adel; Borochowitz, Zvi U.; Dechter, Rina; Thompson, Elizabeth; Geiger, Dan

    2013-01-01

    Motivation: The use of dense single nucleotide polymorphism (SNP) data in genetic linkage analysis of large pedigrees is impeded by significant technical, methodological and computational challenges. Here we describe Superlink-Online SNP, a new powerful online system that streamlines the linkage analysis of SNP data. It features a fully integrated flexible processing workflow comprising both well-known and novel data analysis tools, including SNP clustering, erroneous data filtering, exact and approximate LOD calculations and maximum-likelihood haplotyping. The system draws its power from thousands of CPUs, performing data analysis tasks orders of magnitude faster than a single computer. By providing an intuitive interface to sophisticated state-of-the-art analysis tools coupled with high computing capacity, Superlink-Online SNP helps geneticists unleash the potential of SNP data for detecting disease genes. Results: Computations performed by Superlink-Online SNP are automatically parallelized using novel paradigms, and executed on unlimited number of private or public CPUs. One novel service is large-scale approximate Markov Chain–Monte Carlo (MCMC) analysis. The accuracy of the results is reliably estimated by running the same computation on multiple CPUs and evaluating the Gelman–Rubin Score to set aside unreliable results. Another service within the workflow is a novel parallelized exact algorithm for inferring maximum-likelihood haplotyping. The reported system enables genetic analyses that were previously infeasible. We demonstrate the system capabilities through a study of a large complex pedigree affected with metabolic syndrome. Availability: Superlink-Online SNP is freely available for researchers at http://cbl-hap.cs.technion.ac.il/superlink-snp. The system source code can also be downloaded from the system website. Contact: omerw@cs.technion.ac.il Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23162081

  4. Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel.

    PubMed

    Delaneau, Olivier; Marchini, Jonathan

    2014-01-01

    A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000 GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants. PMID:25653097

  5. Drug-SNPing: an integrated drug-based, protein interaction-based tagSNP-based pharmacogenomics platform for SNP genotyping.

    PubMed

    Yang, Cheng-Hong; Cheng, Yu-Huei; Chuang, Li-Yeh; Chang, Hsueh-Wei

    2013-03-15

    Many drug or single nucleotide polymorphism (SNP)-related resources and tools have been developed, but connecting and integrating them is still a challenge. Here, we describe a user-friendly web-based software package, named Drug-SNPing, which provides a platform for the integration of drug information (DrugBank and PharmGKB), protein-protein interactions (STRING), tagSNP selection (HapMap) and genotyping information (dbSNP, REBASE and SNP500Cancer). DrugBank-based inputs include the following: (i) common name of the drug, (ii) synonym or drug brand name, (iii) gene name (HUGO) and (iv) keywords. PharmGKB-based inputs include the following: (i) gene name (HUGO), (ii) drug name and (iii) disease-related keywords. The output provides drug-related information, metabolizing enzymes and drug targets, as well as protein-protein interaction data. Importantly, tagSNPs of the selected genes are retrieved for genotyping analyses. All drug-based and protein-protein interaction-based SNP genotyping information are provided with PCR-RFLP (PCR-restriction enzyme length polymorphism) and TaqMan probes. Thus, users can enter any drug keywords/brand names to obtain immediate information that is highly relevant to genotyping for pharmacogenomics research. PMID:23418190

  6. Characterization of polyploid wheat genomic diversity using a high-density 90 000 single nucleotide polymorphism array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    High-density single nucleotide polymorphism (SNP) genotyping chips are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships among individuals in populations and studying marker-trait associations in mapping experiments. We developed a genotyping array includ...

  7. A draft fur seal genome provides insights into factors affecting SNP validation and how to mitigate them.

    PubMed

    Humble, E; Martinez-Barrio, A; Forcada, J; Trathan, P N; Thorne, M A S; Hoffmann, M; Wolf, J B W; Hoffman, J I

    2016-07-01

    Custom genotyping arrays provide a flexible and accurate means of genotyping single nucleotide polymorphisms (SNPs) in a large number of individuals of essentially any organism. However, validation rates, defined as the proportion of putative SNPs that are verified to be polymorphic in a population, are often very low. A number of potential causes of assay failure have been identified, but none have been explored systematically. In particular, as SNPs are often developed from transcriptomes, parameters relating to the genomic context are rarely taken into account. Here, we assembled a draft Antarctic fur seal (Arctocephalus gazella) genome (assembly size: 2.41 Gb; scaffold/contig N50 : 3.1 Mb/27.5 kb). We then used this resource to map the probe sequences of 144 putative SNPs genotyped in 480 individuals. The number of probe-to-genome mappings and alignment length together explained almost a third of the variation in validation success, indicating that sequence uniqueness and proximity to intron-exon boundaries play an important role. The same pattern was found after mapping the probe sequences to the Walrus and Weddell seal genomes, suggesting that the genomes of species divergent by as much as 23 million years can hold information relevant to SNP validation outcomes. Additionally, reanalysis of genotyping data from seven previous studies found the same two variables to be significantly associated with SNP validation success across a variety of taxa. Finally, our study reveals considerable scope for validation rates to be improved, either by simply filtering for SNPs whose flanking sequences align uniquely and completely to a reference genome, or through predictive modelling. PMID:26683564

  8. SNP Markers as Additional Information to Resolve Complex Kinship Cases

    PubMed Central

    Pontes, M. Lurdes; Fondevila, Manuel; Laréu, Maria Victoria; Medeiros, Rui

    2015-01-01

    Summary Background DNA profiling with sets of highly polymorphic autosomal short tandem repeat (STR) markers has been applied in various aspects of human identification in forensic casework for nearly 20 years. However, in some cases of complex kinship investigation, the information provided by the conventionally used STR markers is not enough, often resulting in low likelihood ratio (LR) calculations. In these cases, it becomes necessary to increment the number of loci under analysis to reach adequate LRs. Recently, it has been proposed that single nucleotide polymorphisms (SNPs) could be used as a supportive tool to STR typing, eventually even replacing the methods/markers now employed. Methods In this work, we describe the results obtained in 7 revised complex paternity cases when applying a battery of STRs, as well as 52 human identification SNPs (SNPforID 52plex identification panel) using a SNaPshot methodology followed by capillary electrophoresis. Results Our results show that the analysis of SNPs, as complement to STR typing in forensic casework applications, would at least increase by a factor of 4 total PI values and correspondent Essen-Möller's W value. Conclusions We demonstrated that SNP genotyping could be a key complement to STR information in challenging casework of disputed paternity, such as close relative individualization or complex pedigrees subject to endogamous relations. PMID:26733770

  9. Imputation of KIR Types from SNP Variation Data.

    PubMed

    Vukcevic, Damjan; Traherne, James A; Næss, Sigrid; Ellinghaus, Eva; Kamatani, Yoichiro; Dilthey, Alexander; Lathrop, Mark; Karlsen, Tom H; Franke, Andre; Moffatt, Miriam; Cookson, William; Trowsdale, John; McVean, Gil; Sawcer, Stephen; Leslie, Stephen

    2015-10-01

    Large population studies of immune system genes are essential for characterizing their role in diseases, including autoimmune conditions. Of key interest are a group of genes encoding the killer cell immunoglobulin-like receptors (KIRs), which have known and hypothesized roles in autoimmune diseases, resistance to viruses, reproductive conditions, and cancer. These genes are highly polymorphic, which makes typing expensive and time consuming. Consequently, despite their importance, KIRs have been little studied in large cohorts. Statistical imputation methods developed for other complex loci (e.g., human leukocyte antigen [HLA]) on the basis of SNP data provide an inexpensive high-throughput alternative to direct laboratory typing of these loci and have enabled important findings and insights for many diseases. We present KIR∗IMP, a method for imputation of KIR copy number. We show that KIR∗IMP is highly accurate and thus allows the study of KIRs in large cohorts and enables detailed investigation of the role of KIRs in human disease. PMID:26430804

  10. Imputation of KIR Types from SNP Variation Data

    PubMed Central

    Vukcevic, Damjan; Traherne, James A.; Næss, Sigrid; Ellinghaus, Eva; Kamatani, Yoichiro; Dilthey, Alexander; Lathrop, Mark; Karlsen, Tom H.; Franke, Andre; Moffatt, Miriam; Cookson, William; Trowsdale, John; McVean, Gil; Sawcer, Stephen; Leslie, Stephen

    2015-01-01

    Large population studies of immune system genes are essential for characterizing their role in diseases, including autoimmune conditions. Of key interest are a group of genes encoding the killer cell immunoglobulin-like receptors (KIRs), which have known and hypothesized roles in autoimmune diseases, resistance to viruses, reproductive conditions, and cancer. These genes are highly polymorphic, which makes typing expensive and time consuming. Consequently, despite their importance, KIRs have been little studied in large cohorts. Statistical imputation methods developed for other complex loci (e.g., human leukocyte antigen [HLA]) on the basis of SNP data provide an inexpensive high-throughput alternative to direct laboratory typing of these loci and have enabled important findings and insights for many diseases. We present KIR∗IMP, a method for imputation of KIR copy number. We show that KIR∗IMP is highly accurate and thus allows the study of KIRs in large cohorts and enables detailed investigation of the role of KIRs in human disease. PMID:26430804

  11. Porcine colonization of the Americas: a 60k SNP story

    PubMed Central

    Burgos-Paz, W; Souza, C A; Megens, H J; Ramayo-Caldas, Y; Melo, M; Lemús-Flores, C; Caal, E; Soto, H W; Martínez, R; Álvarez, L A; Aguirre, L; Iñiguez, V; Revidatti, M A; Martínez-López, O R; Llambi, S; Esteve-Codina, A; Rodríguez, M C; Crooijmans, R P M A; Paiva, S R; Schook, L B; Groenen, M A M; Pérez-Enciso, M

    2013-01-01

    The pig, Sus scrofa, is a foreign species to the American continent. Although pigs originally introduced in the Americas should be related to those from the Iberian Peninsula and Canary islands, the phylogeny of current creole pigs that now populate the continent is likely to be very complex. Because of the extreme climates that America harbors, these populations also provide a unique example of a fast evolutionary phenomenon of adaptation. Here, we provide a genome wide study of these issues by genotyping, with a 60k SNP chip, 206 village pigs sampled across 14 countries and 183 pigs from outgroup breeds that are potential founders of the American populations, including wild boar, Iberian, international and Chinese breeds. Results show that American village pigs are primarily of European ancestry, although the observed genetic landscape is that of a complex conglomerate. There was no correlation between genetic and geographical distances, neither continent wide nor when analyzing specific areas. Most populations showed a clear admixed structure where the Iberian pig was not necessarily the main component, illustrating how international breeds, but also Chinese pigs, have contributed to extant genetic composition of American village pigs. We also observe that many genes related to the cardiovascular system show an increased differentiation between altiplano and genetically related pigs living near sea level. PMID:23250008

  12. Sturgeon conservation genomics: SNP discovery and validation using RAD sequencing.

    PubMed

    Ogden, R; Gharbi, K; Mugue, N; Martinsohn, J; Senn, H; Davey, J W; Pourkazemi, M; McEwing, R; Eland, C; Vidotto, M; Sergeev, A; Congiu, L

    2013-06-01

    Caviar-producing sturgeons belonging to the genus Acipenser are considered to be one of the most endangered species groups in the world. Continued overfishing in spite of increasing legislation, zero catch quotas and extensive aquaculture production have led to the collapse of wild stocks across Europe and Asia. The evolutionary relationships among Adriatic, Russian, Persian and Siberian sturgeons are complex because of past introgression events and remain poorly understood. Conservation management, traceability and enforcement suffer a lack of appropriate DNA markers for the genetic identification of sturgeon at the species, population and individual level. This study employed RAD sequencing to discover and characterize single nucleotide polymorphism (SNP) DNA markers for use in sturgeon conservation in these four tetraploid species over three biological levels, using a single sequencing lane. Four population meta-samples and eight individual samples from one family were barcoded separately before sequencing. Analysis of 14.4 Gb of paired-end RAD data focused on the identification of SNPs in the paired-end contig, with subsequent in silico and empirical validation of candidate markers. Thousands of putatively informative markers were identified including, for the first time, SNPs that show population-wide differentiation between Russian and Persian sturgeons, representing an important advance in our ability to manage these cryptic species. The results highlight the challenges of genotyping-by-sequencing in polyploid taxa, while establishing the potential genetic resources for developing a new range of caviar traceability and enforcement tools. PMID:23473098

  13. Genome-Wide Joint Meta-Analysis of SNP and SNP-by-Smoking Interaction Identifies Novel Loci for Pulmonary Function

    PubMed Central

    Imboden, Medea; Koch, Beate; McArdle, Wendy L.; Smith, Albert V.; Smolonska, Joanna; Sood, Akshay; Tang, Wenbo; Wilk, Jemma B.; Zhai, Guangju; Zhao, Jing Hua; Aschard, Hugues; Burkart, Kristin M.; Curjuric, Ivan; Eijgelsheim, Mark; Elliott, Paul; Gu, Xiangjun; Harris, Tamara B.; Janson, Christer; Homuth, Georg; Hysi, Pirro G.; Liu, Jason Z.; Loehr, Laura R.; Lohman, Kurt; Loos, Ruth J. F.; Manning, Alisa K.; Marciante, Kristin D.; Obeidat, Ma'en; Postma, Dirkje S.; Aldrich, Melinda C.; Brusselle, Guy G.; Chen, Ting-hsu; Eiriksdottir, Gudny; Franceschini, Nora; Heinrich, Joachim; Rotter, Jerome I.; Wijmenga, Cisca; Williams, O. Dale; Bentley, Amy R.; Hofman, Albert; Laurie, Cathy C.; Lumley, Thomas; Morrison, Alanna C.; Joubert, Bonnie R.; Rivadeneira, Fernando; Couper, David J.; Kritchevsky, Stephen B.; Liu, Yongmei; Wjst, Matthias; Wain, Louise V.; Vonk, Judith M.; Uitterlinden, André G.; Rochat, Thierry; Rich, Stephen S.; Psaty, Bruce M.; O'Connor, George T.; North, Kari E.; Mirel, Daniel B.; Meibohm, Bernd; Launer, Lenore J.; Khaw, Kay-Tee; Hartikainen, Anna-Liisa; Hammond, Christopher J.; Gläser, Sven; Marchini, Jonathan; Kraft, Peter; Wareham, Nicholas J.; Völzke, Henry; Stricker, Bruno H. C.; Spector, Timothy D.; Probst-Hensch, Nicole M.; Jarvis, Deborah; Jarvelin, Marjo-Riitta; Heckbert, Susan R.; Gudnason, Vilmundur; Boezen, H. Marike; Barr, R. Graham; Cassano, Patricia A.; Strachan, David P.; Fornage, Myriam; Hall, Ian P.; Dupuis, Josée; Tobin, Martin D.; London, Stephanie J.

    2012-01-01

    Genome-wide association studies have identified numerous genetic loci for spirometic measures of pulmonary function, forced expiratory volume in one second (FEV1), and its ratio to forced vital capacity (FEV1/FVC). Given that cigarette smoking adversely affects pulmonary function, we conducted genome-wide joint meta-analyses (JMA) of single nucleotide polymorphism (SNP) and SNP-by-smoking (ever-smoking or pack-years) associations on FEV1 and FEV1/FVC across 19 studies (total N = 50,047). We identified three novel loci not previously associated with pulmonary function. SNPs in or near DNER (smallest PJMA = 5.00×10−11), HLA-DQB1 and HLA-DQA2 (smallest PJMA = 4.35×10−9), and KCNJ2 and SOX9 (smallest PJMA = 1.28×10−8) were associated with FEV1/FVC or FEV1 in meta-analysis models including SNP main effects, smoking main effects, and SNP-by-smoking (ever-smoking or pack-years) interaction. The HLA region has been widely implicated for autoimmune and lung phenotypes, unlike the other novel loci, which have not been widely implicated. We evaluated DNER, KCNJ2, and SOX9 and found them to be expressed in human lung tissue. DNER and SOX9 further showed evidence of differential expression in human airway epithelium in smokers compared to non-smokers. Our findings demonstrated that joint testing of SNP and SNP-by-environment interaction identified novel loci associated with complex traits that are missed when considering only the genetic main effects. PMID:23284291

  14. Identification of SNP Haplotypes and Prospects of Association Mapping in Watermelon

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Watermelon is the fifth most economically important vegetable crop cultivated world-wide. Implementing Single Nucleotide Polymorphism (SNP) marker technology in watermelon breeding and germplasm evaluation programs holds a key to improve horticulturally important traits. Next-generation sequencing...

  15. SNP analysis of AMY2 and CTSL genes in Litopenaeus vannamei and Penaeus monodon shrimp.

    PubMed

    Glenn, K L; Grapes, L; Suwanasopee, T; Harris, D L; Li, Y; Wilson, K; Rothschild, M F

    2005-06-01

    Genetic studies in shrimp have focused on disease, with production traits such as growth left unexamined. Two shrimp species, Litopenaeus vannamei and Penaeus monodon, which represent the majority of US shrimp imports, were selected for single nucleotide polymorphism (SNP) discovery in alpha-amylase (AMY2) and cathepsin-l (CTSL), both candidate genes for growth. In L. vannamei, four SNPs were found in AMY2 and one SNP was found in CTSL. In P. monodon, one SNP was identified in CTSL. The CTSL gene was mapped to linkage group 28 of P. monodon using the female map developed with the Australian P. monodon mapping population. Association analyses for the AMY2 and CTSL genes with body weight (BW) were performed in two L. vannamei populations. While neither gene was found to be significantly associated with BW in these populations, there was a trend in one population towards higher BW for allele G of CTSL SNP C681G. PMID:15932404

  16. SNP discovery through de novo deep sequencing using the next generation of DNA sequencers

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The production of high volumes of DNA sequence data using new technologies has permitted more efficient identification of single nucleotide polymorphisms in vertebrate genomes. This chapter presented practical methodology for production and analysis of DNA sequence data for SNP discovery....

  17. Evaluation of approaches for identifying population informative markers from high density SNP Chips

    PubMed Central

    2011-01-01

    Background Genetic markers can be used to identify and verify the origin of individuals. Motivation for the inference of ancestry ranges from conservation genetics to forensic analysis. High density assays featuring Single Nucleotide Polymorphism (SNP) markers can be exploited to create a reduced panel containing the most informative markers for these purposes. The objectives of this study were to evaluate methods of marker selection and determine the minimum number of markers from the BovineSNP50 BeadChip required to verify the origin of individuals in European cattle breeds. Delta, Wright's FST, Weir & Cockerham's FST and PCA methods for population differentiation were compared. The level of informativeness of each SNP was estimated from the breed specific allele frequencies. Individual assignment analysis was performed using the ranked informative markers. Stringency levels were applied by log-likelihood ratio to assess the confidence of the assignment test. Results A 95% assignment success rate for the 384 individually genotyped animals was achieved with < 80, < 100, < 140 and < 200 SNP markers (with increasing stringency threshold levels) across all the examined methods for marker selection. No further gain in power of assignment was achieved by sampling in excess of 200 SNP markers. The marker selection method that required the lowest number of SNP markers to verify the animal's breed origin was Wright's FST (60 to 140 SNPs depending on the chosen degree of confidence). Certain breeds required fewer markers (< 100) to achieve 100% assignment success. In contrast, closely related breeds require more markers (~200) to achieve > 95% assignment success. The power of assignment success, and therefore the number of SNP markers required, is dependent on the levels of genetic heterogeneity and pool of samples considered. Conclusions While all SNP selection methods produced marker panels capable of breed identification, the power of assignment varied markedly among

  18. Rapid Identification of Ginseng Cultivars (Panax ginseng Meyer) Using Novel SNP-Based Probes

    PubMed Central

    Jo, Ick-Hyun; Bang, Kyong Hwan; Kim, Young-Chang; Lee, Jei-Wan; Seo, A-Yeon; Seong, Bong-Jae; Kim, Hyun-Ho; Kim, Dong-Hwi; Cha, Seon-Woo; Cho, Yong-Gu; Kim, Hong-Sig

    2011-01-01

    In order to develop a novel system for the discrimination of five ginseng cultivars (Panax ginseng Meyer), single nucleotide polymorphism (SNP) genotyping assays with real-time polymerase chain reaction were conducted. Nucleotide substitution in gDNA library clones of P. ginseng cv. Yunpoong was targeted for the SNP genotyping assay. From these SNP sites, a set of modified SNP specific fluorescence probes (PGP74, PGP110, and PGP130) and novel primer sets have been developed to distinguish among five ginseng cultivars. The combination of the SNP type of the five cultivars, Chungpoong, Yunpoong, Gopoong, Kumpoong, and Sunpoong, was identified as ‘ATA’, ‘GCC’, ‘GTA’, ‘GCA’, and ‘ACC’, respectively. This study represents the first report of the identification of ginseng cultivars by fluorescence probes. An SNP genotyping assay using fluorescence probes could prove useful for the identification of ginseng cultivars and ginseng seed management systems and guarantee the purity of ginseng seed. PMID:23717098

  19. Metallic Nanohole Arrays on Fluoropolymer Substrates as Small Label-Free Real-Time Bioprobes

    PubMed Central

    Yang, Jiun-Chan; Ji, Jin; Hogle, James M.; Larson, Dale N.

    2009-01-01

    We describe a nanoplasmonic probing platform that exploits small-dimension (≤ 20 μm2) ordered arrays of subwavelength holes for multiplexed, high spatial resolution, and real-time analysis on biorecognition events. Nanohole arrays are perforated on a super smooth gold surface (roughness RMS < 2.7 Å) attached on a fluoropolymer (FEP) substrate fabricated by a replica technique. The smooth surface of gold provides a superb environment for fabricating nanometer features and uniform immobilization of biomolecules. The refractive index matching between FEP and biological solutions contributes to ∼ 20% improvement on the sensing performance. Spectral studies on a series of small-dimension nanohole arrays from 1 μm2 to 20 μm2 indicate that the plasmonic sensing sensitivity improves as the gold-solution contact area increases. Our results also demonstrate that nanohole arrays with dimension as small as 1 μm2 can be used to effectively detect biomolecular binding events and analyze the binding kinetics. The future scientific opportunities opened by this nanohole platform include highly multiplexed analysis of ligand interactions with membrane proteins on high quality supported lipid bilayers. PMID:18710296

  20. Identification of Laying-Related SNP Markers in Geese Using RAD Sequencing

    PubMed Central

    Yu, ShiGang; Chu, WeiWei; Zhang, LiFan; Han, HouMing; Zhao, RongXue; Wu, Wei; Zhu, JiangNing; Dodson, Michael V.; Wei, Wei; Liu, HongLin; Chen, Jie

    2015-01-01

    Laying performance is an important economical trait of goose production. As laying performance is of low heritability, it is of significance to develop a marker-assisted selection (MAS) strategy for this trait. Definition of sequence variation related to the target trait is a prerequisite of quantitating MAS, but little is presently known about the goose genome, which greatly hinders the identification of genetic markers for the laying traits of geese. Recently developed restriction site-associated DNA (RAD) sequencing is a possible approach for discerning large-scale single nucleotide polymorphism (SNP) and reducing the complexity of a genome without having reference genomic information available. In the present study, we developed a pooled RAD sequencing strategy for detecting geese laying-related SNP. Two DNA pools were constructed, each consisting of equal amounts of genomic DNA from 10 individuals with either high estimated breeding value (HEBV) or low estimated breeding value (LEBV). A total of 139,013 SNP were obtained from 42,291,356 sequences, of which 18,771,943 were for LEBV and 23,519,413 were for HEBV cohorts. Fifty-five SNP which had different allelic frequencies in the two DNA pools were further validated by individual-based AS-PCR genotyping in the LEBV and HEBV cohorts. Ten out of 55 SNP exhibited distinct allele distributions in these two cohorts. These 10 SNP were further genotyped in a goose population of 492 geese to verify the association with egg numbers. The result showed that 8 of 10 SNP were associated with egg numbers. Additionally, liner regression analysis revealed that SNP Record-111407, 106975 and 112359 were involved in a multiplegene network affecting laying performance. We used IPCR to extend the unknown regions flanking the candidate RAD tags. The obtained sequences were subjected to BLAST to retrieve the orthologous genes in either ducks or chickens. Five novel genes were cloned for geese which harbored the candidate laying

  1. Identification of Laying-Related SNP Markers in Geese Using RAD Sequencing.

    PubMed

    Yu, ShiGang; Chu, WeiWei; Zhang, LiFan; Han, HouMing; Zhao, RongXue; Wu, Wei; Zhu, JiangNing; Dodson, Michael V; Wei, Wei; Liu, HongLin; Chen, Jie

    2015-01-01

    Laying performance is an important economical trait of goose production. As laying performance is of low heritability, it is of significance to develop a marker-assisted selection (MAS) strategy for this trait. Definition of sequence variation related to the target trait is a prerequisite of quantitating MAS, but little is presently known about the goose genome, which greatly hinders the identification of genetic markers for the laying traits of geese. Recently developed restriction site-associated DNA (RAD) sequencing is a possible approach for discerning large-scale single nucleotide polymorphism (SNP) and reducing the complexity of a genome without having reference genomic information available. In the present study, we developed a pooled RAD sequencing strategy for detecting geese laying-related SNP. Two DNA pools were constructed, each consisting of equal amounts of genomic DNA from 10 individuals with either high estimated breeding value (HEBV) or low estimated breeding value (LEBV). A total of 139,013 SNP were obtained from 42,291,356 sequences, of which 18,771,943 were for LEBV and 23,519,413 were for HEBV cohorts. Fifty-five SNP which had different allelic frequencies in the two DNA pools were further validated by individual-based AS-PCR genotyping in the LEBV and HEBV cohorts. Ten out of 55 SNP exhibited distinct allele distributions in these two cohorts. These 10 SNP were further genotyped in a goose population of 492 geese to verify the association with egg numbers. The result showed that 8 of 10 SNP were associated with egg numbers. Additionally, liner regression analysis revealed that SNP Record-111407, 106975 and 112359 were involved in a multiplegene network affecting laying performance. We used IPCR to extend the unknown regions flanking the candidate RAD tags. The obtained sequences were subjected to BLAST to retrieve the orthologous genes in either ducks or chickens. Five novel genes were cloned for geese which harbored the candidate laying

  2. Generation of SNP datasets for orangutan population genomics using improved reduced-representation sequencing and direct comparisons of SNP calling algorithms

    PubMed Central

    2014-01-01

    Background High-throughput sequencing has opened up exciting possibilities in population and conservation genetics by enabling the assessment of genetic variation at genome-wide scales. One approach to reduce genome complexity, i.e. investigating only parts of the genome, is reduced-representation library (RRL) sequencing. Like similar approaches, RRL sequencing reduces ascertainment bias due to simultaneous discovery and genotyping of single-nucleotide polymorphisms (SNPs) and does not require reference genomes. Yet, generating such datasets remains challenging due to laboratory and bioinformatical issues. In the laboratory, current protocols require improvements with regards to sequencing homologous fragments to reduce the number of missing genotypes. From the bioinformatical perspective, the reliance of most studies on a single SNP caller disregards the possibility that different algorithms may produce disparate SNP datasets. Results We present an improved RRL (iRRL) protocol that maximizes the generation of homologous DNA sequences, thus achieving improved genotyping-by-sequencing efficiency. Our modifications facilitate generation of single-sample libraries, enabling individual genotype assignments instead of pooled-sample analysis. We sequenced ~1% of the orangutan genome with 41-fold median coverage in 31 wild-born individuals from two populations. SNPs and genotypes were called using three different algorithms. We obtained substantially different SNP datasets depending on the SNP caller. Genotype validations revealed that the Unified Genotyper of the Genome Analysis Toolkit and SAMtools performed significantly better than a caller from CLC Genomics Workbench (CLC). Of all conflicting genotype calls, CLC was only correct in 17% of the cases. Furthermore, conflicting genotypes between two algorithms showed a systematic bias in that one caller almost exclusively assigned heterozygotes, while the other one almost exclusively assigned homozygotes. Conclusions

  3. AMY-tree: an algorithm to use whole genome SNP calling for Y chromosomal phylogenetic applications

    PubMed Central

    2013-01-01

    Background Due to the rapid progress of next-generation sequencing (NGS) facilities, an explosion of human whole genome data will become available in the coming years. These data can be used to optimize and to increase the resolution of the phylogenetic Y chromosomal tree. Moreover, the exponential growth of known Y chromosomal lineages will require an automatic determination of the phylogenetic position of an individual based on whole genome SNP calling data and an up to date Y chromosomal tree. Results We present an automated approach, ‘AMY-tree’, which is able to determine the phylogenetic position of a Y chromosome using a whole genome SNP profile, independently from the NGS platform and SNP calling program, whereby mistakes in the SNP calling or phylogenetic Y chromosomal tree are taken into account. Moreover, AMY-tree indicates ambiguities within the present phylogenetic tree and points out new Y-SNPs which may be phylogenetically relevant. The AMY-tree software package was validated successfully on 118 whole genome SNP profiles of 109 males with different origins. Moreover, support was found for an unknown recurrent mutation, wrong reported mutation conversions and a large amount of new interesting Y-SNPs. Conclusions Therefore, AMY-tree is a useful tool to determine the Y lineage of a sample based on SNP calling, to identify Y-SNPs with yet unknown phylogenetic position and to optimize the Y chromosomal phylogenetic tree in the future. AMY-tree will not add lineages to the existing phylogenetic tree of the Y-chromosome but it is the first step to analyse whole genome SNP profiles in a phylogenetic framework. PMID:23405914

  4. The EAS-1000 array

    SciTech Connect

    Khristiansen, G.B.; Fomin, IU.A.; Chasnikov, I.IA.; Ivanenko, V.M.; Efimov, N.N. )

    1989-01-01

    The requirements for a newly constructed EAS array are summarized, and the EAS-1000 array now under construction is described. The array is depicted, and its accuracy in finding EAS parameters is shown. The expected statistics in observing EAS of different energies are presented for the most important scientific problems the array is supposed to solve.

  5. High-throughput SNP genotyping in Cucurbita pepo for map construction and quantitative trait loci mapping

    PubMed Central

    2012-01-01

    Background Cucurbita pepo is a member of the Cucurbitaceae family, the second- most important horticultural family in terms of economic importance after Solanaceae. The "summer squash" types, including Zucchini and Scallop, rank among the highest-valued vegetables worldwide. There are few genomic tools available for this species. The first Cucurbita transcriptome, along with a large collection of Single Nucleotide Polymorphisms (SNP), was recently generated using massive sequencing. A set of 384 SNP was selected to generate an Illumina GoldenGate assay in order to construct the first SNP-based genetic map of Cucurbita and map quantitative trait loci (QTL). Results We herein present the construction of the first SNP-based genetic map of Cucurbita pepo using a population derived from the cross of two varieties with contrasting phenotypes, representing the main cultivar groups of the species' two subspecies: Zucchini (subsp. pepo) × Scallop (subsp. ovifera). The mapping population was genotyped with 384 SNP, a set of selected EST-SNP identified in silico after massive sequencing of the transcriptomes of both parents, using the Illumina GoldenGate platform. The global success rate of the assay was higher than 85%. In total, 304 SNP were mapped, along with 11 SSR from a previous map, giving a map density of 5.56 cM/marker. This map was used to infer syntenic relationships between C. pepo and cucumber and to successfully map QTL that control plant, flowering and fruit traits that are of benefit to squash breeding. The QTL effects were validated in backcross populations. Conclusion Our results show that massive sequencing in different genotypes is an excellent tool for SNP discovery, and that the Illumina GoldenGate platform can be successfully applied to constructing genetic maps and performing QTL analysis in Cucurbita. This is the first SNP-based genetic map in the Cucurbita genus and is an invaluable new tool for biological research, especially considering that most

  6. Solar array drive system

    NASA Technical Reports Server (NTRS)

    Berkopec, F. D.; Sturman, J. C.; Stanhouse, R. W.

    1976-01-01

    A solar array drive system consisting of a solar array drive mechanism and the corresponding solar array drive electronics is being developed. The principal feature of the solar array drive mechanism is its bidirectional capability which enables its use in mechanical redundancy. The solar array drive system is of a widely applicable design. This configuration will be tested to determine its acceptability for generic mission sets. Foremost of the testing to be performed is the testing for extended duration.

  7. Integrated infrared array technology

    NASA Technical Reports Server (NTRS)

    Goebel, J. H.; Mccreight, C. R.

    1986-01-01

    An overview of integrated infrared (IR) array technology is presented. Although the array pixel formats are smaller, and the readout noise of IR arrays is larger, than the corresponding values achieved with optical charge-coupled-device silicon technology, substantial progress is being made in IR technology. Both existing IR arrays and those being developed are described. Examples of astronomical images are given which illustrate the potential of integrated IR arrays for scientific investigations.

  8. A novel algorithm for simultaneous SNP selection in high-dimensional genome-wide association studies

    PubMed Central

    2012-01-01

    Background Identification of causal SNPs in most genome wide association studies relies on approaches that consider each SNP individually. However, there is a strong correlation structure among SNPs that needs to be taken into account. Hence, increasingly modern computationally expensive regression methods are employed for SNP selection that consider all markers simultaneously and thus incorporate dependencies among SNPs. Results We develop a novel multivariate algorithm for large scale SNP selection using CAR score regression, a promising new approach for prioritizing biomarkers. Specifically, we propose a computationally efficient procedure for shrinkage estimation of CAR scores from high-dimensional data. Subsequently, we conduct a comprehensive comparison study including five advanced regression approaches (boosting, lasso, NEG, MCP, and CAR score) and a univariate approach (marginal correlation) to determine the effectiveness in finding true causal SNPs. Conclusions Simultaneous SNP selection is a challenging task. We demonstrate that our CAR score-based algorithm consistently outperforms all competing approaches, both uni- and multivariate, in terms of correctly recovered causal SNPs and SNP ranking. An R package implementing the approach as well as R code to reproduce the complete study presented here is available from http://strimmerlab.org/software/care/. PMID:23113980

  9. Highly specific SNP detection using 2D graphene electronics and DNA strand displacement

    PubMed Central

    Hwang, Michael T.; Landon, Preston B.; Lee, Joon; Choi, Duyoung; Mo, Alexander H.; Glinsky, Gennadi; Lal, Ratnesh

    2016-01-01

    Single-nucleotide polymorphisms (SNPs) in a gene sequence are markers for a variety of human diseases. Detection of SNPs with high specificity and sensitivity is essential for effective practical implementation of personalized medicine. Current DNA sequencing, including SNP detection, primarily uses enzyme-based methods or fluorophore-labeled assays that are time-consuming, need laboratory-scale settings, and are expensive. Previously reported electrical charge-based SNP detectors have insufficient specificity and accuracy, limiting their effectiveness. Here, we demonstrate the use of a DNA strand displacement-based probe on a graphene field effect transistor (FET) for high-specificity, single-nucleotide mismatch detection. The single mismatch was detected by measuring strand displacement-induced resistance (and hence current) change and Dirac point shift in a graphene FET. SNP detection in large double-helix DNA strands (e.g., 47 nt) minimize false-positive results. Our electrical sensor-based SNP detection technology, without labeling and without apparent cross-hybridization artifacts, would allow fast, sensitive, and portable SNP detection with single-nucleotide resolution. The technology will have a wide range of applications in digital and implantable biosensors and high-throughput DNA genotyping, with transformative implications for personalized medicine. PMID:27298347

  10. Mining and Analysis of SNP in Response to Salinity Stress in Upland Cotton (Gossypium hirsutum L.)

    PubMed Central

    Wang, Xiaoge; Lu, Xuke; Wang, Junjuan; Wang, Delong; Yin, Zujun; Fan, Weili; Wang, Shuai; Ye, Wuwei

    2016-01-01

    Salinity stress is a major abiotic factor that affects crop output, and as a pioneer crop in saline and alkaline land, salt tolerance study of cotton is particularly important. In our experiment, four salt-tolerance varieties with different salt tolerance indexes including CRI35 (65.04%), Kanghuanwei164 (56.19%), Zhong9807 (55.20%) and CRI44 (50.50%), as well as four salt-sensitive cotton varieties including Hengmian3 (48.21%), GK50 (40.20%), Xinyan96-48 (34.90%), ZhongS9612 (24.80%) were used as the materials. These materials were divided into salt-tolerant group (ST) and salt-sensitive group (SS). Illumina Cotton SNP 70K Chip was used to detect SNP in different cotton varieties. SNPv (SNP variation of the same seedling pre- and after- salt stress) in different varieties were screened; polymorphic SNP and SNPr (SNP related to salt tolerance) were obtained. Annotation and analysis of these SNPs showed that (1) the induction efficiency of salinity stress on SNPv of cotton materials with different salt tolerance index was different, in which the induction efficiency on salt-sensitive materials was significantly higher than that on salt-tolerant materials. The induction of salt stress on SNPv was obviously biased. (2) SNPv induced by salt stress may be related to the methylation changes under salt stress. (3) SNPr may influence salt tolerance of plants by affecting the expression of salt-tolerance related genes. PMID:27355327

  11. A SNP discovery method to assess variant allele probability from next-generation resequencing data

    PubMed Central

    Shen, Yufeng; Wan, Zhengzheng; Coarfa, Cristian; Drabek, Rafal; Chen, Lei; Ostrowski, Elizabeth A.; Liu, Yue; Weinstock, George M.; Wheeler, David A.; Gibbs, Richard A.; Yu, Fuli

    2010-01-01

    Accurate identification of genetic variants from next-generation sequencing (NGS) data is essential for immediate large-scale genomic endeavors such as the 1000 Genomes Project, and is crucial for further genetic analysis based on the discoveries. The key challenge in single nucleotide polymorphism (SNP) discovery is to distinguish true individual variants (occurring at a low frequency) from sequencing errors (often occurring at frequencies orders of magnitude higher). Therefore, knowledge of the error probabilities of base calls is essential. We have developed Atlas-SNP2, a computational tool that detects and accounts for systematic sequencing errors caused by context-related variables in a logistic regression model learned from training data sets. Subsequently, it estimates the posterior error probability for each substitution through a Bayesian formula that integrates prior knowledge of the overall sequencing error probability and the estimated SNP rate with the results from the logistic regression model for the given substitutions. The estimated posterior SNP probability can be used to distinguish true SNPs from sequencing errors. Validation results show that Atlas-SNP2 achieves a false-positive rate of lower than 10%, with an ∼5% or lower false-negative rate. PMID:20019143

  12. Different SNP combinations in the GCH1 gene and use of labor analgesia

    PubMed Central

    2010-01-01

    Background The aim of this study was to investigate if there is an association between different SNP combinations in the guanosine triphosphate cyclohydrolase (GCH1) gene and a number of pain behavior related outcomes during labor. A population-based sample of pregnant women (n = 814) was recruited at gestational week 18. A plasma sample was collected from each subject. Genotyping was performed and three single nucleotide polymorphisms (SNP) previously defined as a pain-protective SNP combination of GCH1 were used. Results Homozygous carriers of the pain-protective SNP combination of GCH1 arrived to the delivery ward with a more advanced stage of cervical dilation compared to heterozygous carriers and non-carriers. However, homozygous carriers more often used second line labor analgesia compared to the others. Conclusion The pain-protective SNP combination of GCH1 may be of importance in the limited number of homozygous carriers during the initial dilation of cervix but upon arrival at the delivery unit these women are more inclined to use second line labor analgesia. PMID:20633294

  13. Supervised learning-based tagSNP selection for genome-wide disease classifications

    PubMed Central

    Liu, Qingzhong; Yang, Jack; Chen, Zhongxue; Yang, Mary Qu; Sung, Andrew H; Huang, Xudong

    2008-01-01

    Background Comprehensive evaluation of common genetic variations through association of single nucleotide polymorphisms (SNPs) with complex human diseases on the genome-wide scale is an active area in human genome research. One of the fundamental questions in a SNP-disease association study is to find an optimal subset of SNPs with predicting power for disease status. To find that subset while reducing study burden in terms of time and costs, one can potentially reconcile information redundancy from associations between SNP markers. Results We have developed a feature selection method named Supervised Recursive Feature Addition (SRFA). This method combines supervised learning and statistical measures for the chosen candidate features/SNPs to reconcile the redundancy information and, in doing so, improve the classification performance in association studies. Additionally, we have proposed a Support Vector based Recursive Feature Addition (SVRFA) scheme in SNP-disease association analysis. Conclusions We have proposed using SRFA with different statistical learning classifiers and SVRFA for both SNP selection and disease classification and then applying them to two complex disease data sets. In general, our approaches outperform the well-known feature selection method of Support Vector Machine Recursive Feature Elimination and logic regression-based SNP selection for disease classification in genetic association studies. Our study further indicates that both genetic and environmental variables should be taken into account when doing disease predictions and classifications for the most complex human diseases that have gene-environment interactions. PMID:18366619

  14. Highly specific SNP detection using 2D graphene electronics and DNA strand displacement.

    PubMed

    Hwang, Michael T; Landon, Preston B; Lee, Joon; Choi, Duyoung; Mo, Alexander H; Glinsky, Gennadi; Lal, Ratnesh

    2016-06-28

    Single-nucleotide polymorphisms (SNPs) in a gene sequence are markers for a variety of human diseases. Detection of SNPs with high specificity and sensitivity is essential for effective practical implementation of personalized medicine. Current DNA sequencing, including SNP detection, primarily uses enzyme-based methods or fluorophore-labeled assays that are time-consuming, need laboratory-scale settings, and are expensive. Previously reported electrical charge-based SNP detectors have insufficient specificity and accuracy, limiting their effectiveness. Here, we demonstrate the use of a DNA strand displacement-based probe on a graphene field effect transistor (FET) for high-specificity, single-nucleotide mismatch detection. The single mismatch was detected by measuring strand displacement-induced resistance (and hence current) change and Dirac point shift in a graphene FET. SNP detection in large double-helix DNA strands (e.g., 47 nt) minimize false-positive results. Our electrical sensor-based SNP detection technology, without labeling and without apparent cross-hybridization artifacts, would allow fast, sensitive, and portable SNP detection with single-nucleotide resolution. The technology will have a wide range of applications in digital and implantable biosensors and high-throughput DNA genotyping, with transformative implications for personalized medicine. PMID:27298347

  15. CsSNP: A Web-Based Tool for the Detecting of Comparative Segments SNPs.

    PubMed

    Wang, Yi; Wang, Shuangshuang; Zhou, Dongjie; Yang, Shuai; Xu, Yongchao; Yang, Chao; Yang, Long

    2016-07-01

    SNP (single nucleotide polymorphism) is a popular tool for the study of genetic diversity, evolution, and other areas. Therefore, it is necessary to develop a convenient, utility, robust, rapid, and open source detecting-SNP tool for all researchers. Since the detection of SNPs needs special software and series steps including alignment, detection, analysis and present, the study of SNPs is limited for nonprofessional users. CsSNP (Comparative segments SNP, http://biodb.sdau.edu.cn/cssnp/ ) is a freely available web tool based on the Blat, Blast, and Perl programs to detect comparative segments SNPs and to show the detail information of SNPs. The results are filtered and presented in the statistics figure and a Gbrowse map. This platform contains the reference genomic sequences and coding sequences of 60 plant species, and also provides new opportunities for the users to detect SNPs easily. CsSNP is provided a convenient tool for nonprofessional users to find comparative segments SNPs in their own sequences, and give the users the information and the analysis of SNPs, and display these data in a dynamic map. It provides a new method to detect SNPs and may accelerate related studies. PMID:27347883

  16. A novel three-round multiplex PCR for SNP genotyping with next generation sequencing.

    PubMed

    Chen, Ke; Zhou, Yu-Xun; Li, Kai; Qi, Li-Xin; Zhang, Qi-Fei; Wang, Mao-Chun; Xiao, Jun-Hua

    2016-06-01

    Owing to the high throughput and low cost, next generation sequencing has attracted much attention for SNP genotyping application for researchers. Here, we introduce a new method based on three-round multiplex PCR to precisely genotype SNPs with next generation sequencing. This method can as much as possible consume the equivalent amount of each pair of specific primers to largely eliminate the amplification discrepancy between different loci. After the PCR amplification, the products can be directly subjected to next generation sequencing platform. We simultaneously amplified 37 SNP loci of 757 samples and sequenced all amplicons on ion torrent PGM platform; 90.5 % of the target SNP loci were accurately genotyped (at least 15×) and 90.4 % amplicons had uniform coverage with a variation less than 50-fold. Ligase detection reaction (LDR) was performed to genotype the 19 SNP loci (as part of the 37 SNP loci) with 91 samples randomly selected from the 757 samples, and 99.5 % genotyping data were consistent with the next generation sequencing results. Our results demonstrate that three-round PCR coupled with next generation sequencing is an efficient and economical genotyping approach. Graphical Abstract The schematic diagram of three-round PCR. PMID:27113460

  17. Breast cancer-associated high-order SNP-SNP interaction of CXCL12/CXCR4-related genes by an improved multifactor dimensionality reduction (MDR-ER).

    PubMed

    Fu, Ou-Yang; Chang, Hsueh-Wei; Lin, Yu-Da; Chuang, Li-Yeh; Hou, Ming-Feng; Yang, Cheng-Hong

    2016-09-01

    In association studies, the combined effects of single nucleotide polymorphism (SNP)-SNP interactions and the problem of imbalanced data between cases and controls are frequently ignored. In the present study, we used an improved multifactor dimensionality reduction (MDR) approach namely MDR-ER to detect the high order SNP‑SNP interaction in an imbalanced breast cancer data set containing seven SNPs of chemokine CXCL12/CXCR4 pathway genes. Most individual SNPs were not significantly associated with breast cancer. After MDR‑ER analysis, six significant SNP‑SNP interaction models with seven genes (highest cross‑validation consistency, 10; classification error rates, 41.3‑21.0; and prediction error rates, 47.4‑55.3) were identified. CD4 and VEGFA genes were associated in a 2‑loci interaction model (classification error rate, 41.3; prediction error rate, 47.5; odds ratio (OR), 2.069; 95% bootstrap CI, 1.40‑2.90; P=1.71E‑04) and it also appeared in all the best 2‑7‑loci models. When the loci number increased, the classification error rates and P‑values decreased. The powers in 2‑7‑loci in all models were >0.9. The minimum classification error rate of the MDR‑ER‑generated model was shown with the 7‑loci interaction model (classification error rate, 21.0; OR=15.282; 95% bootstrap CI, 9.54‑23.87; P=4.03E‑31). In the epistasis network analysis, the overall effect with breast cancer susceptibility was identified and the SNP order of impact on breast cancer was identified as follows: CD4 = VEGFA > KITLG > CXCL12 > CCR7 = MMP2 > CXCR4. In conclusion, the MDR‑ER can effectively and correctly identify the best SNP‑SNP interaction models in an imbalanced data set for breast cancer cases. PMID:27461876

  18. SNP Discovery by Illumina-Based Transcriptome Sequencing of the Olive and the Genetic Characterization of Turkish Olive Genotypes Revealed by AFLP, SSR and SNP Markers

    PubMed Central

    Kaya, Hilal Betul; Cetin, Oznur; Kaya, Hulya; Sahin, Mustafa; Sefer, Filiz; Kahraman, Abdullah; Tanyolac, Bahattin

    2013-01-01

    Background The olive tree (Olea europaea L.) is a diploid (2n = 2x = 46) outcrossing species mainly grown in the Mediterranean area, where it is the most important oil-producing crop. Because of its economic, cultural and ecological importance, various DNA markers have been used in the olive to characterize and elucidate homonyms, synonyms and unknown accessions. However, a comprehensive characterization and a full sequence of its transcriptome are unavailable, leading to the importance of an efficient large-scale single nucleotide polymorphism (SNP) discovery in olive. The objectives of this study were (1) to discover olive SNPs using next-generation sequencing and to identify SNP primers for cultivar identification and (2) to characterize 96 olive genotypes originating from different regions of Turkey. Methodology/Principal Findings Next-generation sequencing technology was used with five distinct olive genotypes and generated cDNA, producing 126,542,413 reads using an Illumina Genome Analyzer IIx. Following quality and size trimming, the high-quality reads were assembled into 22,052 contigs with an average length of 1,321 bases and 45 singletons. The SNPs were filtered and 2,987 high-quality putative SNP primers were identified. The assembled sequences and singletons were subjected to BLAST similarity searches and annotated with a Gene Ontology identifier. To identify the 96 olive genotypes, these SNP primers were applied to the genotypes in combination with amplified fragment length polymorphism (AFLP) and simple sequence repeats (SSR) markers. Conclusions/Significance This study marks the highest number of SNP markers discovered to date from olive genotypes using transcriptome sequencing. The developed SNP markers will provide a useful source for molecular genetic studies, such as genetic diversity and characterization, high density quantitative trait locus (QTL) analysis, association mapping and map-based gene cloning in the olive. High levels of

  19. A high-density SNP genome-wide linkage search of 206 families identifies susceptibility loci for chronic lymphocytic leukemia.

    PubMed

    Sellick, Gabrielle S; Goldin, Lynn R; Wild, Ruth W; Slager, Susan L; Ressenti, Laura; Strom, Sara S; Dyer, Martin J S; Mauro, Francesca R; Marti, Gerald E; Fuller, Stephen; Lyttelton, Matthew; Kipps, Thomas J; Keating, Michael J; Call, Timothy G; Catovsky, Daniel; Caporaso, Neil; Houlston, Richard S

    2007-11-01

    Chronic lymphocytic leukemia (CLL) and other B-cell lymphoproliferative disorders display familial aggregation. To identify a susceptibility gene for CLL, we assembled families from the major European (ICLLC) and American (GEC) consortia to conduct a genome-wide linkage analysis of 101 new CLL pedigrees using a high-density single nucleotide polymorphism (SNP) array and combined the results with data from our previously reported analysis of 105 families. Here, we report on the combined analysis of the 206 families. Multipoint linkage analyses were undertaken using both nonparametric (model-free) and parametric (model-based) methods. After the removal of high linkage disequilibrium SNPs, we obtained a maximum nonparametric linkage (NPL) score of 3.02 (P = .001) on chromosome 2q21.2. The same genomic position also yielded the highest multipoint heterogeneity LOD (HLOD) score under a common recessive model of disease susceptibility (HLOD = 3.11; P = 7.7 x 10(-5)), which was significant at the genome-wide level. In addition, 2 other chromosomal positions, 6p22.1 (corresponding to the major histocompatibility locus) and 18q21.1, displayed HLOD scores higher than 2.1 (P < .002). None of the regions coincided with areas of common chromosomal abnormalities frequently observed in CLL. These findings provide direct evidence for Mendelian predisposition to CLL and evidence for the location of disease loci. PMID:17687107

  20. Transcriptome sequencing for SNP discovery across Cucumis melo

    PubMed Central

    2012-01-01

    from India and Africa as compared to commercial cultivars, cultigens and landraces from Eastern Europe, Western Asia and the Mediterranean basin is consistent with the evolutionary history proposed for the species. Group-specific SNVs that will be useful in introgression programs were also detected. In a sample of 143 selected putative SNPs, we verified 93% of the polymorphisms in a panel of 78 genotypes. Conclusions This study provides the first comprehensive resequencing data for wild, exotic, and cultivated (landraces and commercial) melon transcriptomes, yielding the largest melon SNP collection available to date and representing a notable sample of the species diversity. This data provides a valuable resource for creating a catalog of allelic variants of melon genes and it will aid in future in-depth studies of population genetics, marker-assisted breeding, and gene identification aimed at developing improved varieties. PMID:22726804

  1. SNP Formation Bias in the Murine Genome Provides Evidence for Parallel Evolution.

    PubMed

    Plyler, Zackery E; Hill, Aubrey E; McAtee, Christopher W; Cui, Xiangqin; Moseley, Leah A; Sorscher, Eric J

    2015-09-01

    In this study, we show novel DNA motifs that promote single nucleotide polymorphism (SNP) formation and are conserved among exons, introns, and intergenic DNA from mice (Sanger Mouse Genomes Project), human genes (1000 Genomes), and tumor-specific somatic mutations (data from TCGA). We further characterize SNPs likely to be very recent in origin (i.e., formed in otherwise congenic mice) and show enrichment for both synonymous and parallel DNA variants occurring under circumstances not attributable to purifying selection. The findings provide insight regarding SNP contextual bias and eukaryotic codon usage as strategies that favor long-term exonic stability. The study also furnishes new information concerning rates of murine genomic evolution and features of DNA mutagenesis (at the time of SNP formation) that should be viewed as "adaptive." PMID:26253317

  2. Cross-Species Application of SNP Chips is Not Suitable for Identifying Runs of Homozygosity.

    PubMed

    Shafer, Aaron B A; Miller, Joshua M; Kardos, Marty

    2016-03-01

    Cross-species application of single-nucleotide polymorphism (SNP) chips is a valid, relatively cost-effective alternative to the high-throughput sequencing methods generally required to obtain a genome-wide sampling of polymorphisms. Kharzinova et al. (2015) examined the applicability of SNP chips developed in domestic bovids (cattle and sheep) to a semi-wild cervid (reindeer). The ancestors of bovids and cervids diverged between 20 and 30 million years ago (Hassanin and Douzery 2003; Bibi et al. 2013). Empirical work has shown that for a SNP chip developed in a bovid and applied to a cervid species, approximately 50% genotype success with 1% of the loci being polymorphic is expected (Miller et al. 2012). The genotyping of Kharzinova et al. (2015) follows this pattern; however, these data are not appropriate for identifying runs of homozygosity (ROH) and can be problematic for estimating linkage disequilibrium (LD) and we caution readers in this regard. PMID:26774056

  3. SNP-Seek database of SNPs derived from 3000 rice genomes.

    PubMed

    Alexandrov, Nickolai; Tai, Shuaishuai; Wang, Wensheng; Mansueto, Locedie; Palis, Kevin; Fuentes, Roven Rommel; Ulat, Victor Jun; Chebotarov, Dmytro; Zhang, Gengyun; Li, Zhikang; Mauleon, Ramil; Hamilton, Ruaraidh Sackville; McNally, Kenneth L

    2015-01-01

    We have identified about 20 million rice SNPs by aligning reads from the 3000 rice genomes project with the Nipponbare genome. The SNPs and allele information are organized into a SNP-Seek system (http://www.oryzasnp.org/iric-portal/), which consists of Oracle database having a total number of rows with SNP genotypes close to 60 billion (20 M SNPs × 3 K rice lines) and web interface for convenient querying. The database allows quick retrieving of SNP alleles for all varieties in a given genome region, finding different alleles from predefined varieties and querying basic passport and morphological phenotypic information about sequenced rice lines. SNPs can be visualized together with the gene structures in JBrowse genome browser. Evolutionary relationships between rice varieties can be explored using phylogenetic trees or multidimensional scaling plots. PMID:25429973

  4. Developing single nucleotide polymorphism (SNP) markers from transcriptome sequences for identification of longan (Dimocarpus longan) germplasm

    PubMed Central

    Wang, Boyi; Tan, Hua-Wei; Fang, Wanping; Meinhardt, Lyndel W; Mischke, Sue; Matsumoto, Tracie; Zhang, Dapeng

    2015-01-01

    Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in 50 longan germplasm accessions, including cultivated varieties and wild germplasm; and designated 25 SNP markers that unambiguously identified all tested longan varieties with high statistical rigor (P<0.0001). Multiple trees from the same clone were verified and off-type trees were identified. Diversity analysis revealed genetic relationships among analyzed accessions. Cultivated varieties differed significantly from wild populations (Fst=0.300; P<0.001), demonstrating untapped genetic diversity for germplasm conservation and utilization. Within cultivated varieties, apparent differences between varieties from China and those from Thailand and Hawaii indicated geographic patterns of genetic differentiation. These SNP markers provide a powerful tool to manage longan genetic resources and breeding, with accurate and efficient genotype identification. PMID:26504559

  5. Large Scale Association Analysis for Drug Addiction: Results from SNP to Gene

    PubMed Central

    Guo, Xiaobo; Liu, Zhifa; Wang, Xueqin; Zhang, Heping

    2012-01-01

    Many genetic association studies used single nucleotide polymorphisms (SNPs) data to identify genetic variants for complex diseases. Although SNP-based associations are most common in genome-wide association studies (GWAS), gene-based association analysis has received increasing attention in understanding genetic etiologies for complex diseases. While both methods have been used to analyze the same data, few genome-wide association studies compare the results or observe the connection between them. We performed a comprehensive analysis of the data from the Study of Addiction: Genetics and Environment (SAGE) and compared the results from the SNP-based and gene-based analyses. Our results suggest that the gene-based method complements the individual SNP-based analysis, and conceptually they are closely related. In terms of gene findings, our results validate many genes that were either reported from the analysis of the same dataset or based on animal studies for substance dependence. PMID:23365539

  6. High-density fiber optic biosensor arrays

    NASA Astrophysics Data System (ADS)

    Epstein, Jason R.; Walt, David R.

    2002-02-01

    Novel approaches are required to coordinate the immense amounts of information derived from diverse genomes. This concept has influenced the expanded role of high-throughput DNA detection and analysis in the biological sciences. A high-density fiber optic DNA biosensor was developed consisting of oligonucleotide-functionalized, 3.1 mm diameter microspheres deposited into the etched wells on the distal face of a 500 micrometers imaging fiber bundle. Imaging fiber bundles containing thousands of optical fibers, each associated with a unique oligonucleotide probe sequence, were the foundation for an optically connected, individually addressable DNA detection platform. Different oligonucleotide-functionalized microspheres were combined in a stock solution, and randomly dispersed into the etched wells. Microsphere positions were registered from optical dyes incorporated onto the microspheres. The distribution process provided an inherent redundancy that increases the signal-to-noise ratio as the square root of the number of sensors examined. The representative amount of each probe-type in the array was dependent on their initial stock solution concentration, and as other sequences of interest arise, new microsphere elements can be added to arrays without altering the existing detection capabilities. The oligonucleotide probe sequences hybridize to fluorescently-labeled, complementary DNA target solutions. Fiber optic DNA microarray research has included DNA-protein interaction profiles, microbial strain differentiation, non-labeled target interrogation with molecular beacons, and single cell-based assays. This biosensor array is proficient in DNA detection linked to specific disease states, single nucleotide polymorphism (SNP's) discrimination, and gene expression analysis. This array platform permits multiple detection formats, provides smaller feature sizes, and enables sensor design flexibility. High-density fiber optic microarray biosensors provide a fast

  7. Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography

    PubMed Central

    Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi

    2013-01-01

    New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined ‘elimination’ status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of M. leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. PMID:23291420

  8. Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography.

    PubMed

    Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi

    2013-03-01

    New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined 'elimination' status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of Mycobacterium leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. PMID:23291420

  9. Association of MDM2 SNP309, age of onset, and gender in cutaneous melanoma

    PubMed Central

    Firoz, Elnaz F.; Warycha, Melanie; Zakrzewski, Jan; Pollens, Danuta; Wang, Guimin; Shapiro, Richard; Berman, Russell; Pavlick, Anna; Manga, Prashiela; Ostrer, Harry; Celebi, Julide Tok; Kamino, Hideko; Darvishian, Farbod; Rolnitzky, Linda; Goldberg, Judith D.; Osman, Iman; Polsky, David

    2013-01-01

    Purpose In certain cancers, MDM2 SNP309 has been associated with early tumor onset in women. In melanoma, incidence rates are higher in women than in men among individuals less than age 40; however, among those older than age 50, melanoma is more frequent in men than in women. To investigate this difference, we examined the association between MDM2 SNP309, age at diagnosis, and gender among melanoma patients. Experimental Design Prospectively enrolled melanoma patients (N=227) were evaluated for MDM2 SNP309 and the related polymorphism, p53 Arg72Pro. DNA was isolated from patient blood samples and genotypes were analyzed by PCR-RFLP. Associations between MDM2 SNP309, p53 Arg72Pro, age at diagnosis, and clinicopathologic features of melanoma were analyzed. Results The median age at diagnosis was 13 years earlier among women with a SNP309 GG genotype (46 years) compared to women with TG+TT genotypes (59 years; p=0.19). Analyses using age dichotomized at each decade indicated that women with a GG genotype had significantly higher risks of being diagnosed with melanoma at ages less than 50 compared to women 50 and older, but not 60 and older. At ages less than 50, women with a GG genotype had a 3.89 times greater chance of being diagnosed compared to women with TG+TT genotypes (p=0.01). Similar observations were not seen among men. Conclusions Our data suggest that MDM2 may play an important role in the development of melanoma in women. The MDM2 SNP309 genotype may help identify women at risk for developing melanoma at a young age. PMID:19318491

  10. Using Hamming Distance as Information for SNP-Sets Clustering and Testing in Disease Association Studies

    PubMed Central

    Wang, Charlotte; Kao, Wen-Hsin; Hsiao, Chuhsing Kate

    2015-01-01

    The availability of high-throughput genomic data has led to several challenges in recent genetic association studies, including the large number of genetic variants that must be considered and the computational complexity in statistical analyses. Tackling these problems with a marker-set study such as SNP-set analysis can be an efficient solution. To construct SNP-sets, we first propose a clustering algorithm, which employs Hamming distance to measure the similarity between strings of SNP genotypes and evaluates whether the given SNPs or SNP-sets should be clustered. A dendrogram can then be constructed based on such distance measure, and the number of clusters can be determined. With the resulting SNP-sets, we next develop an association test HDAT to examine susceptibility to the disease of interest. This proposed test assesses, based on Hamming distance, whether the similarity between a diseased and a normal individual differs from the similarity between two individuals of the same disease status. In our proposed methodology, only genotype information is needed. No inference of haplotypes is required, and SNPs under consideration do not need to locate in nearby regions. The proposed clustering algorithm and association test are illustrated with applications and simulation studies. As compared with other existing methods, the clustering algorithm is faster and better at identifying sets containing SNPs exerting a similar effect. In addition, the simulation studies demonstrated that the proposed test works well for SNP-sets containing a large proportion of neutral SNPs. Furthermore, employing the clustering algorithm before testing a large set of data improves the knowledge in confining the genetic regions for susceptible genetic markers. PMID:26302001

  11. An Integrated SNP Mining and Utilization (ISMU) Pipeline for Next Generation Sequencing Data

    PubMed Central

    Azam, Sarwar; Rathore, Abhishek; Shah, Trushar M.; Telluri, Mohan; Amindala, BhanuPrakash; Ruperao, Pradeep; Katta, Mohan A. V. S. K.; Varshney, Rajeev K.

    2014-01-01

    Open source single nucleotide polymorphism (SNP) discovery pipelines for next generation sequencing data commonly requires working knowledge of command line interface, massive computational resources and expertise which is a daunting task for biologists. Further, the SNP information generated may not be readily used for downstream processes such as genotyping. Hence, a comprehensive pipeline has been developed by integrating several open source next generation sequencing (NGS) tools along with a graphical user interface called Integrated SNP Mining and Utilization (ISMU) for SNP discovery and their utilization by developing genotyping assays. The pipeline features functionalities such as pre-processing of raw data, integration of open source alignment tools (Bowtie2, BWA, Maq, NovoAlign and SOAP2), SNP prediction (SAMtools/SOAPsnp/CNS2snp and CbCC) methods and interfaces for developing genotyping assays. The pipeline outputs a list of high quality SNPs between all pairwise combinations of genotypes analyzed, in addition to the reference genome/sequence. Visualization tools (Tablet and Flapjack) integrated into the pipeline enable inspection of the alignment and errors, if any. The pipeline also provides a confidence score or polymorphism information content value with flanking sequences for identified SNPs in standard format required for developing marker genotyping (KASP and Golden Gate) assays. The pipeline enables users to process a range of NGS datasets such as whole genome re-sequencing, restriction site associated DNA sequencing and transcriptome sequencing data at a fast speed. The pipeline is very useful for plant genetics and breeding community with no computational expertise in order to discover SNPs and utilize in genomics, genetics and breeding studies. The pipeline has been parallelized to process huge datasets of next generation sequencing. It has been developed in Java language and is available at http://hpc.icrisat.cgiar.org/ISMU as a standalone

  12. k-merSNP discovery: Software for alignment-and reference-free scalable SNP discovery, phylogenetics, and annotation for hundreds of microbial genomes

    SciTech Connect

    2014-11-18

    With the flood of whole genome finished and draft microbial sequences, we need faster, more scalable bioinformatics tools for sequence comparison. An algorithm is described to find single nucleotide polymorphisms (SNPs) in whole genome data. It scales to hundreds of bacterial or viral genomes, and can be used for finished and/or draft genomes available as unassembled contigs or raw, unassembled reads. The method is fast to compute, finding SNPs and building a SNP phylogeny in minutes to hours, depending on the size and diversity of the input sequences. The SNP-based trees that result are consistent with known taxonomy and trees determined in other studies. The approach we describe can handle many gigabases of sequence in a single run. The algorithm is based on k-mer analysis.

  13. k-merSNP discovery: Software for alignment-and reference-free scalable SNP discovery, phylogenetics, and annotation for hundreds of microbial genomes

    Energy Science and Technology Software Center (ESTSC)

    2014-11-18

    With the flood of whole genome finished and draft microbial sequences, we need faster, more scalable bioinformatics tools for sequence comparison. An algorithm is described to find single nucleotide polymorphisms (SNPs) in whole genome data. It scales to hundreds of bacterial or viral genomes, and can be used for finished and/or draft genomes available as unassembled contigs or raw, unassembled reads. The method is fast to compute, finding SNPs and building a SNP phylogeny inmore » minutes to hours, depending on the size and diversity of the input sequences. The SNP-based trees that result are consistent with known taxonomy and trees determined in other studies. The approach we describe can handle many gigabases of sequence in a single run. The algorithm is based on k-mer analysis.« less

  14. A novel SNP in 3' UTR of INS gene: A case report of neonatal diabetes mellitus.

    PubMed

    Bogari, Neda M; Rayes, Husni H; Mostafa, Fakri; Abdel-Latif, Azza M; Ramadan, Abeer; Al-Allaf, Faisal A; Taher, Mohiuddin M; Fawzy, Ahmed

    2015-09-01

    Neonatal diabetes mellitus (NDM) is a rare condition with a prevalence of 1 in 300,000 live births. We have found 3 known SNPs in 5'UTR and a novel SNP in 3' UTR in the INS gene. These SNPs were present in 9-month-old girl from Saudi Arabia and also present in the father and mother. The novel SNP we found is not present in 1000 Genome project or other databases. Further, the newly identified 3' UTR mutation in the INS gene may abolish the polyadenylation signal and result in severe RNA instability. PMID:26212367

  15. Single nucleotide polymorphism array karyotyping: a diagnostic and prognostic tool in myelodysplastic syndromes with unsuccessful conventional cytogenetic testing.

    PubMed

    Arenillas, Leonor; Mallo, Mar; Ramos, Fernando; Guinta, Kathryn; Barragán, Eva; Lumbreras, Eva; Larráyoz, María-José; De Paz, Raquel; Tormo, Mar; Abáigar, María; Pedro, Carme; Cervera, José; Such, Esperanza; José Calasanz, María; Díez-Campelo, María; Sanz, Guillermo F; Hernández, Jesús María; Luño, Elisa; Saumell, Sílvia; Maciejewski, Jaroslaw; Florensa, Lourdes; Solé, Francesc

    2013-12-01

    Cytogenetic aberrations identified by metaphase cytogenetics (MC) have diagnostic, prognostic, and therapeutic implications in myelodysplastic syndromes (MDS). However, in some MDS patients MC study is unsuccesful. Single nucleotide polymorphism array (SNP-A) based karyotyping could be helpful in these cases. We performed SNP-A in 62 samples from bone marrow or peripheral blood of primary MDS with an unsuccessful MC study. SNP-A analysis enabled the detection of aberrations in 31 (50%) patients. We used the copy number alteration information to apply the International Prognostic Scoring System (IPSS) and we observed differences in survival between the low/intermediate-1 and intermediate-2/high risk patients. We also saw differences in survival between very low/low/intermediate and the high/very high patients when we applied the revised IPSS (IPSS-R). In conclusion, SNP-A can be used successfully in PB samples and the identification of CNA by SNP-A improve the diagnostic and prognostic evaluation of this group of MDS patients. PMID:24123380

  16. Development of genotyping by sequencing (GBS) and array derived SNP markers for stem rust resistance gene Sr42

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The stem rust fungus, particularly race TTKSK (Ug99), poses a serious threat to world wheat production. Gene Sr42 or SrCad (which could be the same gene or an allele of Sr42) is effective against race TTKSK. However, known genetic markers for Sr42 are mostly SSR markers which are generally labor i...

  17. Development and evaluation of SoySNP50K, a high-density genotyping array for soybean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The objective of this research was to identify single nucleotide polymorphisms (SNPs) and to efficiently develop an Infinium iSelect beadchip that contained over 50,000 SNPs from soybean. A total of 498,921,777 reads 35-45bp in length were obtained from DNA sequence analysis of reduced representati...

  18. Identification of the varietal origin of loose leaf tea based on analysis of a single leaf by SNP nanofluidic array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Tea [Camellia sinensis (L.) O Kuntze] is an economically important crop cultivated in more than 50 countries. Production and marketing of premium specialty tea products provides opportunities for tea growers, the tea industry and consumers. Rapid market segmentation in the tea industry has resulted ...

  19. Varietal identification of tea (Camellia sinensis [L.] Kuntze) using nanofluidic array of Single Nucleotide Polymorphism (SNP) markers

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Apart from water, tea is the world’s most widely consumed beverage. Tea is produced in more than 50 countries with an annual production of approximately 4.7 million tons. The market segment for specialty tea has been expanding rapidly owing to increased demand, resulting in higher revenues and profi...

  20. High-Density SNP Map Construction and QTL Identification for the Apetalous Character in Brassica napus L.

    PubMed Central

    Wang, Xiaodong; Yu, Kunjiang; Li, Hongge; Peng, Qi; Chen, Feng; Zhang, Wei; Chen, Song; Hu, Maolong; Zhang, Jiefu

    2015-01-01

    The apetalous genotype is a morphological ideotype for increasing seed yield and should be of considerable agricultural use; however, only a few studies have focused on the genetic control of this trait in Brassica napus. In the present study, a recombinant inbred line, the AH population, containing 189 individuals was derived from a cross between an apetalous line ‘APL01’ and a normally petalled variety ‘Holly’. The Brassica 60 K Infinium BeadChip Array harboring 52,157 single nucleotide polymorphism (SNP) markers was used to genotype the AH individuals. A high-density genetic linkage map was constructed based on 2,755 bins involving 11,458 SNPs and 57 simple sequence repeats, and was used to identify loci associated with petalous degree (PDgr). The linkage map covered 2,027.53 cM, with an average marker interval of 0.72 cM. The AH map had good collinearity with the B. napus reference genome, indicating its high quality and accuracy. After phenotypic analyses across five different experiments, a total of 19 identified quantitative trait loci (QTLs) distributed across chromosomes A3, A5, A6, A9 and C8 were obtained, and these QTLs were further integrated into nine consensus QTLs by a meta-analysis. Interestingly, the major QTL qPD.C8-2 was consistently detected in all five experiments, and qPD.A9-2 and qPD.C8-3 were stably expressed in four experiments. Comparative mapping between the AH map and the B. napus reference genome suggested that there were 328 genes underlying the confidence intervals of the three steady QTLs. Based on the Gene Ontology assignments of 52 genes to the regulation of floral development in published studies, 146 genes were considered as potential candidate genes for PDgr. The current study carried out a QTL analysis for PDgr using a high-density SNP map in B. napus, providing novel targets for improving seed yield. These results advanced our understanding of the genetic control of PDgr regulation in B. napus. PMID:26779193

  1. Meta-Analysis of High-Density SNP Associations for Beef Cattle Production Traits from Three Countries

    Technology Transfer Automated Retrieval System (TEKTRAN)

    About 50,000 SNP were evaluated for associations with growth, carcass, and meat quality traits in three populations of cattle in the United States, Canada, and Australia. Regression coefficients for each SNP were independently estimated within each country. Coefficients for similar traits were stand...

  2. MDM2 promoter SNP55 (rs2870820) affects risk of colon cancer but not breast-, lung-, or prostate cancer.

    PubMed

    Helwa, Reham; Gansmo, Liv B; Romundstad, Pål; Hveem, Kristian; Vatten, Lars; Ryan, Bríd M; Harris, Curtis C; Lønning, Per E; Knappskog, Stian

    2016-01-01

    Two functional SNPs (SNP285G > C; rs117039649 and SNP309T > G; rs2279744) have previously been reported to modulate Sp1 transcription factor binding to the promoter of the proto-oncogene MDM2, and to influence cancer risk. Recently, a third SNP (SNP55C > T; rs2870820) was also reported to affect Sp1 binding and MDM2 transcription. In this large population based case-control study, we genotyped MDM2 SNP55 in 10,779 Caucasian individuals, previously genotyped for SNP309 and SNP285, including cases of colon (n = 1,524), lung (n = 1,323), breast (n = 1,709) and prostate cancer (n = 2,488) and 3,735 non-cancer controls, as well as 299 healthy African-Americans. Applying the dominant model, we found an elevated risk of colon cancer among individuals harbouring SNP55TT/CT genotypes compared to the SNP55CC genotype (OR = 1.15; 95% CI = 1.01-1.30). The risk was found to be highest for left-sided colon cancer (OR = 1.21; 95% CI = 1.00-1.45) and among females (OR = 1.32; 95% CI = 1.01-1.74). Assessing combined genotypes, we found the highest risk of colon cancer among individuals harbouring the SNP55TT or CT together with the SNP309TG genotype (OR = 1.21; 95% CI = 1.00-1.46). Supporting the conclusions from the risk estimates, we found colon cancer cases carrying the SNP55TT/CT genotypes to be diagnosed at younger age as compared to SNP55CC (p = 0.053), in particular among patients carrying the SNP309TG/TT genotypes (p = 0.009). PMID:27624283

  3. Axiom turkey genotyping array

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The Axiom®Turkey Genotyping Array interrogates 643,845 probesets on the array, covering 643,845 SNPs. The array development was led by Dr. Julie Long of the USDA-ARS Beltsville Agricultural Research Center under a public-private partnership with Hendrix Genetics, Aviagen, and Affymetrix. The Turk...

  4. Optofluidic plasmonic onchip nanosensor array for biodetection

    NASA Astrophysics Data System (ADS)

    Huang, Min

    Surface plasmon resonance (SPR) sensing has been demonstrated in the past decade to be the gold standard technique for biochemical interaction analysis, and plays an important role in drug discovery and biomedical research. The technique circumvents the need of fluorescence/radioactive tagging or enzymatic detection, enables ultrasensitive remote sensing, and quantitatively monitors bio-interaction in real time. Although SPR has these attractive features that can satisfy most research/clinic requirements, there still exist problems that limit its applications. First, the reflection geometry of the prism coupling scheme adds limitations for high throughput screening application. Additionally, SPR instrumentations are bulky and not suitable for point-of-care settings. Moreover, the SPR sensor is embedded in conventional micro-fluidic cells, in which the sensor performance is limited by inefficient analyte transport. Suspended plasmonic nanohole array (PNA) offers an opportunity to overcome these limitations. A collinear excitation/collection coupling scheme combined with the small footprint of PNA provides unique platform for multiplexing and system minimization. The suspended nanohole structure also offers a unique configuration to integrate nano-photonics with nano-fluidics. This thesis focuses on developing a lab-on-a-chip PNA platform for point-of-care bio-detection. To achieve this, we first demonstrate that the figure-of-merit of our PNA sensor surpasses that of the prism coupled SPR. We also show that the ultrasensitive label-free PNA sensor is able to directly detect intact viruses from biological media at clinically relevant concentrations with little sample preparation. We then present a plasmonic microarray with over one million PNA sensors on a microscope slide for high throughput screening applications. A dual-color filter imaging method is introduced to increase the accuracy, reliability, and signal-to-noise ratio in a highly multiplexed manner. Finally

  5. Longevity and plasticity of CFTR provide an argument for noncanonical SNP organization in hominid DNA.

    PubMed

    Hill, Aubrey E; Plyler, Zackery E; Tiwari, Hemant; Patki, Amit; Tully, Joel P; McAtee, Christopher W; Moseley, Leah A; Sorscher, Eric J

    2014-01-01

    Like many other ancient genes, the cystic fibrosis transmembrane conductance regulator (CFTR) has survived for hundreds of millions of years. In this report, we consider whether such prodigious longevity of an individual gene--as opposed to an entire genome or species--should be considered surprising in the face of eons of relentless DNA replication errors, mutagenesis, and other causes of sequence polymorphism. The conventions that modern human SNP patterns result either from purifying selection or random (neutral) drift were not well supported, since extant models account rather poorly for the known plasticity and function (or the established SNP distributions) found in a multitude of genes such as CFTR. Instead, our analysis can be taken as a polemic indicating that SNPs in CFTR and many other mammalian genes may have been generated--and continue to accrue--in a fundamentally more organized manner than would otherwise have been expected. The resulting viewpoint contradicts earlier claims of 'directional' or 'intelligent design-type' SNP formation, and has important implications regarding the pace of DNA adaptation, the genesis of conserved non-coding DNA, and the extent to which eukaryotic SNP formation should be viewed as adaptive. PMID:25350658

  6. The use of SNP data for the monitoring of genetic diversity in cattle breeds

    Technology Transfer Automated Retrieval System (TEKTRAN)

    LD between SNPs contains information about effective population size. In this study, we investigate the use of genome-wide SNP data for marker based estimation of effective population size for two taurine cattle breeds of Africa and two local cattle breeds of Switzerland. Estimated recombination rat...

  7. Application of RAD LongRead sequencing for SNP discovery in sugarcane

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The sugarcane (hybrid Saccharum spp.) genome presents a difficult challenge for SNP discovery and analysis due to its complex polyploid nature. This is compounded further due to the absence of a reference genome sequence. We report the discovery of SNPs in sugarcane through reductive sequencing and ...

  8. SNP-revealed genetic diversity in wild emmer wheat correlates with ecological factors

    PubMed Central

    2013-01-01

    Background Patterns of genetic diversity between and within natural plant populations and their driving forces are of great interest in evolutionary biology. However, few studies have been performed on the genetic structure and population divergence in wild emmer wheat using a large number of EST-related single nucleotide polymorphism (SNP) markers. Results In the present study, twenty-five natural wild emmer wheat populations representing a wide range of ecological conditions in Israel and Turkey were used. Genetic diversity and genetic structure were investigated using over 1,000 SNP markers. A moderate level of genetic diversity was detected due to the biallelic property of SNP markers. Clustering based on Bayesian model showed that grouping pattern is related to the geographical distribution of the wild emmer wheat. However, genetic differentiation between populations was not necessarily dependent on the geographical distances. A total of 33 outlier loci under positive selection were identified using a FST-outlier method. Significant correlations between loci and ecogeographical factors were observed. Conclusions Natural selection appears to play a major role in generating adaptive structures in wild emmer wheat. SNP markers are appropriate for detecting selectively-channeled adaptive genetic diversity in natural populations of wild emmer wheat. This adaptive genetic diversity is significantly associated with ecological factors. PMID:23937410

  9. Applying SNP marker technology in the cacao breeding program at the Cocoa Research Institute of Ghana

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In this investigation 45 parental cacao plants and five progeny derived from the parental stock studied were genotyped using six SNP markers to determine off-types or mislabeled clones and to authenticate crosses made in the Cocoa Research Institute of Ghana (CRIG) breeding program. Investigation wa...

  10. SNP Discovery in Swine by Reduced Representation and High Throughput Pyrosequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Relatively little information is available for sequence variation in the pig. Because reduced representation reduces the complexity of the genome being sampled by orders of magnitude and samples identical regions dispersed across the genome, it is an ideal strategy for SNP discovery in a species wit...

  11. SNP-based genotyping in lentil: linking sequence information with phenotypes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Lentil (Lens culinaris) has been late to enter the world of high throughput molecular analysis due to a general lack of genomic resources. Using a 454 sequencing-based approach, SNPs have been identified in genes across the lentil genome. Several hundred have been turned into single SNP KASP assay...

  12. Development of a SNP genotyping panel for genetic monitoring of the laboratory mouse.

    PubMed

    Petkov, Petko M; Cassell, Megan A; Sargent, Evelyn E; Donnelly, Charles J; Robinson, Phil; Crew, Victor; Asquith, Steven; Haar, Raymond Vonder; Wiles, Michael V

    2004-05-01

    We have developed a genotyping system for detecting genetic contamination in the laboratory mouse based on assaying single-nucleotide polymorphism (SNP) markers positioned on all autosomes and the X chromosome. This system provides a fast, reliable, and cost-effective way for genetic monitoring, while maintaining a very high degree of confidence. We describe the allelic distribution of 235 SNPs in 48 mouse strains, thereby creating a database of polymorphisms useful for genotyping purposes. The SNP markers used in this study were chosen from publicly available SNP databases. Four genotyping methods were evaluated, and dynamic two-tube allele-specific PCR assays were developed for each marker and tested on a set of 48 inbred mouse strains. The minimal number of assays sufficient to distinguish groups consisting of different numbers of mouse strains was estimated, and a panel of 28 SNPs sufficient to distinguish virtually all of the inbred strains tested was selected. Amplifluor SNP detection assays were developed for these markers and tested on an extended list of 96 strains. This panel was used as a genetic quality control approach to monitor the genotypes of nearly 300 inbred, wild-derived, congenic, consomic, and recombinant inbred strains maintained at The Jackson Laboratory. We have concluded that this marker panel is sufficient for genetic contamination monitoring in colonies containing a large number of genetically diverse mouse strains and that reduced versions of the panel could be implemented in facilities housing a lower number of strains. PMID:15081119

  13. Identification of a SNP marker associated with WB242 nematode resistance in sugar beet

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The beet-cyst nematode (Heterodera schachtii Schmidt) is one of the major diseases of sugar beet. The identification of molecular markers associated to the nematode resistance would be helpful for developing resistant varieties. The aim of this study was the identification of SNP (Single Nucleotide ...

  14. Field ionization process of Eu 4f76snp Rydberg states

    NASA Astrophysics Data System (ADS)

    Zhang, Jing; Shen, Li; Dai, Chang-Jian

    2015-11-01

    The field ionization process of the Eu 4f76snp Rydberg states, converging to the first ionization limit, 4f76s 9S4, is systematically investigated. The spectra of the Eu 4f76snp Rydberg states are populated with three-step laser excitation, and detected by electric field ionization (EFI) method. Two different kinds of the EFI pulses are applied after laser excitation to observe the possible impacts on the EFI process. The exact EFI ionization thresholds for the 4f76snp Rydberg states can be determined by observing the corresponding EFI spectra. In particular, some structures above the EFI threshold are found in the EFI spectra, which may be interpreted as the effect from black body radiation (BBR). Finally, the scaling law of the EFI threshold for the Eu 4f76snp Rydberg states with the effective quantum number is built. Project supported by the National Natural Science Foundation of China (Grant Nos. 11004151 and 11174218).

  15. SNP discovery in swine by reduced representation and high throughput pyrosequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A reduced representation library (RRL) of porcine genomic fragments was used to identify SNP from a pool of DNA isolated from 26 animals (52 chromosomes) relevant to current pork production. Treatment of the pooled DNA with a restriction enzyme, coupled with gel-based size selection of 450 base pair...

  16. Association Analyses of Candidate SNP on Reproductive and Production Traits in Swine

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The ability to identify young females with superior reproduction would have a large economic impact on commercial swine production. Previous studies have discovered SNP associated with economically important traits such as litter size, growth rate, and feed intake. The objective of this study was to...

  17. High-density SNP Scan of Production and Product Quality Traits in Beef Cattle

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genotypes from the BovineSNP50 BeadChip (50K) were obtained on animals derived from 150 AI sires from seven breeds (22 sires per breed; Angus, Charolais, Gelbvieh, Hereford, Limousin, Red Angus, and Simmental) as either progeny (F1; 590 steers) or grandprogeny (F1 x F1 = F1**2; 1,306 steers and 707 ...

  18. Utilization of a whole genome SNP panel for efficient genetic mapping in the mouse

    PubMed Central

    Moran, Jennifer L.; Bolton, Andrew D.; Tran, Pamela V.; Brown, Alison; Dwyer, Noelle D.; Manning, Danielle K.; Bjork, Bryan C.; Li, Cheng; Montgomery, Kate; Siepka, Sandra M.; Vitaterna, Martha Hotz; Takahashi, Joseph S.; Wiltshire, Tim; Kwiatkowski, David J.; Kucherlapati, Raju; Beier, David R.

    2006-01-01

    Phenotype-driven genetics can be used to create mouse models of human disease and birth defects. However, the utility of these mutant models is limited without identification of the causal gene. To facilitate genetic mapping, we developed a fixed single nucleotide polymorphism (SNP) panel of 394 SNPs as an alternative to analyses using simple sequence length polymorphism (SSLP) marker mapping. With the SNP panel, chromosomal locations for 22 monogenic mutants were identified. The average number of affected progeny genotyped for mapped monogenic mutations is nine. Map locations for several mutants have been obtained with as few as four affected progeny. The average size of genetic intervals obtained for these mutants is 43 Mb, with a range of 17–83 Mb. Thus, our SNP panel allows for identification of moderate resolution map position with small numbers of mice in a high-throughput manner. Importantly, the panel is suitable for mapping crosses from many inbred and wild-derived inbred strain combinations. The chromosomal localizations obtained with the SNP panel allow one to quickly distinguish between potentially novel loci or remutations in known genes, and facilitates fine mapping and positional cloning. By using this approach, we identified DNA sequence changes in two ethylnitrosourea-induced mutants. PMID:16461637

  19. Development and validation of a low-density SNP panel related to prolificacy in sheep

    Technology Transfer Automated Retrieval System (TEKTRAN)

    High-density SNP panels (e.g., 50,000 and 600,000 markers) have been used in exploratory population genetic studies with commercial and minor breeds of sheep. However, routine genetic diversity evaluations of large numbers of samples with large panels are in general cost-prohibitive for gene banks. ...

  20. SNP discovery in complex allotetraploid genomes (Gossypium spp., Malvaceae) using genotyping by sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Dramatic decreases in the cost of DNA sequencing have enabled the development of very large numbers of markers based on single nucleotide polymorphism (SNP) for phylogenetic studies, population genetics, linkage mapping, marker-assisted breeding and other applications. Using Illumina next-generatio...

  1. Association mapping of resistance to leaf rust in emmer wheat using high throughput SNP markers

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Emmer wheat (Triticum turgidum L. subsp. dicoccum) is known to be a useful source of genes for many desirable characters for improvement of modern cultivated wheat. Recently, a panel of 181 emmer wheat accessions has been genotyped with wheat 9K SNP (single nucleotide polymorphism) markers and exte...

  2. Development of gene-tagged SNP markers for gland morphogenesis in cotton

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Cotton (Gossypium spp.) plants, including cottonseed, have small, pigmented glands containing gossypol and other terpenoid compounds that are toxic to humans and non-ruminant animals. Single nucleotide polymorphism (SNP) markers involved in gland morphogenesis are useful for the discovery of candid...

  3. Association of Agronomic Traits with SNP Markers in Durum Wheat (Triticum turgidum L. durum (Desf.))

    PubMed Central

    Hu, Xin; Ren, Jing; Ren, Xifeng; Huang, Sisi; Sabiel, Salih A. I.; Luo, Mingcheng; Nevo, Eviatar; Fu, Chunjie; Peng, Junhua; Sun, Dongfa

    2015-01-01

    Association mapping is a powerful approach to detect associations between traits of interest and genetic markers based on linkage disequilibrium (LD) in molecular plant breeding. In this study, 150 accessions of worldwide originated durum wheat germplasm (Triticum turgidum spp. durum) were genotyped using 1,366 SNP markers. The extent of LD on each chromosome was evaluated. Association of single nucleotide polymorphisms (SNP) markers with ten agronomic traits measured in four consecutive years was analyzed under a mix linear model (MLM). Two hundred and one significant association pairs were detected in the four years. Several markers were associated with one trait, and also some markers were associated with multiple traits. Some of the associated markers were in agreement with previous quantitative trait loci (QTL) analyses. The function and homology analyses of the corresponding ESTs of some SNP markers could explain many of the associations for plant height, length of main spike, number of spikelets on main spike, grain number per plant, and 1000-grain weight, etc. The SNP associations for the observed traits are generally clustered in specific chromosome regions of the wheat genome, mainly in 2A, 5A, 6A, 7A, 1B, and 6B chromosomes. This study demonstrates that association mapping can complement and enhance previous QTL analyses and provide additional information for marker-assisted selection. PMID:26110423

  4. New tools and methods for direct programmatic access to the dbSNP relational database

    PubMed Central

    Saccone, Scott F.; Quan, Jiaxi; Mehta, Gaurang; Bolze, Raphael; Thomas, Prasanth; Deelman, Ewa; Tischfield, Jay A.; Rice, John P.

    2011-01-01

    Genome-wide association studies often incorporate information from public biological databases in order to provide a biological reference for interpreting the results. The dbSNP database is an extensive source of information on single nucleotide polymorphisms (SNPs) for many different organisms, including humans. We have developed free software that will download and install a local MySQL implementation of the dbSNP relational database for a specified organism. We have also designed a system for classifying dbSNP tables in terms of common tasks we wish to accomplish using the database. For each task we have designed a small set of custom tables that facilitate task-related queries and provide entity-relationship diagrams for each task composed from the relevant dbSNP tables. In order to expose these concepts and methods to a wider audience we have developed web tools for querying the database and browsing documentation on the tables and columns to clarify the relevant relational structure. All web tools and software are freely available to the public at http://cgsmd.isi.edu/dbsnpq. Resources such as these for programmatically querying biological databases are essential for viably integrating biological information into genetic association experiments on a genome-wide scale. PMID:21037260

  5. High-throughput RAD-SNP genotyping for characterization of sugar beet genotypes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    High-throughput SNP genotyping provides a rapid way of developing resourceful set of markers for delineating the genetic architecture and for effective species discrimination. In the presented research, we demonstrate a set of 192 SNPs for effective genotyping in sugar beet using high-throughput mar...

  6. Changes in variance explained by top SNP windows over generations for three traits in broiler chicken

    PubMed Central

    Fragomeni, Breno de Oliveira; Misztal, Ignacy; Lourenco, Daniela Lino; Aguilar, Ignacio; Okimoto, Ronald; Muir, William M.

    2014-01-01

    The purpose of this study was to determine if the set of genomic regions inferred as accounting for the majority of genetic variation in quantitative traits remain stable over multiple generations of selection. The data set contained phenotypes for five generations of broiler chicken for body weight, breast meat, and leg score. The population consisted of 294,632 animals over five generations and also included genotypes of 41,036 single nucleotide polymorphism (SNP) for 4,866 animals, after quality control. The SNP effects were calculated by a GWAS type analysis using single step genomic BLUP approach for generations 1–3, 2–4, 3–5, and 1–5. Variances were calculated for windows of 20 SNP. The top ten windows for each trait that explained the largest fraction of the genetic variance across generations were examined. Across generations, the top 10 windows explained more than 0.5% but less than 1% of the total variance. Also, the pattern of the windows was not consistent across generations. The windows that explained the greatest variance changed greatly among the combinations of generations, with a few exceptions. In many cases, a window identified as top for one combination, explained less than 0.1% for the other combinations. We conclude that identification of top SNP windows for a population may have little predictive power for genetic selection in the following generations for the traits here evaluated. PMID:25324857

  7. SNP-based high density genetic map and mapping of btwd1 dwarfing gene in barley

    PubMed Central

    Ren, Xifeng; Wang, Jibin; Liu, Lipan; Sun, Genlou; Li, Chengdao; Luo, Hong; Sun, Dongfa

    2016-01-01

    A high-density linkage map is a valuable tool for functional genomics and breeding. A newly developed sequence-based marker technology, restriction site associated DNA (RAD) sequencing, has been proven to be powerful for the rapid discovery and genotyping of genome-wide single nucleotide polymorphism (SNP) markers and for the high-density genetic map construction. The objective of this research was to construct a high-density genetic map of barley using RAD sequencing. 1894 high-quality SNP markers were developed and mapped onto all seven chromosomes together with 68 SSR markers. These 1962 markers constituted a total genetic length of 1375.8 cM and an average of 0.7 cM between adjacent loci. The number of markers within each linkage group ranged from 209 to 396. The new recessive dwarfing gene btwd1 in Huaai 11 was mapped onto the high density linkage maps. The result showed that the btwd1 is positioned between SNP marks 7HL_6335336 and 7_249275418 with a genetic distance of 0.9 cM and 0.7 cM on chromosome 7H, respectively. The SNP-based high-density genetic map developed and the dwarfing gene btwd1 mapped in this study provide critical information for position cloning of the btwd1 gene and molecular breeding of barley. PMID:27530597

  8. Imputation of microsatellite allele from dense SNP genotypes for parentage verification

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Microsatellite (MS) markers have recently been used for parental verification and are still the international standard despite higher cost, error rate, and turnaround time compared with Single Nucleotide Polymorphisms (SNP)-based assays. Despite domestic and international interest from producers an...

  9. selectSNP – An R package for selecting SNPs optimal for genetic evaluation

    Technology Transfer Automated Retrieval System (TEKTRAN)

    There has been a huge increase in the number of SNPs in the public repositories. This has made it a challenge to design low and medium density SNP panels, which requires careful selection of available SNPs considering many criteria, such as map position, allelic frequency, possible biological functi...

  10. SNP-based high density genetic map and mapping of btwd1 dwarfing gene in barley.

    PubMed

    Ren, Xifeng; Wang, Jibin; Liu, Lipan; Sun, Genlou; Li, Chengdao; Luo, Hong; Sun, Dongfa

    2016-01-01

    A high-density linkage map is a valuable tool for functional genomics and breeding. A newly developed sequence-based marker technology, restriction site associated DNA (RAD) sequencing, has been proven to be powerful for the rapid discovery and genotyping of genome-wide single nucleotide polymorphism (SNP) markers and for the high-density genetic map construction. The objective of this research was to construct a high-density genetic map of barley using RAD sequencing. 1894 high-quality SNP markers were developed and mapped onto all seven chromosomes together with 68 SSR markers. These 1962 markers constituted a total genetic length of 1375.8 cM and an average of 0.7 cM between adjacent loci. The number of markers within each linkage group ranged from 209 to 396. The new recessive dwarfing gene btwd1 in Huaai 11 was mapped onto the high density linkage maps. The result showed that the btwd1 is positioned between SNP marks 7HL_6335336 and 7_249275418 with a genetic distance of 0.9 cM and 0.7 cM on chromosome 7H, respectively. The SNP-based high-density genetic map developed and the dwarfing gene btwd1 mapped in this study provide critical information for position cloning of the btwd1 gene and molecular breeding of barley. PMID:27530597

  11. Development and Characterization of a High Density SNP Genotyping Assay for Cattle

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The success of genomewide association (GWA) studies for the detection of sequence variation affecting complex traits in human has spurred interest in the use of large-scale high-density single nucleotide polymorphism (SNP) genotyping for the identification of quantitative trait loci (QTL) and for ma...

  12. An improved consensus linkage map of barley based on flow-sorted chromosomes and SNP markers

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Recent advances in high-throughput genotyping have made it easier to combine information from different mapping populations into consensus genetic maps, which provide increased marker density and genome coverage compared to individual maps. Previously, a SNP-based genotyping platform was developed a...

  13. Longevity and Plasticity of CFTR Provide an Argument for Noncanonical SNP Organization in Hominid DNA

    PubMed Central

    Hill, Aubrey E.; Plyler, Zackery E.; Tiwari, Hemant; Patki, Amit; Tully, Joel P.; McAtee, Christopher W.; Moseley, Leah A.; Sorscher, Eric J.

    2014-01-01

    Like many other ancient genes, the cystic fibrosis transmembrane conductance regulator (CFTR) has survived for hundreds of millions of years. In this report, we consider whether such prodigious longevity of an individual gene – as opposed to an entire genome or species – should be considered surprising in the face of eons of relentless DNA replication errors, mutagenesis, and other causes of sequence polymorphism. The conventions that modern human SNP patterns result either from purifying selection or random (neutral) drift were not well supported, since extant models account rather poorly for the known plasticity and function (or the established SNP distributions) found in a multitude of genes such as CFTR. Instead, our analysis can be taken as a polemic indicating that SNPs in CFTR and many other mammalian genes may have been generated—and continue to accrue—in a fundamentally more organized manner than would otherwise have been expected. The resulting viewpoint contradicts earlier claims of ‘directional’ or ‘intelligent design-type’ SNP formation, and has important implications regarding the pace of DNA adaptation, the genesis of conserved non-coding DNA, and the extent to which eukaryotic SNP formation should be viewed as adaptive. PMID:25350658

  14. A web-based genome browser for 'SNP-aware' assay design

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Human and animal genomes contain an abundance of single nucleotide polymorphisms (SNPs) that are useful for genetic testing. However, the relatively large number of SNPs present in diverse populations can pose serious problems when designing assays. It is important to “mask” some SNP positions so ...

  15. The impact of SNP fingerprinting and parentage analysis on the effectiveness of variety recommendations in cacao

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Evidence for the impact of mislabeling and/or pollen contamination on consistency of field performance has been lacking to reinforce the need for strict adherence to quality control protocols in cacao seed garden and germplasm plot management. The present study used SNP fingerprinting at 64 loci to ...

  16. Single Nucleotide Polymorphism (SNP) in the Adiponectin Gene and Cardiovascular Disease.

    PubMed

    Chirumbolo, Salvatore

    2016-07-01

    Dear Editor, The recent article by Mohammadzadeh et al.[1] on the latest issue of this Journal showed that the T allele +276G/T SNP of ADIPOQ gene is more associated with the increasing risk of coronary artery disease (CAD) in subjects with type 2 diabetes. Adipocytes were described in myocardial tissue of CAD patients and their role recently discussed[2,3]. Susceptibility to CAD by polymorphism in the Q gene of adiponectin has been reported for 3'-UTR, which harbours some genetic loci associated with metabolic risks and atherosclerosis[4]. Actually, previous studies have shown that the haplotype SNP +276G>T was associated with a decreased risk of CAD, after adjustment for potential confounding factors, therefore some controversial opinion still exists[5]. This evidence should be associated with the role exerted by adipocytes and adiponectin in heart physiology. In particular, in hypertensive disorder complicating pregnancy (HDCP), by investigating the population frequency of alleles, genotypes, and haplotypes of two single nucleotide polymorphisms (SNPs), namely +45T>G (rs2241766) and +276G>T (rs1501299), some authors found that the SNP +276 TT genotype was significantly associated with protection against HDCP, when compared to the pooled G genotypes[6]. Moreover, the same +276G/T SNP haplotype was strongly associated with biliary atresia, an intractable neonatal inflammatory and obliterative cholangiopathy, leading to progressive fibrosis and cirrhosis[7]. CAD is closely related to adiponectin biology. The same isoforms of adiponectin seem to be not associated to CAD severity but to glucose metabolism and its impairment[8]. In the paper by Mohammadzadeh et al.[1], T allele in +276G/T SNP haplotype is highly associated with CAD in subjects with type 2 diabetes, but this linkage should be reappraised if related much more to diabetes rather than CAD. Association of T allele in the indicated SNP with CAD may be an indirect consequence of type 2 diabetes, as reported

  17. Thermophotovoltaic Array Optimization

    SciTech Connect

    SBurger; E Brown; K Rahner; L Danielson; J Openlander; J Vell; D Siganporia

    2004-07-29

    A systematic approach to thermophotovoltaic (TPV) array design and fabrication was used to optimize the performance of a 192-cell TPV array. The systematic approach began with cell selection criteria that ranked cells and then matched cell characteristics to maximize power output. Following cell selection, optimization continued with an array packaging design and fabrication techniques that introduced negligible electrical interconnect resistance and minimal parasitic losses while maintaining original cell electrical performance. This paper describes the cell selection and packaging aspects of array optimization as applied to fabrication of a 192-cell array.

  18. SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate.

    PubMed

    Roffler, Gretchen H; Amish, Stephen J; Smith, Seth; Cosart, Ted; Kardos, Marty; Schwartz, Michael K; Luikart, Gordon

    2016-09-01

    Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding and nearby 5' and 3' untranslated regions of chosen candidate genes. Targeted sequences were taken from bighorn sheep (Ovis canadensis) exon capture data and directly from the domestic sheep genome (Ovis aries v. 3; oviAri3). The bighorn sheep sequences used in the Dall's sheep (Ovis dalli dalli) exon capture aligned to 2350 genes on the oviAri3 genome with an average of 2 exons each. We developed a microfluidic qPCR-based SNP chip to genotype 476 Dall's sheep from locations across their range and test for patterns of selection. Using multiple corroborating approaches (lositan and bayescan), we detected 28 SNP loci potentially under selection. We additionally identified candidate loci significantly associated with latitude, longitude, precipitation and temperature, suggesting local environmental adaptation. The three methods demonstrated consistent support for natural selection on nine genes with immune and disease-regulating functions (e.g. Ovar-DRA, APC, BATF2, MAGEB18), cell regulation signalling pathways (e.g. KRIT1, PI3K, ORRC3), and respiratory health (CYSLTR1). Characterizing adaptive allele distributions from novel genetic techniques will facilitate investigation of the influence of environmental variation on local adaptation of a northern alpine ungulate throughout its range. This research demonstrated the utility of exon capture for gene-targeted SNP discovery and subsequent SNP chip genotyping using low-quality samples in a nonmodel species. PMID:27327375

  19. SNP Discovery in the Transcriptome of White Pacific Shrimp Litopenaeus vannamei by Next Generation Sequencing

    PubMed Central

    Yu, Yang; Wei, Jiankai; Zhang, Xiaojun; Liu, Jingwen; Liu, Chengzhang; Li, Fuhua; Xiang, Jianhai

    2014-01-01

    The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP) discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei) generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp) and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies. PMID:24498047

  20. Estrogen, SNP-Dependent Chemokine Expression and Selective Estrogen Receptor Modulator Regulation.

    PubMed

    Ho, Ming-Fen; Bongartz, Tim; Liu, Mohan; Kalari, Krishna R; Goss, Paul E; Shepherd, Lois E; Goetz, Matthew P; Kubo, Michiaki; Ingle, James N; Wang, Liewei; Weinshilboum, Richard M

    2016-03-01

    We previously reported, on the basis of a genome-wide association study for aromatase inhibitor-induced musculoskeletal symptoms, that single-nucleotide polymorphisms (SNPs) near the T-cell leukemia/lymphoma 1A (TCL1A) gene were associated with aromatase inhibitor-induced musculoskeletal pain and with estradiol (E2)-induced TCL1A expression. Furthermore, variation in TCL1A expression influenced the downstream expression of proinflammatory cytokines and cytokine receptors. Specifically, the top hit genome-wide association study SNP, rs11849538, created a functional estrogen response element (ERE) that displayed estrogen receptor (ER) binding and increased E2 induction of TCL1A expression only for the variant SNP genotype. In the present study, we pursued mechanisms underlying the E2-SNP-dependent regulation of TCL1A expression and, in parallel, our subsequent observations that SNPs at a distance from EREs can regulate ERα binding and that ER antagonists can reverse phenotypes associated with those SNPs. Specifically, we performed a series of functional genomic studies using a large panel of lymphoblastoid cell lines with dense genomic data that demonstrated that TCL1A SNPs at a distance from EREs can modulate ERα binding and expression of TCL1A as well as the expression of downstream immune mediators. Furthermore, 4-hydroxytamoxifen or fulvestrant could reverse these SNP-genotype effects. Similar results were found for SNPs in the IL17A cytokine and CCR6 chemokine receptor genes. These observations greatly expand our previous results and support the existence of a novel molecular mechanism that contributes to the complex interplay between estrogens and immune systems. They also raise the possibility of the pharmacological manipulation of the expression of proinflammatory cytokines and chemokines in a SNP genotype-dependent fashion. PMID:26866883

  1. Development and Characterization of a High Density SNP Genotyping Assay for Cattle

    PubMed Central

    Matukumalli, Lakshmi K.; Lawley, Cynthia T.; Schnabel, Robert D.; Taylor, Jeremy F.; Allan, Mark F.; Heaton, Michael P.; O'Connell, Jeff; Moore, Stephen S.; Smith, Timothy P. L.; Sonstegard, Tad S.; Van Tassell, Curtis P.

    2009-01-01

    The success of genome-wide association (GWA) studies for the detection of sequence variation affecting complex traits in human has spurred interest in the use of large-scale high-density single nucleotide polymorphism (SNP) genotyping for the identification of quantitative trait loci (QTL) and for marker-assisted selection in model and agricultural species. A cost-effective and efficient approach for the development of a custom genotyping assay interrogating 54,001 SNP loci to support GWA applications in cattle is described. A novel algorithm for achieving a compressed inter-marker interval distribution proved remarkably successful, with median interval of 37 kb and maximum predicted gap of <350 kb. The assay was tested on a panel of 576 animals from 21 cattle breeds and six outgroup species and revealed that from 39,765 to 46,492 SNP are polymorphic within individual breeds (average minor allele frequency (MAF) ranging from 0.24 to 0.27). The assay also identified 79 putative copy number variants in cattle. Utility for GWA was demonstrated by localizing known variation for coat color and the presence/absence of horns to their correct genomic locations. The combination of SNP selection and the novel spacing algorithm allows an efficient approach for the development of high-density genotyping platforms in species having full or even moderate quality draft sequence. Aspects of the approach can be exploited in species which lack an available genome sequence. The BovineSNP50 assay described here is commercially available from Illumina and provides a robust platform for mapping disease genes and QTL in cattle. PMID:19390634

  2. De novo sequencing of sunflower genome for SNP discovery using RAD (Restriction site Associated DNA) approach

    PubMed Central

    2013-01-01

    Background Application of Single Nucleotide Polymorphism (SNP) marker technology as a tool in sunflower breeding programs offers enormous potential to improve sunflower genetics, and facilitate faster release of sunflower hybrids to the market place. Through a National Sunflower Association (NSA) funded initiative, we report on the process of SNP discovery through reductive genome sequencing and local assembly of six diverse sunflower inbred lines that represent oil as well as confection types. Results A combination of Restriction site Associated DNA Sequencing (RAD-Seq) protocols and Illumina paired-end sequencing chemistry generated high quality 89.4 M paired end reads from the six lines which represent 5.3 GB of the sequencing data. Raw reads from the sunflower line, RHA 464 were assembled de novo to serve as a framework reference genome. About 15.2 Mb of sunflower genome distributed over 42,267 contigs were obtained upon assembly of RHA 464 sequencing data, the contig lengths ranged from 200 to 950 bp with an N50 length of 393 bp. SNP calling was performed by aligning sequencing data from the six sunflower lines to the assembled reference RHA 464. On average, 1 SNP was located every 143 bp of the sunflower genome sequence. Based on several filtering criteria, a final set of 16,467 putative sequence variants with characteristics favorable for Illumina Infinium Genotyping Technology (IGT) were mined from the sequence data generated across six diverse sunflower lines. Conclusion Here we report the molecular and computational methodology involved in SNP development for a complex genome like sunflower lacking reference assembly, offering an attractive tool for molecular breeding purposes in sunflower. PMID:23947483

  3. Single-nucleotide polymorphism-array improves detection rate of genomic alterations in core-binding factor leukemia.

    PubMed

    Costa, Ana Rosa da Silveira; Vasudevan, Anupama; Krepischi, Ana; Rosenberg, Carla; Chauffaille, Maria de Lourdes L F

    2013-01-01

    Acute myeloid leukemia (AML) is a group of clonal diseases, resulting from two classes of mutation. Investigation for additional abnormalities associated with a well-recognized subtype, core-binding factor AML (CBF-AML) can provide further understanding and discrimination to this special group of leukemia. In order to better define genetic alterations in CBF-AML and identify possible cooperating lesions, a single-nucleotide polymorphism-array (SNP-array) analysis was performed, combined to KIT mutation screening, in a set of cases. Validation of SNP-array results was done by array comparative genomic hybridization and FISH. Fifteen cases were analyzed. Three cases had microscopic lesions better delineated by arrays. One case had +22 not identified by arrays. Submicroscopic abnormalities were mostly non-recurrent between samples. Of relevance, four regions were more frequently affected: 4q28, 9p11, 16q22.1, and 16q23. One case had an uncovered unbalanced inv(16) due to submicroscopic deletion of 5´MYH11 and 3´CBFB. Telomeric and large copy number neutral loss of heterozygosity (CNN-LOH) regions (>25 Mb), likely representing uniparental disomy, were detected in four out of fifteen cases. Only three cases had mutation on KIT gene, enhancing the role of abnormalities by SNP-array as presumptive cooperating alterations. Molecular karyotyping can add valuable information to metaphase karyotype analysis, emerging as an important tool to uncover and characterize microscopic, submicroscopic genomic alterations, and CNN-LOH events in the search for cooperating lesions. PMID:23636907

  4. Superconducting Bolometer Array Architectures

    NASA Technical Reports Server (NTRS)

    Benford, Dominic; Chervenak, Jay; Irwin, Kent; Moseley, S. Harvey; Shafer, Rick; Staguhn, Johannes; Wollack, Ed; Oegerle, William (Technical Monitor)

    2002-01-01

    The next generation of far-infrared and submillimeter instruments require large arrays of detectors containing thousands of elements. These arrays will necessarily be multiplexed, and superconducting bolometer arrays are the most promising present prospect for these detectors. We discuss our current research into superconducting bolometer array technologies, which has recently resulted in the first multiplexed detections of submillimeter light and the first multiplexed astronomical observations. Prototype arrays containing 512 pixels are in production using the Pop-Up Detector (PUD) architecture, which can be extended easily to 1000 pixel arrays. Planar arrays of close-packed bolometers are being developed for the GBT (Green Bank Telescope) and for future space missions. For certain applications, such as a slewed far-infrared sky survey, feedhorncoupling of a large sparsely-filled array of bolometers is desirable, and is being developed using photolithographic feedhorn arrays. Individual detectors have achieved a Noise Equivalent Power (NEP) of -10(exp 17) W/square root of Hz at 300mK, but several orders of magnitude improvement are required and can be reached with existing technology. The testing of such ultralow-background detectors will prove difficult, as this requires optical loading of below IfW. Antenna-coupled bolometer designs have advantages for large format array designs at low powers due to their mode selectivity.

  5. Detection of Hereditary 1,25-Hydroxyvitamin D-Resistant Rickets Caused by Uniparental Disomy of Chromosome 12 Using Genome-Wide Single Nucleotide Polymorphism Array

    PubMed Central

    Tamura, Mayuko; Isojima, Tsuyoshi; Kawashima, Minae; Yoshida, Hideki; Yamamoto, Keiko; Kitaoka, Taichi; Namba, Noriyuki; Oka, Akira; Ozono, Keiichi; Tokunaga, Katsushi; Kitanaka, Sachiko

    2015-01-01

    Context Hereditary 1,25-dihydroxyvitamin D-resistant rickets (HVDRR) is an autosomal recessive disease caused by biallelic mutations in the vitamin D receptor (VDR) gene. No patients have been reported with uniparental disomy (UPD). Objective Using genome-wide single nucleotide polymorphism (SNP) array to confirm whether HVDRR was caused by UPD of chromosome 12. Materials and Methods A 2-year-old girl with alopecia and short stature and without any family history of consanguinity was diagnosed with HVDRR by typical laboratory data findings and clinical features of rickets. Sequence analysis of VDR was performed, and the origin of the homozygous mutation was investigated by target SNP sequencing, short tandem repeat analysis, and genome-wide SNP array. Results The patient had a homozygous p.Arg73Ter nonsense mutation. Her mother was heterozygous for the mutation, but her father was negative. We excluded gross deletion of the father’s allele or paternal discordance. Genome-wide SNP array of the family (the patient and her parents) showed complete maternal isodisomy of chromosome 12. She was successfully treated with high-dose oral calcium. Conclusions This is the first report of HVDRR caused by UPD, and the third case of complete UPD of chromosome 12, in the published literature. Genome-wide SNP array was useful for detecting isodisomy and the parental origin of the allele. Comprehensive examination of the homozygous state is essential for accurate genetic counseling of recurrence risk and appropriate monitoring for other chromosome 12 related disorders. Furthermore, oral calcium therapy was effective as an initial treatment for rickets in this instance. PMID:26153892

  6. Electronic Switch Arrays for Managing Microbattery Arrays

    NASA Technical Reports Server (NTRS)

    Mojarradi, Mohammad; Alahmad, Mahmoud; Sukumar, Vinesh; Zghoul, Fadi; Buck, Kevin; Hess, Herbert; Li, Harry; Cox, David

    2008-01-01

    Integrated circuits have been invented for managing the charging and discharging of such advanced miniature energy-storage devices as planar arrays of microscopic energy-storage elements [typically, microscopic electrochemical cells (microbatteries) or microcapacitors]. The architecture of these circuits enables implementation of the following energy-management options: dynamic configuration of the elements of an array into a series or parallel combination of banks (subarrarys), each array comprising a series of parallel combination of elements; direct addressing of individual banks for charging/or discharging; and, disconnection of defective elements and corresponding reconfiguration of the rest of the array to utilize the remaining functional elements to obtain the desited voltage and current performance. An integrated circuit according to the invention consists partly of a planar array of field-effect transistors that function as switches for routing electric power among the energy-storage elements, the power source, and the load. To connect the energy-storage elements to the power source for charging, a specific subset of switches is closed; to connect the energy-storage elements to the load for discharging, a different specific set of switches is closed. Also included in the integrated circuit is circuitry for monitoring and controlling charging and discharging. The control and monitoring circuitry, the switching transistors, and interconnecting metal lines are laid out on the integrated-circuit chip in a pattern that registers with the array of energy-storage elements. There is a design option to either (1) fabricate the energy-storage elements in the corresponding locations on, and as an integral part of, this integrated circuit; or (2) following a flip-chip approach, fabricate the array of energy-storage elements on a separate integrated-circuit chip and then align and bond the two chips together.

  7. Identification of Mendelian inconsistencies between SNP and pedigree information of sibs

    PubMed Central

    2011-01-01

    Background Using SNP genotypes to apply genomic selection in breeding programs is becoming common practice. Tools to edit and check the quality of genotype data are required. Checking for Mendelian inconsistencies makes it possible to identify animals for which pedigree information and genotype information are not in agreement. Methods Straightforward tests to detect Mendelian inconsistencies exist that count the number of opposing homozygous marker (e.g. SNP) genotypes between parent and offspring (PAR-OFF). Here, we develop two tests to identify Mendelian inconsistencies between sibs. The first test counts SNP with opposing homozygous genotypes between sib pairs (SIBCOUNT). The second test compares pedigree and SNP-based relationships (SIBREL). All tests iteratively remove animals based on decreasing numbers of inconsistent parents and offspring or sibs. The PAR-OFF test, followed by either SIB test, was applied to a dataset comprising 2,078 genotyped cows and 211 genotyped sires. Theoretical expectations for distributions of test statistics of all three tests were calculated and compared to empirically derived values. Type I and II error rates were calculated after applying the tests to the edited data, while Mendelian inconsistencies were introduced by permuting pedigree against genotype data for various proportions of animals. Results Both SIB tests identified animal pairs for which pedigree and genomic relationships could be considered as inconsistent by visual inspection of a scatter plot of pairwise pedigree and SNP-based relationships. After removal of 235 animals with the PAR-OFF test, SIBCOUNT (SIBREL) identified 18 (22) additional inconsistent animals. Seventeen animals were identified by both methods. The numbers of incorrectly deleted animals (Type I error), were equally low for both methods, while the numbers of incorrectly non-deleted animals (Type II error), were considerably higher for SIBREL compared to SIBCOUNT. Conclusions Tests to remove

  8. Detection of selective sweeps in cattle using genome-wide SNP data

    PubMed Central

    2013-01-01

    Background The domestication and subsequent selection by humans to create breeds and biological types of cattle undoubtedly altered the patterning of variation within their genomes. Strong selection to fix advantageous large-effect mutations underlying domesticability, breed characteristics or productivity created selective sweeps in which variation was lost in the chromosomal region flanking the selected allele. Selective sweeps have now been identified in the genomes of many animal species including humans, dogs, horses, and chickens. Here, we attempt to identify and characterise regions of the bovine genome that have been subjected to selective sweeps. Results Two datasets were used for the discovery and validation of selective sweeps via the fixation of alleles at a series of contiguous SNP loci. BovineSNP50 data were used to identify 28 putative sweep regions among 14 diverse cattle breeds. Affymetrix BOS 1 prescreening assay data for five breeds were used to identify 85 regions and validate 5 regions identified using the BovineSNP50 data. Many genes are located within these regions and the lack of sequence data for the analysed breeds precludes the nomination of selected genes or variants and limits the prediction of the selected phenotypes. However, phenotypes that we predict to have historically been under strong selection include horned-polled, coat colour, stature, ear morphology, and behaviour. Conclusions The bias towards common SNPs in the design of the BovineSNP50 assay led to the identification of recent selective sweeps associated with breed formation and common to only a small number of breeds rather than ancient events associated with domestication which could potentially be common to all European taurines. The limited SNP density, or marker resolution, of the BovineSNP50 assay significantly impacted the rate of false discovery of selective sweeps, however, we found sweeps in common between breeds which were confirmed using an ultra

  9. Strong effect of SNP rs4988300 of the LRP5 gene on bone phenotype of Caucasian postmenopausal women.

    PubMed

    Horváth, Péter; Balla, Bernadett; Kósa, János P; Tóbiás, Bálint; Szili, Balázs; Kirschner, Gyöngyi; Győri, Gabriella; Kató, Karina; Lakatos, Péter; Takács, István

    2016-01-01

    The purpose of this study was to identify relationships between single nucleotide polymorphisms (SNPs) in the genes of the Wnt pathway and bone mineral density (BMD) of postmenopausal women. We chose this pathway due to its importance in bone metabolism that was underlined in several studies. DNA samples of 932 Hungarian postmenopausal women were studied. First, their BMD values at different sites (spine, total hip) were measured, using a Lunar Prodigy DXA scanner. Thereafter, T-score values and the patients' body mass indices (BMIs) were calculated, while information about the fracture history of the sample population was also collected. We genotyped nine SNPs of the following three genes: LRP5, GPR177, and SP7, using a Sequenom MassARRAY Analyzer 4 instrument. The genomic DNA samples used for genotyping were extracted from the buccal mucosa of the subjects. Statistical analyses were carried out using the SPSS 21 and R package. The results of this analysis showed a significant association between SNP rs4988300 of the LRP5 gene and total hip BMD values. We could not reveal any associations between the markers of GPR177, SP7, and bone phenotypes. We found no effect of these genotypes on fracture risk. We could demonstrate a significant gene-gene interaction between two SNPs of LRP5 (rs4988300 and rs634008, p = 0.009) which was lost after Bonferroni correction. We could firmly demonstrate a significant association between rs4988300 of the LRP5 gene and bone density of the hip on the largest homogeneous postmenopausal study group analyzed to date. Our finding corroborates the relationship between LRP5 genotype and bone phenotype in postmenopausal women, however, the complete mechanism of this relationship requires further investigations. PMID:25762437

  10. Designing linear systolic arrays

    SciTech Connect

    Kumar, V.K.P.; Tsai, Y.C. . Dept. of Electrical Engineering)

    1989-12-01

    The authors develop a simple mapping technique to design linear systolic arrays. The basic idea of the technique is to map the computations of a certain class of two-dimensional systolic arrays onto one-dimensional arrays. Using this technique, systolic algorithms are derived for problems such as matrix multiplication and transitive closure on linearly connected arrays of PEs with constant I/O bandwidth. Compared to known designs in the literature, the technique leads to modular systolic arrays with constant hardware in each PE, few control lines, lexicographic data input/output, and improved delay time. The unidirectional flow of control and data in this design assures implementation of the linear array in the known fault models of wafer scale integration.

  11. Reconfigurable mosaic annular arrays.

    PubMed

    Thomenius, Kai E; Wodnicki, Robert; Cogan, Scott D; Fisher, Rayette A; Burdick, Bill; Smith, L Scott; Khuri-Yakub, Pierre; Lin, Der-Song; Zhuang, Xuefeng; Bonitz, Barry; Davies, Todd; Thomas, Glen; Woychik, Charles

    2014-07-01

    Mosaic annular arrays (MAA) based on reconfigurable array (RA) transducer electronics assemblies are presented as a potential solution for future highly integrated ultrasonic transducer subsystems. Advantages of MAAs include excellent beam quality and depth of field resulting from superior elevational focus compared with 1-D electronically scanned arrays, as well as potentially reduced cost, size, and power consumption resulting from the use of a limited number of beamforming channels for processing a large number of subelements. Specific design tradeoffs for these highly integrated arrays are discussed in terms of array specifications for center frequency, element pitch, and electronic switch-on resistance. Large-area RAs essentially function as RC delay lines. Efficient architectures which take into account RC delay effects are presented. Architectures for integration of the transducer and electronics layers of large-area array implementations are reviewed. PMID:24960699

  12. Carbon nanotube nanoelectrode arrays

    DOEpatents

    Ren, Zhifeng; Lin, Yuehe; Yantasee, Wassana; Liu, Guodong; Lu, Fang; Tu, Yi

    2008-11-18

    The present invention relates to microelectode arrays (MEAs), and more particularly to carbon nanotube nanoelectrode arrays (CNT-NEAs) for chemical and biological sensing, and methods of use. A nanoelectrode array includes a carbon nanotube material comprising an array of substantially linear carbon nanotubes each having a proximal end and a distal end, the proximal end of the carbon nanotubes are attached to a catalyst substrate material so as to form the array with a pre-determined site density, wherein the carbon nanotubes are aligned with respect to one another within the array; an electrically insulating layer on the surface of the carbon nanotube material, whereby the distal end of the carbon nanotubes extend beyond the electrically insulating layer; a second adhesive electrically insulating layer on the surface of the electrically insulating layer, whereby the distal end of the carbon nanotubes extend beyond the second adhesive electrically insulating layer; and a metal wire attached to the catalyst substrate material.

  13. Solar array automation limitations

    NASA Technical Reports Server (NTRS)

    Trumble, Terry M.

    1990-01-01

    Significant progress in the automation of the spacecraft electrical power systems has been made within the past few years. This is especially important with the development of the space station and the increasing demand on the electrical power systems for future satellites. The key element of the spacecraft power system, the solar arrays which supply the power, will have to grow to supply many tens of kilowatts of power within the next twenty years. This growth will be accompanied by the problems associated with large distributed power systems. The growth of the arrays, the on-array management problems and potential solutions to array degradation or failure are discussed. Multilowatt arrays for unmanned spacecraft with comments on the implications of array degradation for manned spacecraft are discussed.

  14. Development and characterization of a microheater array device for real-time DNA mutation detection

    NASA Astrophysics Data System (ADS)

    Williams, Layne; Okandan, Murat; Chagovetz, Alex; Blair, Steve

    2008-04-01

    DNA analysis, specifically single nucleotide polymorphism (SNP) detection, is becoming increasingly important in rapid diagnostics and disease detection. Temperature is often controlled to help speed reaction rates and perform melting of hybridized oligonucleotides. The difference in melting temperatures, Tm, between wild-type and SNP sequences, respectively, to a given probe oligonucleotide, is indicative of the specificity of the reaction. We have characterized Tm's in solution and on a solid substrate of three sequences from known mutations associated with Cystic Fibrosis. Taking advantage of Tm differences, a microheater array device was designed to enable individual temperature control of up to 18 specific hybridization events. The device was fabricated at Sandia National Laboratories using surface micromachining techniques. The microheaters have been characterized using an IR camera at Sandia and show individual temperature control with minimal thermal cross talk. Development of the device as a real-time DNA detection platform, including surface chemistry and associated microfluidics, is described.

  15. Development and characterization of a microheater array device for real-time DNA mutation detection

    NASA Astrophysics Data System (ADS)

    Williams, Layne; Okandan, Murat; Chagovetz, Alex; Blair, Steve

    2008-02-01

    DNA analysis, specifically single nucleotide polymorphism (SNP) detection, is becoming increasingly important in rapid diagnostics and disease detection. Temperature is often controlled to help speed reaction rates and perform melting of hybridized oligonucleotides. The difference in melting temperatures, Tm, between wild-type and SNP sequences, respectively, to a given probe oligonucleotide, is indicative of the specificity of the reaction. We have characterized Tm's in solution and on a solid substrate of three sequences from known mutations associated with Cystic Fibrosis. Taking advantage of Tm differences, a microheater array device was designed to enable individual temperature control of up to 18 specific hybridization events. The device was fabricated at Sandia National Laboratories using surface micromachining techniques. The microheaters have been characterized using an IR camera at Sandia and show individual temperature control with minimal thermal cross talk. Development of the device as a real-time DNA detection platform, including surface chemistry and associated microfluidics, is described.

  16. Pacific Array (Transportable Broadband Ocean Floor Array)

    NASA Astrophysics Data System (ADS)

    Kawakatsu, Hitoshi; Ekstrom, Goran; Evans, Rob; Forsyth, Don; Gaherty, Jim; Kennett, Brian; Montagner, Jean-Paul; Utada, Hisashi

    2016-04-01

    Based on recent developments on broadband ocean bottom seismometry, we propose a next generation large-scale array experiment in the ocean. Recent advances in ocean bottom broadband seismometry1, together with advances in the seismic analysis methodology, have enabled us to resolve the regional 1-D structure of the entire lithosphere/asthenosphere system, including seismic anisotropy (azimuthal, and hopefully radial), with deployments of ~15 broadband ocean bottom seismometers (BBOBSs). Having ~15 BBOBSs as an array unit for a 2-year deployment, and repeating such deployments in a leap-frog way or concurrently (an array of arrays) for a decade or so would enable us to cover a large portion of the Pacific basin. Such efforts, not only by giving regional constraints on the 1-D structure beneath Pacific ocean, but also by sharing waveform data for global scale waveform tomography, would drastically increase our knowledge of how plate tectonics works on this planet, as well as how it worked for the past 150 million years. International collaborations is essential: if three countries/institutions participate this endeavor together, Pacific Array may be accomplished within five-or-so years.

  17. Microstrip monpulse dipole array

    NASA Astrophysics Data System (ADS)

    Miccioli, W.; Toth, J.; Sa, N.; Lewis, M.

    1985-01-01

    The development of a microstrip radiating aperture utilizing multiple microstrip dipole radiators fed by a resonant feed configuration is described. This array combines an efficient capacitively coupled radiator feeding mechanism with a planar power divider configuration to achieve an extremely thin, lightweight antenna aperture. Linear array dipole matching theory and radiator bandwidth improvement techniques are also described. A quadrant based microstrip monopulse antenna was constructed. Experimental data from this array, its subassemblies and individual components are presented and compared to analytical predictions.

  18. Distributed array radar

    NASA Astrophysics Data System (ADS)

    Heimiller, R. C.; Belyea, J. E.; Tomlinson, P. G.

    1983-11-01

    Distributed array radar (DAR) is a concept for efficiently accomplishing surveillance and tracking using coherently internetted mini-radars. They form a long baseline, very thinned array and are capable of very accurate location of targets. This paper describes the DAR concept. Factors involving two-way effective gain patterns for deterministic and random DAR arrays are analyzed and discussed. An analysis of factors affecting signal-to-noise ratio is presented and key technical and performance issues are briefly summarized.

  19. Telescope Array Experiment

    NASA Astrophysics Data System (ADS)

    Kawai, H.; Yoshida, S.; Yoshii, H.; Tanaka, K.; Cohen, F.; Fukushima, M.; Hayashida, N.; Hiyama, K.; Ikeda, D.; Kido, E.; Kondo, Y.; Nonaka, T.; Ohnishi, M.; Ohoka, H.; Ozawa, S.; Sagawa, H.; Sakurai, N.; Shibata, T.; Shimodaira, H.; Takeda, M.; Taketa, A.; Takita, M.; Tokuno, H.; Torii, R.; Udo, S.; Yamakawa, Y.; Fujii, H.; Matsuda, T.; Tanaka, M.; Yamaoka, H.; Hibino, K.; Benno, T.; Doura, K.; Chikawa, M.; Nakamura, T.; Teshima, M.; Kadota, K.; Uchihori, Y.; Hayashi, K.; Hayashi, Y.; Kawakami, S.; Matsuyama, T.; Minamino, M.; Ogio, S.; Ohshima, A.; Okuda, T.; Shimizu, N.; Tanaka, H.; Bergman, D. R.; Hughes, G.; Stratton, S.; Thomson, G. B.; Endo, A.; Inoue, N.; Kawana, S.; Wada, Y.; Kasahara, K.; Azuma, R.; Iguchi, T.; Kakimoto, F.; Machida, S.; Misumi, K.; Murano, Y.; Tameda, Y.; Tsunesada, Y.; Chiba, J.; Miyata, K.; Abu-Zayyad, T.; Belz, J. W.; Cady, R.; Cao, Z.; Huentemeyer, P.; Jui, C. C. H.; Martens, K.; Matthews, J. N.; Mostofa, M.; Smith, J. D.; Sokolsky, P.; Springer, R. W.; Thomas, J. R.; Thomas, S. B.; Wiencke, L. R.; Doyle, T.; Taylor, M. J.; Wickwar, V. B.; Wilkerson, T. D.; Hashimoto, K.; Honda, K.; Ikuta, K.; Ishii, T.; Kanbe, T.; Tomida, T.

    2008-01-01

    The TA observatory is a hybrid detector system consisting of both a surface detector array as well as a set of fluorescence detectors. The observatory will measure the energy spectrum, anisotropy and composition of ultra-high energy cosmic rays. The surface detectors are being deployed and the array should be complete by the end of February, 2007. We will soon be collecting hybrid data at the Telecope Array.

  20. Electrically reconfigurable logic array

    NASA Technical Reports Server (NTRS)

    Agarwal, R. K.

    1982-01-01

    To compose the complicated systems using algorithmically specialized logic circuits or processors, one solution is to perform relational computations such as union, division and intersection directly on hardware. These relations can be pipelined efficiently on a network of processors having an array configuration. These processors can be designed and implemented with a few simple cells. In order to determine the state-of-the-art in Electrically Reconfigurable Logic Array (ERLA), a survey of the available programmable logic array (PLA) and the logic circuit elements used in such arrays was conducted. Based on this survey some recommendations are made for ERLA devices.

  1. Integrated avalanche photodiode arrays

    DOEpatents

    Harmon, Eric S.

    2015-07-07

    The present disclosure includes devices for detecting photons, including avalanche photon detectors, arrays of such detectors, and circuits including such arrays. In some aspects, the detectors and arrays include a virtual beveled edge mesa structure surrounded by resistive material damaged by ion implantation and having side wall profiles that taper inwardly towards the top of the mesa structures, or towards the direction from which the ion implantation occurred. Other aspects are directed to masking and multiple implantation and/or annealing steps. Furthermore, methods for fabricating and using such devices, circuits and arrays are disclosed.

  2. Spatial Structure and Climatic Adaptation in African Maize Revealed by Surveying SNP Diversity in Relation to Global Breeding and Landrace Panels

    PubMed Central

    Westengen, Ola T.; Berg, Paul R.; Kent, Matthew P.; Brysting, Anne K.

    2012-01-01

    Background Climate change threatens maize productivity in sub-Saharan Africa. To ensure food security, access to locally adapted genetic resources and varieties is an important adaptation measure. Most of the maize grown in Africa is a genetic mix of varieties introduced at different historic times following the birth of the trans-Atlantic economy, and knowledge about geographic structure and local adaptations is limited. Methodology A panel of 48 accessions of maize representing various introduction routes and sources of historic and recent germplasm introductions in Africa was genotyped with the MaizeSNP50 array. Spatial genetic structure and genetic relationships in the African panel were analysed separately and in the context of a panel of 265 inbred lines representing global breeding material (based on 26,900 SNPs) and a panel of 1127 landraces from the Americas (270 SNPs). Environmental association analysis was used to detect SNPs associated with three climatic variables based on the full 43,963 SNP dataset. Conclusions The genetic structure is consistent between subsets of the data and the markers are well suited for resolving relationships and admixture among the accessions. The African accessions are structured in three clusters reflecting historical and current patterns of gene flow from the New World and within Africa. The Sahelian cluster reflects original introductions of Meso-American landraces via Europe and a modern introduction of temperate breeding material. The Western cluster reflects introduction of Coastal Brazilian landraces, as well as a Northeast-West spread of maize through Arabic trade routes across the continent. The Eastern cluster most strongly reflects gene flow from modern introduced tropical varieties. Controlling for population history in a linear model, we identify 79 SNPs associated with maximum temperature during the growing season. The associations located in genes of known importance for abiotic stress tolerance are

  3. Pigment phenotype and biogeographical ancestry from ancient skeletal remains: inferences from multiplexed autosomal SNP analysis.

    PubMed

    Bouakaze, Caroline; Keyser, Christine; Crubézy, Eric; Montagnon, Daniel; Ludes, Bertrand

    2009-07-01

    In the present study, a multiplexed genotyping assay for ten single nucleotide polymorphisms (SNPs) located within six pigmentation candidate genes was developed on modern biological samples and applied to DNA retrieved from 25 archeological human remains from southern central Siberia dating from the Bronze and Iron Ages. SNP genotyping was successful for the majority of ancient samples and revealed that most probably had typical European pigment features, i.e., blue or green eye color, light hair color and skin type, and were likely of European individual ancestry. To our knowledge, this study reports for the first time the multiplexed typing of autosomal SNPs on aged and degraded DNA. By providing valuable information on pigment traits of an individual and allowing individual biogeographical ancestry estimation, autosomal SNP typing can improve ancient DNA studies and aid human identification in some forensic casework situations when used to complement conventional molecular markers. PMID:19415315

  4. SNPit: a federated data integration system for the purpose of functional SNP annotation.

    PubMed

    Shen, Terry H; Carlson, Christopher S; Tarczy-Hornoch, Peter

    2009-08-01

    Genome wide association studies can potentially identify the genetic causes behind the majority of human diseases. With the advent of more advanced genotyping techniques, there is now an explosion of data gathered on single nucleotide polymorphisms (SNPs). The need exists for an integrated system that can provide up-to-date functional annotation information on SNPs. We have developed the SNP Integration Tool (SNPit) system to address this need. Built upon a federated data integration system, SNPit provides current information on a comprehensive list of SNP data sources. Additional logical inference analysis was included through an inference engine plug in. The SNPit web servlet is available online for use. SNPit allows users to go to one source for up-to-date information on the functional annotation of SNPs. A tool that can help to integrate and analyze the potential functional significance of SNPs is important for understanding the results from genome wide association studies. PMID:19327864

  5. Slider—maximum use of probability information for alignment of short sequence reads and SNP detection

    PubMed Central

    Malhis, Nawar; Butterfield, Yaron S. N.; Ester, Martin; Jones, Steven J. M.

    2009-01-01

    Motivation: A plethora of alignment tools have been created that are designed to best fit different types of alignment conditions. While some of these are made for aligning Illumina Sequence Analyzer reads, none of these are fully utilizing its probability (prb) output. In this article, we will introduce a new alignment approach (Slider) that reduces the alignment problem space by utilizing each read base's probabilities given in the prb files. Results: Compared with other aligners, Slider has higher alignment accuracy and efficiency. In addition, given that Slider matches bases with probabilities other than the most probable, it significantly reduces the percentage of base mismatches. The result is that its SNP predictions are more accurate than other SNP prediction approaches used today that start from the most probable sequence, including those using base quality. Contact: nmalhis@bcgsc.ca Supplementary information and availability: http://www.bcgsc.ca/platform/bioinfo/software/slider PMID:18974170

  6. Solar array stepping to minimize array excitation

    NASA Technical Reports Server (NTRS)

    Bhat, Mahabaleshwar K. P. (Inventor); Liu, Tung Y. (Inventor); Plescia, Carl T. (Inventor)

    1989-01-01

    Mechanical oscillations of a mechanism containing a stepper motor, such as a solar-array powered spacecraft, are reduced and minimized by the execution of step movements in pairs of steps, the period between steps being equal to one-half of the period of torsional oscillation of the mechanism. Each pair of steps is repeated at needed intervals to maintain desired continuous movement of the portion of elements to be moved, such as the solar array of a spacecraft. In order to account for uncertainty as well as slow change in the period of torsional oscillation, a command unit may be provided for varying the interval between steps in a pair.

  7. Focal plane array with modular pixel array components for scalability

    SciTech Connect

    Kay, Randolph R; Campbell, David V; Shinde, Subhash L; Rienstra, Jeffrey L; Serkland, Darwin K; Holmes, Michael L

    2014-12-09

    A modular, scalable focal plane array is provided as an array of integrated circuit dice, wherein each die includes a given amount of modular pixel array circuitry. The array of dice effectively multiplies the amount of modular pixel array circuitry to produce a larger pixel array without increasing die size. Desired pixel pitch across the enlarged pixel array is preserved by forming die stacks with each pixel array circuitry die stacked on a separate die that contains the corresponding signal processing circuitry. Techniques for die stack interconnections and die stack placement are implemented to ensure that the desired pixel pitch is preserved across the enlarged pixel array.

  8. Applying SNP-Derived Molecular Coancestry Estimates to Captive Breeding Programs.

    PubMed

    Ivy, Jamie A; Putnam, Andrea S; Navarro, Asako Y; Gurr, Jessica; Ryder, Oliver A

    2016-09-01

    Captive breeding programs for wildlife species typically rely on pedigrees to inform genetic management. Although pedigree-based breeding strategies are quite effective at retaining long-term genetic variation, management of zoo-based breeding programs continues to be hampered when pedigrees are poorly known. The objective of this study was to evaluate 2 options for generating single nucleotide polymorphism (SNP) data to resolve unknown relationships within captive breeding programs. We generated SNP data for a zoo-based population of addax (Addax nasomasculatus) using both the Illumina BovineHD BeadChip and double digest restriction site-associated DNA (ddRAD) sequencing. Our results demonstrated that estimates of allele sharing (AS) between pairs of individuals exhibited low variances. Average AS variances were highest when using 50 loci (SNPchipall = 0.00159; ddRADall = 0.0249), but fell below 0.0003 for the SNP chip dataset when sampling ≥250 loci and below 0.0025 for the ddRAD dataset when sampling ≥500 loci. Furthermore, the correlation between the SNPchipall and ddRADall AS datasets was 0.88 (95%CI = 0.84-0.91) when subsampling 500 loci. Collectively, our results indicated that both SNP genotyping methods produced sufficient data for accurately estimating relationships, even within an extremely bottlenecked population. Our results also suggested that analytic assumptions historically integrated into the addax pedigree are not adversely impacting long-term pedigree-based management; kinships calculated from the analytic pedigree were significantly correlated (P < 0.001) with AS estimates. Overall, our conclusions are intended to serve as both a proof of concept and a model for applying molecular data to the genetic management of captive breeding programs. PMID:27208150

  9. MAFsnp: A Multi-Sample Accurate and Flexible SNP Caller Using Next-Generation Sequencing Data.

    PubMed

    Hu, Jiyuan; Li, Tengfei; Xiu, Zidi; Zhang, Hong

    2015-01-01

    Most existing statistical methods developed for calling single nucleotide polymorphisms (SNPs) using next-generation sequencing (NGS) data are based on Bayesian frameworks, and there does not exist any SNP caller that produces p-values for calling SNPs in a frequentist framework. To fill in this gap, we develop a new method MAFsnp, a Multiple-sample based Accurate and Flexible algorithm for calling SNPs with NGS data. MAFsnp is based on an estimated likelihood ratio test (eLRT) statistic. In practical situation, the involved parameter is very close to the boundary of the parametric space, so the standard large sample property is not suitable to evaluate the finite-sample distribution of the eLRT statistic. Observing that the distribution of the test statistic is a mixture of zero and a continuous part, we propose to model the test statistic with a novel two-parameter mixture distribution. Once the parameters in the mixture distribution are estimated, p-values can be easily calculated for detecting SNPs, and the multiple-testing corrected p-values can be used to control false discovery rate (FDR) at any pre-specified level. With simulated data, MAFsnp is shown to have much better control of FDR than the existing SNP callers. Through the application to two real datasets, MAFsnp is also shown to outperform the existing SNP callers in terms of calling accuracy. An R package "MAFsnp" implementing the new SNP caller is freely available at http://homepage.fudan.edu.cn/zhangh/softwares/. PMID:26309201

  10. Typing SNP based on the near-infrared spectroscopy and artificial neural network

    NASA Astrophysics Data System (ADS)

    Ren, Li; Wang, Wei-Peng; Gao, Yu-Zhen; Yu, Xiao-Wei; Xie, Hong-Ping

    2009-07-01

    Based on the near-infrared spectra (NIRS) of the measured samples as the discriminant variables of their genotypes, the genotype discriminant model of SNP has been established by using back-propagation artificial neural network (BP-ANN). Taking a SNP (857G > A) of N-acetyltransferase 2 (NAT2) as an example, DNA fragments containing the SNP site were amplified by the PCR method based on a pair of primers to obtain the three-genotype (GG, AA, and GA) modeling samples. The NIRS-s of the amplified samples were directly measured in transmission by using quartz cell. Based on the sample spectra measured, the two BP-ANN-s were combined to obtain the stronger ability of the three-genotype classification. One of them was established to compress the measured NIRS variables by using the resilient back-propagation algorithm, and another network established by Levenberg-Marquardt algorithm according to the compressed NIRS-s was used as the discriminant model of the three-genotype classification. For the established model, the root mean square error for the training and the prediction sample sets were 0.0135 and 0.0132, respectively. Certainly, this model could rightly predict the three genotypes (i.e. the accuracy of prediction samples was up to100%) and had a good robust for the prediction of unknown samples. Since the three genotypes of SNP could be directly determined by using the NIRS-s without any preprocessing for the analyzed samples after PCR, this method is simple, rapid and low-cost.

  11. Efficient Genome-Wide TagSNP Selection Across Populations via the Linkage Disequilibrium Criterion

    PubMed Central

    Wu, Yonghui; Lonardi, Stefano; Jiang, Tao

    2010-01-01

    Abstract In this article, we studied the tag single-nucleotide polymorphism (tagSNP) selection problem on multiple populations using the pairwise r2 linkage disequilibrium criterion. We proposed a novel combinatorial optimization model for the tagSNP selection problem, called the minimum common tagSNP selection (MCTS) problem, and presented efficient solutions for MCTS. Our approach consists of the following three main steps: (i) partitioning the SNP markers into small disjoint components, (ii) applying some data reduction rules to simplify the problem, and (iii) applying either a fast greedy algorithm or a Lagrangian relaxation algorithm to solve the remaining (general) MCTS. These algorithms also provide lower bounds on tagging (i.e., the minimum number of tagSNPs needed). The lower bounds allow us to evaluate how far our solution is from the optimum. To the best of our knowledge, it is the first time the tagging lower bounds are discussed in the literature. We assessed the performance of our algorithms on real HapMap data for genome-wide tagging. The experiments demonstrated that our algorithms run 3–4 orders of magnitude faster than the existing single-population tagging programs such as FESTA, LD-Select, and the multiple-population tagging method MultiPop-TagSelect. Our method also greatly reduced the required tagSNPs compared with LD-Select on a single population and MultiPop-TagSelect on multiple populations. Moreover, the numbers of tagSNPs selected by our algorithms are almost optimal because they are very close to the corresponding lower bounds obtained by our method. PMID:20078395

  12. Genome-wide SNP analysis of the Systemic Capillary Leak Syndrome (Clarkson disease)

    PubMed Central

    Xie, Zhihui; Nagarajan, Vijayaraj; Sturdevant, Daniel E; Iwaki, Shoko; Chan, Eunice; Wisch, Laura; Young, Michael; Nelson, Celeste M; Porcella, Stephen F; Druey, Kirk M

    2013-01-01

    The Systemic Capillary Leak Syndrome (SCLS) is an extremely rare, orphan disease that resembles, and is frequently erroneously diagnosed as, systemic anaphylaxis. The disorder is characterized by repeated, transient, and seemingly unprovoked episodes of hypotensive shock and peripheral edema due to transient endothelial hyperpermeability. SCLS is often accompanied by a monoclonal gammopathy of unknown significance (MGUS). Using Affymetrix Single Nucleotide Polymorphism (SNP) microarrays, we performed the first genome-wide SNP analysis of SCLS in a cohort of 12 disease subjects and 18 controls. Exome capture sequencing was performed on genomic DNA from nine of these patients as validation for the SNP-chip discoveries and de novo data generation. We identified candidate susceptibility loci for SCLS, which included a region flanking CAV3 (3p25.3) as well as SNP clusters in PON1 (7q21.3), PSORS1C1 (6p21.3), and CHCHD3 (7q33). Among the most highly ranked discoveries were gene-associated SNPs in the uncharacterized LOC100130480 gene (rs6417039, rs2004296). Top case-associated SNPs were observed in BTRC (rs12355803, 3rs4436485), ARHGEF18 (rs11668246), CDH13 (rs4782779), and EDG2 (rs12552348), which encode proteins with known or suspected roles in B cell function and/or vascular integrity. 61 SNPs that were significantly associated with SCLS by microarray analysis were also detected and validated by exome deep sequencing. Functional annotation of highly ranked SNPs revealed enrichment of cell projections, cell junctions and adhesion, and molecules containing pleckstrin homology, Ras/Rho regulatory, and immunoglobulin Ig-like C2/fibronectin type III domains, all of which involve mechanistic functions that correlate with the SCLS phenotype. These results highlight SNPs with potential relevance to SCLS. PMID:24808988

  13. Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT

    PubMed Central

    Neigenfind, Jost; Gyetvai, Gabor; Basekow, Rico; Diehl, Svenja; Achenbach, Ute; Gebhardt, Christiane; Selbig, Joachim; Kersten, Birgit

    2008-01-01

    Background Haplotype inference based on unphased SNP markers is an important task in population genetics. Although there are different approaches to the inference of haplotypes in diploid species, the existing software is not suitable for inferring haplotypes from unphased SNP data in polyploid species, such as the cultivated potato (Solanum tuberosum). Potato species are tetraploid and highly heterozygous. Results Here we present the software SATlotyper which is able to handle polyploid and polyallelic data. SATlo-typer uses the Boolean satisfiability problem to formulate Haplotype Inference by Pure Parsimony. The software excludes existing haplotype inferences, thus allowing for calculation of alternative inferences. As it is not known which of the multiple haplotype inferences are best supported by the given unphased data set, we use a bootstrapping procedure that allows for scoring of alternative inferences. Finally, by means of the bootstrapping scores, it is possible to optimise the phased genotypes belonging to a given haplotype inference. The program is evaluated with simulated and experimental SNP data generated for heterozygous tetraploid populations of potato. We show that, instead of taking the first haplotype inference reported by the program, we can significantly improve the quality of the final result by applying additional methods that include scoring of the alternative haplotype inferences and genotype optimisation. For a sub-population of nineteen individuals, the predicted results computed by SATlotyper were directly compared with results obtained by experimental haplotype inference via sequencing of cloned amplicons. Prediction and experiment gave similar results regarding the inferred haplotypes and phased genotypes. Conclusion Our results suggest that Haplotype Inference by Pure Parsimony can be solved efficiently by the SAT approach, even for data sets of unphased SNP from heterozygous polyploids. SATlotyper is freeware and is distributed as

  14. SNP Discovery Using Next Generation Transcriptomic Sequencing in Atlantic Herring (Clupea harengus)

    PubMed Central

    Bekkevold, Dorte; Babbucci, Massimiliano; van Houdt, Jeroen; Maes, Gregory E.; Bargelloni, Luca; Nielsen, Rasmus O.; Taylor, Martin I.; Ogden, Rob; Cariani, Alessia; Carvalho, Gary R.; Consortium, FishPopTrace; Panitz, Frank

    2012-01-01

    The introduction of Next Generation Sequencing (NGS) has revolutionised population genetics, providing studies of non-model species with unprecedented genomic coverage, allowing evolutionary biologists to address questions previously far beyond the reach of available resources. Furthermore, the simple mutation model of Single Nucleotide Polymorphisms (SNPs) permits cost-effective high-throughput genotyping in thousands of individuals simultaneously. Genomic resources are scarce for the Atlantic herring (Clupea harengus), a small pelagic species that sustains high revenue fisheries. This paper details the development of 578 SNPs using a combined NGS and high-throughput genotyping approach. Eight individuals covering the species distribution in the eastern Atlantic were bar-coded and multiplexed into a single cDNA library and sequenced using the 454 GS FLX platform. SNP discovery was performed by de novo sequence clustering and contig assembly, followed by the mapping of reads against consensus contig sequences. Selection of candidate SNPs for genotyping was conducted using an in silico approach. SNP validation and genotyping were performed simultaneously using an Illumina 1,536 GoldenGate assay. Although the conversion rate of candidate SNPs in the genotyping assay cannot be predicted in advance, this approach has the potential to maximise cost and time efficiencies by avoiding expensive and time-consuming laboratory stages of SNP validation. Additionally, the in silico approach leads to lower ascertainment bias in the resulting SNP panel as marker selection is based only on the ability to design primers and the predicted presence of intron-exon boundaries. Consequently SNPs with a wider spectrum of minor allele frequencies (MAFs) will be genotyped in the final panel. The genomic resources presented here represent a valuable multi-purpose resource for developing informative marker panels for population discrimination, microarray development and for population

  15. An extended Tajima’s D neutrality test incorporating SNP calling and imputation uncertainties

    PubMed Central

    Zhang, Qingrun; Tyler-Smith, Chris; Long, Quan

    2015-01-01

    To identify evolutionary events from the footprints left in the patterns of genetic variation in a population, people use many statistical frameworks, including neutrality tests. In datasets from current high throughput sequencing and genotyping platforms, it is common to have missing data and low-confidence SNP calls at many segregating sites. However, the traditional statistical framework for neutrality tests does not allow for these possibilities; therefore the usual way of treating missing data is to ignore segregating sites with missing/low confidence calls, regardless of the good SNP calls at these sites in other individuals. In this work, we propose a modified neutrality test, Extended Tajima’s D, which incorporates missing data and SNP-calling uncertainties. Because we do not specify any particular error-generating mechanism, this approach is robust and widely applicable. Simulations show that in most cases the power of the new test is better than the original Tajima’s D, given the same type I error. Applications to real data show that it detects fewer outliers associated with low quality data. PMID:26681995

  16. Quadruplex-single nucleotide polymorphisms (Quad-SNP) influence gene expression difference among individuals

    PubMed Central

    Baral, Aradhita; Kumar, Pankaj; Halder, Rashi; Mani, Prithvi; Yadav, Vinod Kumar; Singh, Ankita; Das, Swapan K.; Chowdhury, Shantanu

    2012-01-01

    Non-canonical guanine quadruplex structures are not only predominant but also conserved among bacterial and mammalian promoters. Moreover recent findings directly implicate quadruplex structures in transcription. These argue for an intrinsic role of the structural motif and thereby posit that single nucleotide polymorphisms (SNP) that compromise the quadruplex architecture could influence function. To test this, we analysed SNPs within quadruplex motifs (Quad-SNP) and gene expression in 270 individuals across four populations (HapMap) representing more than 14 500 genotypes. Findings reveal significant association between quadruplex-SNPs and expression of the corresponding gene in individuals (P < 0.0001). Furthermore, analysis of Quad-SNPs obtained from population-scale sequencing of 1000 human genomes showed relative selection bias against alteration of the structural motif. To directly test the quadruplex-SNP-transcription connection, we constructed a reporter system using the RPS3 promoter—remarkable difference in promoter activity in the ‘quadruplex-destabilized’ versus ‘quadruplex-intact’ promoter was noticed. As a further test, we incorporated a quadruplex motif or its disrupted counterpart within a synthetic promoter reporter construct. The quadruplex motif, and not the disrupted-motif, enhanced transcription in human cell lines of different origin. Together, these findings build direct support for quadruplex-mediated transcription and suggest quadruplex-SNPs may play significant role in mechanistically understanding variations in gene expression among individuals. PMID:22238381

  17. Application of LogitBoost Classifier for Traceability Using SNP Chip Data

    PubMed Central

    Kang, Hyunsung; Cho, Seoae; Kim, Heebal; Seo, Kang-Seok

    2015-01-01

    Consumer attention to food safety has increased rapidly due to animal-related diseases; therefore, it is important to identify their places of origin (POO) for safety purposes. However, only a few studies have addressed this issue and focused on machine learning-based approaches. In the present study, classification analyses were performed using a customized SNP chip for POO prediction. To accomplish this, 4,122 pigs originating from 104 farms were genotyped using the SNP chip. Several factors were considered to establish the best prediction model based on these data. We also assessed the applicability of the suggested model using a kinship coefficient-filtering approach. Our results showed that the LogitBoost-based prediction model outperformed other classifiers in terms of classification performance under most conditions. Specifically, a greater level of accuracy was observed when a higher kinship-based cutoff was employed. These results demonstrated the applicability of a machine learning-based approach using SNP chip data for practical traceability. PMID:26436917

  18. Novel approach for deriving genome wide SNP analysis data from archived blood spots

    PubMed Central

    2012-01-01

    Background The ability to transport and store DNA at room temperature in low volumes has the advantage of optimising cost, time and storage space. Blood spots on adapted filter papers are popular for this, with FTA (Flinders Technology Associates) Whatman™TM technology being one of the most recent. Plant material, plasmids, viral particles, bacteria and animal blood have been stored and transported successfully using this technology, however the method of porcine DNA extraction from FTA Whatman™TM cards is a relatively new approach, allowing nucleic acids to be ready for downstream applications such as PCR, whole genome amplification, sequencing and subsequent application to single nucleotide polymorphism microarrays has hitherto been under-explored. Findings DNA was extracted from FTA Whatman™TM cards (following adaptations of the manufacturer’s instructions), whole genome amplified and subsequently analysed to validate the integrity of the DNA for downstream SNP analysis. DNA was successfully extracted from 288/288 samples and amplified by WGA. Allele dropout post WGA, was observed in less than 2% of samples and there was no clear evidence of amplification bias nor contamination. Acceptable call rates on porcine SNP chips were also achieved using DNA extracted and amplified in this way. Conclusions DNA extracted from FTA Whatman cards is of a high enough quality and quantity following whole genomic amplification to perform meaningful SNP chip studies. PMID:22974252

  19. High-throughput SNP-genotyping analysis of the relationships among Ponto-Caspian sturgeon species

    PubMed Central

    Rastorguev, Sergey M; Nedoluzhko, Artem V; Mazur, Alexander M; Gruzdeva, Natalia M; Volkov, Alexander A; Barmintseva, Anna E; Mugue, Nikolai S; Prokhortchouk, Egor B

    2013-01-01

    Abstract Legally certified sturgeon fisheries require population protection and conservation methods, including DNA tests to identify the source of valuable sturgeon roe. However, the available genetic data are insufficient to distinguish between different sturgeon populations, and are even unable to distinguish between some species. We performed high-throughput single-nucleotide polymorphism (SNP)-genotyping analysis on different populations of Russian (Acipenser gueldenstaedtii), Persian (A. persicus), and Siberian (A. baerii) sturgeon species from the Caspian Sea region (Volga and Ural Rivers), the Azov Sea, and two Siberian rivers. We found that Russian sturgeons from the Volga and Ural Rivers were essentially indistinguishable, but they differed from Russian sturgeons in the Azov Sea, and from Persian and Siberian sturgeons. We identified eight SNPs that were sufficient to distinguish these sturgeon populations with 80% confidence, and allowed the development of markers to distinguish sturgeon species. Finally, on the basis of our SNP data, we propose that the A. baerii-like mitochondrial DNA found in some Russian sturgeons from the Caspian Sea arose via an introgression event during the Pleistocene glaciation. In the present study, the high-throughput genotyping analysis of several sturgeon populations was performed. SNP markers for species identification were defined. The possible explanation of the baerii-like mitotype presence in some Russian sturgeons in the Caspian Sea was suggested. PMID:24567827

  20. A functional SNP in FLT1 increases risk of coronary artery disease in a Japanese population.

    PubMed

    Konta, Atsuko; Ozaki, Kouichi; Sakata, Yasuhiko; Takahashi, Atsushi; Morizono, Takashi; Suna, Shinichiro; Onouchi, Yoshihiro; Tsunoda, Tatsuhiko; Kubo, Michiaki; Komuro, Issei; Eishi, Yoshinobu; Tanaka, Toshihiro

    2016-05-01

    Coronary artery disease (CAD) including myocardial infarction is one of the leading causes of death in many countries. Similar to other common diseases, its pathogenesis is thought to result from complex interactions among multiple genetic and environmental factors. Recent large-scale genetic association analysis for CAD identified 15 new loci. We examined the reproducibility of these previous association findings with 7990 cases and 6582 controls in a Japanese population. We found a convincing association of rs9319428 in FLT1, encoding fms-related tyrosine kinase 1 (P=5.98 × 10(-8)). Fine mapping using tag single-nucleotide polymorphisms (SNPs) at FLT1 locus revealed that another SNP (rs74412485) showed more profound genetic effect for CAD (P=2.85 × 10(-12)). The SNP, located in intron 1 in FLT1, enhanced the transcriptional level of FLT1. RNA interference experiment against FLT1 showed that the suppression of FLT1 resulted in decreased expression of inflammatory adhesion molecules. Expression of FLT1 was observed in endothelial cells of human coronary artery. Our results indicate that the genetically coded increased expression of FLT1 by a functional SNP implicates activation in an inflammatory cascade that might eventually lead to CAD. PMID:26791355

  1. Identification and SNP association analysis of a novel gene in chicken.

    PubMed

    Mei, Xingxing; Kang, Xiangtao; Liu, Xiaojun; Jia, Lijuan; Li, Hong; Li, Zhuanjian; Jiang, Ruirui

    2016-02-01

    A novel gene that was predicted to encode a long noncoding RNA (lncRNA) transcript was identified in a previous study that aimed to detect candidate genes related to growth rate differences between Chinese local breed Gushi chickens and Anka broilers. To characterise the biological function of the lncRNA, we cloned and sequenced the complete open reading frame of the gene. We performed quantitative real-time polymerase chain reaction (qPCR) to analyse the expression patterns of the lncRNA in different tissues of chicken at different development stages. The qPCR data showed that the novel lncRNA gene was expressed extensively, with the highest abundance in spleen and lung and the lowest abundance in pectoralis and leg muscle. Additionally, we identified a single nucleotide polymorphism (SNP) at the 5'-end of the gene and studied the association between the SNP and chicken growth traits using data from an F2 resource population of Gushi chickens and Anka broilers. The association analysis showed that the SNP was significantly (P < 0.05) associated with leg muscle weight, chest breadth, sternal length and body weight in chickens at 1 day, 4 weeks and 6 weeks of age. We concluded that the novel lncRNA gene, which we designated pouBW1, may play an important role in regulating chicken growth. PMID:26643990

  2. PEAS V1.0: a package for elementary analysis of SNP data.

    PubMed

    Xu, Shuhua; Gupta, Sanchit; Jin, Li

    2010-11-01

    We have developed a software package named PEAS to facilitate analyses of large data sets of single nucleotide polymorphisms (SNPs) for population genetics and molecular phylogenetics studies. PEAS reads SNP data in various formats as input and is versatile in data formatting; using PEAS, it is easy to create input files for many popular packages, such as STRUCTURE, frappe, Arlequin, Haploview, LDhat, PLINK, EIGENSOFT, PHASE, fastPHASE, MEGA and PHYLIP. In addition, PEAS fills up several analysis gaps in currently available computer programs in population genetics and molecular phylogenetics. Notably, (i) It calculates genetic distance matrices with bootstrapping for both individuals and populations from genome-wide high-density SNP data, and the output can be streamlined to MEGA and PHYLIP programs for further processing; (ii) It calculates genetic distances from STRUCTURE output and generates MEGA file to reconstruct component trees; (iii) It provides tools to conduct haplotype sharing analysis for phylogenetic studies based on high-density SNP data. To our knowledge, these analyses are not available in any other computer program. PEAS for Windows is freely available for academic users from http://www.picb.ac.cn/~xushua/index.files/Download_PEAS.htm. PMID:21565121

  3. Rapid Detection of Rare Deleterious Variants by Next Generation Sequencing with Optional Microarray SNP Genotype Data

    PubMed Central

    Watson, Christopher M.; Crinnion, Laura A.; Gurgel‐Gianetti, Juliana; Harrison, Sally M.; Daly, Catherine; Antanavicuite, Agne; Lascelles, Carolina; Markham, Alexander F.; Pena, Sergio D. J.; Bonthron, David T.

    2015-01-01

    ABSTRACT Autozygosity mapping is a powerful technique for the identification of rare, autosomal recessive, disease‐causing genes. The ease with which this category of disease gene can be identified has greatly increased through the availability of genome‐wide SNP genotyping microarrays and subsequently of exome sequencing. Although these methods have simplified the generation of experimental data, its analysis, particularly when disparate data types must be integrated, remains time consuming. Moreover, the huge volume of sequence variant data generated from next generation sequencing experiments opens up the possibility of using these data instead of microarray genotype data to identify disease loci. To allow these two types of data to be used in an integrated fashion, we have developed AgileVCFMapper, a program that performs both the mapping of disease loci by SNP genotyping and the analysis of potentially deleterious variants using exome sequence variant data, in a single step. This method does not require microarray SNP genotype data, although analysis with a combination of microarray and exome genotype data enables more precise delineation of disease loci, due to superior marker density and distribution. PMID:26037133

  4. Application of LogitBoost Classifier for Traceability Using SNP Chip Data.

    PubMed

    Kim, Kwondo; Seo, Minseok; Kang, Hyunsung; Cho, Seoae; Kim, Heebal; Seo, Kang-Seok

    2015-01-01

    Consumer attention to food safety has increased rapidly due to animal-related diseases; therefore, it is important to identify their places of origin (POO) for safety purposes. However, only a few studies have addressed this issue and focused on machine learning-based approaches. In the present study, classification analyses were performed using a customized SNP chip for POO prediction. To accomplish this, 4,122 pigs originating from 104 farms were genotyped using the SNP chip. Several factors were considered to establish the best prediction model based on these data. We also assessed the applicability of the suggested model using a kinship coefficient-filtering approach. Our results showed that the LogitBoost-based prediction model outperformed other classifiers in terms of classification performance under most conditions. Specifically, a greater level of accuracy was observed when a higher kinship-based cutoff was employed. These results demonstrated the applicability of a machine learning-based approach using SNP chip data for practical traceability. PMID:26436917

  5. Evaluation of Y chromosomal SNP haplogrouping in the HID-Ion AmpliSeq™ Identity Panel.

    PubMed

    Ochiai, Eriko; Minaguchi, Kiyoshi; Nambiar, Phrabhakaran; Kakimoto, Yu; Satoh, Fumiko; Nakatome, Masato; Miyashita, Keiko; Osawa, Motoki

    2016-09-01

    The Y chromosomal haplogroup determined from single nucleotide polymorphism (SNP) combinations is a valuable genetic marker to study ancestral male lineage and ethical distribution. Next-generation sequencing has been developed for widely diverse genetics fields. For this study, we demonstrate 34 Y-SNP typing employing the Ion PGM™ system to perform haplogrouping. DNA libraries were constructed using the HID-Ion AmpliSeq™ Identity Panel. Emulsion PCR was performed, then DNA sequences were analyzed on the Ion 314 and 316 Chip Kit v2. Some difficulties became apparent during the analytic processes. No-call was reported at rs2032599 and M479 in six samples, in which the least coverage was observed at M479. A minor misreading occurred at rs2032631 and M479. A real time PCR experiment using other pairs of oligonucleotide primers showed that these events might result from the flanking sequence. Finally, Y haplogroup was determined completely for 81 unrelated males including Japanese (n=59) and Malay (n=22) subjects. The allelic divergence differed between the two populations. In comparison with the conventional Sanger method, next-generation sequencing provides a comprehensive SNP analysis with convenient procedures, but further system improvement is necessary. PMID:27591541

  6. How to Use SNP_TATA_Comparator to Find a Significant Change in Gene Expression Caused by the Regulatory SNP of This Gene's Promoter via a Change in Affinity of the TATA-Binding Protein for This Promoter

    PubMed Central

    Ponomarenko, Mikhail; Rasskazov, Dmitry; Arkova, Olga; Ponomarenko, Petr; Suslov, Valentin; Savinkova, Ludmila; Kolchanov, Nikolay

    2015-01-01

    The use of biomedical SNP markers of diseases can improve effectiveness of treatment. Genotyping of patients with subsequent searching for SNPs more frequent than in norm is the only commonly accepted method for identification of SNP markers within the framework of translational research. The bioinformatics applications aimed at millions of unannotated SNPs of the “1000 Genomes” can make this search for SNP markers more focused and less expensive. We used our Web service involving Fisher's Z-score for candidate SNP markers to find a significant change in a gene's expression. Here we analyzed the change caused by SNPs in the gene's promoter via a change in affinity of the TATA-binding protein for this promoter. We provide examples and discuss how to use this bioinformatics application in the course of practical analysis of unannotated SNPs from the “1000 Genomes” project. Using known biomedical SNP markers, we identified 17 novel candidate SNP markers nearby: rs549858786 (rheumatoid arthritis); rs72661131 (cardiovascular events in rheumatoid arthritis); rs562962093 (stroke); rs563558831 (cyclophosphamide bioactivation); rs55878706 (malaria resistance, leukopenia), rs572527200 (asthma, systemic sclerosis, and psoriasis), rs371045754 (hemophilia B), rs587745372 (cardiovascular events); rs372329931, rs200209906, rs367732974, and rs549591993 (all four: cancer); rs17231520 and rs569033466 (both: atherosclerosis); rs63750953, rs281864525, and rs34166473 (all three: malaria resistance, thalassemia). PMID:26516624

  7. Efficient SNP Discovery by Combining Microarray and Lab-on-a-Chip Data for Animal Breeding and Selection

    PubMed Central

    Huang, Chao-Wei; Lin, Yu-Tsung; Ding, Shih-Torng; Lo, Ling-Ling; Wang, Pei-Hwa; Lin, En-Chung; Liu, Fang-Wei; Lu, Yen-Wen

    2015-01-01

    The genetic markers associated with economic traits have been widely explored for animal breeding. Among these markers, single-nucleotide polymorphism (SNPs) are gradually becoming a prevalent and effective evaluation tool. Since SNPs only focus on the genetic sequences of interest, it thereby reduces the evaluation time and cost. Compared to traditional approaches, SNP genotyping techniques incorporate informative genetic background, improve the breeding prediction accuracy and acquiesce breeding quality on the farm. This article therefore reviews the typical procedures of animal breeding using SNPs and the current status of related techniques. The associated SNP information and genotyping techniques, including microarray and Lab-on-a-Chip based platforms, along with their potential are highlighted. Examples in pig and poultry with different SNP loci linked to high economic trait values are given. The recommendations for utilizing SNP genotyping in nimal breeding are summarized. PMID:27600241

  8. Generation of large numbers of SNP in cattle by coupling reduced genome representation with high throughput sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Whole genome sequencing projects have produced draft sequences for species from diverse evolutionary clades for comparative evolutionary studies. Generally, these projects have not simultaneously created extensive single nucleotide polymorphism (SNP) resources for use in genetics studies within the...

  9. Citrus (Rutaceae) SNP markers based on Competitive Allele-Specific PCR; transferability across the Aurantioideae subfamily1

    PubMed Central

    Garcia-Lor, Andres; Ancillo, Gema; Navarro, Luis; Ollitrault, Patrick

    2013-01-01

    • Premise of the study: Single nucleotide polymorphism (SNP) markers based on Competitive Allele-Specific PCR (KASPar) were developed from sequences of three Citrus species. Their transferability was tested in 63 Citrus genotypes and 19 relative genera of the subfamily Aurantioideae to estimate the potential of SNP markers, selected from a limited intrageneric discovery panel, for ongoing broader diversity analysis at the intra- and intergeneric levels and systematic germplasm bank characterization. • Methods and Results: Forty-two SNP markers were developed using KASPar technology. Forty-one were successfully genotyped in all of the Citrus germplasm, where intra- and interspecific polymorphisms were observed. The transferability and diversity decreased with increasing taxonomic distance. • Conclusions: SNP markers based on the KASPar method developed from sequence data of a limited intrageneric discovery panel provide a valuable molecular resource for genetic diversity analysis of germplasm within a genus and should be useful for germplasm fingerprinting at a much broader diversity level. PMID:25202535

  10. Developing Single Nucleotide Polymorphism (SNP) markers from transcriptome sequences for the identification of longan (Dimocarpus longan) germplasm

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in...

  11. ISS Solar Array Management

    NASA Technical Reports Server (NTRS)

    Williams, James P.; Martin, Keith D.; Thomas, Justin R.; Caro, Samuel

    2010-01-01

    The International Space Station (ISS) Solar Array Management (SAM) software toolset provides the capabilities necessary to operate a spacecraft with complex solar array constraints. It monitors spacecraft telemetry and provides interpretations of solar array constraint data in an intuitive manner. The toolset provides extensive situational awareness to ensure mission success by analyzing power generation needs, array motion constraints, and structural loading situations. The software suite consists of several components including samCS (constraint set selector), samShadyTimers (array shadowing timers), samWin (visualization GUI), samLock (array motion constraint computation), and samJet (attitude control system configuration selector). It provides high availability and uptime for extended and continuous mission support. It is able to support two-degrees-of-freedom (DOF) array positioning and supports up to ten simultaneous constraints with intuitive 1D and 2D decision support visualizations of constraint data. Display synchronization is enabled across a networked control center and multiple methods for constraint data interpolation are supported. Use of this software toolset increases flight safety, reduces mission support effort, optimizes solar array operation for achieving mission goals, and has run for weeks at a time without issues. The SAM toolset is currently used in ISS real-time mission operations.

  12. Array for detecting microbes

    DOEpatents

    Andersen, Gary L.; DeSantis, Todd D.

    2014-07-08

    The present embodiments relate to an array system for detecting and identifying biomolecules and organisms. More specifically, the present embodiments relate to an array system comprising a microarray configured to simultaneously detect a plurality of organisms in a sample at a high confidence level.

  13. LincSNP: a database of linking disease-associated SNPs to human large intergenic non-coding RNAs

    PubMed Central

    2014-01-01

    Background Genome-wide association studies (GWAS) have successfully identified a large number of single nucleotide polymorphisms (SNPs) that are associated with a wide range of human diseases. However, many of these disease-associated SNPs are located in non-coding regions and have remained largely unexplained. Recent findings indicate that disease-associated SNPs in human large intergenic non-coding RNA (lincRNA) may lead to susceptibility to diseases through their effects on lincRNA expression. There is, therefore, a need to specifically record these SNPs and annotate them as potential candidates for disease. Description We have built LincSNP, an integrated database, to identify and annotate disease-associated SNPs in human lincRNAs. The current release of LincSNP contains approximately 140,000 disease-associated SNPs (or linkage disequilibrium SNPs), which can be mapped to around 5,000 human lincRNAs, together with their comprehensive functional annotations. The database also contains annotated, experimentally supported SNP-lincRNA-disease associations and disease-associated lincRNAs. It provides flexible search options for data extraction and searches can be performed by disease/phenotype name, SNP ID, lincRNA name and chromosome region. In addition, we provide users with a link to download all the data from LincSNP and have developed a web interface for the submission of novel identified SNP-lincRNA-disease associations. Conclusions The LincSNP database aims to integrate disease-associated SNPs and human lincRNAs, which will be an important resource for the investigation of the functions and mechanisms of lincRNAs in human disease. The database is available at http://bioinfo.hrbmu.edu.cn/LincSNP. PMID:24885522

  14. A Multiple-SNP Approach for Genome-Wide Association Study of Milk Production Traits in Chinese Holstein Cattle

    PubMed Central

    Fang, Ming; Fu, Weixuan; Jiang, Dan; Zhang, Qin; Sun, Dongxiao; Ding, Xiangdong; Liu, Jianfeng

    2014-01-01

    The multiple-SNP analysis has been studied by many researchers, in which the effects of multiple SNPs are simultaneously estimated and tested in a multiple linear regression. The multiple-SNP association analysis usually has higher power and lower false-positive rate for detecting causative SNP(s) than single marker analysis (SMA). Several methods have been proposed to simultaneously estimate and test multiple SNP effects. In this research, a fast method called MEML (Mixed model based Expectation-Maximization Lasso algorithm) was developed for simultaneously estimate of multiple SNP effects. An improved Lasso prior was assigned to SNP effects which were estimated by searching the maximum joint posterior mode. The residual polygenic effect was included in the model to absorb many tiny SNP effects, which is treated as missing data in our EM algorithm. A series of simulation experiments were conducted to validate the proposed method, and the results showed that compared with SMMA, the new method can dramatically decrease the false-positive rate. The new method was also applied to the 50k SNP-panel dataset for genome-wide association study of milk production traits in Chinese Holstein cattle. Totally, 39 significant SNPs and their nearby 25 genes were found. The number of significant SNPs is remarkably fewer than that by SMMA which found 105 significant SNPs. Among 39 significant SNPs, 8 were also found by SMMA and several well-known QTLs or genes were confirmed again; furthermore, we also got some positional candidate gene with potential function of effecting milk production traits. These novel findings in our research should be valuable for further investigation. PMID:25148050

  15. Use of the Illumina GoldenGate assay for single nucleotide polymorphism (SNP) genotyping in cereal crops.

    PubMed

    Chao, Shiaoman; Lawley, Cindy

    2015-01-01

    Highly parallel genotyping assays, such as the GoldenGate assay developed by Illumina, capable of interrogating up to 3,072 single nucleotide polymorphisms (SNPs) simultaneously, have greatly facilitated genome-wide studies, particularly for crops with large and complex genome structures. In this report, we provide detailed information and guidelines regarding genomic DNA preparation, SNP assay design, SNP assay protocols, and genotype calling using Illumina's GenomeStudio software. PMID:25373766

  16. Micromachined electrode array

    DOEpatents

    Okandan, Murat; Wessendorf, Kurt O.

    2007-12-11

    An electrode array is disclosed which has applications for neural stimulation and sensing. The electrode array, in certain embodiments, can include a plurality of electrodes each of which is flexibly attached to a common substrate using a plurality of springs to allow the electrodes to move independently. In other embodiments of the electrode array, the electrodes can be fixed to the substrate. The electrode array can be formed from a combination of bulk and surface micromachining, and can include electrode tips having an electroplated metal (e.g. platinum, iridium, gold or titanium) or a metal oxide (e.g. iridium oxide) for biocompatibility. The electrode array can be used to form a part of a neural prosthesis, and is particularly well adapted for use in an implantable retinal prosthesis.

  17. Microbial sensor cell arrays.

    PubMed

    Melamed, Sahar; Elad, Tal; Belkin, Shimshon

    2012-02-01

    Motivated by the advantages endowed by high-throughput analysis, researchers have succeeded in incorporating multiple reporter cells into a single platform; the technology now allows the simultaneous scrutiny of a large collection of sensor strains. We review current aspects in cell array technology with emphasis on microbial sensor arrays. We consider various techniques for patterning live cells on solid surfaces, describe different array-based applications and devices, and highlight recent efforts for live cell storage. We review mathematical approaches for deciphering the data emanating from bioreporter collections, and discuss the future of single cell arrays. Innovative technologies for cell patterning, preservation and interpretation are continuously being developed; when they all mature, cell arrays may become an efficient analytical tool, in a scope resembling that of DNA microarray biochips. PMID:22176747

  18. Photovoltaic array loss mechanisms

    NASA Technical Reports Server (NTRS)

    Gonzalez, Charles

    1986-01-01

    Loss mechanisms which come into play when solar cell modules are mounted in arrays are identified. Losses can occur either from a reduction in the array electrical performance or with nonoptimal extraction of power from the array. Electrical performance degradation is caused by electrical mismatch, transmission losses from cell surface soiling and steep angle of reflectance, and electrical losses from field wiring resistance and the voltage drop across blocking diodes. The second type of loss, concerned with the operating points of the array, can involve nonoptimal load impedance and limiting the operating envelope of the array to specific ranges of voltage and current. Each of the loss mechanisms are discussed and average energy losses expected from soiling, steep reflectance angles and circuit losses are calculated.

  19. Multibeam Phased Array Antennas

    NASA Technical Reports Server (NTRS)

    Popovic, Zoya; Romisch, Stefania; Rondineau, Sebastien

    2004-01-01

    In this study, a new architecture for Ka-band multi-beam arrays was developed and demonstrated experimentally. The goal of the investigation was to demonstrate a new architecture that has the potential of reducing the cost as compared to standard expensive phased array technology. The goals of this specific part of the project, as stated in the yearly statement of work in the original proposal are: 1. Investigate bounds on performance of multi-beam lens arrays in terms of beamwidths, volume (size), isolation between beams, number of simultaneous beams, etc. 2. Design a small-scale array to demonstrate the principle. The array will be designed for operation around 3OGHz (Ka-band), with two 10-degree beamwidth beams. 3. Investigate most appropriate way to accomplish fine-tuning of the beam pointing within 5 degrees around the main beam pointing angle.

  20. Microfabricated ion trap array

    DOEpatents

    Blain, Matthew G.; Fleming, James G.

    2006-12-26

    A microfabricated ion trap array, comprising a plurality of ion traps having an inner radius of order one micron, can be fabricated using surface micromachining techniques and materials known to the integrated circuits manufacturing and microelectromechanical systems industries. Micromachining methods enable batch fabrication, reduced manufacturing costs, dimensional and positional precision, and monolithic integration of massive arrays of ion traps with microscale ion generation and detection devices. Massive arraying enables the microscale ion traps to retain the resolution, sensitivity, and mass range advantages necessary for high chemical selectivity. The reduced electrode voltage enables integration of the microfabricated ion trap array with on-chip circuit-based rf operation and detection electronics (i.e., cell phone electronics). Therefore, the full performance advantages of the microfabricated ion trap array can be realized in truly field portable, handheld microanalysis systems.

  1. High density pixel array

    NASA Technical Reports Server (NTRS)

    Wiener-Avnear, Eliezer (Inventor); McFall, James Earl (Inventor)

    2004-01-01

    A pixel array device is fabricated by a laser micro-milling method under strict process control conditions. The device has an array of pixels bonded together with an adhesive filling the grooves between adjacent pixels. The array is fabricated by moving a substrate relative to a laser beam of predetermined intensity at a controlled, constant velocity along a predetermined path defining a set of grooves between adjacent pixels so that a predetermined laser flux per unit area is applied to the material, and repeating the movement for a plurality of passes of the laser beam until the grooves are ablated to a desired depth. The substrate is of an ultrasonic transducer material in one example for fabrication of a 2D ultrasonic phase array transducer. A substrate of phosphor material is used to fabricate an X-ray focal plane array detector.

  2. Diode Laser Arrays

    NASA Astrophysics Data System (ADS)

    Botez, Dan; Scifres, Don R.

    1994-08-01

    This book provides a comprehensive overview of the fundamental principles and applications of semiconductor diode laser arrays. All of the major types of arrays are discussed in detail, including coherent, incoherent, edge- and surface-emitting, horizontal- and vertical-cavity, individually addressed, lattice- matched and strained-layer systems. The initial chapters cover such topics as lasers, amplifiers, external-cavity control, theoretical modeling, and operational dynamics. Spatially incoherent arrays are then described in detail, and the uses of vertical-cavity surface emitter and edge-emitting arrays in parallel optical-signal processing and multi-channel optical recording are discussed. Researchers and graduate students in solid state physics and electrical engineering studying the properties and applications of such arrays will find this book invaluable.

  3. Identification of SNP and SSR Markers in Finger Millet Using Next Generation Sequencing Technologies

    PubMed Central

    Gimode, Davis; Odeny, Damaris A.; de Villiers, Etienne P.; Wanyonyi, Solomon; Dida, Mathews M.; Mneney, Emmarold E.; Muchugi, Alice; Machuka, Jesse; de Villiers, Santie M.

    2016-01-01

    Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS) technologies to develop both Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNP) markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC) was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included in the regional

  4. SNP genotypes of olfactory receptor genes associated with olfactory ability in German Shepherd dogs.

    PubMed

    Yang, M; Geng, G-J; Zhang, W; Cui, L; Zhang, H-X; Zheng, J-L

    2016-04-01

    To find out the relationship between SNP genotypes of canine olfactory receptor genes and olfactory ability, 28 males and 20 females from German Shepherd dogs in police service were scored by odor detection tests and analyzed using the Beckman GenomeLab SNPstream. The representative 22 SNP loci from the exonic regions of 12 olfactory receptor genes were investigated, and three kinds of odor (human, ice drug and trinitrotoluene) were detected. The results showed that the SNP genotypes at the OR10H1-like:c.632C>T, OR10H1-like:c.770A>T, OR2K2-like:c.518G>A, OR4C11-like:c.511T>G and OR4C11-like:c.692G>A loci had a statistically significant effect on the scenting abilities (P < 0.001). The kind of odor influenced the performances of the dogs (P < 0.001). In addition, there were interactions between genotype and the kind of odor at the following loci: OR10H1-like:c.632C>T, OR10H1-like:c.770A>T, OR4C11-like:c.511T>G and OR4C11-like:c.692G>A (P < 0.001). The dogs with genotype CC at the OR10H1-like:c.632C>T, genotype AA at the OR10H1-like:c.770A>T, genotype TT at the OR4C11-like:c.511T>G and genotype GG at the OR4C11-like:c.692G>A loci did better at detecting the ice drug. We concluded that there was linkage between certain SNP genotypes and the olfactory ability of dogs and that SNP genotypes might be useful in determining dogs' scenting potential. PMID:26582499

  5. Supplementing High-Density SNP Microarrays for Additional Coverage of Disease-Related Genes: Addiction as a Paradigm

    SciTech Connect

    SacconePhD, Scott F; Chesler, Elissa J; Bierut, Laura J; Kalivas, Peter J; Lerman, Caryn; Saccone, Nancy L; Uhl, George R; Li, Chuan-Yun; Philip, Vivek M; Edenberg, Howard; Sherry, Steven; Feolo, Michael; Moyzis, Robert K; Rutter, Joni L

    2009-01-01

    Commercial SNP microarrays now provide comprehensive and affordable coverage of the human genome. However, some diseases have biologically relevant genomic regions that may require additional coverage. Addiction, for example, is thought to be influenced by complex interactions among many relevant genes and pathways. We have assembled a list of 486 biologically relevant genes nominated by a panel of experts on addiction. We then added 424 genes that showed evidence of association with addiction phenotypes through mouse QTL mappings and gene co-expression analysis. We demonstrate that there are a substantial number of SNPs in these genes that are not well represented by commercial SNP platforms. We address this problem by introducing a publicly available SNP database for addiction. The database is annotated using numeric prioritization scores indicating the extent of biological relevance. The scores incorporate a number of factors such as SNP/gene functional properties (including synonymy and promoter regions), data from mouse systems genetics and measures of human/mouse evolutionary conservation. We then used HapMap genotyping data to determine if a SNP is tagged by a commercial microarray through linkage disequilibrium. This combination of biological prioritization scores and LD tagging annotation will enable addiction researchers to supplement commercial SNP microarrays to ensure comprehensive coverage of biologically relevant regions.

  6. Supplementing High-Density SNP Microarrays for Additional Coverage of Disease-Related Genes: Addiction as a Paradigm

    PubMed Central

    Saccone, Scott F.; Bierut, Laura J.; Chesler, Elissa J.; Kalivas, Peter W.; Lerman, Caryn; Saccone, Nancy L.; Uhl, George R.; Li, Chuan-Yun; Philip, Vivek M.; Edenberg, Howard J.; Sherry, Stephen T.; Feolo, Michael; Moyzis, Robert K.; Rutter, Joni L.

    2009-01-01

    Commercial SNP microarrays now provide comprehensive and affordable coverage of the human genome. However, some diseases have biologically relevant genomic regions that may require additional coverage. Addiction, for example, is thought to be influenced by complex interactions among many relevant genes and pathways. We have assembled a list of 486 biologically relevant genes nominated by a panel of experts on addiction. We then added 424 genes that showed evidence of association with addiction phenotypes through mouse QTL mappings and gene co-expression analysis. We demonstrate that there are a substantial number of SNPs in these genes that are not well represented by commercial SNP platforms. We address this problem by introducing a publicly available SNP database for addiction. The database is annotated using numeric prioritization scores indicating the extent of biological relevance. The scores incorporate a number of factors such as SNP/gene functional properties (including synonymy and promoter regions), data from mouse systems genetics and measures of human/mouse evolutionary conservation. We then used HapMap genotyping data to determine if a SNP is tagged by a commercial microarray through linkage disequilibrium. This combination of biological prioritization scores and LD tagging annotation will enable addiction researchers to supplement commercial SNP microarrays to ensure comprehensive coverage of biologically relevant regions. PMID:19381300

  7. Interactions between SNP alleles at multiple loci contribute to skin color differences between caucasoid and mongoloid subjects.

    PubMed

    Anno, Sumiko; Abe, Takashi; Yamamoto, Takushi

    2008-01-01

    This study aimed to identify single nucleotide polymorphism (SNP) alleles at multiple loci associated with racial differences in skin color using SNP genotyping. A total of 122 Caucasians in Toledo, Ohio and 100 Mongoloids in Japan were genotyped for 20 SNPs in 7 candidate genes, encoding the Agouti signaling protein (ASIP), tyrosinase-related protein 1 (TYRP1), tyrosinase (TYR), melanocortin 1 receptor (MC1R), oculocutaneous albinism II (OCA2), microphthalmia-associated transcription factor (MITF), and myosin VA (MYO5A). Data were used to analyze associations between the 20 SNP alleles using linkage disequilibrium (LD). Combinations of SNP alleles were jointly tested under LD for associations with racial groups by performing a chi(2) test for independence. Results showed that SNP alleles at multiple loci can be considered the haplotype that contributes to significant differences between the two population groups and suggest a high probability of LD. Confirmation of these findings requires further study with other ethnic groups to analyze the associations between SNP alleles at multiple loci and skin color variation among races. PMID:18392143

  8. Alternative oxidase pathway is involved in the exogenous SNP-elevated tolerance of Medicago truncatula to salt stress.

    PubMed

    Jian, Wei; Zhang, Da-Wei; Zhu, Feng; Wang, Shuo-Xun; Pu, Xiao-Jun; Deng, Xing-Guang; Luo, Shi-Shuai; Lin, Hong-Hui

    2016-04-01

    Exogenous application of sodium nitroprusside (SNP) would enhance the tolerance of plants to stress conditions. Some evidences suggested that nitric oxide (NO) could induce the expression of alternative oxidase (AOX). In this study, Medicago truncatula (Medicago) was chosen to study the role of AOX in the SNP-elevated resistance to salt stress. Our results showed that the expression of AOX genes (especially AOX1 and AOX2b1) and cyanide-resistant respiration rate (Valt) could be significantly induced by salt stress. Exogenous application of SNP could further enhance the expression of AOX genes and Valt. Exogenous application of SNP could alleviate the oxidative damage and photosynthetic damage caused by salt stress. However, the stress resistance was significantly decreased in the plants which were pretreated with n-propyl gallate (nPG). More importantly, the damage in nPG-pretreated plants could not be alleviated by application of SNP. Further study showed that effects of nPG on the activities of antioxidant enzymes were minor. These results showed that AOX pathway played an important role in the SNP-elevated resistance of Medicago to salt stress. AOX could contribute to regulating the accumulation of reactive oxygen (ROS) and protect of photosystem, and we proposed that all these were depend on the ability of maintaining the homeostasis of redox state. PMID:26962709

  9. Characterization of polyploid wheat genomic diversity using a high-density 90 000 single nucleotide polymorphism array

    PubMed Central

    Wang, Shichen; Wong, Debbie; Forrest, Kerrie; Allen, Alexandra; Chao, Shiaoman; Huang, Bevan E; Maccaferri, Marco; Salvi, Silvio; Milner, Sara G; Cattivelli, Luigi; Mastrangelo, Anna M; Whan, Alex; Stephen, Stuart; Barker, Gary; Wieseke, Ralf; Plieske, Joerg; International Wheat Genome Sequencing Consortium; Lillemo, Morten; Mather, Diane; Appels, Rudi; Dolferus, Rudy; Brown-Guedira, Gina; Korol, Abraham; Akhunova, Alina R; Feuillet, Catherine; Salse, Jerome; Morgante, Michele; Pozniak, Curtis; Luo, Ming-Cheng; Dvorak, Jan; Morell, Matthew; Dubcovsky, Jorge; Ganal, Martin; Tuberosa, Roberto; Lawley, Cindy; Mikoulitch, Ivan; Cavanagh, Colin; Edwards, Keith J; Hayden, Matthew; Akhunov, Eduard

    2014-01-01

    High-density single nucleotide polymorphism (SNP) genotyping arrays are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships between individuals in populations and studying marker–trait associations in mapping experiments. We developed a genotyping array including about 90 000 gene-associated SNPs and used it to characterize genetic variation in allohexaploid and allotetraploid wheat populations. The array includes a significant fraction of common genome-wide distributed SNPs that are represented in populations of diverse geographical origin. We used density-based spatial clustering algorithms to enable high-throughput genotype calling in complex data sets obtained for polyploid wheat. We show that these model-free clustering algorithms provide accurate genotype calling in the presence of multiple clusters including clusters with low signal intensity resulting from significant sequence divergence at the target SNP site or gene deletions. Assays that detect low-intensity clusters can provide insight into the distribution of presence–absence variation (PAV) in wheat populations. A total of 46 977 SNPs from the wheat 90K array were genetically mapped using a combination of eight mapping populations. The developed array and cluster identification algorithms provide an opportunity to infer detailed haplotype structure in polyploid wheat and will serve as an invaluable resource for diversity studies and investigating the genetic basis of trait variation in wheat. PMID:24646323

  10. Strategies for single nucleotide polymorphism (SNP) genotyping to enhance genotype imputation in Gyr (Bos indicus) dairy cattle: Comparison of commercially available SNP chips.

    PubMed

    Boison, S A; Santos, D J A; Utsunomiya, A H T; Carvalheiro, R; Neves, H H R; O'Brien, A M Perez; Garcia, J F; Sölkner, J; da Silva, M V G B

    2015-07-01

    Genotype imputation is widely used as a cost-effective strategy in genomic evaluation of cattle. Key determinants of imputation accuracies, such as linkage disequilibrium patterns, marker densities, and ascertainment bias, differ between Bos indicus and Bos taurus breeds. Consequently, there is a need to investigate effectiveness of genotype imputation in indicine breeds. Thus, the objective of the study was to investigate strategies and factors affecting the accuracy of genotype imputation in Gyr (Bos indicus) dairy cattle. Four imputation scenarios were studied using 471 sires and 1,644 dams genotyped on Illumina BovineHD (HD-777K; San Diego, CA) and BovineSNP50 (50K) chips, respectively. Scenarios were based on which reference high-density single nucleotide polymorphism (SNP) panel (HDP) should be adopted [HD-777K, 50K, and GeneSeek GGP-75Ki (Lincoln, NE)]. Depending on the scenario, validation animals had their genotypes masked for one of the lower-density panels: Illumina (3K, 7K, and 50K) and GeneSeek (SGGP-20Ki and GGP-75Ki). We randomly selected 171 sires as reference and 300 as validation for all the scenarios. Additionally, all sires were used as reference and the 1,644 dams were imputed for validation. Genotypes of 98 individuals with 4 and more offspring were completely masked and imputed. Imputation algorithms FImpute and Beagle v3.3 and v4 were used. Imputation accuracies were measured using the correlation and allelic correct rate. FImpute resulted in highest accuracies, whereas Beagle 3.3 gave the least-accurate imputations. Accuracies evaluated as correlation (allelic correct rate) ranged from 0.910 (0.942) to 0.961 (0.974) using 50K as HDP and with 3K (7K) as low-density panels. With GGP-75Ki as HDP, accuracies were moderate for 3K, 7K, and 50K, but high for SGGP-20Ki. The use of HD-777K as HDP resulted in accuracies of 0.888 (3K), 0.941 (7K), 0.980 (SGGP-20Ki), 0.982 (50K), and 0.993 (GGP-75Ki). Ungenotyped individuals were imputed with an

  11. Genome-wide SNP scan of pooled DNA reveals nonsense mutation in FGF20 in the scaleless line of featherless chickens

    PubMed Central

    2012-01-01

    Background Scaleless (sc/sc) chickens carry a single recessive mutation that causes a lack of almost all body feathers, as well as foot scales and spurs, due to a failure of skin patterning during embryogenesis. This spontaneous mutant line, first described in the 1950s, has been used extensively to explore the tissue interactions involved in ectodermal appendage formation in embryonic skin. Moreover, the trait is potentially useful in tropical agriculture due to the ability of featherless chickens to tolerate heat, which is at present a major constraint to efficient poultry meat production in hot climates. In the interests of enhancing our understanding of feather placode development, and to provide the poultry industry with a strategy to breed heat-tolerant meat-type chickens (broilers), we mapped and identified the sc mutation. Results Through a cost-effective and labour-efficient SNP array mapping approach using DNA from sc/sc and sc/+ blood sample pools, we map the sc trait to chromosome 4 and show that a nonsense mutation in FGF20 is completely associated with the sc/sc phenotype. This mutation, common to all sc/sc individuals and absent from wild type, is predicted to lead to loss of a highly conserved region of the FGF20 protein important for FGF signalling. In situ hybridisation and quantitative RT-PCR studies reveal that FGF20 is epidermally expressed during the early stages of feather placode patterning. In addition, we describe a dCAPS genotyping assay based on the mutation, developed to facilitate discrimination between wild type and sc alleles. Conclusions This work represents the first loss of function genetic evidence supporting a role for FGF ligand signalling in feather development, and suggests FGF20 as a novel central player in the development of vertebrate skin appendages, including hair follicles and exocrine glands. In addition, this is to our knowledge the first report describing the use of the chicken SNP array to map genes based on

  12. Ultralightweight solar array technology

    SciTech Connect

    Goldsmith, P.; Kurland, R.

    1982-06-01

    Flat fold array technology is described, and performance for a range of missions and power levels is predicted. The array employs large area flat panel flexible substrates. The solar cells are adhesively bonded to a thin Kapton substrate to form individual panel assemblies. Any number of these panel assemblies may be joined together to make a blanket assembly. A container assembly protects each blanket assembly when stowed, and a tension guide wire assembly controls the flexible blanket shape when fully extended. Blanket extension and retraction are achieved through a motor powered lightweight trilongeron coilable lattice mast assembly. Ground and zero gravity flight tests on prototype array assemblies are successful.

  13. Working with Oligonucleotide Arrays.

    PubMed

    Carvalho, Benilton S

    2016-01-01

    Preprocessing microarray data consists of a number of statistical procedures that convert the observed intensities into quantities that represent biological events of interest, like gene expression and allele-specific abundances. Here, we present a summary of the theory behind microarray data preprocessing for expression, whole transcriptome and SNP designs and focus on the computational protocol used to obtain processed data that will be used on downstream analyses. We describe the main features of the oligo Bioconductor package, an application designed to support oligonucleotide microarrays using the R statistical environment and the infrastructure provided by Bioconductor, allowing the researcher to handle probe-level data and interface with advanced statistical tools under a simplified framework. We demonstrate the use of the package by preprocessing data originated from three different designs. PMID:27008013

  14. Blood Pressure Loci Identified with a Gene-Centric Array

    PubMed Central

    Johnson, Toby; Gaunt, Tom R.; Newhouse, Stephen J.; Padmanabhan, Sandosh; Tomaszewski, Maciej; Kumari, Meena; Morris, Richard W.; Tzoulaki, Ioanna; O'Brien, Eoin T.; Poulter, Neil R.; Sever, Peter; Shields, Denis C.; Thom, Simon; Wannamethee, Sasiwarang G.; Whincup, Peter H.; Brown, Morris J.; Connell, John M.; Dobson, Richard J.; Howard, Philip J.; Mein, Charles A.; Onipinla, Abiodun; Shaw-Hawkins, Sue; Zhang, Yun; Smith, George Davey; Day, Ian N.M.; Lawlor, Debbie A.; Goodall, Alison H.; Fowkes, F. Gerald; Abecasis, Gonçalo R.; Elliott, Paul; Gateva, Vesela; Braund, Peter S.; Burton, Paul R.; Nelson, Christopher P.; Tobin, Martin D.; van der Harst, Pim; Glorioso, Nicola; Neuvrith, Hani; Salvi, Erika; Staessen, Jan A.; Stucchi, Andrea; Devos, Nabila; Jeunemaitre, Xavier; Plouin, Pierre-François; Tichet, Jean; Juhanson, Peeter; Org, Elin; Putku, Margus; Sõber, Siim; Veldre, Gudrun; Viigimaa, Margus; Levinsson, Anna; Rosengren, Annika; Thelle, Dag S.; Hastie, Claire E.; Hedner, Thomas; Lee, Wai K.; Melander, Olle; Wahlstrand, Björn; Hardy, Rebecca; Wong, Andrew; Cooper, Jackie A.; Palmen, Jutta; Chen, Li; Stewart, Alexandre F.R.; Wells, George A.; Westra, Harm-Jan; Wolfs, Marcel G.M.; Clarke, Robert; Franzosi, Maria Grazia; Goel, Anuj; Hamsten, Anders; Lathrop, Mark; Peden, John F.; Seedorf, Udo; Watkins, Hugh; Ouwehand, Willem H.; Sambrook, Jennifer; Stephens, Jonathan; Casas, Juan-Pablo; Drenos, Fotios; Holmes, Michael V.; Kivimaki, Mika; Shah, Sonia; Shah, Tina; Talmud, Philippa J.; Whittaker, John; Wallace, Chris; Delles, Christian; Laan, Maris; Kuh, Diana; Humphries, Steve E.; Nyberg, Fredrik; Cusi, Daniele; Roberts, Robert; Newton-Cheh, Christopher; Franke, Lude; Stanton, Alice V.; Dominiczak, Anna F.; Farrall, Martin; Hingorani, Aroon D.; Samani, Nilesh J.; Caulfield, Mark J.; Munroe, Patricia B.

    2011-01-01

    Raised blood pressure (BP) is a major risk factor for cardiovascular disease. Previous studies have identified 47 distinct genetic variants robustly associated with BP, but collectively these explain only a few percent of the heritability for BP phenotypes. To find additional BP loci, we used a bespoke gene-centric array to genotype an independent discovery sample of 25,118 individuals that combined hypertensive case-control and general population samples. We followed up four SNPs associated with BP at our p < 8.56 × 10−7 study-specific significance threshold and six suggestively associated SNPs in a further 59,349 individuals. We identified and replicated a SNP at LSP1/TNNT3, a SNP at MTHFR-NPPB independent (r2 = 0.33) of previous reports, and replicated SNPs at AGT and ATP2B1 reported previously. An analysis of combined discovery and follow-up data identified SNPs significantly associated with BP at p < 8.56 × 10−7 at four further loci (NPR3, HFE, NOS3, and SOX6). The high number of discoveries made with modest genotyping effort can be attributed to using a large-scale yet targeted genotyping array and to the development of a weighting scheme that maximized power when meta-analyzing results from samples ascertained with extreme phenotypes, in combination with results from nonascertained or population samples. Chromatin immunoprecipitation and transcript expression data highlight potential gene regulatory mechanisms at the MTHFR and NOS3 loci. These results provide candidates for further study to help dissect mechanisms affecting BP and highlight the utility of studying SNPs and samples that are independent of those studied previously even when the sample size is smaller than that in previous studies. PMID:22100073

  15. Blood pressure loci identified with a gene-centric array.

    PubMed

    Johnson, Toby; Gaunt, Tom R; Newhouse, Stephen J; Padmanabhan, Sandosh; Tomaszewski, Maciej; Kumari, Meena; Morris, Richard W; Tzoulaki, Ioanna; O'Brien, Eoin T; Poulter, Neil R; Sever, Peter; Shields, Denis C; Thom, Simon; Wannamethee, Sasiwarang G; Whincup, Peter H; Brown, Morris J; Connell, John M; Dobson, Richard J; Howard, Philip J; Mein, Charles A; Onipinla, Abiodun; Shaw-Hawkins, Sue; Zhang, Yun; Davey Smith, George; Day, Ian N M; Lawlor, Debbie A; Goodall, Alison H; Fowkes, F Gerald; Abecasis, Gonçalo R; Elliott, Paul; Gateva, Vesela; Braund, Peter S; Burton, Paul R; Nelson, Christopher P; Tobin, Martin D; van der Harst, Pim; Glorioso, Nicola; Neuvrith, Hani; Salvi, Erika; Staessen, Jan A; Stucchi, Andrea; Devos, Nabila; Jeunemaitre, Xavier; Plouin, Pierre-François; Tichet, Jean; Juhanson, Peeter; Org, Elin; Putku, Margus; Sõber, Siim; Veldre, Gudrun; Viigimaa, Margus; Levinsson, Anna; Rosengren, Annika; Thelle, Dag S; Hastie, Claire E; Hedner, Thomas; Lee, Wai K; Melander, Olle; Wahlstrand, Björn; Hardy, Rebecca; Wong, Andrew; Cooper, Jackie A; Palmen, Jutta; Chen, Li; Stewart, Alexandre F R; Wells, George A; Westra, Harm-Jan; Wolfs, Marcel G M; Clarke, Robert; Franzosi, Maria Grazia; Goel, Anuj; Hamsten, Anders; Lathrop, Mark; Peden, John F; Seedorf, Udo; Watkins, Hugh; Ouwehand, Willem H; Sambrook, Jennifer; Stephens, Jonathan; Casas, Juan-Pablo; Drenos, Fotios; Holmes, Michael V; Kivimaki, Mika; Shah, Sonia; Shah, Tina; Talmud, Philippa J; Whittaker, John; Wallace, Chris; Delles, Christian; Laan, Maris; Kuh, Diana; Humphries, Steve E; Nyberg, Fredrik; Cusi, Daniele; Roberts, Robert; Newton-Cheh, Christopher; Franke, Lude; Stanton, Alice V; Dominiczak, Anna F; Farrall, Martin; Hingorani, Aroon D; Samani, Nilesh J; Caulfield, Mark J; Munroe, Patricia B

    2011-12-01

    Raised blood pressure (BP) is a major risk factor for cardiovascular disease. Previous studies have identified 47 distinct genetic variants robustly associated with BP, but collectively these explain only a few percent of the heritability for BP phenotypes. To find additional BP loci, we used a bespoke gene-centric array to genotype an independent discovery sample of 25,118 individuals that combined hypertensive case-control and general population samples. We followed up four SNPs associated with BP at our p < 8.56 × 10(-7) study-specific significance threshold and six suggestively associated SNPs in a further 59,349 individuals. We identified and replicated a SNP at LSP1/TNNT3, a SNP at MTHFR-NPPB independent (r(2) = 0.33) of previous reports, and replicated SNPs at AGT and ATP2B1 reported previously. An analysis of combined discovery and follow-up data identified SNPs significantly associated with BP at p < 8.56 × 10(-7) at four further loci (NPR3, HFE, NOS3, and SOX6). The high number of discoveries made with modest genotyping effort can be attributed to using a large-scale yet targeted genotyping array and to the development of a weighting scheme that maximized power when meta-analyzing results from samples ascertained with extreme phenotypes, in combination with results from nonascertained or population samples. Chromatin immunoprecipitation and transcript expression data highlight potential gene regulatory mechanisms at the MTHFR and NOS3 loci. These results provide candidates for further study to help dissect mechanisms affecting BP and highlight the utility of studying SNPs and samples that are independent of those studied previously even when the sample size is smaller than that in previous studies. PMID:22100073

  16. Flexible retinal electrode array

    DOEpatents

    Okandan, Murat; Wessendorf, Kurt O.; Christenson, Todd R.

    2006-10-24

    An electrode array which has applications for neural stimulation and sensing. The electrode array can include a large number of electrodes each of which is flexibly attached to a common substrate using a plurality of springs to allow the electrodes to move independently. The electrode array can be formed from a combination of bulk and surface micromachining, with electrode tips that can include an electroplated metal (e.g. platinum, iridium, gold or titanium) or a metal oxide (e.g. iridium oxide) for biocompatibility. The electrode array can be used to form a part of a neural prosthesis, and is particularly well adapted for use in an implantable retinal prosthesis where the electrodes can be tailored to provide a uniform gentle contact pressure with optional sensing of this contact pressure at one or more of the electrodes.

  17. Economical custom LSI arrays

    NASA Technical Reports Server (NTRS)

    Feller, A.; Smith, A.; Ramondetta, P.; Noto, R.; Lombardi, T.

    1976-01-01

    Automatic design technique uses standard circuit cells for producing large-scale integrated arrays. Computerized fabrication process provides individual cells of high density and efficiency, quick turnaround time, low cost, and ease of corrections for changes and errors.

  18. Glory Solar Array Deployment

    NASA Video Gallery

    The Glory spacecraft uses Orbital Sciences Corporation Space Systems Group's LEOStar-1 bus design, with deployable, four-panel solar arrays. This conceptual animation reveals Glory's unique solar a...

  19. Expandable LED array interconnect

    DOEpatents

    Yuan, Thomas Cheng-Hsin; Keller, Bernd

    2011-03-01

    A light emitting device that can function as an array element in an expandable array of such devices. The light emitting device comprises a substrate that has a top surface and a plurality of edges. Input and output terminals are mounted to the top surface of the substrate. Both terminals comprise a plurality of contact pads disposed proximate to the edges of the substrate, allowing for easy access to both terminals from multiple edges of the substrate. A lighting element is mounted to the top surface of the substrate. The lighting element is connected between the input and output terminals. The contact pads provide multiple access points to the terminals which allow for greater flexibility in design when the devices are used as array elements in an expandable array.

  20. Virtual center arraying

    NASA Technical Reports Server (NTRS)

    Deutsch, L. J.; Lipes, R. G.; Miller, R. L.

    1981-01-01

    Methods to increase the amount of data that can be received from outer planet missions are described with emphasis on antenna arraying systems designed to increase the total effective aperture of the receiving system. One such method is virtual center arraying (VCA). In VCA, a combined carrier reference is derived at a point that is, conceptually, the geometric center of the array. This point need not coincide with any of the actual antennas of the array. A noise analysis of the VCA system is given along with formulas for the phase jitter as a function of loop bandwidths and the amount of loop damping. If the ratio of the loop bandwidths of the center loop to the vertex loops is greater than 100, then the jitter is very nearly equal to that expected for ideal combined carrier referencing.

  1. A GWAS SNP for Schizophrenia Is Linked to the Internal MIR137 Promoter and Supports Differential Allele-Specific Expression

    PubMed Central

    Warburton, Alix; Breen, Gerome; Bubb, Vivien J.; Quinn, John P.

    2016-01-01

    Single nucleotide polymorphisms (SNPs) within the MIR137 gene locus have been shown to confer risk for schizophrenia through genome-wide association studies (GWAS). The expression levels of microRNA-137 (miR-137) and its validated gene targets have also been shown to be disrupted in several neuropsychiatric conditions, including schizophrenia. Regulation of miR-137 expression is thus imperative for normal neuronal functioning. We previously characterized an internal promoter domain within the MIR137 gene that contained a variable number tandem repeat (VNTR) polymorphism and could alter the in vitro levels of miR-137 in a stimulus-induced and allele-specific manner. We now demonstrate that haplotype tagging-SNP analysis linked the rs1625579 GWAS SNP for schizophrenia to this internal MIR137 promoter through a proxy SNP rs2660304 located at this domain. We postulated that the rs2660304 promoter SNP may act as predisposing factor for schizophrenia through altering the levels of miR-137 expression in a genotype-dependent manner. Reporter gene analysis of the internal MIR137 promoter containing the common VNTR variant demonstrated genotype-dependent differences in promoter activity with respect to rs2660304. In line with previous reports, the major allele of the rs2660304 proxy SNP, which has previously been linked with schizophrenia risk through genetic association, resulted in downregulation of reporter gene expression in a tissue culture model. The genetic influence of the rs2660304 proxy SNP on the transcriptional activity of the internal MIR137 promoter, and thus the levels of miR-137 expression, therefore offers a distinct regulatory mechanism to explain the functional significance of the rs1625579 GWAS SNP for schizophrenia risk. PMID:26429811

  2. A GWAS SNP for Schizophrenia Is Linked to the Internal MIR137 Promoter and Supports Differential Allele-Specific Expression.

    PubMed

    Warburton, Alix; Breen, Gerome; Bubb, Vivien J; Quinn, John P

    2016-07-01

    Single nucleotide polymorphisms (SNPs) within the MIR137 gene locus have been shown to confer risk for schizophrenia through genome-wide association studies (GWAS). The expression levels of microRNA-137 (miR-137) and its validated gene targets have also been shown to be disrupted in several neuropsychiatric conditions, including schizophrenia. Regulation of miR-137 expression is thus imperative for normal neuronal functioning. We previously characterized an internal promoter domain within the MIR137 gene that contained a variable number tandem repeat (VNTR) polymorphism and could alter the in vitro levels of miR-137 in a stimulus-induced and allele-specific manner. We now demonstrate that haplotype tagging-SNP analysis linked the rs1625579 GWAS SNP for schizophrenia to this internal MIR137 promoter through a proxy SNP rs2660304 located at this domain. We postulated that the rs2660304 promoter SNP may act as predisposing factor for schizophrenia through altering the levels of miR-137 expression in a genotype-dependent manner. Reporter gene analysis of the internal MIR137 promoter containing the common VNTR variant demonstrated genotype-dependent differences in promoter activity with respect to rs2660304. In line with previous reports, the major allele of the rs2660304 proxy SNP, which has previously been linked with schizophrenia risk through genetic association, resulted in downregulation of reporter gene expression in a tissue culture model. The genetic influence of the rs2660304 proxy SNP on the transcriptional activity of the internal MIR137 promoter, and thus the levels of miR-137 expression, therefore offers a distinct regulatory mechanism to explain the functional significance of the rs1625579 GWAS SNP for schizophrenia risk. PMID:26429811

  3. Imputation of microsatellite alleles from dense SNP genotypes for parentage verification across multiple Bos taurus and Bos indicus breeds

    PubMed Central

    McClure, Matthew C.; Sonstegard, Tad S.; Wiggans, George R.; Van Eenennaam, Alison L.; Weber, Kristina L.; Penedo, Cecilia T.; Berry, Donagh P.; Flynn, John; Garcia, Jose F.; Carmo, Adriana S.; Regitano, Luciana C. A.; Albuquerque, Milla; Silva, Marcos V. G. B.; Machado, Marco A.; Coffey, Mike; Moore, Kirsty; Boscher, Marie-Yvonne; Genestout, Lucie; Mazza, Raffaele; Taylor, Jeremy F.; Schnabel, Robert D.; Simpson, Barry; Marques, Elisa; McEwan, John C.; Cromie, Andrew; Coutinho, Luiz L.; Kuehn, Larry A.; Keele, John W.; Piper, Emily K.; Cook, Jim; Williams, Robert; Van Tassell, Curtis P.

    2013-01-01

    To assist cattle producers transition from microsatellite (MS) to single nucleotide polymorphism (SNP) genotyping for parental verification we previously devised an effective and inexpensive method to impute MS alleles from SNP haplotypes. While the reported method was verified with only a limited data set (N = 479) from Brown Swiss, Guernsey, Holstein, and Jersey cattle, some of the MS-SNP haplotype associations were concordant across these phylogenetically diverse breeds. This implied that some haplotypes predate modern breed formation and remain in strong linkage disequilibrium. To expand the utility of MS allele imputation across breeds, MS and SNP data from more than 8000 animals representing 39 breeds (Bos taurus and B. indicus) were used to predict 9410 SNP haplotypes, incorporating an average of 73 SNPs per haplotype, for which alleles from 12 MS markers could be accurately be imputed. Approximately 25% of the MS-SNP haplotypes were present in multiple breeds (N = 2 to 36 breeds). These shared haplotypes allowed for MS imputation in breeds that were not represented in the reference population with only a small increase in Mendelian inheritance inconsistancies. Our reported reference haplotypes can be used for any cattle breed and the reported methods can be applied to any species to aid the transition from MS to SNP genetic markers. While ~91% of the animals with imputed alleles for 12 MS markers had ≤1 Mendelian inheritance conflicts with their parents' reported MS genotypes, this figure was 96% for our reference animals, indicating potential errors in the reported MS genotypes. The workflow we suggest autocorrects for genotyping errors and rare haplotypes, by MS genotyping animals whose imputed MS alleles fail parentage verification, and then incorporating those animals into the reference dataset. PMID:24065982

  4. Deriving Gene Networks from SNP Associated with Triacylglycerol and Phospholipid Fatty Acid Fractions from Ribeyes of Angus Cattle

    PubMed Central

    Buchanan, Justin W.; Reecy, James M.; Garrick, Dorian J.; Duan, Qing; Beitz, Don C.; Koltes, James E.; Saatchi, Mahdi; Koesterke, Lars; Mateescu, Raluca G.

    2016-01-01

    The fatty acid profile of beef is a complex trait that can benefit from gene-interaction network analysis to understand relationships among loci that contribute to phenotypic variation. Phenotypic measures of fatty acid profile from triacylglycerol and phospholipid fractions of longissimus muscle, pedigree information, and Illumina 54 k bovine SNP genotypes were utilized to derive an annotated gene network associated with fatty acid composition in 1,833 Angus beef cattle. The Bayes-B statistical model was utilized to perform a genome wide association study to estimate associations between 54 k SNP genotypes and 39 individual fatty acid phenotypes within each fraction. Posterior means of the effects were estimated for each of the 54 k SNP and for the collective effects of all the SNP in every 1-Mb genomic window in terms of the proportion of genetic variance explained by the window. Windows that explained the largest proportions of genetic variance for individual lipids were found in the triacylglycerol fraction. There was almost no overlap in the genomic regions explaining variance between the triacylglycerol and phospholipid fractions. Partial correlations were used to identify correlated regions of the genome for the set of largest 1 Mb windows that explained up to 35% genetic variation in either fatty acid fraction. SNP were allocated to windows based on the bovine UMD3.1 assembly. Gene network clusters were generated utilizing a partial correlation and information theory algorithm. Results were used in conjunction with network scoring and visualization software to analyze correlated SNP across 39 fatty acid phenotypes to identify SNP of significance. Significant pathways implicated in fatty acid metabolism through GO term enrichment analysis included homeostasis of number of cells, homeostatic process, coenzyme/cofactor activity, and immunoglobulin. These results suggest different metabolic pathways regulate the development of different types of lipids found in

  5. Random array grid collimator

    DOEpatents

    Fenimore, E.E.

    1980-08-22

    A hexagonally shaped quasi-random no-two-holes touching grid collimator. The quasi-random array grid collimator eliminates contamination from small angle off-axis rays by using a no-two-holes-touching pattern which simultaneously provides for a self-supporting array increasng throughput by elimination of a substrate. The presentation invention also provides maximum throughput using hexagonally shaped holes in a hexagonal lattice pattern for diffraction limited applications. Mosaicking is also disclosed for reducing fabrication effort.

  6. The NMR phased array.

    PubMed

    Roemer, P B; Edelstein, W A; Hayes, C E; Souza, S P; Mueller, O M

    1990-11-01

    We describe methods for simultaneously acquiring and subsequently combining data from a multitude of closely positioned NMR receiving coils. The approach is conceptually similar to phased array radar and ultrasound and hence we call our techniques the "NMR phased array." The NMR phased array offers the signal-to-noise ratio (SNR) and resolution of a small surface coil over fields-of-view (FOV) normally associated with body imaging with no increase in imaging time. The NMR phased array can be applied to both imaging and spectroscopy for all pulse sequences. The problematic interactions among nearby surface coils is eliminated (a) by overlapping adjacent coils to give zero mutual inductance, hence zero interaction, and (b) by attaching low input impedance preamplifiers to all coils, thus eliminating interference among next nearest and more distant neighbors. We derive an algorithm for combining the data from the phased array elements to yield an image with optimum SNR. Other techniques which are easier to implement at the cost of lower SNR are explored. Phased array imaging is demonstrated with high resolution (512 x 512, 48-cm FOV, and 32-cm FOV) spin-echo images of the thoracic and lumbar spine. Data were acquired from four-element linear spine arrays, the first made of 12-cm square coils and the second made of 8-cm square coils. When compared with images from a single 15 x 30-cm rectangular coil and identical imaging parameters, the phased array yields a 2X and 3X higher SNR at the depth of the spine (approximately 7 cm). PMID:2266841

  7. Association of rs2072183 SNP and serum lipid levels in the Mulao and Han populations

    PubMed Central

    2012-01-01

    Background Niemann-pick C1-like 1 (NPC1L1) is a key protein for intestinal cholesterol transportation. Common single nucleotide polymorphisms (SNPs) in the NPC1L1 gene have been associated with cholesterol absorption and serum lipid levels. The present study was undertaken to explore the possible association of NPC1L1 rs2072183 1735 C > G SNP and several environmental factors with serum lipid levels in the Mulao and Han populations. Methods Genotyping of the rs2072183 SNP was performed in 688 subjects of Mulao and 738 participants of Han Chinese. The interactions between NPC1L1 1735 C > G polymorphism and several environmental factors on serum lipid phenotypes were tested using the factorial design covariance analysis after controlling for potential confounders. Results The frequency of G allele was lower in Mulao than in Han (29.72% vs. 37.26%, P < 0.001). The frequency of CC, CG and GG genotypes was 49.85%, 40.84% and 9.31% in Mulao, and 39.30%, 46.88% and 13.82% in Han (P < 0.001); respectively. The levels of low-density lipoprotein cholesterol (LDL-C), apolipoprotein (Apo) B and the ratio of ApoAI/ApoB in Han but not in Mulao were different among the three genotypes (P < 0.05 for all), the subjects with GG and CG genotypes had higher LDL-C, ApoB levels and lower ApoAI/ApoB ratio than the subjects with CC genotype. Subgroup analysis showed that the G allele carriers in Han had higher total cholesterol (TC), LDL-C and ApoB levels in males (P < 0.05) and lower ApoAI/ApoB ratio in both sexes (P < 0.05) than the G allele noncarriers. The G allele carriers in Mulao had higher TC and LDL-C levels in males (P < 0.05) and lower high-density lipoprotein cholesterol (HDL-C) levels in both sexes (P < 0.05) than the G allele noncarriers. Serum TC, LDL-C, ApoB levels and ApoAI/ApoB ratio were correlated with genotypes in Han males (P < 0.05) but not in females. Serum lipid parameters were also correlated with several environmental

  8. Carbon nanotube array actuators

    NASA Astrophysics Data System (ADS)

    Geier, S.; Mahrholz, T.; Wierach, P.; Sinapius, M.

    2013-09-01

    Experimental investigations of highly vertically aligned carbon nanotubes (CNTs), also known as CNT-arrays, are the main focus of this paper. The free strain as result of an active material behavior is analyzed via a novel experimental setup. Previous test experiences of papers made of randomly oriented CNTs, also called Bucky-papers, reveal comparably low free strain. The anisotropy of aligned CNTs promises better performance. Via synthesis techniques like chemical vapor deposition (CVD) or plasma enhanced CVD (PECVD), highly aligned arrays of multi-walled carbon nanotubes (MWCNTs) are synthesized. Two different types of CNT-arrays are analyzed, morphologically first, and optically tested for their active characteristics afterwards. One type of the analyzed arrays features tube lengths of 750-2000 μm with a large variety of diameters between 20 and 50 nm and a wave-like CNT-shape. The second type features a maximum, almost uniform, length of 12 μm and a constant diameter of 50 nm. Different CNT-lengths and array types are tested due to their active behavior. As result of the presented tests, it is reported that the quality of orientation is the most decisive property for excellent active behavior. Due to their alignment, CNT-arrays feature the opportunity to clarify the actuation mechanism of architectures made of CNTs.

  9. No association of IFNG+874T/A SNP and NOS2A-954G/C SNP variants with nitric oxide radical serum levels or susceptibility to tuberculosis in a Brazilian population subset.

    PubMed

    Leandro, Ana Cristina C S; Rocha, Márcia Andrade; Lamoglia-Souza, Andreia; VandeBerg, John L; Rolla, Valeria Cavalcanti; Bonecini-Almeida, Maria da Gloria

    2013-01-01

    Tuberculosis (TB) is one of the most common infectious diseases in the world. Mycobacterium tuberculosis infection leads to pulmonary active disease in approximately 5-10% of exposed individuals. Both bacteria- and host-related characteristics influence latent infection and disease. Host genetic predisposition to develop TB may involve multiple genes and their polymorphisms. It was reported previously that interferon gamma (IFN-γ) and nitric oxide synthase 2 (NOS2) are expressed on alveolar macrophages from TB patients and are responsible for bacilli control; thus, we aimed this study at genotyping single nucleotide polymorphisms IFNG+874T/A SNP and NOS2A-954G/C SNP to estimate their role on TB susceptibility and determine whether these polymorphisms influence serum nitrite and NOx(-) production. This case-control study enrolled 172 TB patients and 179 healthy controls. Neither polymorphism was associated with susceptibility to TB. NOS2A-954G/C SNP was not associated with serum levels of nitrite and NOx(-). These results indicate that variants of IFNG+874T/A SNP and NOS2A-954G/C SNP do not influence TB susceptibility or the secretion of nitric oxide radicals in the study population. PMID:24024215

  10. Identification of SNP and SSR markers in eggplant using RAD tag sequencing

    PubMed Central

    2011-01-01

    Background The eggplant (Solanum melongena L.) genome is relatively unexplored, especially compared to those of the other major Solanaceae crops tomato and potato. In particular, no SNP markers are publicly available; on the other hand, over 1,000 SSR markers were developed and publicly available. We have combined the recently developed Restriction-site Associated DNA (RAD) approach with Illumina DNA sequencing for rapid and mass discovery of both SNP and SSR markers for eggplant. Results RAD tags were generated from the genomic DNA of a pair of eggplant mapping parents, and sequenced to produce ~17.5 Mb of sequences arrangeable into ~78,000 contigs. The resulting non-redundant genomic sequence dataset consisted of ~45,000 sequences, of which ~29% were putative coding sequences and ~70% were in common between the mapping parents. The shared sequences allowed the discovery of ~10,000 SNPs and nearly 1,000 indels, equivalent to a SNP frequency of 0.8 per Kb and an indel frequency of 0.07 per Kb. Over 2,000 of the SNPs are likely to be mappable via the Illumina GoldenGate assay. A subset of 384 SNPs was used to successfully fingerprint a panel of eggplant germplasm, producing a set of informative diversity data. The RAD sequences also included nearly 2,000 putative SSRs, and primer pairs were designed to amplify 1,155 loci. Conclusion The high throughput sequencing of the RAD tags allowed the discovery of a large number of DNA markers, which will prove useful for extending our current knowledge of the genome organization of eggplant, for assisting in marker-aided selection and for carrying out comparative genomic analyses within the Solanaceae family. PMID:21663628

  11. SNP variation in ADRB3 gene reflects the breed difference of sheep populations.

    PubMed

    Wu, Jianliang; Qiao, Liying; Liu, Jianhua; Yuan, Yanan; Liu, Wenzhong

    2012-08-01

    The β3-adrenergic receptor (ADRB3), a G-protein coupled receptor, plays a major role in energy metabolism and regulation of lipolysis and homeostasis. We detect the single nucleotide polymorphism (SNP) variation in full-length sequence of ovine ADRB3 gene in 12 domestic sheep populations within four types by polymerase chain reaction-single strand conformation polymorphism and sequencing to reveal the breed difference. Twenty-two SNPs, 12 of which in the exon 1 and ten in the intron, were detected, and 12 new exonic and four new intronic SNPs were found. Most SNPs presented in Shanxi Dam Line and least ones in Dorset. The average SNP number in both meat and dual purpose for meat and wool breeds was significantly higher than general and dual purpose breeds for wool and meat. Frequency of each SNP in studied breeds or types was different. The 18C Del and 1617T Ins majorly existed in dual purpose breeds for wool and meat. The 25A Del, 119C>G and 130C>T were mostly found in the meat and dual purpose for meat and wool breeds. The 1764C>A more frequently presented in meat than in other types. The majority of variations came from within the populations as suggested by analysis of molecular variance. Close relationship presented among the Chinese and western breeds, respectively. In conclusion, SNPs of ovine ADRB3 gene can reflect the breed difference and within- and between-population variations, and to a great extent, the breed relationship. PMID:22711302

  12. A Whole Methylome CpG-SNP Association Study of Psychosis in Blood and Brain Tissue.

    PubMed

    van den Oord, Edwin J C G; Clark, Shaunna L; Xie, Lin Ying; Shabalin, Andrey A; Dozmorov, Mikhail G; Kumar, Gaurav; Vladimirov, Vladimir I; Magnusson, Patrik K E; Aberg, Karolina A

    2016-07-01

    Mutated CpG sites (CpG-SNPs) are potential hotspots for human diseases because in addition to the sequence variation they may show individual differences in DNA methylation. We performed methylome-wide association studies (MWAS) to test whether methylation differences at those sites were associated with schizophrenia. We assayed all common CpG-SNPs with methyl-CpG binding domain protein-enriched genome sequencing (MBD-seq) using DNA extracted from 1408 blood samples and 66 postmortem brain samples (BA10) of schizophrenia cases and controls. Seven CpG-SNPs passed our FDR threshold of 0.1 in the blood MWAS. Of the CpG-SNPs methylated in brain, 94% were also methylated in blood. This significantly exceeded the 46.2% overlap expected by chance (P-value < 1.0×10(-8)) and justified replicating findings from blood in brain tissue. CpG-SNP rs3796293 in IL1RAP replicated (P-value = .003) with the same direction of effects. This site was further validated through targeted bisulfite pyrosequencing in 736 independent case-control blood samples (P-value < 9.5×10(-4)). Our top result in the brain MWAS (P-value = 8.8×10(-7)) was CpG-SNP rs16872141 located in the potential promoter of ENC1. Overall, our results suggested that CpG-SNP methylation may reflect effects of environmental insults and can provide biomarkers in blood that could potentially improve disease management. PMID:26656881

  13. PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations

    PubMed Central

    Bendl, Jaroslav; Stourac, Jan; Salanda, Ondrej; Pavelka, Antonin; Wieben, Eric D.; Zendulka, Jaroslav; Brezovsky, Jan; Damborsky, Jiri

    2014-01-01

    Single nucleotide variants represent a prevalent form of genetic variation. Mutations in the coding regions are frequently associated with the development of various genetic diseases. Computational tools for the prediction of the effects of mutations on protein function are very important for analysis of single nucleotide variants and their prioritization for experimental characterization. Many computational tools are already widely employed for this purpose. Unfortunately, their comparison and further improvement is hindered by large overlaps between the training datasets and benchmark datasets, which lead to biased and overly optimistic reported performances. In this study, we have constructed three independent datasets by removing all duplicities, inconsistencies and mutations previously used in the training of evaluated tools. The benchmark dataset containing over 43,000 mutations was employed for the unbiased evaluation of eight established prediction tools: MAPP, nsSNPAnalyzer, PANTHER, PhD-SNP, PolyPhen-1, PolyPhen-2, SIFT and SNAP. The six best performing tools were combined into a consensus classifier PredictSNP, resulting into significantly improved prediction performance, and at the same time returned results for all mutations, confirming that consensus prediction represents an accurate and robust alternative to the predictions delivered by individual tools. A user-friendly web interface enables easy access to all eight prediction tools, the consensus classifier PredictSNP and annotations from the Protein Mutant Database and the UniProt database. The web server and the datasets are freely available to the academic community at http://loschmidt.chemi.muni.cz/predictsnp. PMID:24453961

  14. Plastid DNA sequencing and nuclear SNP genotyping help resolve the puzzle of central American Platanus

    PubMed Central

    De Castro, Olga; Di Maio, Antonietta; Lozada García, José Armando; Piacenti, Danilo; Vázquez-Torres, Mario; De Luca, Paolo

    2013-01-01

    Background and Aims Recent research on the history of Platanus reveals that hybridization phenomena occurred in the central American species. This study has two goals: to help resolve the evolutive puzzle of central American Platanus, and to test the potential of real-time polymerase chain reaction (PCR) for detecting ancient hybridization. Methods Sequencing of a uniparental plastid DNA marker [psbA-trnH(GUG) intergenic spacer] and qualitative and quantitative single nucleotide polymorphism (SNP) genotyping of biparental nuclear ribosomal DNA (nrDNA) markers [LEAFY intron 2 (LFY-i2) and internal transcribed spacer 2 (ITS2)] were used. Key Results Based on the SNP genotyping results, several Platanus accessions show the presence of hybridization/introgression, including some accessions of P. rzedowskii and of P. mexicana var. interior and one of P. mexicana var. mexicana from Oaxaca (= P. oaxacana). Based on haplotype analyses of the psbA-trnH spacer, five haplotypes were detected. The most common of these is present in taxa belonging to P. orientalis, P. racemosa sensu lato, some accessions of P. occidentalis sensu stricto (s.s.) from Texas, P. occidentalis var. palmeri, P. mexicana s.s. and P. rzedowskii. This is highly relevant to genetic relationships with the haplotypes present in P. occidentalis s.s. and P. mexicana var. interior. Conclusions Hybridization and introgression events between lineages ancestral to modern central and eastern North American Platanus species occurred. Plastid haplotypes and qualitative and quantitative SNP genotyping provide information critical for understanding the complex history of Mexican Platanus. Compared with the usual molecular techniques of sub-cloning, sequencing and genotyping, real-time PCR assay is a quick and sensitive technique for analysing complex evolutionary patterns. PMID:23798602

  15. Role of an SNP in Alternative Splicing of Bovine NCF4 and Mastitis Susceptibility

    PubMed Central

    Wang, Xiuge; Yang, Chunhong; Sun, Yan; Jiang, Qiang; Wang, Fei; Li, Mengjiao; Zhong, Jifeng; Huang, Jinming

    2015-01-01

    Neutrophil cytosolic factor 4 (NCF4) is component of the nicotinamide dinucleotide phosphate oxidase complex, a key factor in biochemical pathways and innate immune responses. In this study, splice variants and functional single-nucleotide polymorphism (SNP) of NCF4 were identified to determine the variability and association of the gene with susceptibility to bovine mastitis characterized by inflammation. A novel splice variant, designated as NCF4-TV and characterized by the retention of a 48 bp sequence in intron 9, was detected in the mammary gland tissues of infected cows. The expression of the NCF4-reference main transcript in the mastitic mammary tissues was higher than that in normal tissues. A novel SNP, g.18174 A>G, was also found in the retained 48 bp region of intron 9. To determine whether NCF4-TV could be due to the g.18174 A>G mutation, we constructed two mini-gene expression vectors with the wild-type or mutant NCF4 g.18174 A>G fragment. The vectors were then transiently transfected into 293T cells, and alternative splicing of NCF4 was analyzed by reverse transcription-PCR and sequencing. Mini-gene splicing assay demonstrated that the aberrantly spliced NCF4-TV with 48 bp retained fragment in intron 9 could be due to g.18174 A>G, which was associated with milk somatic count score and increased risk of mastitis infection in cows. NCF4 expression was also regulated by alternative splicing. This study proposes that NCF4 splice variants generated by functional SNP are important risk factors for mastitis susceptibility in dairy cows. PMID:26600390

  16. Integrated platform for detection of DNA sequence variants using capillary array electrophoresis

    SciTech Connect

    Qingbro, Li; Liu, Zhaowei; Monroe, Heidi M; Culiat, Cymbeline T

    2002-08-01

    We have developed a highly versatile platform that performs temperature gradient capillary electrophoresis (TGCE) for mutation/single-nucleotide polymorphism (SNP) detection, sequencing and mutation/SNP genotyping for identification of sequence variants on an automated 24-, 96- or 192-capillary array instrument. In the first mode, multiple DNA samples consisting of homoduplexes and heteroduplexes are separated by CE, during which a temperature gradient is applied that covers all possible temperatures of 50% melting equilibrium (Tms) for the samples. The differences in Tms result in separation of homoduplexes from heteroduplexes, thereby identifying the presence of DNA variants. The sequencing mode is then used to determine the exact location of the mutation/SNPs in the DNA variants. The first two modes allow the rapid identification of variants from the screening of a large number of samples. Only the variants need to be sequenced. The third mode utilizes multiplexed single-base extensions (SBEs) to survey mutations and SNPs at the known sites of DNA sequence. The TGCE approach combined with sequencing and SBE is fast and cost-effective for high-throughput mutation/SNP detection.

  17. CLUSTAG & WCLUSTAG: Hierarchical Clustering Algorithms for Efficient Tag-SNP Selection

    NASA Astrophysics Data System (ADS)

    Ao, Sio-Iong

    More than 6 million single nucleotide polymorphisms (SNPs) in the human genome have been genotyped by the HapMap project. Although only a pro portion of these SNPs are functional, all can be considered as candidate markers for indirect association studies to detect disease-related genetic variants. The complete screening of a gene or a chromosomal region is nevertheless an expensive undertak ing for association studies. A key strategy for improving the efficiency of association studies is to select a subset of informative SNPs, called tag SNPs, for analysis. In the chapter, hierarchical clustering algorithms have been proposed for efficient tag SNP selection.

  18. mrsFAST-Ultra: a compact, SNP-aware mapper for high performance sequencing applications

    PubMed Central

    Hach, Faraz; Sarrafi, Iman; Hormozdiari, Farhad; Alkan, Can; Eichler, Evan E.; Sahinalp, S. Cenk

    2014-01-01

    High throughput sequencing (HTS) platforms generate unprecedented amounts of data that introduce challenges for processing and downstream analysis. While tools that report the ‘best’ mapping location of each read provide a fast way to process HTS data, they are not suitable for many types of downstream analysis such as structural variation detection, where it is important to report multiple mapping loci for each read. For this purpose we introduce mrsFAST-Ultra, a fast, cache oblivious, SNP-aware aligner that can handle the multi-mapping of HTS reads very efficiently. mrsFAST-Ultra improves mrsFAST, our first cache oblivious read aligner capable of handling multi-mapping reads, through new and compact index structures that reduce not only the overall memory usage but also the number of CPU operations per alignment. In fact the size of the index generated by mrsFAST-Ultra is 10 times smaller than that of mrsFAST. As importantly, mrsFAST-Ultra introduces new features such as being able to (i) obtain the best mapping loci for each read, and (ii) return all reads that have at most n mapping loci (within an error threshold), together with these loci, for any user specified n. Furthermore, mrsFAST-Ultra is SNP-aware, i.e. it can map reads to reference genome while discounting the mismatches that occur at common SNP locations provided by db-SNP; this significantly increases the number of reads that can be mapped to the reference genome. Notice that all of the above features are implemented within the index structure and are not simple post-processing steps and thus are performed highly efficiently. Finally, mrsFAST-Ultra utilizes multiple available cores and processors and can be tuned for various memory settings. Our results show that mrsFAST-Ultra is roughly five times faster than its predecessor mrsFAST. In comparison to newly enhanced popular tools such as Bowtie2, it is more sensitive (it can report 10 times or more mappings per read) and much faster (six times

  19. SNP discovery in non-model organisms using 454 next generation sequencing.

    PubMed

    Wheat, Christopher W

    2012-01-01

    Roche 454 sequencing of the transcriptome has become a standard approach for efficiently obtaining single nucleotide polymorphisms (SNPs) in non-model species. In this chapter, the primary issues facing the development of SNPs from the transcriptome in non-model species are presented: tissue and sampling choices, mRNA preparation, considerations of normalization, pooling and barcoding, how much to sequence, how to assemble the data and assess the assembly, calling transcriptome SNPs, developing these into genomic SNPs, and publishing the work. Discussion also covers the comparison of this approach to RAD-tag sequencing and the potential of using other sequencing platforms for SNP development. PMID:22665274

  20. SNP annotation-based whole genomic prediction and selection: an application to feed efficiency and its component traits in pigs.

    PubMed

    Do, D N; Janss, L L G; Jensen, J; Kadarmideen, H N

    2015-05-01

    The study investigated genetic architecture and predictive ability using genomic annotation of residual feed intake (RFI) and its component traits (daily feed intake [DFI], ADG, and back fat [BF]). A total of 1,272 Duroc pigs had both genotypic and phenotypic records, and the records were split into a training (968 pigs) and a validation dataset (304 pigs) by assigning records as before and after January 1, 2012, respectively. SNP were annotated by 14 different classes using Ensembl variant effect prediction. Predictive accuracy and prediction bias were calculated using Bayesian Power LASSO, Bayesian A, B, and Cπ, and genomic BLUP (GBLUP) methods. Predictive accuracy ranged from 0.508 to 0.531, 0.506 to 0.532, 0.276 to 0.357, and 0.308 to 0.362 for DFI, RFI, ADG, and BF, respectively. BayesCπ100.1 increased accuracy slightly compared to the GBLUP model and other methods. The contribution per SNP to total genomic variance was similar among annotated classes across different traits. Predictive performance of SNP classes did not significantly differ from randomized SNP groups. Genomic prediction has accuracy comparable to observed phenotype, and use of genomic prediction can be cost effective by replacing feed intake measurement. Genomic annotation had less impact on predictive accuracy traits considered here but may be different for other traits. It is the first study to provide useful insights into biological classes of SNP driving the whole genomic prediction for complex traits in pigs. PMID:26020301

  1. Sampled Longest Common Prefix Array

    NASA Astrophysics Data System (ADS)

    Sirén, Jouni

    When augmented with the longest common prefix (LCP) array and some other structures, the suffix array can solve many string processing problems in optimal time and space. A compressed representation of the LCP array is also one of the main building blocks in many compressed suffix tree proposals. In this paper, we describe a new compressed LCP representation: the sampled LCP array. We show that when used with a compressed suffix array (CSA), the sampled LCP array often offers better time/space trade-offs than the existing alternatives. We also show how to construct the compressed representations of the LCP array directly from a CSA.

  2. Imaging antenna arrays

    NASA Technical Reports Server (NTRS)

    Rutledge, D. B.; Muha, M. S.

    1982-01-01

    Many millimeter and far-infrared imaging systems are limited in sensitivity and speed because they depend on a single scanned element. Because of recent advances in planar detectors such as Schottky diodes, superconducting tunnel junctions, and microbolometers, an attractive approach to this problem is a planar antenna array with integrated detectors. A planar line antenna array and optical system for imaging has been developed. The significant advances are a 'reverse-microscope' optical configuration and a modified bow-tie antenna design. In the 'reverse-microscope' configuration, a lens is attached to the bottom of the substrate containing the antennas. Imaging is done through the substrate. This configuration eliminates the troublesome effects of substrate surface waves. The substrate lens has only a single refracting surface, making possible a virtually aplanatic system, with little spherical aberration or coma. The array is characterized by an optical transfer function that is easily measured. An array with 19 dB crosstalk levels between adjacent antennas has been tested and it was found that the array captured 50 percent of the available power. This imaging system was diffraction limited.

  3. Striped tertiary storage arrays

    NASA Technical Reports Server (NTRS)

    Drapeau, Ann L.

    1993-01-01

    Data stripping is a technique for increasing the throughput and reducing the response time of large access to a storage system. In striped magnetic or optical disk arrays, a single file is striped or interleaved across several disks; in a striped tape system, files are interleaved across tape cartridges. Because a striped file can be accessed by several disk drives or tape recorders in parallel, the sustained bandwidth to the file is greater than in non-striped systems, where access to the file are restricted to a single device. It is argued that applying striping to tertiary storage systems will provide needed performance and reliability benefits. The performance benefits of striping for applications using large tertiary storage systems is discussed. It will introduce commonly available tape drives and libraries, and discuss their performance limitations, especially focusing on the long latency of tape accesses. This section will also describe an event-driven tertiary storage array simulator that is being used to understand the best ways of configuring these storage arrays. The reliability problems of magnetic tape devices are discussed, and plans for modeling the overall reliability of striped tertiary storage arrays to identify the amount of error correction required are described. Finally, work being done by other members of the Sequoia group to address latency of accesses, optimizing tertiary storage arrays that perform mostly writes, and compression is discussed.

  4. Folding of Nucleosome Arrays

    NASA Astrophysics Data System (ADS)

    Howell, Steven; Jimenez-Useche, Isabel; Andresen, Kurt; Yuan, Chongli; Qiu, Xiangyun

    2014-03-01

    Chromatin conformation and dynamics is central to gene functions including packaging, regulation, and repair. At the molecular level, the basic building block of chromatin is a nucleosome core particle (NCP) made of 147 base pairs (bp) of dsDNA wrapped around an octamer of histone proteins. These NCPs are connected by short 10-90 bps of linker DNA as beads on a string. Key factors determining the packaging of NCP arrays to form chromatin include ionic condition, linker DNA length, and epigenetic modifications, especially of the histone tails. We have investigated how the conformations of model tetra-NCP arrays are modulated by these factors using small angle x-ray scattering (SAXS). Here we present recent studies of the effects of ion (KCl and MgCl2), linker length, and histone modification (tail deletions) on NCP arrays. Our SAXS measurement makes it possible to learn about both the global compaction of NCP arrays and local inter-NCP spatial correlations within the same array.

  5. Solar array flight experiment

    NASA Technical Reports Server (NTRS)

    1986-01-01

    Emerging satellite designs require increasing amounts of electrical power to operate spacecraft instruments and to provide environments suitable for human habitation. In the past, electrical power was generated by covering rigid honeycomb panels with solar cells. This technology results in unacceptable weight and volume penalties when large amounts of power are required. To fill the need for large-area, lightweight solar arrays, a fabrication technique in which solar cells are attached to a copper printed circuit laminated to a plastic sheet was developed. The result is a flexible solar array with one-tenth the stowed volume and one-third the weight of comparably sized rigid arrays. An automated welding process developed to attack the cells to the printed circuit guarantees repeatable welds that are more tolerant of severe environments than conventional soldered connections. To demonstrate the flight readiness of this technology, the Solar Array Flight Experiment (SAFE) was developed and flown on the space shuttle Discovery in September 1984. The tests showed the modes and frequencies of the array to be very close to preflight predictions. Structural damping, however, was higher than anticipated. Electrical performance of the active solar panel was also tested. The flight performance and postflight data evaluation are described.

  6. SNPsplit: Allele-specific splitting of alignments between genomes with known SNP genotypes

    PubMed Central

    Krueger, Felix; Andrews, Simon R.

    2016-01-01

    Sequencing reads overlapping polymorphic sites in diploid mammalian genomes may be assigned to one allele or the other. This holds the potential to detect gene expression, chromatin modifications, DNA methylation or nuclear interactions in an allele-specific fashion. SNPsplit is an allele-specific alignment sorter designed to read files in SAM/BAM format and determine the allelic origin of reads or read-pairs that cover known single nucleotide polymorphic (SNP) positions. For this to work libraries must have been aligned to a genome in which all known SNP positions were masked with the ambiguity base 'N' and aligned using a suitable mapping program such as Bowtie2, TopHat, STAR, HISAT2, HiCUP or Bismark. SNPsplit also provides an automated solution to generate N-masked reference genomes for hybrid mouse strains based on the variant call information provided by the Mouse Genomes Project. The unique ability of SNPsplit to work with various different kinds of sequencing data including RNA-Seq, ChIP-Seq, Bisulfite-Seq or Hi-C opens new avenues for the integrative exploration of allele-specific data. PMID:27429743

  7. A Genome Wide Survey of SNP Variation Reveals the Genetic Structure of Sheep Breeds

    PubMed Central

    Kijas, James W.; Townley, David; Dalrymple, Brian P.; Heaton, Michael P.; Maddox, Jillian F.; McGrath, Annette; Wilson, Peter; Ingersoll, Roxann G.; McCulloch, Russell; McWilliam, Sean; Tang, Dave; McEwan, John; Cockett, Noelle; Oddy, V. Hutton; Nicholas, Frank W.; Raadsma, Herman

    2009-01-01

    The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identifying the first genome-wide set of SNP for sheep, we report on levels of genetic variability both within and between a diverse sample of ovine populations. Then, using cluster analysis and the partitioning of genetic variation, we demonstrate sheep are characterised by weak phylogeographic structure, overlapping genetic similarity and generally low differentiation which is consistent with their short evolutionary history. The degree of population substructure was, however, sufficient to cluster individuals based on geographic origin and known breed history. Specifically, African and Asian populations clustered separately from breeds of European origin sampled from Australia, New Zealand, Europe and North America. Furthermore, we demonstrate the presence of stratification within some, but not all, ovine breeds. The results emphasize that careful documentation of genetic structure will be an essential prerequisite when mapping the genetic basis of complex traits. Furthermore, the identification of a subset of SNP able to assign individuals into broad groupings demonstrates even a small panel of markers may be suitable for applications such as traceability. PMID:19270757

  8. GStream: Improving SNP and CNV Coverage on Genome-Wide Association Studies

    PubMed Central

    Alonso, Arnald; Marsal, Sara; Tortosa, Raül; Canela-Xandri, Oriol; Julià, Antonio

    2013-01-01

    We present GStream, a method that combines genome-wide SNP and CNV genotyping in the Illumina microarray platform with unprecedented accuracy. This new method outperforms previous well-established SNP genotyping software. More importantly, the CNV calling algorithm of GStream dramatically improves the results obtained by previous state-of-the-art methods and yields an accuracy that is close to that obtained by purely CNV-oriented technologies like Comparative Genomic Hybridization (CGH). We demonstrate the superior performance of GStream using microarray data generated from HapMap samples. Using the reference CNV calls generated by the 1000 Genomes Project (1KGP) and well-known studies on whole genome CNV characterization based either on CGH or genotyping microarray technologies, we show that GStream can increase the number of reliably detected variants up to 25% compared to previously developed methods. Furthermore, the increased genome coverage provided by GStream allows the discovery of CNVs in close linkage disequilibrium with SNPs, previously associated with disease risk in published Genome-Wide Association Studies (GWAS). These results could provide important insights into the biological mechanism underlying the detected disease risk association. With GStream, large-scale GWAS will not only benefit from the combined genotyping of SNPs and CNVs at an unprecedented accuracy, but will also take advantage of the computational efficiency of the method. PMID:23844243

  9. Linkage Disequilibrium Estimation of Chinese Beef Simmental Cattle Using High-density SNP Panels

    PubMed Central

    Zhu, M.; Zhu, B.; Wang, Y. H.; Wu, Y.; Xu, L.; Guo, L. P.; Yuan, Z. R.; Zhang, L. P.; Gao, X.; Gao, H. J.; Xu, S. Z.; Li, J. Y.

    2013-01-01

    Linkage disequilibrium (LD) plays an important role in genomic selection and mapping quantitative trait loci (QTL). In this study, the pattern of LD and effective population size (Ne) were investigated in Chinese beef Simmental cattle. A total of 640 bulls were genotyped with IlluminaBovinSNP50BeadChip and IlluminaBovinHDBeadChip. We estimated LD for each autosomal chromosome at the distance between two random SNPs of <0 to 25 kb, 25 to 50 kb, 50 to 100 kb, 100 to 500 kb, 0.5 to 1 Mb, 1 to 5 Mb and 5 to 10 Mb. The mean values of r2 were 0.30, 0.16 and 0.08, when the separation between SNPs ranged from 0 to 25 kb to 50 to 100 kb and then to 0.5 to 1 Mb, respectively. The LD estimates decreased as the distance increased in SNP pairs, and increased with the increase of minor allelic frequency (MAF) and with the decrease of sample sizes. Estimates of effective population size for Chinese beef Simmental cattle decreased in the past generations and Ne was 73 at five generations ago. PMID:25049849

  10. Fast and Rigorous Computation of Gene and Pathway Scores from SNP-Based Summary Statistics

    PubMed Central

    Rueedi, Rico; Kutalik, Zoltán; Bergmann, Sven

    2016-01-01

    Integrating single nucleotide polymorphism (SNP) p-values from genome-wide association studies (GWAS) across genes and pathways is a strategy to improve statistical power and gain biological insight. Here, we present Pascal (Pathway scoring algorithm), a powerful tool for computing gene and pathway scores from SNP-phenotype association summary statistics. For gene score computation, we implemented analytic and efficient numerical solutions to calculate test statistics. We examined in particular the sum and the maximum of chi-squared statistics, which measure the strongest and the average association signals per gene, respectively. For pathway scoring, we use a modified Fisher method, which offers not only significant power improvement over more traditional enrichment strategies, but also eliminates the problem of arbitrary threshold selection inherent in any binary membership based pathway enrichment approach. We demonstrate the marked increase in power by analyzing summary statistics from dozens of large meta-studies for various traits. Our extensive testing indicates that our method not only excels in rigorous type I error control, but also results in more biologically meaningful discoveries. PMID:26808494

  11. Performance of different SNP panels for parentage testing in two East Asian cattle breeds.

    PubMed

    Strucken, E M; Gudex, B; Ferdosi, M H; Lee, H K; Song, K D; Gibson, J P; Kelly, M; Piper, E K; Porto-Neto, L R; Lee, S H; Gondro, C

    2014-08-01

    The International Society for Animal Genetics (ISAG) proposed a panel of single nucleotide polymorphisms (SNPs) for parentage testing in cattle (a core panel of 100 SNPs and an additional list of 100 SNPs). However, markers specific to East Asian taurine cattle breeds were not included, and no information is available as to whether the ISAG panel performs adequately for these breeds. We tested ISAG's core (100 SNP) and full (200 SNP) panels on two East Asian taurine breeds: the Korean Hanwoo and the Japanese Wagyu, the latter from the Australian herd. Even though the power of exclusion was high at 0.99 for both ISAG panels, the core panel performed poorly with 3.01% false-positive assignments in the Hanwoo population and 3.57% in the Wagyu. The full ISAG panel identified all sire-offspring relations correctly in both populations with 0.02% of relations wrongly excluded in the Hanwoo population. Based on these results, we created and tested two population-specific marker panels: one for the Wagyu population, which showed no false-positive assignments with either 100 or 200 SNPs, and a second panel for the Hanwoo, which still had some false-positive assignments with 100 SNPs but no false positives using 200 SNPs. In conclusion, for parentage assignment in East Asian cattle breeds, only the full ISAG panel is adequate for parentage testing. If fewer markers should be used, it is advisable to use population-specific markers rather than the ISAG panel. PMID:24730981

  12. SNP discovery in complex allotetraploid genomes (Gossypium spp., Malvaceae) using genotyping by sequencing1

    PubMed Central

    Logan-Young, Carla Jo; Yu, John Z.; Verma, Surender K.; Percy, Richard G.; Pepper, Alan E.

    2015-01-01

    Premise of the study: Single-nucleotide polymorphism (SNP) marker discovery in plants with complex allotetraploid genomes is often confounded by the presence of homeologous loci (along with paralogous and orthologous loci). Here we present a strategy to filter for SNPs representing orthologous loci. Methods and Results: Using Illumina next-generation sequencing, 54 million reads were collected from restriction enzyme–digested DNA libraries of a diversity of Gossypium taxa. Loci with one to three SNPs were discovered using the Stacks software package, yielding 25,529 new cotton SNP combinations, including those that are polymorphic at both interspecific and intraspecific levels. Frequencies of predicted dual-homozygous (aa/bb) marker polymorphisms ranged from 6.7–11.6% of total shared fragments in intraspecific comparisons and from 15.0–16.4% in interspecific comparisons. Conclusions: This resource provides dual-homozygous (aa/bb) marker polymorphisms. Both in silico and experimental validation efforts demonstrated that these markers are enriched for single orthologous loci that are homozygous for alternative alleles. PMID:25798340

  13. Use of Microsatellite and SNP Markers for Biotype Characterization in Hessian Fly

    PubMed Central

    Crane, Yan Ma; Cambron, Sue E.; Crane, Charles F.; Shukle, Richard H.

    2015-01-01

    Exploration of the biotype structure of Hessian fly, Mayetiola destructor (Say) (Diptera: Cecidomyiidae), would improve our knowledge regarding variation in virulence phenotypes and difference in genetic background. Microsatellites (simple sequence repeats) and single-nucleotide polymorphisms (SNPs) are highly variable genetic markers that are widely used in population genetic studies. This study developed and tested a panel of 18 microsatellite and 22 SNP markers to investigate the genetic structure of nine Hessian fly biotypes: B, C, D, E, GP, L, O, vH9, and vH13. The simple sequence repeats were more polymorphic than the SNP markers, and their neighbor-joining trees differed in consequence. Microsatellites suggested a simple geographic association of related biotypes that did not progressively gain virulence with increasing genetic distance from a founder type. Use of the k-means clustering algorithm in the STRUCTURE program shows that the nine biotypes comprise six to eight populations that are related to geography or history within laboratory cultures. PMID:26543089

  14. SNP diversity within and among Brassica rapa accessions reveals no geographic differentiation.

    PubMed

    Tanhuanpää, P; Erkkilä, M; Tenhola-Roininen, T; Tanskanen, J; Manninen, O

    2016-01-01

    Genetic diversity was studied in a collection of 61 accessions of Brassica rapa, which were mostly oil-type turnip rapes but also included two oil-type subsp. dichotoma and five subsp. trilocularis accessions, as well as three leaf-type subspecies (subsp. japonica, pekinensis, and chinensis) and five turnip cultivars (subsp. rapa). Two-hundred and nine SNP markers, which had been discovered by amplicon resequencing, were used to genotype 893 plants from the B. rapa collection using Illumina BeadXpress. There was great variation in the diversity indices between accessions. With STRUCTURE analysis, the plant collection could be divided into three groups that seemed to correspond to morphotype and flowering habit but not to geography. According to AMOVA analysis, 65% of the variation was due to variation within accessions, 25% among accessions, and 10% among groups. A smaller subset of the plant collection, 12 accessions, was also studied with 5727 GBS-SNPs. Diversity indices obtained with GBS-SNPs correlated well with those obtained with Illumina BeadXpress SNPs. The developed SNP markers have already been used and will be used in future plant breeding programs as well as in mapping and diversity studies. PMID:26694015

  15. PrimerMapper: high throughput primer design and graphical assembly for PCR and SNP detection.

    PubMed

    O'Halloran, Damien M

    2016-01-01

    Primer design represents a widely employed gambit in diverse molecular applications including PCR, sequencing, and probe hybridization. Variations of PCR, including primer walking, allele-specific PCR, and nested PCR provide specialized validation and detection protocols for molecular analyses that often require screening large numbers of DNA fragments. In these cases, automated sequence retrieval and processing become important features, and furthermore, a graphic that provides the user with a visual guide to the distribution of designed primers across targets is most helpful in quickly ascertaining primer coverage. To this end, I describe here, PrimerMapper, which provides a comprehensive graphical user interface that designs robust primers from any number of inputted sequences while providing the user with both, graphical maps of primer distribution for each inputted sequence, and also a global assembled map of all inputted sequences with designed primers. PrimerMapper also enables the visualization of graphical maps within a browser and allows the user to draw new primers directly onto the webpage. Other features of PrimerMapper include allele-specific design features for SNP genotyping, a remote BLAST window to NCBI databases, and remote sequence retrieval from GenBank and dbSNP. PrimerMapper is hosted at GitHub and freely available without restriction. PMID:26853558

  16. MultiBLUP: improved SNP-based prediction for complex traits.

    PubMed

    Speed, Doug; Balding, David J

    2014-09-01

    BLUP (best linear unbiased prediction) is widely used to predict complex traits in plant and animal breeding, and increasingly in human genetics. The BLUP mathematical model, which consists of a single random effect term, was adequate when kinships were measured from pedigrees. However, when genome-wide SNPs are used to measure kinships, the BLUP model implicitly assumes that all SNPs have the same effect-size distribution, which is a severe and unnecessary limitation. We propose MultiBLUP, which extends the BLUP model to include multiple random effects, allowing greatly improved prediction when the random effects correspond to classes of SNPs with distinct effect-size variances. The SNP classes can be specified in advance, for example, based on SNP functional annotations, and we also provide an adaptive procedure for determining a suitable partition of SNPs. We apply MultiBLUP to genome-wide association data from the Wellcome Trust Case Control Consortium (seven diseases), and from much larger studies of celiac disease and inflammatory bowel disease, finding that it consistently provides better prediction than alternative methods. Moreover, MultiBLUP is computationally very efficient; for the largest data set, which includes 12,678 individuals and 1.5 M SNPs, the total analysis can be run on a single desktop PC in less than a day and can be parallelized to run even faster. Tools to perform MultiBLUP are freely available in our software LDAK. PMID:24963154

  17. Coding region SNP analysis to enhance dog mtDNA discrimination power in forensic casework.

    PubMed

    Verscheure, Sophie; Backeljau, Thierry; Desmyter, Stijn

    2015-01-01

    The high population frequencies of three control region haplotypes contribute to the low discrimination power of the dog mtDNA control region. It also diminishes the evidential power of a match with one of these haplotypes in forensic casework. A mitochondrial genome study of 214 Belgian dogs suggested 26 polymorphic coding region sites that successfully resolved dogs with the three most frequent control region haplotypes. In this study, three SNP assays were developed to determine the identity of the 26 informative sites. The control region of 132 newly sampled dogs was sequenced and added to the study of 214 dogs. The assays were applied to 58 dogs of the haplotypes of interest, which confirmed their suitability for enhancing dog mtDNA discrimination power. In the Belgian population study of 346 dogs, the set of 26 sites divided the dogs into 25 clusters of mtGenome sequences with substantially lower population frequency estimates than their control region sequences. In case of a match with one of the three control region haplotypes, using these three SNP assays in conjunction with control region sequencing would augment the exclusion probability of dog mtDNA analysis from 92.9% to 97.0%. PMID:25299153

  18. Differentiation of drug and non-drug Cannabis using a single nucleotide polymorphism (SNP) assay.

    PubMed

    Rotherham, D; Harbison, S A

    2011-04-15

    Cannabis sativa is both an illegal drug and a legitimate crop. The differentiation of illegal drug Cannabis from non-drug forms of Cannabis is relevant in the context of the growth of fibre and seed oil varieties of Cannabis for commercial purposes. This differentiation is currently determined based on the levels of tetrahydrocannabinol (THC) in adult plants. DNA based methods have the potential to assay Cannabis material unsuitable for analysis using conventional means including seeds, pollen and severely degraded material. The purpose of this research was to develop a single nucleotide polymorphism (SNP) assay for the differentiation of "drug" and "non-drug"Cannabis plants. An assay was developed based on four polymorphisms within a 399 bp fragment of the tetrahydrocannabinolic acid (THCA) synthase gene, utilising the snapshot multiplex kit. This SNP assay was tested on 94 Cannabis plants, which included 10 blind samples, and was able to differentiate between "drug" and "non-drug"Cannabis in all cases, while also differentiating between Cannabis and other species. Non-drug plants were found to be homozygous at the four sites assayed while drug Cannabis plants were either homozygous or heterozygous. PMID:21036496

  19. SNP Miniplexes for Individual Identification of Random-Bred Domestic Cats.

    PubMed

    Brooks, Ashley; Creighton, Erica K; Gandolfi, Barbara; Khan, Razib; Grahn, Robert A; Lyons, Leslie A

    2016-05-01

    Phenotypic and genotypic characteristics of the cat can be obtained from