Schoeman, Elizna M; Lopez, Genghis H; McGowan, Eunike C; Millard, Glenda M; O'Brien, Helen; Roulis, Eileen V; Liew, Yew-Wah; Martin, Jacqueline R; McGrath, Kelli A; Powley, Tanya; Flower, Robert L; Hyland, Catherine A
2017-04-01
Blood group single nucleotide polymorphism genotyping probes for a limited range of polymorphisms. This study investigated whether massively parallel sequencing (also known as next-generation sequencing), with a targeted exome strategy, provides an extended blood group genotype and the extent to which massively parallel sequencing correctly genotypes in homologous gene systems, such as RH and MNS. Donor samples (n = 28) that were extensively phenotyped and genotyped using single nucleotide polymorphism typing, were analyzed using the TruSight One Sequencing Panel and MiSeq platform. Genes for 28 protein-based blood group systems, GATA1, and KLF1 were analyzed. Copy number variation analysis was used to characterize complex structural variants in the GYPC and RH systems. The average sequencing depth per target region was 66.2 ± 39.8. Each sample harbored on average 43 ± 9 variants, of which 10 ± 3 were used for genotyping. For the 28 samples, massively parallel sequencing variant sequences correctly matched expected sequences based on single nucleotide polymorphism genotyping data. Copy number variation analysis defined the Rh C/c alleles and complex RHD hybrids. Hybrid RHD*D-CE-D variants were correctly identified, but copy number variation analysis did not confidently distinguish between D and CE exon deletion versus rearrangement. The targeted exome sequencing strategy employed extended the range of blood group genotypes detected compared with single nucleotide polymorphism typing. This single-test format included detection of complex MNS hybrid cases and, with copy number variation analysis, defined RH hybrid genes along with the RHCE*C allele hitherto difficult to resolve by variant detection. The approach is economical compared with whole-genome sequencing and is suitable for a red blood cell reference laboratory setting. © 2017 AABB.
Whole-genome sequencing and genetic variant analysis of a Quarter Horse mare.
Doan, Ryan; Cohen, Noah D; Sawyer, Jason; Ghaffari, Noushin; Johnson, Charlie D; Dindot, Scott V
2012-02-17
The catalog of genetic variants in the horse genome originates from a few select animals, the majority originating from the Thoroughbred mare used for the equine genome sequencing project. The purpose of this study was to identify genetic variants, including single nucleotide polymorphisms (SNPs), insertion/deletion polymorphisms (INDELs), and copy number variants (CNVs) in the genome of an individual Quarter Horse mare sequenced by next-generation sequencing. Using massively parallel paired-end sequencing, we generated 59.6 Gb of DNA sequence from a Quarter Horse mare resulting in an average of 24.7X sequence coverage. Reads were mapped to approximately 97% of the reference Thoroughbred genome. Unmapped reads were de novo assembled resulting in 19.1 Mb of new genomic sequence in the horse. Using a stringent filtering method, we identified 3.1 million SNPs, 193 thousand INDELs, and 282 CNVs. Genetic variants were annotated to determine their impact on gene structure and function. Additionally, we genotyped this Quarter Horse for mutations of known diseases and for variants associated with particular traits. Functional clustering analysis of genetic variants revealed that most of the genetic variation in the horse's genome was enriched in sensory perception, signal transduction, and immunity and defense pathways. This is the first sequencing of a horse genome by next-generation sequencing and the first genomic sequence of an individual Quarter Horse mare. We have increased the catalog of genetic variants for use in equine genomics by the addition of novel SNPs, INDELs, and CNVs. The genetic variants described here will be a useful resource for future studies of genetic variation regulating performance traits and diseases in equids.
Chowanadisai, Winyoo; Kelleher, Shannon L; Nemeth, Jennifer F; Yachetti, Stephen; Kuhlman, Charles F; Jackson, Joan G; Davis, Anne M; Lien, Eric L; Lönnerdal, Bo
2005-05-01
Variability in the protein composition of breast milk has been observed in many women and is believed to be due to natural variation of the human population. Single nucleotide polymorphisms (SNPs) are present throughout the entire human genome, but the impact of this variation on human milk composition and biological activity and infant nutrition and health is unclear. The goals of this study were to characterize a variant of human alpha-lactalbumin observed in milk from a Filipino population by determining the location of the polymorphism in the amino acid and genomic sequences of alpha-lactalbumin. Milk and blood samples were collected from 20 Filipino women, and milk samples were collected from an additional 450 women from nine different countries. alpha-Lactalbumin concentration was measured by high-performance liquid chromatography (HPLC), and milk samples containing the variant form of the protein were identified with both HPLC and mass spectrometry (MS). The molecular weight of the variant form was measured by MS, and the location of the polymorphism was narrowed down by protein reduction, alkylation and trypsin digestion. Genomic DNA was isolated from whole blood, and the polymorphism location and subject genotype were determined by amplifying the entire coding sequence of human alpha-lactalbumin by PCR, followed by DNA sequencing. A variant form of alpha-lactalbumin was observed in HPLC chromatograms, and the difference in molecular weight was determined by MS (wild type=14,070 Da, variant=14,056 Da). Protein reduction and digestion narrowed the polymorphism between the 33rd and 77th amino acid of the protein. The genetic polymorphism was identified as adenine to guanine, which translates to a substitution from isoleucine to valine at amino acid 46. The frequency of variation was higher in milk from China, Japan and Philippines, which suggests that this polymorphism is most prevalent in Asia. There are SNPs in the genome for human milk proteins and their implications for protein bioactivity and infant nutrition need to be considered.
A statistical method for the detection of variants from next-generation resequencing of DNA pools.
Bansal, Vikas
2010-06-15
Next-generation sequencing technologies have enabled the sequencing of several human genomes in their entirety. However, the routine resequencing of complete genomes remains infeasible. The massive capacity of next-generation sequencers can be harnessed for sequencing specific genomic regions in hundreds to thousands of individuals. Sequencing-based association studies are currently limited by the low level of multiplexing offered by sequencing platforms. Pooled sequencing represents a cost-effective approach for studying rare variants in large populations. To utilize the power of DNA pooling, it is important to accurately identify sequence variants from pooled sequencing data. Detection of rare variants from pooled sequencing represents a different challenge than detection of variants from individual sequencing. We describe a novel statistical approach, CRISP [Comprehensive Read analysis for Identification of Single Nucleotide Polymorphisms (SNPs) from Pooled sequencing] that is able to identify both rare and common variants by using two approaches: (i) comparing the distribution of allele counts across multiple pools using contingency tables and (ii) evaluating the probability of observing multiple non-reference base calls due to sequencing errors alone. Information about the distribution of reads between the forward and reverse strands and the size of the pools is also incorporated within this framework to filter out false variants. Validation of CRISP on two separate pooled sequencing datasets generated using the Illumina Genome Analyzer demonstrates that it can detect 80-85% of SNPs identified using individual sequencing while achieving a low false discovery rate (3-5%). Comparison with previous methods for pooled SNP detection demonstrates the significantly lower false positive and false negative rates for CRISP. Implementation of this method is available at http://polymorphism.scripps.edu/~vbansal/software/CRISP/.
USDA-ARS?s Scientific Manuscript database
Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a mor...
Lorenzetti, Mario Alejandro; Gantuz, Magdalena; Altcheh, Jaime; De Matteo, Elena; Chabay, Paola Andrea; Preciado, María Victoria
2012-03-01
The ubiquitous Epstein-Barr virus (EBV) is related to the development of lymphoma and is also the etiological agent for infectious mononucleosis (IM). Sequence variations in the gene encoding LMP1 have been deeply studied in different pathologies and geographic regions. Controversial results propose the existence of tumor-related variants, while others argued in favor of a geographical distribution of these variants. Reports assessing EBV variants in IM were performed in adult patients who displayed multiple variant infections. In the present study, LMP1 variants in 15 pediatric patients with IM and 20 pediatric patients with EBV-associated lymphomas from Argentina were analyzed as representatives of benign and malignant infections in children, respectively. A 3-month follow-up study of LMP1 variants in peripheral blood cells and in oral secretions of patients with IM was performed. Moreover, an integrated linkage analysis was performed with variants of EBNA1 and the promoter region of BZLF1. Similar sequence polymorphisms were detected in both pathological conditions, IM and lymphoma, but these differ from those previously described in healthy donors from Argentina and Brazil. The results suggest that certain LMP1 polymorphisms, namely, the 30-bp deletion and high copy number of the 33-bp repeats, are associated with EBV-related pathologies, either benign or malignant, instead of just being tumor related. Additionally, this is the first study to describe the Alaskan variant in EBV-related lymphomas that previously was restricted to nasopharyngeal carcinomas from North America.
Gantuz, Magdalena; Altcheh, Jaime; De Matteo, Elena; Chabay, Paola Andrea; Preciado, María Victoria
2012-01-01
The ubiquitous Epstein-Barr virus (EBV) is related to the development of lymphoma and is also the etiological agent for infectious mononucleosis (IM). Sequence variations in the gene encoding LMP1 have been deeply studied in different pathologies and geographic regions. Controversial results propose the existence of tumor-related variants, while others argued in favor of a geographical distribution of these variants. Reports assessing EBV variants in IM were performed in adult patients who displayed multiple variant infections. In the present study, LMP1 variants in 15 pediatric patients with IM and 20 pediatric patients with EBV-associated lymphomas from Argentina were analyzed as representatives of benign and malignant infections in children, respectively. A 3-month follow-up study of LMP1 variants in peripheral blood cells and in oral secretions of patients with IM was performed. Moreover, an integrated linkage analysis was performed with variants of EBNA1 and the promoter region of BZLF1. Similar sequence polymorphisms were detected in both pathological conditions, IM and lymphoma, but these differ from those previously described in healthy donors from Argentina and Brazil. The results suggest that certain LMP1 polymorphisms, namely, the 30-bp deletion and high copy number of the 33-bp repeats, are associated with EBV-related pathologies, either benign or malignant, instead of just being tumor related. Additionally, this is the first study to describe the Alaskan variant in EBV-related lymphomas that previously was restricted to nasopharyngeal carcinomas from North America. PMID:22205789
Meurs, Kathryn M; Olsen, Lisbeth H; Reimann, Maria J; Keene, Bruce W; Atkins, Clarke E; Adin, Darcy; Aona, Brent; Condit, Julia; DeFrancesco, Teresa; Reina-Doreste, Yamir; Stern, Joshua A; Tou, Sandra; Ward, Jessica; Woodruff, Kathleen
2018-02-01
Myxomatous mitral valve disease (MMVD) is the most common heart disease in the dog. It is particularly common in the Cavalier King Charles Spaniel (CKCS) breed and affected dogs are frequently managed with angiotensin-converting enzyme inhibitors (ACE-I). We have previously identified a canine ACE gene polymorphism associated with a decrease in angiotensin-converting enzyme (ACE) activity. The aim of this study was to evaluate for the prevalence of the ACE polymorphism in CKCS with mitral valve disease and to determine whether the presence of the polymorphism is associated with alterations in ACE activity at different stages of cardiac disease. Seventy-three dogs with a diagnosis of mitral valve disease were evaluated and a blood sample was drawn for ACE polymorphism genotyping and ACE activity measurement. Forty-three dogs were homozygous for the ACE polymorphism; five were heterozygous and 25 were homozygous wild type. The mean age and the median severity of disease were not different for dogs with the polymorphism and dogs with the wild-type sequence. The median baseline ACE activity was significantly lower for the ACE polymorphism (27.0 U/l) than the wild-type sequence dogs (31.0 U/l) (P=0.02). Dogs with more severe disease and the ACE polymorphism had significantly lower levels of ACE activity than dogs with the wild-type sequence (P=0.03). The CKCS appears to have a high prevalence of the ACE variant. Dogs with the ACE variant had lower levels of ACE activity even in more advanced mitral valve disease than dogs without the variant. The clinical significance of this finding and its impact on the need for ACE-I in dogs with the polymorphism and heart disease deserves further study.
USDA-ARS?s Scientific Manuscript database
Salmonid genomes are considered to be in a pseudo-tetraploid state as a result of an evolutionarily recent genome duplication event. This situation complicates single nucleotide polymorphism (SNP) discovery in rainbow trout as many putative SNPs are actually paralogous sequence variants (PSVs) and ...
López-Urrutia, Eduardo; Valdés, Jesús; Bonilla-Moreno, Raúl; Martínez-Salazar, Martha; Martínez-Garcia, Martha; Berumen, Jaime; Villegas-Sepúlveda, Nicolás
2012-06-01
The HPV-16 E6/E7 genes, which contain intron 1, are processed by alternative splicing and its transcripts are detected with a heterogeneous profile in tumours cells. Frequently, the HPV-16 positive carcinoma cells bear viral variants that contain single nucleotide polymorphisms into its DNA sequence. We were interested in analysing the contribution of this polymorphism to the heterogeneity in the pattern of the E6/E7 spliced transcripts. Using the E6/E7 sequences from three closely related HPV-16 variants, we have shown that a few nucleotide changes are sufficient to produce heterogeneity in the splicing profile. Furthermore, using mutants that contained a single SNP, we also showed that one nucleotide change was sufficient to reproduce the heterogeneous splicing profile. Additionally, a difference of two or three SNPs among these viral sequences was sufficient to recruit differentially several splicing factors to the polymorphic E6/E7 transcripts. Moreover, only one SNP was sufficient to alter the binding site of at least one splicing factor, changing the ability of splicing factors to bind the transcript. Finally, the factors that were differentially bound to the short form of intron 1 of one of these E6/E7 variants were identified as TIA1 and/or TIAR and U1-70k, while U2AF65, U5-52k and PTB were preferentially bound to the transcript of the other variants. Copyright © 2012 Elsevier B.V. All rights reserved.
Hakenberg, Jörg; Cheng, Wei-Yi; Thomas, Philippe; Wang, Ying-Chih; Uzilov, Andrew V; Chen, Rong
2016-01-08
Data from a plethora of high-throughput sequencing studies is readily available to researchers, providing genetic variants detected in a variety of healthy and disease populations. While each individual cohort helps gain insights into polymorphic and disease-associated variants, a joint perspective can be more powerful in identifying polymorphisms, rare variants, disease-associations, genetic burden, somatic variants, and disease mechanisms. We have set up a Reference Variant Store (RVS) containing variants observed in a number of large-scale sequencing efforts, such as 1000 Genomes, ExAC, Scripps Wellderly, UK10K; various genotyping studies; and disease association databases. RVS holds extensive annotations pertaining to affected genes, functional impacts, disease associations, and population frequencies. RVS currently stores 400 million distinct variants observed in more than 80,000 human samples. RVS facilitates cross-study analysis to discover novel genetic risk factors, gene-disease associations, potential disease mechanisms, and actionable variants. Due to its large reference populations, RVS can also be employed for variant filtration and gene prioritization. A web interface to public datasets and annotations in RVS is available at https://rvs.u.hpc.mssm.edu/.
Chou, A; Burke, J
1999-05-01
DNA sequence clustering has become a valuable method in support of gene discovery and gene expression analysis. Our interest lies in leveraging the sequence diversity within clusters of expressed sequence tags (ESTs) to model gene structure for the study of gene variants that arise from, among other things, alternative mRNA splicing, polymorphism, and divergence after gene duplication, fusion, and translocation events. In previous work, CRAW was developed to discover gene variants from assembled clusters of ESTs. Most importantly, novel gene features (the differing units between gene variants, for example alternative exons, polymorphisms, transposable elements, etc.) that are specialized to tissue, disease, population, or developmental states can be identified when these tools collate DNA source information with gene variant discrimination. While the goal is complete automation of novel feature and gene variant detection, current methods are far from perfect and hence the development of effective tools for visualization and exploratory data analysis are of paramount importance in the process of sifting through candidate genes and validating targets. We present CRAWview, a Java based visualization extension to CRAW. Features that vary between gene forms are displayed using an automatically generated color coded index. The reporting format of CRAWview gives a brief, high level summary report to display overlap and divergence within clusters of sequences as well as the ability to 'drill down' and see detailed information concerning regions of interest. Additionally, the alignment viewing and editing capabilities of CRAWview make it possible to interactively correct frame-shifts and otherwise edit cluster assemblies. We have implemented CRAWview as a Java application across windows NT/95 and UNIX platforms. A beta version of CRAWview will be freely available to academic users from Pangea Systems (http://www.pangeasystems.com). Contact :
Jasim, Anfal A.; Al-Bustan, Suzanne A.; Al-Kandari, Wafa; Al-Serri, Ahmad; AlAskar, Huda
2018-01-01
Common variants of Apolipoprotein A5 (APOA5) have been associated with lipid levels yet very few studies have reported full sequence data from various ethnic groups. The purpose of this study was to analyse the full APOA5 gene sequence to identify variants in 100 healthy Kuwaitis of Arab ethnicities and assess their association with variation in lipid levels in a cohort of 733 samples. Sanger method was used in the direct sequencing of the full 3.7 Kb APOA5 and multiple sequence alignment was used to identify variants. The complete APOA5 sequence in Kuwaiti Arabs has been deposited in GenBank (KJ401315). A total of 20 reported single nucleotide polymorphisms (SNPs) were identified. Two novel SNPs were also identified: a synonymous 2197G>A polymorphism at genomic position 116661525 and a 3′ UTR 3222 C>T polymorphism at genomic position 116660500 based on human genome assembly GRCh37/hg:19. Five SNPs along with the two novel SNPs were selected for validation in the cohort. Association of those SNPs with lipid levels was tested and minor alleles of three SNPs (rs2072560, rs2266788, and rs662799) were found significantly associated with TG and VLDL levels. This is the first study to report the full APOA5 sequence and SNPs in an Arab ethnic group. Analysis of the variants identified and comparison to other populations suggests a distinctive genetic component in Arabs. The positive association observed for rs2072560 and rs2266788 with TG and VLDL levels confirms their role in lipid metabolism. PMID:29686695
Jasim, Anfal A; Al-Bustan, Suzanne A; Al-Kandari, Wafa; Al-Serri, Ahmad; AlAskar, Huda
2018-01-01
Common variants of Apolipoprotein A5 ( APOA 5) have been associated with lipid levels yet very few studies have reported full sequence data from various ethnic groups. The purpose of this study was to analyse the full APOA5 gene sequence to identify variants in 100 healthy Kuwaitis of Arab ethnicities and assess their association with variation in lipid levels in a cohort of 733 samples. Sanger method was used in the direct sequencing of the full 3.7 Kb APOA5 and multiple sequence alignment was used to identify variants. The complete APOA5 sequence in Kuwaiti Arabs has been deposited in GenBank (KJ401315). A total of 20 reported single nucleotide polymorphisms (SNPs) were identified. Two novel SNPs were also identified: a synonymous 2197G>A polymorphism at genomic position 116661525 and a 3' UTR 3222 C>T polymorphism at genomic position 116660500 based on human genome assembly GRCh37/hg:19. Five SNPs along with the two novel SNPs were selected for validation in the cohort. Association of those SNPs with lipid levels was tested and minor alleles of three SNPs (rs2072560, rs2266788, and rs662799) were found significantly associated with TG and VLDL levels. This is the first study to report the full APOA5 sequence and SNPs in an Arab ethnic group. Analysis of the variants identified and comparison to other populations suggests a distinctive genetic component in Arabs. The positive association observed for rs2072560 and rs2266788 with TG and VLDL levels confirms their role in lipid metabolism.
USDA-ARS?s Scientific Manuscript database
Copy number variants (CNV) are large scale duplications or deletions of genomic sequence that are caused by a diverse set of molecular phenomena that are distinct from single nucleotide polymorphism (SNP) formation. Due to their different mechanisms of formation, CNVs are often difficult to track us...
Kuhn, Alexandre; Ong, Yao Min; Quake, Stephen R; Burkholder, William F
2015-07-08
Like other structural variants, transposable element insertions can be highly polymorphic across individuals. Their functional impact, however, remains poorly understood. Current genome-wide approaches for genotyping insertion-site polymorphisms based on targeted or whole-genome sequencing remain very expensive and can lack accuracy, hence new large-scale genotyping methods are needed. We describe a high-throughput method for genotyping transposable element insertions and other types of structural variants that can be assayed by breakpoint PCR. The method relies on next-generation sequencing of multiplex, site-specific PCR amplification products and read count-based genotype calls. We show that this method is flexible, efficient (it does not require rounds of optimization), cost-effective and highly accurate. This method can benefit a wide range of applications from the routine genotyping of animal and plant populations to the functional study of structural variants in humans.
Korchagin, V I; Badaeva, T N; Tokarskaya, O N; Martirosyan, I A; Darevsky, I S; Ryskov, A P
2007-05-01
Populations of parthenogenetic lizards of the genus Darevskia consist of genetically identical animals, and represent a unique model for studying the molecular mechanisms underlying the variability and evolution of hypervariable DNA repeats. As unisexual lineages, parthenogenetic lizards are characterized by some level of genetic diversity at microsatellite loci. We cloned and sequenced a number of (GATA)n microsatellite loci of Darevskia unisexualis. PCR products from these loci were also sequenced and the degree of intraspecific polymorphism was assessed. Among the five (GATA)n loci analysed, two (Du215 and Du281) were polymorphic. Cross-species analysis of Du215 and Du281 indicate that the priming sites at the D. unisexualis loci are conserved in the bisexual parental species, D. raddei and D. valentini. Sequencing the PCR products amplified from Du215 and Du281 and from monomorphic Du323 showed that allelic differences at the polymorphic loci are caused by microsatellite mutations and by point mutations in the flanking regions. The haplotypes identified among the allelic variants of Du281 and among its orthologues in the parental species provide new evidence of the cross-species origin of D. unisexualis. To our knowledge, these data are the first to characterize the nucleotide sequences of allelic variants at microsatellite loci within parthenogenetic vertebrate animals.
Kim, Dae-Wi; Thawng, Cung Nawl; Choi, Jung-Hye; Lee, Kihyun; Cha, Chang-Jun
2018-01-01
The environmental resistome has been recognized as the origin and reservoir of antibiotic resistance genes and considered to be dynamic and ever expanding. In this study, a targeted gene sequencing approach revealed that the polymorphic diversity of the aminoglycoside-inactivating enzyme AAC(6')-Ib was ecological niche-specific. AAC(6')-Ib-cr, previously known as a clinical variant, was prevalent in various soils and the intestines of chickens and humans, suggesting that this variant might not have arisen from adaptive mutations in the clinic but instead originated from the environment. Furthermore, ecologically dominant polymorphic variants of AAC(6')-Ib were characterized and found to display different substrate specificities for quinolones and aminoglycosides, conferring the altered resistance spectra. Interestingly, a novel variant with the D179Y substitution showed an extended resistance spectrum to the recently developed fluoroquinolone gemifloxacin. Our results suggest that soil and animal microbiomes could be major reservoirs of antibiotic resistance; polymorphic diversity expands the antibiotic resistome in the environment, resulting in the potential emergence of novel resistance.
Deep whole-genome sequencing of 90 Han Chinese genomes.
Lan, Tianming; Lin, Haoxiang; Zhu, Wenjuan; Laurent, Tellier Christian Asker Melchior; Yang, Mengcheng; Liu, Xin; Wang, Jun; Wang, Jian; Yang, Huanming; Xu, Xun; Guo, Xiaosen
2017-09-01
Next-generation sequencing provides a high-resolution insight into human genetic information. However, the focus of previous studies has primarily been on low-coverage data due to the high cost of sequencing. Although the 1000 Genomes Project and the Haplotype Reference Consortium have both provided powerful reference panels for imputation, low-frequency and novel variants remain difficult to discover and call with accuracy on the basis of low-coverage data. Deep sequencing provides an optimal solution for the problem of these low-frequency and novel variants. Although whole-exome sequencing is also a viable choice for exome regions, it cannot account for noncoding regions, sometimes resulting in the absence of important, causal variants. For Han Chinese populations, the majority of variants have been discovered based upon low-coverage data from the 1000 Genomes Project. However, high-coverage, whole-genome sequencing data are limited for any population, and a large amount of low-frequency, population-specific variants remain uncharacterized. We have performed whole-genome sequencing at a high depth (∼×80) of 90 unrelated individuals of Chinese ancestry, collected from the 1000 Genomes Project samples, including 45 Northern Han Chinese and 45 Southern Han Chinese samples. Eighty-three of these 90 have been sequenced by the 1000 Genomes Project. We have identified 12 568 804 single nucleotide polymorphisms, 2 074 210 short InDels, and 26 142 structural variations from these 90 samples. Compared to the Han Chinese data from the 1000 Genomes Project, we have found 7 000 629 novel variants with low frequency (defined as minor allele frequency < 5%), including 5 813 503 single nucleotide polymorphisms, 1 169 199 InDels, and 17 927 structural variants. Using deep sequencing data, we have built a greatly expanded spectrum of genetic variation for the Han Chinese genome. Compared to the 1000 Genomes Project, these Han Chinese deep sequencing data enhance the characterization of a large number of low-frequency, novel variants. This will be a valuable resource for promoting Chinese genetics research and medical development. Additionally, it will provide a valuable supplement to the 1000 Genomes Project, as well as to other human genome projects. © The Authors 2017. Published by Oxford University Press.
Gymoese, Pernille; Sørensen, Gitte; Litrup, Eva; Olsen, John Elmerdal; Nielsen, Eva Møller
2017-01-01
Whole-genome sequencing is rapidly replacing current molecular typing methods for surveillance purposes. Our study evaluates core-genome single-nucleotide polymorphism analysis for outbreak detection and linking of sources of Salmonella enterica serovar Typhimurium and its monophasic variants during a 7-month surveillance period in Denmark. We reanalyzed and defined 8 previously characterized outbreaks from the phylogenetic relatedness of the isolates, epidemiologic data, and food traceback investigations. All outbreaks were identified, and we were able to exclude unrelated and include additional related human cases. We were furthermore able to link possible food and veterinary sources to the outbreaks. Isolates clustered according to sequence types (STs) 19, 34, and 36. Our study shows that core-genome single-nucleotide polymorphism analysis is suitable for surveillance and outbreak investigation for Salmonella Typhimurium (ST19 and ST36), but whole genome–wide analysis may be required for the tight genetic clone of monophasic variants (ST34). PMID:28930002
Gao, Guangtu; Nome, Torfinn; Pearse, Devon E; Moen, Thomas; Naish, Kerry A; Thorgaard, Gary H; Lien, Sigbjørn; Palti, Yniv
2018-01-01
Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout ( Oncorhynchus mykiss ), SNP discovery has been previously done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL) and RNA sequencing. Recently we have performed high coverage whole genome resequencing with 61 unrelated samples, representing a wide range of rainbow trout and steelhead populations, with 49 new samples added to 12 aquaculture samples from AquaGen (Norway) that we previously used for SNP discovery. Of the 49 new samples, 11 were double-haploid lines from Washington State University (WSU) and 38 represented wild and hatchery populations from a wide range of geographic distribution and with divergent migratory phenotypes. We then mapped the sequences to the new rainbow trout reference genome assembly (GCA_002163495.1) which is based on the Swanson YY doubled haploid line. Variant calling was conducted with FreeBayes and SAMtools mpileup , followed by filtering of SNPs based on quality score, sequence complexity, read depth on the locus, and number of genotyped samples. Results from the two variant calling programs were compared and genotypes of the double haploid samples were used for detecting and filtering putative paralogous sequence variants (PSVs) and multi-sequence variants (MSVs). Overall, 30,302,087 SNPs were identified on the rainbow trout genome 29 chromosomes and 1,139,018 on unplaced scaffolds, with 4,042,723 SNPs having high minor allele frequency (MAF > 0.25). The average SNP density on the chromosomes was one SNP per 64 bp, or 15.6 SNPs per 1 kb. Results from the phylogenetic analysis that we conducted indicate that the SNP markers contain enough population-specific polymorphisms for recovering population relationships despite the small sample size used. Intra-Population polymorphism assessment revealed high level of polymorphism and heterozygosity within each population. We also provide functional annotation based on the genome position of each SNP and evaluate the use of clonal lines for filtering of PSVs and MSVs. These SNPs form a new database, which provides an important resource for a new high density SNP array design and for other SNP genotyping platforms used for genetic and genomics studies of this iconic salmonid fish species.
Kumar, Akash; Dougherty, Max; Findlay, Gregory M; Geisheker, Madeleine; Klein, Jason; Lazar, John; Machkovech, Heather; Resnick, Jesse; Resnick, Rebecca; Salter, Alexander I; Talebi-Liasi, Faezeh; Arakawa, Christopher; Baudin, Jacob; Bogaard, Andrew; Salesky, Rebecca; Zhou, Qian; Smith, Kelly; Clark, John I; Shendure, Jay; Horwitz, Marshall S
2014-01-01
Even in cases where there is no obvious family history of disease, genome sequencing may contribute to clinical diagnosis and management. Clinical application of the genome has not yet become routine, however, in part because physicians are still learning how best to utilize such information. As an educational research exercise performed in conjunction with our medical school human anatomy course, we explored the potential utility of determining the whole genome sequence of a patient who had died following a clinical diagnosis of idiopathic pulmonary fibrosis (IPF). Medical students performed dissection and whole genome sequencing of the cadaver. Gross and microscopic findings were more consistent with the fibrosing variant of nonspecific interstitial pneumonia (NSIP), as opposed to IPF per se. Variants in genes causing Mendelian disorders predisposing to IPF were not detected. However, whole genome sequencing identified several common variants associated with IPF, including a single nucleotide polymorphism (SNP), rs35705950, located in the promoter region of the gene encoding mucin glycoprotein MUC5B. The MUC5B promoter polymorphism was recently found to markedly elevate risk for IPF, though a particular association with NSIP has not been previously reported, nor has its contribution to disease risk previously been evaluated in the genome-wide context of all genetic variants. We did not identify additional predicted functional variants in a region of linkage disequilibrium (LD) adjacent to MUC5B, nor did we discover other likely risk-contributing variants elsewhere in the genome. Whole genome sequencing thus corroborates the association of rs35705950 with MUC5B dysregulation and interstitial lung disease. This novel exercise additionally served a unique mission in bridging clinical and basic science education.
Hybridization capture reveals evolution and conservation across the entire Koala retrovirus genome.
Tsangaras, Kyriakos; Siracusa, Matthew C; Nikolaidis, Nikolas; Ishida, Yasuko; Cui, Pin; Vielgrader, Hanna; Helgen, Kristofer M; Roca, Alfred L; Greenwood, Alex D
2014-01-01
The koala retrovirus (KoRV) is the only retrovirus known to be in the midst of invading the germ line of its host species. Hybridization capture and next generation sequencing were used on modern and museum DNA samples of koala (Phascolarctos cinereus) to examine ca. 130 years of evolution across the full KoRV genome. Overall, the entire proviral genome appeared to be conserved across time in sequence, protein structure and transcriptional binding sites. A total of 138 polymorphisms were detected, of which 72 were found in more than one individual. At every polymorphic site in the museum koalas, one of the character states matched that of modern KoRV. Among non-synonymous polymorphisms, radical substitutions involving large physiochemical differences between amino acids were elevated in env, potentially reflecting anti-viral immune pressure or avoidance of receptor interference. Polymorphisms were not detected within two functional regions believed to affect infectivity. Host sequences flanking proviral integration sites were also captured; with few proviral loci shared among koalas. Recently described variants of KoRV, designated KoRV-B and KoRV-J, were not detected in museum samples, suggesting that these variants may be of recent origin.
Hybridization Capture Reveals Evolution and Conservation across the Entire Koala Retrovirus Genome
Ishida, Yasuko; Cui, Pin; Vielgrader, Hanna; Helgen, Kristofer M.; Roca, Alfred L.; Greenwood, Alex D.
2014-01-01
The koala retrovirus (KoRV) is the only retrovirus known to be in the midst of invading the germ line of its host species. Hybridization capture and next generation sequencing were used on modern and museum DNA samples of koala (Phascolarctos cinereus) to examine ca. 130 years of evolution across the full KoRV genome. Overall, the entire proviral genome appeared to be conserved across time in sequence, protein structure and transcriptional binding sites. A total of 138 polymorphisms were detected, of which 72 were found in more than one individual. At every polymorphic site in the museum koalas, one of the character states matched that of modern KoRV. Among non-synonymous polymorphisms, radical substitutions involving large physiochemical differences between amino acids were elevated in env, potentially reflecting anti-viral immune pressure or avoidance of receptor interference. Polymorphisms were not detected within two functional regions believed to affect infectivity. Host sequences flanking proviral integration sites were also captured; with few proviral loci shared among koalas. Recently described variants of KoRV, designated KoRV-B and KoRV-J, were not detected in museum samples, suggesting that these variants may be of recent origin. PMID:24752422
Polymorphic human somatostatin gene is located on chromosome 3.
Naylor, S L; Sakaguchi, A Y; Shen, L P; Bell, G I; Rutter, W J; Shows, T B
1983-01-01
Somatostatin is a 14-amino-acid neuropeptide and hormone that inhibits the secretion of several peptide hormones. The human gene for somatostatin SST has been cloned, and the sequence has been determined. This clone was used as a probe in chromosome mapping studies to detect the human somatostatin sequence in human-rodent hybrids. Southern blot analysis of 41 hybrids, including some containing translocations of human chromosomes, placed SST in the q21 leads to qter region of chromosome 3. Human DNAs from unrelated individuals were screened for restriction fragment polymorphisms detectable by the somatostatin gene probe. Two polymorphisms were found: (i) an EcoRI variant located at the 3' end of the gene, found in Caucasian, U.S. Black, and Asian populations with a frequency of approximately 0.10 and (ii) a BamHI variant in the intron, which occurs in Caucasians at a frequency of 0.13. Images PMID:6133281
Blanco-Marchite, Cristina; Sánchez-Sánchez, Francisco; López-Garrido, María-Pilar; Iñigez-de-Onzoño, Mercedes; López-Martínez, Francisco; López-Sánchez, Enrique; Alvarez, Lydia; Rodríguez-Calvo, Pedro-Pablo; Méndez-Hernández, Carmen; Fernández-Vega, Luis; García-Sánchez, Julián; Coca-Prados, Miguel; García-Feijoo, Julián
2011-01-01
Purpose. To investigate the role of WDR36 and P53 sequence variations in POAG susceptibility. Methods. The authors performed a case-control genetic association study in 268 unrelated Spanish patients (POAG1) and 380 control subjects matched for sex, age, and ethnicity. WDR36 sequence variations were screened by either direct DNA sequencing or denaturing high-performance liquid chromatography. P53 polymorphisms p.R72P and c.97–147ins16bp were analyzed by single-nucleotide polymorphism (SNP) genotyping and PCR, respectively. Positive SNP and haplotype associations were reanalyzed in a second sample of 211 patients and in combined cases (n = 479). Results. The authors identified almost 50 WDR36 sequence variations, of which approximately two-thirds were rare and one-third were polymorphisms. Approximately half the variants were novel. Eight patients (2.9%) carried rare mutations that were not identified in the control group (P = 0.001). Six Tag SNPs were expected to be structured in three common haplotypes. Haplotype H2 was consistently associated with the disease (P = 0.0024 in combined cases). According to a dominant model, genotypes containing allele P of the P53 p.R72P SNP slightly increased glaucoma risk. Glaucoma susceptibility associated with different WDR36 genotypes also increased significantly in combination with the P53 RP risk genotype, indicating the existence of a genetic interaction. For instance, the OR of the H2 diplotype estimated for POAG1 and combined cases rose approximately 1.6 times in the two-locus genotype H2/RP. Conclusions. Rare WDR36 variants and the P53 p.R72P polymorphism behaved as moderate glaucoma risk factors in Spanish patients. The authors provide evidence for a genetic interaction between WDR36 and P53 variants in POAG susceptibility, although this finding must be confirmed in other populations. PMID:21931130
Implication of common and disease specific variants in CLU, CR1, and PICALM.
Ferrari, Raffaele; Moreno, Jorge H; Minhajuddin, Abu T; O'Bryant, Sid E; Reisch, Joan S; Barber, Robert C; Momeni, Parastoo
2012-08-01
Two recent genome-wide association studies (GWAS) for late onset Alzheimer's disease (LOAD) revealed 3 new genes: clusterin (CLU), phosphatidylinositol binding clathrin assembly protein (PICALM), and complement receptor 1 (CR1). In order to evaluate association with these genome-wide association study-identified genes and to isolate the variants contributing to the pathogenesis of LOAD, we genotyped the top single nucleotide polymorphisms (SNPs), rs11136000 (CLU), rs3818361 (CR1), and rs3851179 (PICALM), and sequenced the entire coding regions of these genes in our cohort of 342 LOAD patients and 277 control subjects. We confirmed the association of rs3851179 (PICALM) (p = 7.4 × 10(-3)) with the disease status. Through sequencing we identified 18 variants in CLU, 3 of which were found exclusively in patients; 8 variants (out of 65) in CR1 gene were only found in patients and the 16 variants identified in PICALM gene were present in both patients and controls. In silico analysis of the variants in PICALM did not predict any damaging effect on the protein. The haplotype analysis of the variants in each gene predicted a common haplotype when the 3 single nucleotide polymorphisms rs11136000 (CLU), rs3818361 (CR1), and rs3851179 (PICALM), respectively, were included. For each gene the haplotype structure and size differed between patients and controls. In conclusion, we confirmed association of CLU, CR1, and PICALM genes with the disease status in our cohort through identification of a number of disease-specific variants among patients through the sequencing of the coding region of these genes. Published by Elsevier Inc.
A High Proportion of Chromosome 21 Promoter Polymorphisms Influence Transcriptional Activity
Buckland, Paul R.; Coleman, Sharol L.; Hoogendoorn, Bastiaan; Guy, Carol; Smith, S. Kaye; O’Donovan, Michael C.
2004-01-01
We have sought to obtain an unbiased estimate of the proportion of polymorphisms in promoters of human genes that have functional effects. We carried out polymorphism discovery on a randomly selected group of 51 gene promoters mapping to human chromosome 21 and successfully analyzed the effect on transcription of 38 of the sequence variants. To achieve this, a total of 53 different haplotypes from 20 promoters were cloned into a modified pGL3 luciferase reporter gene vector and were tested for their abilities to promote transcription in HEK293t and JEG-3 cells. Up to seven (18%) of the 38 tested variants altered transcription by 1.5-fold, confirming that a surprisingly high proportion of promoter region polymorphisms are likely to be functionally important. The functional variants were distributed across the promoters of CRYAA, IFNAR1, KCNJ15, NCAM2, IGSF5, and B3GALT5. Three of the genes (NCAM2, IFNAR1, and CRYAA) have been previously associated with human phenotypes and the polymorphisms we describe here may therefore play a role in those phenotypes. PMID:15200235
Abraham, Paul E; Wang, Xiaojing; Ranjan, Priya; Nookaew, Intawat; Zhang, Bing; Tuskan, Gerald A; Hettich, Robert L
2015-12-04
Next-generation sequencing has transformed the ability to link genotypes to phenotypes and facilitates the dissection of genetic contribution to complex traits. However, it is challenging to link genetic variants with the perturbed functional effects on proteins encoded by such genes. Here we show how RNA sequencing can be exploited to construct genotype-specific protein sequence databases to assess natural variation in proteins, providing information about the molecular toolbox driving cellular processes. For this study, we used two natural genotypes selected from a recent genome-wide association study of Populus trichocarpa, an obligate outcrosser with tremendous phenotypic variation across the natural population. This strategy allowed us to comprehensively catalogue proteins containing single amino acid polymorphisms (SAAPs), as well as insertions and deletions. We profiled the frequency of 128 types of naturally occurring amino acid substitutions, including both expected (neutral) and unexpected (non-neutral) SAAPs, with a subset occurring in regions of the genome having strong polymorphism patterns consistent with recent positive and/or divergent selection. By zeroing in on the molecular signatures of these important regions that might have previously been uncharacterized, we now provide a high-resolution molecular inventory that should improve accessibility and subsequent identification of natural protein variants in future genotype-to-phenotype studies.
Metzger, Julia; Tonda, Raul; Beltran, Sergi; Agueda, Lídia; Gut, Marta; Distl, Ottmar
2014-07-04
Domestication has shaped the horse and lead to a group of many different types. Some have been under strong human selection while others developed in close relationship with nature. The aim of our study was to perform next generation sequencing of breed and non-breed horses to provide an insight into genetic influences on selective forces. Whole genome sequencing of five horses of four different populations revealed 10,193,421 single nucleotide polymorphisms (SNPs) and 1,361,948 insertion/deletion polymorphisms (indels). In comparison to horse variant databases and previous reports, we were able to identify 3,394,883 novel SNPs and 868,525 novel indels. We analyzed the distribution of individual variants and found significant enrichment of private mutations in coding regions of genes involved in primary metabolic processes, anatomical structures, morphogenesis and cellular components in non-breed horses and in contrast to that private mutations in genes affecting cell communication, lipid metabolic process, neurological system process, muscle contraction, ion transport, developmental processes of the nervous system and ectoderm in breed horses. Our next generation sequencing data constitute an important first step for the characterization of non-breed in comparison to breed horses and provide a large number of novel variants for future analyses. Functional annotations suggest specific variants that could play a role for the characterization of breed or non-breed horses.
Sadsad, Rosemarie; Martinez, Elena; Jelfs, Peter; Hill-Cawthorne, Grant A.; Gilbert, Gwendolyn L.; Marais, Ben J.; Sintchenko, Vitali
2016-01-01
Background Improved tuberculosis control and the need to contain the spread of drug-resistant strains provide a strong rationale for exploring tuberculosis transmission dynamics at the population level. Whole-genome sequencing provides optimal strain resolution, facilitating detailed mapping of potential transmission pathways. Methods We sequenced 22 isolates from a Mycobacterium tuberculosis cluster in New South Wales, Australia, identified during routine 24-locus mycobacterial interspersed repetitive unit typing. Following high-depth paired-end sequencing using the Illumina HiSeq 2000 platform, two independent pipelines were employed for analysis, both employing read mapping onto reference genomes as well as de novo assembly, to control biases in variant detection. In addition to single-nucleotide polymorphisms, the analyses also sought to identify insertions, deletions and structural variants. Results Isolates were highly similar, with a distance of 13 variants between the most distant members of the cluster. The most sensitive analysis classified the 22 isolates into 18 groups. Four of the isolates did not appear to share a recent common ancestor with the largest clade; another four isolates had an uncertain ancestral relationship with the largest clade. Conclusion Whole genome sequencing, with analysis of single-nucleotide polymorphisms, insertions, deletions, structural variants and subpopulations, enabled the highest possible level of discrimination between cluster members, clarifying likely transmission pathways and exposing the complexity of strain origin. The analysis provides a basis for targeted public health intervention and enhanced classification of future isolates linked to the cluster. PMID:26938641
Duplication polymorphisms in exon 4 of κ-casein gene in yak breeds/populations.
Pingcuo, S; Gao, J; Jiang, Z R; Jin, S Y; Fu, C Y; Liu, X; Huang, L; Zheng, Y C
2015-08-28
The objective of this study was to compare 12 bp-duplication polymorphisms in exon 4 of the κ-casein gene among 3 breeds/populations of yak (Bos grunniens). Genomic DNA was extracted from yak blood or muscle samples (N = 211) and a partial sequence of exon 4 of κ-casein gene was amplified by polymerase chain reaction. A polyacrylamide gel electrophoresis assay of the products (169 bp) revealed 2 variants. These variants differed in a 12-bp duplication of the nucleotide sequence corresponding to amino acids 147-150 (Glu-Ala-Ser-Pro) or 148-151 (Ala-Ser-Pro-Glu). The genotype frequency and gene frequency of the 2 κ-casein variants differed among the 3 yak breeds/populations. The long form of the κ-casein gene was the predominant allele, and the Jiulong yak showed the highest frequency of the short form variant of the κ-casein gene. In addition, 2 nucleotide differences resulting in amino acid substitutions were also identified in yaks. These results are significant for designing a breeding strategy to improve the genetic makeup of yak herds.
Cingolani, Pablo; Patel, Viral M.; Coon, Melissa; Nguyen, Tung; Land, Susan J.; Ruden, Douglas M.; Lu, Xiangyi
2012-01-01
This paper describes a new program SnpSift for filtering differential DNA sequence variants between two or more experimental genomes after genotoxic chemical exposure. Here, we illustrate how SnpSift can be used to identify candidate phenotype-relevant variants including single nucleotide polymorphisms, multiple nucleotide polymorphisms, insertions, and deletions (InDels) in mutant strains isolated from genome-wide chemical mutagenesis of Drosophila melanogaster. First, the genomes of two independently isolated mutant fly strains that are allelic for a novel recessive male-sterile locus generated by genotoxic chemical exposure were sequenced using the Illumina next-generation DNA sequencer to obtain 20- to 29-fold coverage of the euchromatic sequences. The sequencing reads were processed and variants were called using standard bioinformatic tools. Next, SnpEff was used to annotate all sequence variants and their potential mutational effects on associated genes. Then, SnpSift was used to filter and select differential variants that potentially disrupt a common gene in the two allelic mutant strains. The potential causative DNA lesions were partially validated by capillary sequencing of polymerase chain reaction-amplified DNA in the genetic interval as defined by meiotic mapping and deletions that remove defined regions of the chromosome. Of the five candidate genes located in the genetic interval, the Pka-like gene CG12069 was found to carry a separate pre-mature stop codon mutation in each of the two allelic mutants whereas the other four candidate genes within the interval have wild-type sequences. The Pka-like gene is therefore a strong candidate gene for the male-sterile locus. These results demonstrate that combining SnpEff and SnpSift can expedite the identification of candidate phenotype-causative mutations in chemically mutagenized Drosophila strains. This technique can also be used to characterize the variety of mutations generated by genotoxic chemicals. PMID:22435069
Functional genetic variants in the vesicular monoamine transporter 1 modulate emotion processing.
Lohoff, F W; Hodge, R; Narasimhan, S; Nall, A; Ferraro, T N; Mickey, B J; Heitzeg, M M; Langenecker, S A; Zubieta, J-K; Bogdan, R; Nikolova, Y S; Drabant, E; Hariri, A R; Bevilacqua, L; Goldman, D; Doyle, G A
2014-01-01
Emotional behavior is in part heritable and often disrupted in psychopathology. Identification of specific genetic variants that drive this heritability may provide important new insight into molecular and neurobiological mechanisms involved in emotionality. Our results demonstrate that the presynaptic vesicular monoamine transporter 1 (VMAT1) Thr136Ile (rs1390938) polymorphism is functional in vitro, with the Ile allele leading to increased monoamine transport into presynaptic vesicles. Moreover, we show that the Thr136Ile variant predicts differential responses in emotional brain circuits consistent with its effects in vitro. Lastly, deep sequencing of bipolar disorder (BPD) patients and controls identified several rare novel VMAT1 variants. The variant Phe84Ser was only present in individuals with BPD and leads to marked increase monoamine transport in vitro. Taken together, our data show that VMAT1 polymorphisms influence monoamine signaling, the functional response of emotional brain circuits and risk for psychopathology.
Screening of SHOX gene sequence variants in Saudi Arabian children with idiopathic short stature.
Alharthi, Abdulla A; El-Hallous, Ehab I; Talaat, Iman M; Alghamdi, Hamed A; Almalki, Matar I; Gaber, Ahmed
2017-10-01
Short stature affects approximately 2%-3% of children, representing one of the most frequent disorders for which clinical attention is sought during childhood. Despite assumed genetic heterogeneity, mutations or deletions in the short stature homeobox-containing gene ( SHOX ) are frequently detected in subjects with short stature. Idiopathic short stature (ISS) refers to patients with short stature for various unknown reasons. The goal of this study was to screen all the exons of SHOX to identify related mutations. We screened all the exons of SHOX for mutations analysis in 105 ISS children patients (57 girls and 48 boys) living in Taif governorate, KSA using a direct DNA sequencing method. Height, arm span, and sitting height were recorded, and subischial leg length was calculated. A total of 30 of 105 ISS patients (28%) contained six polymorphic variants in exons 1, 2, 4, and 6. One mutation was found in the DNA domain binding region of exon 4. Three of these polymorphic variants were novel, while the others were reported previously. There were no significant differences in anthropometric measures in ISS patients with and without identifiable polymorphic variants in SHOX . In Saudi Arabia ISS patients, rather than SHOX , it is possible that new genes are involved in longitudinal growth. Additional molecular analysis is required to diagnose and understand the etiology of this disease.
Carr, Ian M; Morgan, Joanne; Watson, Christopher; Melnik, Svitlana; Diggle, Christine P; Logan, Clare V; Harrison, Sally M; Taylor, Graham R; Pena, Sergio D J; Markham, Alexander F; Alkuraya, Fowzan S; Black, Graeme C M; Ali, Manir; Bonthron, David T
2013-07-01
Massively parallel ("next generation") DNA sequencing (NGS) has quickly become the method of choice for seeking pathogenic mutations in rare uncharacterized monogenic diseases. Typically, before DNA sequencing, protein-coding regions are enriched from patient genomic DNA, representing either the entire genome ("exome sequencing") or selected mapped candidate loci. Sequence variants, identified as differences between the patient's and the human genome reference sequences, are then filtered according to various quality parameters. Changes are screened against datasets of known polymorphisms, such as dbSNP and the 1000 Genomes Project, in the effort to narrow the list of candidate causative variants. An increasing number of commercial services now offer to both generate and align NGS data to a reference genome. This potentially allows small groups with limited computing infrastructure and informatics skills to utilize this technology. However, the capability to effectively filter and assess sequence variants is still an important bottleneck in the identification of deleterious sequence variants in both research and diagnostic settings. We have developed an approach to this problem comprising a user-friendly suite of programs that can interactively analyze, filter and screen data from enrichment-capture NGS data. These programs ("Agile Suite") are particularly suitable for small-scale gene discovery or for diagnostic analysis. © 2013 WILEY PERIODICALS, INC.
Transposon Variants and Their Effects on Gene Expression in Arabidopsis
Wang, Xi; Weigel, Detlef; Smith, Lisa M.
2013-01-01
Transposable elements (TEs) make up the majority of many plant genomes. Their transcription and transposition is controlled through siRNAs and epigenetic marks including DNA methylation. To dissect the interplay of siRNA–mediated regulation and TE evolution, and to examine how TE differences affect nearby gene expression, we investigated genome-wide differences in TEs, siRNAs, and gene expression among three Arabidopsis thaliana accessions. Both TE sequence polymorphisms and presence of linked TEs are positively correlated with intraspecific variation in gene expression. The expression of genes within 2 kb of conserved TEs is more stable than that of genes next to variant TEs harboring sequence polymorphisms. Polymorphism levels of TEs and closely linked adjacent genes are positively correlated as well. We also investigated the distribution of 24-nt-long siRNAs, which mediate TE repression. TEs targeted by uniquely mapping siRNAs are on average farther from coding genes, apparently because they more strongly suppress expression of adjacent genes. Furthermore, siRNAs, and especially uniquely mapping siRNAs, are enriched in TE regions missing in other accessions. Thus, targeting by uniquely mapping siRNAs appears to promote sequence deletions in TEs. Overall, our work indicates that siRNA–targeting of TEs may influence removal of sequences from the genome and hence evolution of gene expression in plants. PMID:23408902
Frequency of genetic polymorphisms of PXR gene in the Brazilian population.
Moreira, Ricardo P P; Jorge, Alexander A L; Mendonca, Berenice B; Bachega, Tânia A S S
2011-01-01
PXR polymorphisms have been implicated in modulating CYP3A4 and PXR expression, potentially accounting for interindividual differences in drug metabolism. The prevalence of PXR polymorphisms varies among ethnic groups and data on the allelic distribution in the highly mixed Brazilian population is lacking. The aim of this study was to analyze genetic variations in the PXR gene in Brazilians and to compare the results to other ethnic groups. DNA samples from 117 healthy Brazilians underwent PCR amplification and sequencing. Eleven polymorphisms were identified, 3 of which are highly associated with differences in CYP3A4 expression. We also identified 1 new synonymous variant in 1.3% of the alleles. Among the functional polymorphisms, -25913 C>T and -6994T>C occurred at a higher frequency comparedtothe Africanalleles (p < 0.05) but at a lower frequency compared to Caucasian alleles. The 8055 C>T allele was found at a similar frequency to those described in Caucasians and Africans (p > 0.05). We observed that functional variants of the PXR were frequent in our sample of the Brazilian population. Our results suggest that PXR gene variants may be of interest in pharmacogenetic studies involving Brazilians.
Allelic polymorphism in the T cell receptor and its impact on immune responses.
Gras, Stephanie; Chen, Zhenjun; Miles, John J; Liu, Yu Chih; Bell, Melissa J; Sullivan, Lucy C; Kjer-Nielsen, Lars; Brennan, Rebekah M; Burrows, Jacqueline M; Neller, Michelle A; Khanna, Rajiv; Purcell, Anthony W; Brooks, Andrew G; McCluskey, James; Rossjohn, Jamie; Burrows, Scott R
2010-07-05
In comparison to human leukocyte antigen (HLA) polymorphism, the impact of allelic sequence variation within T cell receptor (TCR) loci is much less understood. Particular TCR loci have been associated with autoimmunity, but the molecular basis for this phenomenon is undefined. We examined the T cell response to an HLA-B*3501-restricted epitope (HPVGEADYFEY) from Epstein-Barr virus (EBV), which is frequently dominated by a TRBV9*01(+) public TCR (TK3). However, the common allelic variant TRBV9*02, which differs by a single amino acid near the CDR2beta loop (Gln55-->His55), was never used in this response. The structure of the TK3 TCR, its allelic variant, and a nonnaturally occurring mutant (Gln55-->Ala55) in complex with HLA-B*3501(HPVGEADYFEY) revealed that the Gln55-->His55 polymorphism affected the charge complementarity at the TCR-peptide-MHC interface, resulting in reduced functional recognition of the cognate and naturally occurring variants of this EBV peptide. Thus, polymorphism in the TCR loci may contribute toward variability in immune responses and the outcome of infection.
Cheng, Hsin-Lin; Liu, Yu-Fan; Su, Chun-Wen; Su, Shih-Chi; Chen, Mu-Kuan; Yang, Shun-Fa; Lin, Chiao-Wen
2016-10-25
In Taiwan, oral cancer is the fourth leading cancer in males and is associated with exposure to environmental carcinogens. WW domain-containing oxidoreductase (WWOX), a tumor suppressor gene, is associated with the development of various cancers. We hypothesized that genetic variants of WWOX influence the susceptibility to oral cancer. Five polymorphisms of WWOX gene from 761 male patients with oral cancer and 1199 male cancer-free individuals were genotyped. We observed that individuals carrying the polymorphic allele of WWOX rs11545028 are more susceptible to oral cancer. Furthermore, patients with advanced-stage oral cancer were associated with a higher frequency of WWOX rs11545028 polymorphisms with the variant genotype TT than did patients with the wild-type gene. An additional integrated in silico analysis confirmed that rs11545028 affects WWOX expression, which significantly correlates with tumor expression and subsequently with tumor development and aggressiveness. In conclusion, genetic variants of WWOX contribute to the occurrence of oral cancer, and the findings regarding these biomarkers provided a prediction model for risk assessment.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Villard, E.; Soubrier, F.; Tiret, L.
1996-06-01
Plasma angiotensin I-converting enzyme (ACE) levels are highly genetically determined. A previous segregation-linkage analysis suggested the existence of a functional mutation located within or close to the ACE locus, in almost complete linkage disequilibrium (LD) with the ACE insertion/deletion (I/D) polymorphism and accounting for half the ACE variance. In order to identify the functional variant at the molecular level, we compared ACE gene sequences between four subjects selected for having contrasted ACE levels and I/D genotypes. We identified 10 new polymorphisms, among which 8 were genotyped in 95 healthy nuclear families, in addition to the I/D polymorphism. These polymorphisms couldmore » be divided into two groups: five polymorphisms in the 5{prime} region and three in the coding sequence and the 3{prime} UTR. Within each group, polymorphisms were in nearly complete association, whereas polymorphisms from the two groups were in strong negative LD. After adjustment for the I/D polymorphism, all polymorphisms of the 5{prime} group remained significantly associated with ACE levels, which suggests the existence of two quantitative trait loci (QTL) acting additively on ACE levels. Segregation-linkage analyses including one or two ACE-linked QTLs in LD with two ACE markers were performed to test this hypothesis. The two QTLs and the two markers were assumed to be in complete LD. Results supported the existence of two ACE-linked QTLs, which would explain 38% and 49% of the ACE variance in parents and offspring, respectively. One of these QTLs might be the I/D polymorphism itself or the newly characterized 4656(CT){sub 2/3} polymorphism. The second QTL would have a frequency of {approximately}.20, which is incompatible with any of the yet-identified polymorphisms. More extensive sequencing and extended analyses in larger samples and in other populations will be necessary to characterize definitely the functional variants. 30 refs., 1 fig., 6 tabs.« less
[Fine mapping of complex disease susceptibility loci].
Song, Qingfeng; Zhang, Hongxing; Ma, Yilong; Zhou, Gangqiao
2014-01-01
Genome-wide association studies (GWAS) using single nucleotide polymorphism (SNP) markers have identified more than 3800 susceptibility loci for more than 660 diseases or traits. However, the most significantly associated variants or causative variants in these loci and their biological functions have remained to be clarified. These causative variants can help to elucidate the pathogenesis and discover new biomarkers of complex diseases. One of the main goals in the post-GWAS era is to identify the causative variants and susceptibility genes, and clarify their functional aspects by fine mapping. For common variants, imputation or re-sequencing based strategies were implemented to increase the number of analyzed variants and help to identify the most significantly associated variants. In addition, functional element, expression quantitative trait locus (eQTL) and haplotype analyses were performed to identify functional common variants and susceptibility genes. For rare variants, fine mapping was carried out by re-sequencing, rare haplotype analysis, family-based analysis, burden test, etc.This review summarizes the strategies and problems for fine mapping.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Erlich, H.; Zangenberg, G.; Bugawan, T.
The rate at which allelic diversity at the HLA class I and class II loci evolves has been the subject of considerable controversy as have the mechanisms which generate new alleles. The patchwork pattern of polymorphism, particularly within the second exon of the HLA-DPB1 locus where the polymorphic sequence motifs are localized to 6 discrete regions, is consistent with the hypothesis that much of the allelic sequence variation may have been generated by segmental exchange (gene conversion). To measure the rate of new DPB1 variant generation, we have developed a strategy in which DPB1 second exon sequences are amplified frommore » pools of FACS-sorted sperm (n=50) from a heterozygous sperm donor. Pools of sperm from these heterozygous individuals are amplified with an allele-specific primer for one allele and analyzed with sequence-specific oligonucleotide probes (SSOP) complementary to the other allele. This screening procedure, which is capable of detecting a single variant molecule in a pool of parental alleles, allows the identification of new variants that have been generated by recombination and/or gene conversion between the two parental alleles. To control for potential PCR artifacts, the same screening procedure was carried out with mixtures of sperm from DPB1 *0301/*0301 and DPB1 *0401/ 0401 individuals. Pools containing putative new variants DPB1 alleles were analyzed further by cloning into M13 and sequencing the M13 clones. Our current estimate is that about 1/10,000 sperm from these heterozygous individuals represents a new DPB1 allele generated by micro-gene conversion within the second exon.« less
Identification of a novel valosin-containing protein polymorphism in late-onset Alzheimer's disease.
Kaleem, M; Zhao, A; Hamshere, M; Myers, A J
2007-01-01
Recently, mutations in the valosin-containing protein gene (VCP) were found to be causative for a rare form of dementia [Watts GDJ, et al.: Nat Genet 2004;36:377-381]. This gene lies within a region on the genome that has been linked to late onset Alzheimer's disease (LOAD) [Myers A, et al.: Am J Med Genet 2002;114:233-242]. In this study, we investigated whether variation within VCP could account for the LOAD linkage peak on chromosome 9. We sequenced 188 individuals from the set of sibling pairs we had used to obtain the linkage results for chromosome 9 to look for novel polymorphisms that could explain the linkage signal. Any variant that was found was then typed in 2 additional sets of neuropathologically confirmed samples to look for associations with Alzheimer's disease. We found 2 variants when we sequenced VCP. One was a novel rare variant (R92H) and the other is already reported within the publicly available databases (rs10972300). Neither explained the chromosome 9 linkage signal for LOAD. We have found a novel rare variant within the VCP gene, but we did not find a variant that could explain the linkage signal for LOAD on chromosome 9. Copyright (c) 2007 S. Karger AG, Basel.
267 Spanish Exomes Reveal Population-Specific Differences in Disease-Related Genetic Variation
Dopazo, Joaquín; Amadoz, Alicia; Bleda, Marta; Garcia-Alonso, Luz; Alemán, Alejandro; García-García, Francisco; Rodriguez, Juan A.; Daub, Josephine T.; Muntané, Gerard; Rueda, Antonio; Vela-Boza, Alicia; López-Domingo, Francisco J.; Florido, Javier P.; Arce, Pablo; Ruiz-Ferrer, Macarena; Méndez-Vidal, Cristina; Arnold, Todd E.; Spleiss, Olivia; Alvarez-Tejado, Miguel; Navarro, Arcadi; Bhattacharya, Shomi S.; Borrego, Salud; Santoyo-López, Javier; Antiñolo, Guillermo
2016-01-01
Recent results from large-scale genomic projects suggest that allele frequencies, which are highly relevant for medical purposes, differ considerably across different populations. The need for a detailed catalog of local variability motivated the whole-exome sequencing of 267 unrelated individuals, representative of the healthy Spanish population. Like in other studies, a considerable number of rare variants were found (almost one-third of the described variants). There were also relevant differences in allelic frequencies in polymorphic variants, including ∼10,000 polymorphisms private to the Spanish population. The allelic frequencies of variants conferring susceptibility to complex diseases (including cancer, schizophrenia, Alzheimer disease, type 2 diabetes, and other pathologies) were overall similar to those of other populations. However, the trend is the opposite for variants linked to Mendelian and rare diseases (including several retinal degenerative dystrophies and cardiomyopathies) that show marked frequency differences between populations. Interestingly, a correspondence between differences in allelic frequencies and disease prevalence was found, highlighting the relevance of frequency differences in disease risk. These differences are also observed in variants that disrupt known drug binding sites, suggesting an important role for local variability in population-specific drug resistances or adverse effects. We have made the Spanish population variant server web page that contains population frequency information for the complete list of 170,888 variant positions we found publicly available (http://spv.babelomics.org/), We show that it if fundamental to determine population-specific variant frequencies to distinguish real disease associations from population-specific polymorphisms. PMID:26764160
Valine/isoleucine variants drive selective pressure in the VP1 sequence of EV-A71 enteroviruses.
Duy, Nghia Ngu; Huong, Le Thi Thanh; Ravel, Patrice; Huong, Le Thi Song; Dwivedi, Ankit; Sessions, October Michael; Hou, Yan'An; Chua, Robert; Kister, Guilhem; Afelt, Aneta; Moulia, Catherine; Gubler, Duane J; Thiem, Vu Dinh; Thanh, Nguyen Thi Hien; Devaux, Christian; Duong, Tran Nhu; Hien, Nguyen Tran; Cornillot, Emmanuel; Gavotte, Laurent; Frutos, Roger
2017-05-08
In 2011-2012, Northern Vietnam experienced its first large scale hand foot and mouth disease (HFMD) epidemic. In 2011, a major HFMD epidemic was also reported in South Vietnam with fatal cases. This 2011-2012 outbreak was the first one to occur in North Vietnam providing grounds to study the etiology, origin and dynamic of the disease. We report here the analysis of the VP1 gene of strains isolated throughout North Vietnam during the 2011-2012 outbreak and before. The VP1 gene of 106 EV-A71 isolates from North Vietnam and 2 from Central Vietnam were sequenced. Sequence alignments were analyzed at the nucleic acid and protein level. Gene polymorphism was also analyzed. A Factorial Correspondence Analysis was performed to correlate amino acid mutations with clinical parameters. The sequences were distributed into four phylogenetic clusters. Three clusters corresponded to the subgenogroup C4 and the last one corresponded to the subgenogroup C5. Each cluster displayed different polymorphism characteristics. Proteins were highly conserved but three sites bearing only Isoleucine (I) or Valine (V) were characterized. The isoleucine/valine variability matched the clusters. Spatiotemporal analysis of the I/V variants showed that all variants which emerged in 2011 and then in 2012 were not the same but were all present in the region prior to the 2011-2012 outbreak. Some correlation was found between certain I/V variants and ethnicity and severity. The 2011-2012 outbreak was not caused by an exogenous strain coming from South Vietnam or elsewhere but by strains already present and circulating at low level in North Vietnam. However, what triggered the outbreak remains unclear. A selective pressure is applied on I/V variants which matches the genetic clusters. I/V variants were shown on other viruses to correlate with pathogenicity. This should be investigated in EV-A71. I/V variants are an easy and efficient way to survey and identify circulating EV-A71 strains.
Quantitative trait nucleotide analysis using Bayesian model selection.
Blangero, John; Goring, Harald H H; Kent, Jack W; Williams, Jeff T; Peterson, Charles P; Almasy, Laura; Dyer, Thomas D
2005-10-01
Although much attention has been given to statistical genetic methods for the initial localization and fine mapping of quantitative trait loci (QTLs), little methodological work has been done to date on the problem of statistically identifying the most likely functional polymorphisms using sequence data. In this paper we provide a general statistical genetic framework, called Bayesian quantitative trait nucleotide (BQTN) analysis, for assessing the likely functional status of genetic variants. The approach requires the initial enumeration of all genetic variants in a set of resequenced individuals. These polymorphisms are then typed in a large number of individuals (potentially in families), and marker variation is related to quantitative phenotypic variation using Bayesian model selection and averaging. For each sequence variant a posterior probability of effect is obtained and can be used to prioritize additional molecular functional experiments. An example of this quantitative nucleotide analysis is provided using the GAW12 simulated data. The results show that the BQTN method may be useful for choosing the most likely functional variants within a gene (or set of genes). We also include instructions on how to use our computer program, SOLAR, for association analysis and BQTN analysis.
Wik, Lotta; Mikko, Sofia; Klingeborn, Mikael; Stéen, Margareta; Simonsson, Magnus; Linné, Tommy
2012-01-01
The prion protein (PrP) sequence of European moose, reindeer, roe deer and fallow deer in Scandinavia has high homology to the PrP sequence of North American cervids. Variants in the European moose PrP sequence were found at amino acid position 109 as K or Q. The 109Q variant is unique in the PrP sequence of vertebrates. During the 1980s a wasting syndrome in Swedish moose, Moose Wasting Syndrome (MWS), was described. SNP analysis demonstrated a difference in the observed genotype proportions of the heterozygous Q/K and homozygous Q/Q variants in the MWS animals compared with the healthy animals. In MWS moose the allele frequencies for 109K and 109Q were 0.73 and 0.27, respectively, and for healthy animals 0.69 and 0.31. Both alleles were seen as heterozygotes and homozygotes. In reindeer, PrP sequence variation was demonstrated at codon 176 as D or N and codon 225 as S or Y. The PrP sequences in roe deer and fallow deer were identical with published GenBank sequences. PMID:22441661
Literak, Ivan; Manga, Ivan; Wojczulanis-Jakubas, Katarzyna; Chroma, Magdalena; Jamborova, Ivana; Dobiasova, Hana; Sedlakova, Miroslava Htoutou; Cizek, Alois
2014-07-16
We aimed at Escherichia coli and Enterobacter cloacae isolates resistant to cephalosporins and fluoroquinolones and Salmonella isolates in wild birds in Arctic Svalbard, Norway. Cloacal swabs of little auks (Alle alle, n=215) and samples of faeces of glaucous gulls (Larus hyperboreus, n=15) were examined. Inducible production of AmpC enzyme was detected in E. cloacae KW218 isolate. Sequence analysis of the 1146 bp PCR product of the ampC gene from this isolate revealed 99% sequence homology with the blaACT-14 and blaACT-5 AmpC beta-lactamase genes. Four, respectively six of the identified single nucleotide polymorphisms generated amino acid substitutions in the amino acid chain. As the ampC sequence polymorphism in the investigated E. cloacae strain was identified as unique, we revealed a novel variant of the ampC beta-lactamase gene blaACT-23. Copyright © 2014 Elsevier B.V. All rights reserved.
Lesniak, Anna; Walczak, Marta; Jezierski, Tadeusz; Sacharczuk, Mariusz; Gawkowski, Maciej; Jaszczak, Kazimierz
2008-01-01
The outstanding sensitivity of the canine olfactory system has been acknowledged by using sniffer dogs in military and civilian service for detection of a variety of odors. It is hypothesized that the canine olfactory ability is determined by polymorphisms in olfactory receptor (OR) genes. We investigated 5 OR genes for polymorphic sites which might affect the olfactory ability of service dogs in different fields of specific substance detection. All investigated OR DNA sequences proved to have allelic variants, the majority of which lead to protein sequence alteration. Homozygous individuals at 2 gene loci significantly differed in their detection skills from other genotypes. This suggests a role of specific alleles in odor detection and a linkage between single-nucleotide polymorphism and odor recognition efficiency.
USDA-ARS?s Scientific Manuscript database
One of the key aims of livestock genetics and genomics research is to discover the genetic variants underlying economically important traits such as reproductive performance, feed efficiency, disease susceptibility, and product quality. Next generation sequencing has recently emerged as an economica...
Genetic analysis of LRRK2 functional domains in Brazilian patients with Parkinson's disease.
Abdalla-Carvalho, C B; Santos-Rebouças, C B; Guimarães, B C; Campos, M; Pereira, J S; de Rosso, A L Zuma; Nicaretta, D H; Marinho e Silva, M; dos Santos, Mendonça J; Pimentel, M M G
2010-12-01
Mutations in the leucine-rich repeat kinase 2 gene (LRRK2) have been associated with Parkinson's disease (PD), and the majority of the pathogenic variants are located in the ROC and MAPKKK domains. Exons 29-31 and 38-44 (ROC and MAPKKK domains) were sequenced in 204 patients with PD, mostly Brazilian. We identified four polymorphisms, a novel silent variant p.R1398R and four substitutions: p.T1410M, p.G2019S, p.Y2189C and the novel variant p.C2139S. The most prevalent mutation was the p.G2019S (2.4%). We consider that the p.T1410M and the p.Y2189C variants are probably polymorphisms and that the p.C2139S mutation is potentially pathogenic. © 2010 The Author(s). European Journal of Neurology © 2010 EFNS.
Tyler, S D; Johnson, W M; Lior, H; Wang, G; Rozee, K R
1991-01-01
A set of synthetic oligonucleotide primers was designed for use in a polymerase chain reaction protocol to specifically detect the B subunit genes in vtx2ha and vtx2hb, which code for the production of the VT2 (Shiga-like toxin II) variant cytotoxins VT2v-a and VT2v-b, respectively. An additional set of primers amplified a fragment common to the B subunits of the VT2 and the VT2 variant genes. Subsequent restriction endonuclease digestion of this amplicon permitted prediction of specific VT2 and variant genotypes on the basis of predetermined restriction fragment length polymorphisms. Genotypes of 21 VT2-producing strains of Escherichia coli were determined using this polymerase chain reaction-restriction fragment length polymorphism procedure. Four strains contained B subunit target sequences only for VT2 genes, 9 strains contained sequences only for VT2v-a genes, and 3 strains contained sequences only for VT2v-b. For genes in combination, one strain contained B subunit genes for both VT2 and VT2v-a and two strains contained B subunit genes for VT2 and VT2v-b. Two strains of E. coli O91:H21 contained both VT2v-a and VT2v-b B subunit genes. The VT2 reference strain of E. coli, E32511, was found to contain the targeted sequences from both VT2 and VT2v-a genes, whereas the recombinant E. coli, pEB1, possessed only that of the VT2 gene. The specific activities of extracellular VT2 determined in HeLa cells ranged from 0.3 to 41.7 TCD50 per microgram of protein in strains carrying the VT2 gene target and from 0 to 50.0 TCD50 per microgram of protein in strains carrying only the VT2 variant target (TCD50 is the tissue culture dose by which 50% of the cells were affected), suggesting that phenotypic expression does not correlate with genotype. Images PMID:1679436
Rodriguez-Flores, Juan L.; Fakhro, Khalid; Hackett, Neil R.; Salit, Jacqueline; Fuller, Jennifer; Agosto-Perez, Francisco; Gharbiah, Maey; Malek, Joel A.; Zirie, Mahmoud; Jayyousi, Amin; Badii, Ramin; Al-Marri, Ajayeb Al-Nabet; Chouchane, Lotfi; Stadler, Dora J.; Hunter-Zinck, Haley; Mezey, Jason G.; Crystal, Ronald G.
2013-01-01
Exome sequencing of families of related individuals has been highly successful in identifying genetic polymorphisms responsible for Mendelian disorders. Here, we demonstrate the value of the reverse approach, where we use exome sequencing of a sample of unrelated individuals to analyze allele frequencies of known causal mutations for Mendelian diseases. We sequenced the exomes of 100 individuals representing the three major genetic subgroups of the Qatari population (Q1 Bedouin, Q2 Persian-South Asian, Q3 African) and identified 37 variants in 33 genes with effects on 36 clinically significant Mendelian diseases. These include variants not present in 1000 Genomes and variants at high frequency when compared to 1000 Genomes populations. Several of these Mendelian variants were only segregating in one Qatari subpopulation, where the observed subpopulation specificity trends were confirmed in an independent population of 386 Qataris. Pre-marital genetic screening in Qatar tests for only 4 out of the 37, such that this study provides a set of Mendelian disease variants with potential impact on the epidemiological profile of the population that could be incorporated into the testing program if further experimental and clinical characterization confirms high penetrance. PMID:24123366
Sawada, Akihisa; Croom-Carter, Deborah; Kondo, Osamu; Yasui, Masahiro; Koyama-Sato, Maho; Inoue, Masami; Kawa, Keisei; Rickinson, Alan B; Tierney, Rosemary J
2011-05-01
Polymorphisms in Epstein-Barr virus (EBV) latent genes can identify virus strains from different human populations and individual strains within a population. An Asian EBV signature has been defined almost exclusively from Chinese viruses, with little information from other Asian countries. Here we sequenced polymorphic regions of the EBNA1, 2, 3A, 3B, 3C and LMP1 genes of 31 Japanese strains from control donors and EBV-associated T/NK-cell lymphoproliferative disease (T/NK-LPD) patients. Though identical to Chinese strains in their dominant EBNA1 and LMP1 alleles, Japanese viruses were subtly different at other loci. Thus, while Chinese viruses mainly fall into two families with strongly linked 'Wu' or 'Li' alleles at EBNA2 and EBNA3A/B/C, Japanese viruses all have the consensus Wu EBNA2 allele but fall into two families at EBNA3A/B/C. One family has variant Li-like sequences at EBNA3A and 3B and the consensus Li sequence at EBNA3C; the other family has variant Wu-like sequences at EBNA3A, variants of a low frequency Chinese allele 'Sp' at EBNA3B and a consensus Sp sequence at EBNA3C. Thus, EBNA3A/B/C allelotypes clearly distinguish Japanese from Chinese strains. Interestingly, most Japanese viruses also lack those immune-escape mutations in the HLA-A11 epitope-encoding region of EBNA3B that are so characteristic of viruses from the highly A11-positive Chinese population. Control donor-derived and T/NK-LPD-derived strains were similarly distributed across allelotypes and, by using allelic polymorphisms to track virus strains in patients pre- and post-haematopoietic stem-cell transplant, we show that a single strain can induce both T/NK-LPD and B-cell-lymphoproliferative disease in the same patient.
Karas, Vlad O; Sinnott-Armstrong, Nicholas A; Varghese, Vici; Shafer, Robert W; Greenleaf, William J; Sherlock, Gavin
2018-01-01
Abstract Much of the within species genetic variation is in the form of single nucleotide polymorphisms (SNPs), typically detected by whole genome sequencing (WGS) or microarray-based technologies. However, WGS produces mostly uninformative reads that perfectly match the reference, while microarrays require genome-specific reagents. We have developed Diff-seq, a sequencing-based mismatch detection assay for SNP discovery without the requirement for specialized nucleic-acid reagents. Diff-seq leverages the Surveyor endonuclease to cleave mismatched DNA molecules that are generated after cross-annealing of a complex pool of DNA fragments. Sequencing libraries enriched for Surveyor-cleaved molecules result in increased coverage at the variant sites. Diff-seq detected all mismatches present in an initial test substrate, with specific enrichment dependent on the identity and context of the variation. Application to viral sequences resulted in increased observation of variant alleles in a biologically relevant context. Diff-Seq has the potential to increase the sensitivity and efficiency of high-throughput sequencing in the detection of variation. PMID:29361139
Molecular mechanisms for protein-encoded inheritance
Wiltzius, Jed J. W.; Landau, Meytal; Nelson, Rebecca; Sawaya, Michael R.; Apostol, Marcin I.; Goldschmidt, Lukasz; Soriaga, Angela B.; Cascio, Duilio; Rajashankar, Kanagalaghatta; Eisenberg, David
2013-01-01
Strains are phenotypic variants, encoded by nucleic acid sequences in chromosomal inheritance and by protein “conformations” in prion inheritance and transmission. But how is a protein “conformation” stable enough to endure transmission between cells or organisms? Here new polymorphic crystal structures of segments of prion and other amyloid proteins offer structural mechanisms for prion strains. In packing polymorphism, prion strains are encoded by alternative packings (polymorphs) of β-sheets formed by the same segment of a protein; in a second mechanism, segmental polymorphism, prion strains are encoded by distinct β-sheets built from different segments of a protein. Both forms of polymorphism can produce enduring “conformations,” capable of encoding strains. These molecular mechanisms for transfer of information into prion strains share features with the familiar mechanism for transfer of information by nucleic acid inheritance, including sequence specificity and recognition by non-covalent bonds. PMID:19684598
Glessner, Joseph T; Bick, Alexander G; Ito, Kaoru; Homsy, Jason; Rodriguez-Murillo, Laura; Fromer, Menachem; Mazaika, Erica; Vardarajan, Badri; Italia, Michael; Leipzig, Jeremy; DePalma, Steven R; Golhar, Ryan; Sanders, Stephan J; Yamrom, Boris; Ronemus, Michael; Iossifov, Ivan; Willsey, A Jeremy; State, Matthew W; Kaltman, Jonathan R; White, Peter S; Shen, Yufeng; Warburton, Dorothy; Brueckner, Martina; Seidman, Christine; Goldmuntz, Elizabeth; Gelb, Bruce D; Lifton, Richard; Seidman, Jonathan; Hakonarson, Hakon; Chung, Wendy K
2014-10-24
Congenital heart disease (CHD) is among the most common birth defects. Most cases are of unknown pathogenesis. To determine the contribution of de novo copy number variants (CNVs) in the pathogenesis of sporadic CHD. We studied 538 CHD trios using genome-wide dense single nucleotide polymorphism arrays and whole exome sequencing. Results were experimentally validated using digital droplet polymerase chain reaction. We compared validated CNVs in CHD cases with CNVs in 1301 healthy control trios. The 2 complementary high-resolution technologies identified 63 validated de novo CNVs in 51 CHD cases. A significant increase in CNV burden was observed when comparing CHD trios with healthy trios, using either single nucleotide polymorphism array (P=7×10(-5); odds ratio, 4.6) or whole exome sequencing data (P=6×10(-4); odds ratio, 3.5) and remained after removing 16% of de novo CNV loci previously reported as pathogenic (P=0.02; odds ratio, 2.7). We observed recurrent de novo CNVs on 15q11.2 encompassing CYFIP1, NIPA1, and NIPA2 and single de novo CNVs encompassing DUSP1, JUN, JUP, MED15, MED9, PTPRE SREBF1, TOP2A, and ZEB2, genes that interact with established CHD proteins NKX2-5 and GATA4. Integrating de novo variants in whole exome sequencing and CNV data suggests that ETS1 is the pathogenic gene altered by 11q24.2-q25 deletions in Jacobsen syndrome and that CTBP2 is the pathogenic gene in 10q subtelomeric deletions. We demonstrate a significantly increased frequency of rare de novo CNVs in CHD patients compared with healthy controls and suggest several novel genetic loci for CHD. © 2014 American Heart Association, Inc.
Comparison and evaluation of two exome capture kits and sequencing platforms for variant calling.
Zhang, Guoqiang; Wang, Jianfeng; Yang, Jin; Li, Wenjie; Deng, Yutian; Li, Jing; Huang, Jun; Hu, Songnian; Zhang, Bing
2015-08-05
To promote the clinical application of next-generation sequencing, it is important to obtain accurate and consistent variants of target genomic regions at low cost. Ion Proton, the latest updated semiconductor-based sequencing instrument from Life Technologies, is designed to provide investigators with an inexpensive platform for human whole exome sequencing that achieves a rapid turnaround time. However, few studies have comprehensively compared and evaluated the accuracy of variant calling between Ion Proton and Illumina sequencing platforms such as HiSeq 2000, which is the most popular sequencing platform for the human genome. The Ion Proton sequencer combined with the Ion TargetSeq Exome Enrichment Kit together make up TargetSeq-Proton, whereas SureSelect-Hiseq is based on the Agilent SureSelect Human All Exon v4 Kit and the HiSeq 2000 sequencer. Here, we sequenced exonic DNA from four human blood samples using both TargetSeq-Proton and SureSelect-HiSeq. We then called variants in the exonic regions that overlapped between the two exome capture kits (33.6 Mb). The rates of shared variant loci called by two sequencing platforms were from 68.0 to 75.3% in four samples, whereas the concordance of co-detected variant loci reached 99%. Sanger sequencing validation revealed that the validated rate of concordant single nucleotide polymorphisms (SNPs) (91.5%) was higher than the SNPs specific to TargetSeq-Proton (60.0%) or specific to SureSelect-HiSeq (88.3%). With regard to 1-bp small insertions and deletions (InDels), the Sanger sequencing validated rates of concordant variants (100.0%) and SureSelect-HiSeq-specific (89.6%) were higher than those of TargetSeq-Proton-specific (15.8%). In the sequencing of exonic regions, a combination of using of two sequencing strategies (SureSelect-HiSeq and TargetSeq-Proton) increased the variant calling specificity for concordant variant loci and the sensitivity for variant loci called by any one platform. However, for the sequencing of platform-specific variants, the accuracy of variant calling by HiSeq 2000 was higher than that of Ion Proton, specifically for the InDel detection. Moreover, the variant calling software also influences the detection of SNPs and, specifically, InDels in Ion Proton exome sequencing.
Multi-species Identification of Polymorphic Peptide Variants via Propagation in Spectral Networks
DOE Office of Scientific and Technical Information (OSTI.GOV)
Na, Seungjin; Payne, Samuel H.; Bandeira, Nuno
The spectral networks approach enables the detection of pairs of spectra from related peptides and thus allows for the propagation of annotations from identified peptides to unidentified spectra. Beyond allowing for unbiased discovery of unexpected post-translational modifications, spectral networks are also applicable to multi-species comparative proteomics or metaproteomics to identify numerous orthologous versions of a protein. We present algorithmic and statistical advances in spectral networks that have made it possible to rigorously assess the statistical significance of spectral pairs and accurately estimate the error rate of identifications via propagation. In the analysis of three related Cyanothece species, a model organismmore » for biohydrogen production, spectral networks identified peptides with highly divergent sequences with up to dozens of variants per peptide, including many novel peptides in species that lack a sequenced genome. Furthermore, spectral networks strongly suggested the presence of novel peptides even in genomically characterized species (i.e. missing from databases) in that a significant portion of unidentified multi-species networks included at least two polymorphic peptide variants.« less
2012-01-01
Background The central role of the somatotrophic axis in animal post-natal growth, development and fertility is well established. Therefore, the identification of genetic variants affecting quantitative traits within this axis is an attractive goal. However, large sample numbers are a pre-requisite for the identification of genetic variants underlying complex traits and although technologies are improving rapidly, high-throughput sequencing of large numbers of complete individual genomes remains prohibitively expensive. Therefore using a pooled DNA approach coupled with target enrichment and high-throughput sequencing, the aim of this study was to identify polymorphisms and estimate allele frequency differences across 83 candidate genes of the somatotrophic axis, in 150 Holstein-Friesian dairy bulls divided into two groups divergent for genetic merit for fertility. Results In total, 4,135 SNPs and 893 indels were identified during the resequencing of the 83 candidate genes. Nineteen percent (n = 952) of variants were located within 5' and 3' UTRs. Seventy-two percent (n = 3,612) were intronic and 9% (n = 464) were exonic, including 65 indels and 236 SNPs resulting in non-synonymous substitutions (NSS). Significant (P < 0.01) mean allele frequency differentials between the low and high fertility groups were observed for 720 SNPs (58 NSS). Allele frequencies for 43 of the SNPs were also determined by genotyping the 150 individual animals (Sequenom® MassARRAY). No significant differences (P > 0.1) were observed between the two methods for any of the 43 SNPs across both pools (i.e., 86 tests in total). Conclusions The results of the current study support previous findings of the use of DNA sample pooling and high-throughput sequencing as a viable strategy for polymorphism discovery and allele frequency estimation. Using this approach we have characterised the genetic variation within genes of the somatotrophic axis and related pathways, central to mammalian post-natal growth and development and subsequent lactogenesis and fertility. We have identified a large number of variants segregating at significantly different frequencies between cattle groups divergent for calving interval plausibly harbouring causative variants contributing to heritable variation. To our knowledge, this is the first report describing sequencing of targeted genomic regions in any livestock species using groups with divergent phenotypes for an economically important trait. PMID:22235840
Han, Jun Hyun; Lee, Yong Seong; Kim, Hae Jong; Lee, Shin Young; Myung, Soon Chul
2015-01-01
In this study, we evaluated genetic variants of the androgen metabolism genes CYP17A1, CYP3A4, and CYP3A43 to determine whether they play a role in the development of prostate cancer (PCa) in Korean men. The study population included 240 pathologically diagnosed cases of PCa and 223 age-matched controls. Among the 789 single-nucleotide polymorphism (SNP) database variants detected, 129 were reported in two Asian groups (Han Chinese and Japanese) in the HapMap database. Only 21 polymorphisms of CYP17A1, CYP3A4, and CYP3A43 were selected based on linkage disequilibrium in Asians (r2 = 1), locations (SNPs in exons were preferred), and amino acid changes and were assessed. In addition, we performed haplotype analysis for the 21 SNPs in CYP17A1, CYP3A4, and CYP3A43 genes. To determine the association between genotype and haplotype distributions of patients and controls, logistic analyses were carried out, controlling for age. Twelve sequence variants and five major haplotypes were identified in CYP17A1. Five sequence variants and two major haplotypes were identified in CYP3A4. Four sequence variants and four major haplotypes were observed in CYP3A43. CYP17A1 haplotype-2 (Ht-2) (odds ratio [OR], 1.51; 95% confidence interval [CI], 1.04–2.18) was associated with PCa susceptibility. CYP3A4 Ht-2 (OR: 1.87; 95% CI: 1.02–3.43) was associated with PCa metastatic potential according to tumor stage. rs17115149 (OR: 1.96; 95% CI: 1.04–3.68) and CYP17A1 Ht-4 (OR: 2.01; 95% CI: 1.07–4.11) showed a significant association with histologic aggressiveness according to Gleason score. Genetic variants of CYP17A1 and CYP3A4 may play a role in the development of PCa in Korean men. PMID:25337833
Han, Jun Hyun; Lee, Yong Seong; Kim, Hae Jong; Lee, Shin Young; Myung, Soon Chul
2015-01-01
In this study, we evaluated genetic variants of the androgen metabolism genes CYP17A1, CYP3A4, and CYP3A43 to determine whether they play a role in the development of prostate cancer (PCa) in Korean men. The study population included 240 pathologically diagnosed cases of PCa and 223 age-matched controls. Among the 789 single-nucleotide polymorphism (SNP) database variants detected, 129 were reported in two Asian groups (Han Chinese and Japanese) in the HapMap database. Only 21 polymorphisms of CYP17A1, CYP3A4, and CYP3A43 were selected based on linkage disequilibrium in Asians (r2 = 1), locations (SNPs in exons were preferred), and amino acid changes and were assessed. In addition, we performed haplotype analysis for the 21 SNPs in CYP17A1, CYP3A4, and CYP3A43 genes. To determine the association between genotype and haplotype distributions of patients and controls, logistic analyses were carried out, controlling for age. Twelve sequence variants and five major haplotypes were identified in CYP17A1. Five sequence variants and two major haplotypes were identified in CYP3A4. Four sequence variants and four major haplotypes were observed in CYP3A43. CYP17A1 haplotype-2 (Ht-2) (odds ratio [OR], 1.51; 95% confidence interval [CI], 1.04-2.18) was associated with PCa susceptibility. CYP3A4 Ht-2 (OR: 1.87; 95% CI: 1.02-3.43) was associated with PCa metastatic potential according to tumor stage. rs17115149 (OR: 1.96; 95% CI: 1.04-3.68) and CYP17A1 Ht-4 (OR: 2.01; 95% CI: 1.07-4.11) showed a significant association with histologic aggressiveness according to Gleason score. Genetic variants of CYP17A1 and CYP3A4 may play a role in the development of PCa in Korean men.
Safra, Noa; Hayward, Louisa J; Aguilar, Miriam; Sacks, Benjamin N; Westropp, Jodi L; Mohr, F Charles; Mellersh, Cathryn S; Bannasch, Danika L
2015-01-01
The aim of this study was to investigate the frequency of regional DNA variants upstream to the translation initiation site of the canine Cyclooxygenase-2 (Cox-2) gene in healthy dogs. Cox-2 plays a role in various disease conditions such as acute and chronic inflammation, osteoarthritis and malignancy. A role for Cox-2 DNA variants in genetic predisposition to canine renal dysplasia has been proposed and dog breeders have been encouraged to select against these DNA variants. We sequenced 272-422 bases in 152 dogs unaffected by renal dysplasia and found 19 different haplotypes including 11 genetic variants which had not been described previously. We genotyped 7 gray wolves to ascertain the wildtype variant and found that the wolves we analyzed had predominantly the second most common DNA variant found in dogs. Our results demonstrate an elevated level of regional polymorphism that appears to be a feature of healthy domesticated dogs.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Abraham, Paul E.; Wang, Xiaojing; Ranjan, Priya
The availability of next-generation sequencing technologies has rapidly transformed our ability to link genotypes to phenotypes, and as such, promises to facilitate the dissection of genetic contribution to complex traits. Although discoveries of genetic associations will further our understanding of biology, once candidate variants have been identified, investigators are faced with the challenge of characterizing the functional effects on proteins encoded by such genes. Here we show how next-generation RNA sequencing data can be exploited to construct genotype-specific protein sequence databases, which provide a clearer picture of the molecular toolbox underlying cellular and organismal processes and their variation in amore » natural population. For this study, we used two individual genotypes (DENA-17-3 and VNDL-27-4) from a recent genome wide association (GWA) study of Populus trichocarpa, an obligate outcrosser that exhibits tremendous phenotypic variation across the natural population. This strategy allowed us to comprehensively catalogue proteins containing single amino acid polymorphisms (SAAPs) and insertions and deletions (INDELS). Based on large-scale identification of SAAPs, we profiled the frequency of 128 types of naturally occurring amino acid substitutions, with a subset of SAAPs occurring in regions of the genome having strong polymorphism patterns consistent with recent positive and/or divergent selection. In addition, we were able to explore the diploid landscape of Populus at the proteome-level, allowing the characterization of heterozygous variants.« less
Abraham, Paul E.; Wang, Xiaojing; Ranjan, Priya; ...
2015-10-20
The availability of next-generation sequencing technologies has rapidly transformed our ability to link genotypes to phenotypes, and as such, promises to facilitate the dissection of genetic contribution to complex traits. Although discoveries of genetic associations will further our understanding of biology, once candidate variants have been identified, investigators are faced with the challenge of characterizing the functional effects on proteins encoded by such genes. Here we show how next-generation RNA sequencing data can be exploited to construct genotype-specific protein sequence databases, which provide a clearer picture of the molecular toolbox underlying cellular and organismal processes and their variation in amore » natural population. For this study, we used two individual genotypes (DENA-17-3 and VNDL-27-4) from a recent genome wide association (GWA) study of Populus trichocarpa, an obligate outcrosser that exhibits tremendous phenotypic variation across the natural population. This strategy allowed us to comprehensively catalogue proteins containing single amino acid polymorphisms (SAAPs) and insertions and deletions (INDELS). Based on large-scale identification of SAAPs, we profiled the frequency of 128 types of naturally occurring amino acid substitutions, with a subset of SAAPs occurring in regions of the genome having strong polymorphism patterns consistent with recent positive and/or divergent selection. In addition, we were able to explore the diploid landscape of Populus at the proteome-level, allowing the characterization of heterozygous variants.« less
USDA-ARS?s Scientific Manuscript database
We needed to obtain an alternative to conventional cloning to generate high-quality DNA sequences from a variety of nuclear orthologs for phylogenetic studies in potato, to save time and money and to avoid problems typically encountered in cloning. We tested a variety of SSCP protocols to include pu...
267 Spanish Exomes Reveal Population-Specific Differences in Disease-Related Genetic Variation.
Dopazo, Joaquín; Amadoz, Alicia; Bleda, Marta; Garcia-Alonso, Luz; Alemán, Alejandro; García-García, Francisco; Rodriguez, Juan A; Daub, Josephine T; Muntané, Gerard; Rueda, Antonio; Vela-Boza, Alicia; López-Domingo, Francisco J; Florido, Javier P; Arce, Pablo; Ruiz-Ferrer, Macarena; Méndez-Vidal, Cristina; Arnold, Todd E; Spleiss, Olivia; Alvarez-Tejado, Miguel; Navarro, Arcadi; Bhattacharya, Shomi S; Borrego, Salud; Santoyo-López, Javier; Antiñolo, Guillermo
2016-05-01
Recent results from large-scale genomic projects suggest that allele frequencies, which are highly relevant for medical purposes, differ considerably across different populations. The need for a detailed catalog of local variability motivated the whole-exome sequencing of 267 unrelated individuals, representative of the healthy Spanish population. Like in other studies, a considerable number of rare variants were found (almost one-third of the described variants). There were also relevant differences in allelic frequencies in polymorphic variants, including ∼10,000 polymorphisms private to the Spanish population. The allelic frequencies of variants conferring susceptibility to complex diseases (including cancer, schizophrenia, Alzheimer disease, type 2 diabetes, and other pathologies) were overall similar to those of other populations. However, the trend is the opposite for variants linked to Mendelian and rare diseases (including several retinal degenerative dystrophies and cardiomyopathies) that show marked frequency differences between populations. Interestingly, a correspondence between differences in allelic frequencies and disease prevalence was found, highlighting the relevance of frequency differences in disease risk. These differences are also observed in variants that disrupt known drug binding sites, suggesting an important role for local variability in population-specific drug resistances or adverse effects. We have made the Spanish population variant server web page that contains population frequency information for the complete list of 170,888 variant positions we found publicly available (http://spv.babelomics.org/), We show that it if fundamental to determine population-specific variant frequencies to distinguish real disease associations from population-specific polymorphisms. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Jaligot, E; Beulé, T; Baurens, F-C; Billotte, N; Rival, A
2004-02-01
The methylation-sensitive amplification polymorphism (MSAP) technique has been employed on somatic embryo-derived oil palms (Elaeis guineensis Jacq.) to identify methylation polymorphisms correlated with the "mantled" somaclonal variation. The variant phenotype displays an unstable feminization of male organs in both male and female flowers. Using MSAP, the methylation status of CCGG sites was compared in three normal versus three mantled regenerants sampled in clonal populations obtained through somatic embryogenesis from four genotypically distinct mother palms. Overall, 64 selective primer combinations were used and they have amplified 23 markers exhibiting a differential methylation pattern between the two phenotypes. Our results indicate that CCGG sites are poorly affected by the considerable decrease in global DNA methylation that has been previously associated with the mantled phenotype. Each of the 23 markers isolated in the present study could discriminate between the two phenotypes only when they were from the same genetic origin. This result hampers at the moment the direct use of MSAP markers for the early detection of variants, even though valuable information on putative target sequences will be obtained from a further characterization of these polymorphic markers.
Functional analysis of regulatory single-nucleotide polymorphisms.
Pampín, Sandra; Rodríguez-Rey, José C
2007-04-01
The identification of regulatory polymorphisms has become a key problem in human genetics. In the past few years there has been a conceptual change in the way in which regulatory single-nucleotide polymorphisms are studied. We revise the new approaches and discuss how gene expression studies can contribute to a better knowledge of the genetics of common diseases. New techniques for the association of single-nucleotide polymorphisms with changes in gene expression have been recently developed. This, together with a more comprehensive use of the old in-vitro methods, has produced a great amount of genetic information. When added to current databases, it will help to design better tools for the detection of regulatory single-nucleotide polymorphisms. The identification of functional regulatory single-nucleotide polymorphisms cannot be done by the simple inspection of DNA sequence. In-vivo techniques, based on primer-extension, and the more recently developed 'haploChIP' allow the association of gene variants to changes in gene expression. Gene expression analysis by conventional in-vitro techniques is the only way to identify the functional consequences of regulatory single-nucleotide polymorphisms. The amount of information produced in the last few years will help to refine the tools for the future analysis of regulatory gene variants.
A global reference for human genetic variation
2016-01-01
The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies. PMID:26432245
Schmidt, S; Pericak-Vance, M A; Sawcer, S; Barcellos, L F; Hart, J; Sims, J; Prokop, A M; van der Walt, J; DeLoa, C; Lincoln, R R; Oksenberg, J R; Compston, A; Hauser, S L; Haines, J L; Gregory, S G
2006-07-01
Discrepant findings have been reported regarding an association of the apolipoprotein E (APOE) gene with the clinical course of multiple sclerosis (MS). To resolve these discrepancies, we examined common sequence variation in six candidate genes residing in a 380-kb genomic region surrounding and including the APOE locus for an association with MS severity. We genotyped at least three polymorphisms in each of six candidate genes in 1,540 Caucasian MS families (729 single-case and multiple-case families from the United States, 811 single-case families from the UK). By applying the quantitative transmission/disequilibrium test to a recently proposed MS severity score, the only statistically significant (P=0.003) association with MS severity was found for an intronic variant in the Herpes Virus Entry Mediator-B Gene PVRL2. Additional genotyping extended the association to a 16.6 kb block spanning intron 1 to intron 2 of the gene. Sequencing of PVRL2 failed to identify variants with an obvious functional role. In conclusion, the analysis of a very large data set suggests that genetic polymorphisms in PVRL2 may influence MS severity and supports the possibility that viral factors may contribute to the clinical course of MS, consistent with previous reports.
From genomics to functional markers in the era of next-generation sequencing.
Salgotra, R K; Gupta, B B; Stewart, C N
2014-03-01
The availability of complete genome sequences, along with other genomic resources for Arabidopsis, rice, pigeon pea, soybean and other crops, has revolutionized our understanding of the genetic make-up of plants. Next-generation DNA sequencing (NGS) has facilitated single nucleotide polymorphism discovery in plants. Functionally-characterized sequences can be identified and functional markers (FMs) for important traits can be developed at an ever-increasing ease. FMs are derived from sequence polymorphisms found in allelic variants of a functional gene. Linkage disequilibrium-based association mapping and homologous recombinants have been developed for identification of "perfect" markers for their use in crop improvement practices. Compared with many other molecular markers, FMs derived from the functionally characterized sequence genes using NGS techniques and their use provide opportunities to develop high-yielding plant genotypes resistant to various stresses at a fast pace.
Goettel, Wolfgang; Xia, Eric; Upchurch, Robert; Wang, Ming-Li; Chen, Pengyin; An, Yong-Qiang Charles
2014-04-23
Variation in seed oil composition and content among soybean varieties is largely attributed to differences in transcript sequences and/or transcript accumulation of oil production related genes in seeds. Discovery and analysis of sequence and expression variations in these genes will accelerate soybean oil quality improvement. In an effort to identify these variations, we sequenced the transcriptomes of soybean seeds from nine lines varying in oil composition and/or total oil content. Our results showed that 69,338 distinct transcripts from 32,885 annotated genes were expressed in seeds. A total of 8,037 transcript expression polymorphisms and 50,485 transcript sequence polymorphisms (48,792 SNPs and 1,693 small Indels) were identified among the lines. Effects of the transcript polymorphisms on their encoded protein sequences and functions were predicted. The studies also provided independent evidence that the lack of FAD2-1A gene activity and a non-synonymous SNP in the coding sequence of FAB2C caused elevated oleic acid and stearic acid levels in soybean lines M23 and FAM94-41, respectively. As a proof-of-concept, we developed an integrated RNA-seq and bioinformatics approach to identify and functionally annotate transcript polymorphisms, and demonstrated its high effectiveness for discovery of genetic and transcript variations that result in altered oil quality traits. The collection of transcript polymorphisms coupled with their predicted functional effects will be a valuable asset for further discovery of genes, gene variants, and functional markers to improve soybean oil quality.
Huang, Zhen; Peng, Gary; Liu, Xunjia; Deora, Abhinandan; Falk, Kevin C.; Gossen, Bruce D.; McDonald, Mary R.; Yu, Fengqun
2017-01-01
Clubroot, caused by Plasmodiophora brassicae, is an important disease of canola (Brassica napus) in western Canada and worldwide. In this study, a clubroot resistance gene (Rcr2) was identified and fine mapped in Chinese cabbage cv. “Jazz” using single-nucleotide polymorphisms (SNP) markers identified from bulked segregant RNA sequencing (BSR-Seq) and molecular markers were developed for use in marker assisted selection. In total, 203.9 million raw reads were generated from one pooled resistant (R) and one pooled susceptible (S) sample, and >173,000 polymorphic SNP sites were identified between the R and S samples. One significant peak was observed between 22 and 26 Mb of chromosome A03, which had been predicted by BSR-Seq to contain the causal gene Rcr2. There were 490 polymorphic SNP sites identified in the region. A segregating population consisting of 675 plants was analyzed with 15 SNP sites in the region using the Kompetitive Allele Specific PCR method, and Rcr2 was fine mapped between two SNP markers, SNP_A03_32 and SNP_A03_67 with 0.1 and 0.3 cM from Rcr2, respectively. Five SNP markers co-segregated with Rcr2 in this region. Variants were identified in 14 of 36 genes annotated in the Rcr2 target region. The numbers of poly variants differed among the genes. Four genes encode TIR-NBS-LRR proteins and two of them Bra019410 and Bra019413, had high numbers of polymorphic variants and so are the most likely candidates of Rcr2. PMID:28894454
Efficient analysis of mouse genome sequences reveal many nonsense variants
Steeland, Sophie; Timmermans, Steven; Van Ryckeghem, Sara; Hulpiau, Paco; Saeys, Yvan; Van Montagu, Marc; Vandenbroucke, Roosmarijn E.; Libert, Claude
2016-01-01
Genetic polymorphisms in coding genes play an important role when using mouse inbred strains as research models. They have been shown to influence research results, explain phenotypical differences between inbred strains, and increase the amount of interesting gene variants present in the many available inbred lines. SPRET/Ei is an inbred strain derived from Mus spretus that has ∼1% sequence difference with the C57BL/6J reference genome. We obtained a listing of all SNPs and insertions/deletions (indels) present in SPRET/Ei from the Mouse Genomes Project (Wellcome Trust Sanger Institute) and processed these data to obtain an overview of all transcripts having nonsynonymous coding sequence variants. We identified 8,883 unique variants affecting 10,096 different transcripts from 6,328 protein-coding genes, which is about 28% of all coding genes. Because only a subset of these variants results in drastic changes in proteins, we focused on variations that are nonsense mutations that ultimately resulted in a gain of a stop codon. These genes were identified by in silico changing the C57BL/6J coding sequences to the SPRET/Ei sequences, converting them to amino acid (AA) sequences, and comparing the AA sequences. All variants and transcripts affected were also stored in a database, which can be browsed using a SPRET/Ei M. spretus variants web tool (www.spretus.org), including a manual. We validated the tool by demonstrating the loss of function of three proteins predicted to be severely truncated, namely Fas, IRAK2, and IFNγR1. PMID:27147605
Identification of Blastocystis Subtype 1 Variants in the Home for Girls, Bangkok, Thailand
Thathaisong, Umaporn; Siripattanapipong, Suradej; Mungthin, Mathirut; Pipatsatitpong, Duangnate; Tan-ariya, Peerapan; Naaglor, Tawee; Leelayoova, Saovanee
2013-01-01
A cross-sectional study of Blastocystis infection was conducted to evaluate the prevalence, risk factors, and subtypes of Blastocystis at the Home for Girls, Bangkok, Thailand in November 2008. Of 370 stool samples, 118 (31.9%) were infected with Blastocystis. Genotypic characterization of Blastocystis was performed by polymerase chain reaction and sequence analysis of the partial small subunit ribosomal RNA (SSU rRNA) gene. Subtype 1 was the most predominant (94.8%), followed by subtype 6 (3.5%) and subtype 2 (1.7%). Sequence analyses revealed nucleotide polymorphisms for Blastocystis subtype 1, which were described as subtype 1/variant 1, subtype 1/variant 2. Blastocystis subtype 1/variant 1 was the most predominant infection occurring in almost every house. The results showed that subtype analysis of Blastocystis was useful for molecular epidemiological study. PMID:23166199
Wujcicka, Wioletta; Wilczyński, Jan; Nowakowska, Dorota
2017-09-01
The research was conducted to evaluate the role of genotypes, haplotypes and multiple-SNP variants in the range of TLR2, TLR4 and TLR9 single nucleotide polymorphisms (SNPs) in the development of Toxoplasma gondii infection among Polish pregnant women. The study was performed for 116 Polish pregnant women, including 51 patients infected with T. gondii, and 65 age-matched control pregnant individuals. Genotypes in TLR2 2258 G>A, TLR4 896 A>G, TLR4 1196 C>T and TLR9 2848 G>A SNPs were estimated by self-designed, nested PCR-RFLP assays. Randomly selected PCR products, representative for distinct genotypes in the studied polymorphisms, were confirmed by sequencing. All the genotypes were calculated for Hardy-Weinberg (H-W) equilibrium and TLR4 variants were tested for linkage disequilibrium. Relationships were assessed between alleles, genotypes, haplotypes or multiple-SNP variants in TLR polymorphisms and the occurrence of T. gondii infection in pregnant women, using a logistic regression model. All the analyzed genotypes preserved the H-W equilibrium among the studied groups of patients (P>0.050). Similar distribution of distinct alleles and individual genotypes in TLR SNPs, as well as of haplotypes in TLR4 polymorphisms, were observed in T. gondii infected and control uninfected pregnant women. However, the GACG multiple-SNP variant, within the range of all the four studied polymorphisms, was correlated with a decreased risk of the parasitic infection (OR 0.52, 95% CI 0.28-0.97; P≤0.050). The polymorphisms, located within TLR2, TLR4 and TLR9 genes, may be involved together in occurrence of T. gondii infection among Polish pregnant women. Copyright © 2017 Medical University of Bialystok. Published by Elsevier B.V. All rights reserved.
Albrechtsen, A; Grarup, N; Li, Y; Sparsø, T; Tian, G; Cao, H; Jiang, T; Kim, S Y; Korneliussen, T; Li, Q; Nie, C; Wu, R; Skotte, L; Morris, A P; Ladenvall, C; Cauchi, S; Stančáková, A; Andersen, G; Astrup, A; Banasik, K; Bennett, A J; Bolund, L; Charpentier, G; Chen, Y; Dekker, J M; Doney, A S F; Dorkhan, M; Forsen, T; Frayling, T M; Groves, C J; Gui, Y; Hallmans, G; Hattersley, A T; He, K; Hitman, G A; Holmkvist, J; Huang, S; Jiang, H; Jin, X; Justesen, J M; Kristiansen, K; Kuusisto, J; Lajer, M; Lantieri, O; Li, W; Liang, H; Liao, Q; Liu, X; Ma, T; Ma, X; Manijak, M P; Marre, M; Mokrosiński, J; Morris, A D; Mu, B; Nielsen, A A; Nijpels, G; Nilsson, P; Palmer, C N A; Rayner, N W; Renström, F; Ribel-Madsen, R; Robertson, N; Rolandsson, O; Rossing, P; Schwartz, T W; Slagboom, P E; Sterner, M; Tang, M; Tarnow, L; Tuomi, T; van't Riet, E; van Leeuwen, N; Varga, T V; Vestmar, M A; Walker, M; Wang, B; Wang, Y; Wu, H; Xi, F; Yengo, L; Yu, C; Zhang, X; Zhang, J; Zhang, Q; Zhang, W; Zheng, H; Zhou, Y; Altshuler, D; 't Hart, L M; Franks, P W; Balkau, B; Froguel, P; McCarthy, M I; Laakso, M; Groop, L; Christensen, C; Brandslund, I; Lauritzen, T; Witte, D R; Linneberg, A; Jørgensen, T; Hansen, T; Wang, J; Nielsen, R; Pedersen, O
2013-02-01
Human complex metabolic traits are in part regulated by genetic determinants. Here we applied exome sequencing to identify novel associations of coding polymorphisms at minor allele frequencies (MAFs) >1% with common metabolic phenotypes. The study comprised three stages. We performed medium-depth (8×) whole exome sequencing in 1,000 cases with type 2 diabetes, BMI >27.5 kg/m(2) and hypertension and in 1,000 controls (stage 1). We selected 16,192 polymorphisms nominally associated (p < 0.05) with case-control status, from four selected annotation categories or from loci reported to associate with metabolic traits. These variants were genotyped in 15,989 Danes to search for association with 12 metabolic phenotypes (stage 2). In stage 3, polymorphisms showing potential associations were genotyped in a further 63,896 Europeans. Exome sequencing identified 70,182 polymorphisms with MAF >1%. In stage 2 we identified 51 potential associations with one or more of eight metabolic phenotypes covered by 45 unique polymorphisms. In meta-analyses of stage 2 and stage 3 results, we demonstrated robust associations for coding polymorphisms in CD300LG (fasting HDL-cholesterol: MAF 3.5%, p = 8.5 × 10(-14)), COBLL1 (type 2 diabetes: MAF 12.5%, OR 0.88, p = 1.2 × 10(-11)) and MACF1 (type 2 diabetes: MAF 23.4%, OR 1.10, p = 8.2 × 10(-10)). We applied exome sequencing as a basis for finding genetic determinants of metabolic traits and show the existence of low-frequency and common coding polymorphisms with impact on common metabolic traits. Based on our study, coding polymorphisms with MAF above 1% do not seem to have particularly high effect sizes on the measured metabolic traits.
Sequence variants of Toll-like receptor 4 and susceptibility to prostate cancer.
Chen, Yen-Ching; Giovannucci, Edward; Lazarus, Ross; Kraft, Peter; Ketkar, Shamika; Hunter, David J
2005-12-15
Chronic inflammation has been hypothesized to be a risk factor for prostate cancer. The Toll-like receptor 4 (TLR4) presents the bacterial lipopolysaccharide (LPS), which interacts with ligand-binding protein and CD14 (LPS receptor) and activates expression of inflammatory genes through nuclear factor-kappaB and mitogen-activated protein kinase signaling. A previous case-control study found a modest association of a polymorphism in the TLR4 gene [11381G/C, GG versus GC/CC: odds ratio (OR), 1.26] with risk of prostate cancer. We assessed if sequence variants of TLR4 were associated with the risk of prostate cancer. In a nested case-control design within the Health Professionals Follow-up Study, we identified 700 participants with prostate cancer diagnosed after they had provided a blood specimen in 1993 and before January 2000. Controls were 700 age-matched men without prostate cancer who had had a prostate-specific antigen test after providing a blood specimen. We genotyped 16 common (>5%) single nucleotide polymorphisms (SNP) discovered in a resequencing study spanning TLR4 to test for association between sequence variation in TLR4 and prostate cancer. Homozygosity for the variant alleles of eight SNPs was associated with a statistically significantly lower risk of prostate cancer (TLR4_1893, TLR4_2032, TLR4_2437, TLR4_7764, TLR4_11912, TLR4_16649, TLR4_17050, and TLR4_17923), but the TLR4_15844 polymorphism corresponding to 11381G/C was not associated with prostate cancer (GG versus CG/CC: OR, 1.01; 95% confidence interval, 0.79-1.29). Six common haplotypes (cumulative frequency, 81%) were observed; the global test for association between haplotypes and prostate cancer was statistically significant (chi(2) = 14.8 on 6 degrees of freedom; P = 0.02). Two common haplotypes were statistically significantly associated with altered risk of prostate cancer. Inherited polymorphisms of the innate immune gene TLR4 are associated with risk of prostate cancer.
Hu, Hao; Wienker, Thomas F; Musante, Luciana; Kalscheuer, Vera M; Kahrizi, Kimia; Najmabadi, Hossein; Ropers, H Hilger
2014-12-01
Next-generation sequencing has greatly accelerated the search for disease-causing defects, but even for experts the data analysis can be a major challenge. To facilitate the data processing in a clinical setting, we have developed a novel medical resequencing analysis pipeline (MERAP). MERAP assesses the quality of sequencing, and has optimized capacity for calling variants, including single-nucleotide variants, insertions and deletions, copy-number variation, and other structural variants. MERAP identifies polymorphic and known causal variants by filtering against public domain databases, and flags nonsynonymous and splice-site changes. MERAP uses a logistic model to estimate the causal likelihood of a given missense variant. MERAP considers the relevant information such as phenotype and interaction with known disease-causing genes. MERAP compares favorably with GATK, one of the widely used tools, because of its higher sensitivity for detecting indels, its easy installation, and its economical use of computational resources. Upon testing more than 1,200 individuals with mutations in known and novel disease genes, MERAP proved highly reliable, as illustrated here for five families with disease-causing variants. We believe that the clinical implementation of MERAP will expedite the diagnostic process of many disease-causing defects. © 2014 WILEY PERIODICALS, INC.
Tang, Z; Diamond, M A; Chen, J-M; Holly, T A; Bonow, R O; Dasgupta, A; Hyslop, T; Purzycki, A; Wagner, J; McNamara, D M; Kukulski, T; Wos, S; Velazquez, E J; Ardlie, K; Feldman, A M
2007-10-01
The goal of this experiment was to identify the presence of genetic variants in the adenosine receptor genes and assess their relationship to infarct size in a population of patients with ischemic cardiomyopathy. Adenosine receptors play an important role in protecting the heart during ischemia and in mediating the effects of ischemic preconditioning. We sequenced DNA samples from 273 individuals with ischemic cardiomyopathy and from 203 normal controls to identify the presence of genetic variants in the adenosine receptor genes. Subsequently, we analyzed the relationship between the identified genetic variants and infarct size, left ventricular size, and left ventricular function. Three variants in the 3'-untranslated region of the A(1)-adenosine gene (nt 1689 C/A, nt 2206 Tdel, nt 2683del36) and an informative polymorphism in the coding region of the A3-adenosine gene (nt 1509 A/C I248L) were associated with changes in infarct size. These results suggest that genetic variants in the adenosine receptor genes may predict the heart's response to ischemia or injury and might also influence an individual's response to adenosine therapy.
Genetic variations in NADPH-CYP450 oxidoreductase in a Czech Slavic cohort
Tomková, Mária; Panda, Satya Prakash; Šeda, Ondřej; Baxová, Alice; Hůlková, Martina; Masters, Bettie Sue Siler; Martásek, Pavel
2015-01-01
Background Gene polymorphisms encoding the enzyme NADPH–cytochrome P450 oxidoreductase (POR) contribute to inter-individual differences in drug response. Aim To estimate polymorphic allele frequencies of the POR gene in a Czech Slavic population. Materials & Methods The gene POR was analyzed in 322 Czech Slavic individuals from a control cohort by sequencing and HRM analysis. Results Twenty-five SNP genetic variations were identified. Of these variants, 7 were new, unreported SNPs, including two SNPs in the 5´flanking region (g.4965 C>T and g.4994 G>T), one intronic variant (c.1899 −20C>T), one synonymous SNP (p.20Ala=) and three nonsynonymous SNPs (p.Thr29Ser, p.Pro384Leu and p.Thr529Met). The p.Pro384Leu variant exhibited reduced enzymatic activities compared to wild type. Conclusion New POR variant identification indicates that the number of uncommon variants might be specific for each subpopulation being investigated, particularly germane to the singular role that POR plays in providing reducing equivalents to all CYPs in the endoplasmic reticulum. PMID:25712184
Pausch, Hubert; Wurmser, Christine; Reinhardt, Friedrich; Emmerling, Reiner; Fries, Ruedi
2015-06-01
Most association studies for pinpointing trait-associated variants are performed within breed. The availability of sequence data from key ancestors of several cattle breeds now enables immediate assessment of the frequency of trait-associated variants in populations different from the mapping population and their imputation into large validation populations. The objective of this study was to validate the effects of 4 putatively causative variants on milk production traits, male fertility, and stature in German Fleckvieh and Holstein-Friesian animals using targeted sequence imputation. We used whole-genome sequence data of 456 animals to impute 4 missense mutations in DGAT1, GHR, PRLR, and PROP1 into 10,363 Fleckvieh and 8,812 Holstein animals. The accuracy of the imputed genotypes exceeded 95% for all variants. Association testing with imputed variants revealed consistent antagonistic effects of the DGAT1 p.A232K and GHR p.F279Y variants on milk yield and protein and fat contents, respectively, in both breeds. The allele frequency of both polymorphisms has changed considerably in the past 20 yr, indicating that they were targets of recent selection for milk production traits. The PRLR p.S18N variant was associated with yield traits in Fleckvieh but not in Holstein, suggesting that it may be in linkage disequilibrium with a mutation affecting yield traits rather than being causal. The reported effects of the PROP1 p.H173R variant on milk production, male fertility, and stature could not be confirmed. Our results demonstrate that population-wide imputation of candidate causal variants from sequence data is feasible, enabling their rapid validation in large independent populations. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Simmons, Sheri L; Dibartolo, Genevieve; Denef, Vincent J; Goltsman, Daniela S Aliaga; Thelen, Michael P; Banfield, Jillian F
2008-07-22
Deeply sampled community genomic (metagenomic) datasets enable comprehensive analysis of heterogeneity in natural microbial populations. In this study, we used sequence data obtained from the dominant member of a low-diversity natural chemoautotrophic microbial community to determine how coexisting closely related individuals differ from each other in terms of gene sequence and gene content, and to uncover evidence of evolutionary processes that occur over short timescales. DNA sequence obtained from an acid mine drainage biofilm was reconstructed, taking into account the effects of strain variation, to generate a nearly complete genome tiling path for a Leptospirillum group II species closely related to L. ferriphilum (sampling depth approximately 20x). The population is dominated by one sequence type, yet we detected evidence for relatively abundant variants (>99.5% sequence identity to the dominant type) at multiple loci, and a few rare variants. Blocks of other Leptospirillum group II types ( approximately 94% sequence identity) have recombined into one or more variants. Variant blocks of both types are more numerous near the origin of replication. Heterogeneity in genetic potential within the population arises from localized variation in gene content, typically focused in integrated plasmid/phage-like regions. Some laterally transferred gene blocks encode physiologically important genes, including quorum-sensing genes of the LuxIR system. Overall, results suggest inter- and intrapopulation genetic exchange involving distinct parental genome types and implicate gain and loss of phage and plasmid genes in recent evolution of this Leptospirillum group II population. Population genetic analyses of single nucleotide polymorphisms indicate variation between closely related strains is not maintained by positive selection, suggesting that these regions do not represent adaptive differences between strains. Thus, the most likely explanation for the observed patterns of polymorphism is divergence of ancestral strains due to geographic isolation, followed by mixing and subsequent recombination.
Denef, Vincent J; Goltsman, Daniela S. Aliaga; Thelen, Michael P; Banfield, Jillian F
2008-01-01
Deeply sampled community genomic (metagenomic) datasets enable comprehensive analysis of heterogeneity in natural microbial populations. In this study, we used sequence data obtained from the dominant member of a low-diversity natural chemoautotrophic microbial community to determine how coexisting closely related individuals differ from each other in terms of gene sequence and gene content, and to uncover evidence of evolutionary processes that occur over short timescales. DNA sequence obtained from an acid mine drainage biofilm was reconstructed, taking into account the effects of strain variation, to generate a nearly complete genome tiling path for a Leptospirillum group II species closely related to L. ferriphilum (sampling depth ∼20×). The population is dominated by one sequence type, yet we detected evidence for relatively abundant variants (>99.5% sequence identity to the dominant type) at multiple loci, and a few rare variants. Blocks of other Leptospirillum group II types (∼94% sequence identity) have recombined into one or more variants. Variant blocks of both types are more numerous near the origin of replication. Heterogeneity in genetic potential within the population arises from localized variation in gene content, typically focused in integrated plasmid/phage-like regions. Some laterally transferred gene blocks encode physiologically important genes, including quorum-sensing genes of the LuxIR system. Overall, results suggest inter- and intrapopulation genetic exchange involving distinct parental genome types and implicate gain and loss of phage and plasmid genes in recent evolution of this Leptospirillum group II population. Population genetic analyses of single nucleotide polymorphisms indicate variation between closely related strains is not maintained by positive selection, suggesting that these regions do not represent adaptive differences between strains. Thus, the most likely explanation for the observed patterns of polymorphism is divergence of ancestral strains due to geographic isolation, followed by mixing and subsequent recombination. PMID:18651792
Abu-Farha, Mohamed; Melhem, Motasem; Abubaker, Jehad; Behbehani, Kazem; Alsmadi, Osama; Elkum, Naser
2016-02-11
ANGPTL8 (betatrophin) has been recently identified as a regulator of lipid metabolism through its interaction with ANGPTL3. A sequence variant in ANGPTL8 has been shown to associate with lower level of Low Density Lipoprotein (LDL) and High Density Lipoprotein (HDL). The objective of this study is to identify sequence variants in ANGPTL8 gene in Arabs and investigate their association with ANGPTL8 plasma level and clinical parameters. A cross sectional study was designed to examine the level of ANGPTL8 in 283 non-diabetic Arabs, and to identify its sequence variants using Sanger sequencing and their association with various clinical parameters. Using Sanger sequencing, we sequenced the full ANGPTL8 gene in 283 Arabs identifying two single nucleotide polymorphisms (SNPs) Rs.892066 and Rs.2278426 in the coding region. Our data shows for the first time that Arabs with the heterozygote form of (c.194C > T Rs.2278426) had higher level of Fasting Blood Glucose (FBG) compared to the CC homozygotes. LDL and HDL level in these subjects did not show significant difference between the two subgroups. Circulation level of ANGPTL8 did not vary between the two forms. No significant changes were observed between the various forms of Rs.892066 variant and FBG, LDL or HDL. Our data shows for the first time that heterozygote form of ANGPTL8 Rs.2278426 variant was associated with higher FBG level in Arabs highlighting the importance of these variants in controlling the function of betatrophin.
Ramasamy, Ranjith; Bakırcıoğlu, M Emre; Cengiz, Cenk; Karaca, Ender; Scovell, Jason; Jhangiani, Shalini N; Akdemir, Zeynep C; Bainbridge, Matthew; Yu, Yao; Huff, Chad; Gibbs, Richard A; Lupski, James R; Lamb, Dolores J
2015-08-01
To investigate the genetic cause of nonobstructive azoospermia (NOA) in a consanguineous Turkish family through homozygosity mapping followed by targeted exon/whole-exome sequencing to identify genetic variations. Whole-exome sequencing (WES). Research laboratory. Two siblings in a consanguineous family with NOA. Validating all variants passing filter criteria with Sanger sequencing to confirm familial segregation and absence in the control population. Discovery of a mutation that could potentially cause NOA. A novel nonsynonymous mutation in the neuronal PAS-2 domain (NPAS2) was identified in a consanguineous family from Turkey. This mutation in exon 14 (chr2: 101592000 C>G) of NPAS2 is likely a disease-causing mutation as it is predicted to be damaging, it is a novel variant, and it segregates with the disease. Family segregation of the variants showed the presence of the homozygous mutation in the three brothers with NOA and a heterozygous mutation in the mother as well as one brother and one sister who were both fertile. The mutation is not found in the single-nucleotide polymorphism database, the 1000 Genomes Project, the Baylor College of Medicine cohort of 500 Turkish patients (not a population-specific polymorphism), or the matching 50 fertile controls. With the use of WES we identified a novel homozygous mutation in NPAS2 as a likely disease-causing variant in a Turkish family diagnosed with NOA. Our data reinforce the clinical role of WES in the molecular diagnosis of highly heterogeneous genetic diseases for which conventional genetic approaches have previously failed to find a molecular diagnosis. Copyright © 2015 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Molecular characterization of a Toxocara variant from cats in Kuala Lumpur, Malaysia.
Zhu, X Q; Jacobs, D E; Chilton, N B; Sani, R A; Cheng, N A; Gasser, R B
1998-08-01
The ascaridoid nematode of cats from Kuala Lumpur, Malaysia, previously identified morphologically as Toxocara canis, was characterized using a molecular approach. The nuclear ribosomal DNA (rDNA) region spanning the first internal transcribed spacer (ITS-1), the 5.8S gene and the second internal transcribed spacer (ITS-2) was amplified and sequenced. The sequences for the parasite from Malaysian cats were compared with those for T. canis and T. cati. The sequence data showed that this taxon was genetically more similar to T. cati than to T. canis in the ITS-1, 5.8S and ITS-2. Differences in the ITS-1 and ITS-2 sequences between the taxa (9.4-26.1%) were markedly higher than variation between samples within T. canis and T. cati (0-2.9%). The sequence data demonstrate that the parasite from Malaysian cats is neither T. canis nor T. cati and indicate that it is a distinct species. Based on these data, PCR-linked restriction fragment length polymorphism (RFLP) and single-strand conformation polymorphism (SSCP) methods were employed for the unequivocal differentiation of the Toxocara variant from T. canis and T. cati. These methods should provide valuable tools for studying the life-cycle, transmission pattern(s) and zoonotic potential of this parasite.
Woods, D E; Edge, M D; Colten, H R
1984-01-01
Complementary DNA (cDNA) clones corresponding to the major histocompatibility (MHC) class III antigen, complement protein C2, have been isolated from human liver cDNA libraries with the use of a complex mixture of synthetic oligonucleotides (17 mer) that contains 576 different oligonucleotide sequences. The C2 cDNA were used to identify a DNA restriction enzyme fragment length polymorphism that provides a genetic marker within the MHC that was not detectable at the protein level. An extensive search for genomic polymorphisms using a cDNA clone for another MHC class III gene, factor B, failed to reveal any DNA variants. The genomic variants detected with the C2 cDNA probe provide an additional genetic marker for analysis of MHC-linked diseases. Images PMID:6086718
Androgen Receptor Gene Polymorphisms and Alterations in Prostate Cancer: Of Humanized Mice and Men
Robins, Diane M.
2011-01-01
Germline polymorphisms and somatic mutations of the androgen receptor (AR) have been intensely investigated in prostate cancer but even with genomic approaches their impact remains controversial. To assess the functional significance of AR genetic variation, we converted the mouse gene to the human sequence by germline recombination and engineered alleles to query the role of a polymorphic glutamine (Q) tract implicated in cancer risk. In a prostate cancer model, AR Q tract length influences progression and castration response. Mutation profiling in mice provides direct evidence that somatic AR variants are selected by therapy, a finding validated in human metastases from distinct treatment groups. Mutant ARs exploit multiple mechanisms to resist hormone ablation, including alterations in ligand specificity, target gene selectivity, chaperone interaction and nuclear localization. Regardless of their frequency, these variants permute normal function to reveal novel means to target wild type AR and its key interacting partners. PMID:21689727
Han, Soo-Jin; Marshall, Vickie; Barsov, Eugene; Quiñones, Octavio; Ray, Alex; Labo, Nazzarena; Trivett, Matthew; Ott, David; Renne, Rolf
2013-01-01
Kaposi's sarcoma-associated herpesvirus (KSHV) encodes 12 pre-microRNAs that can produce 25 KSHV mature microRNAs. We previously reported single-nucleotide polymorphisms (SNPs) in KSHV-encoded pre-microRNA and mature microRNA sequences from clinical samples (V. Marshall et al., J. Infect. Dis., 195:645–659, 2007). To determine whether microRNA SNPs affect pre-microRNA processing and, ultimately, mature microRNA expression levels, we performed a detailed comparative analysis of (i) mature microRNA expression levels, (ii) in vitro Drosha/Dicer processing, and (iii) RNA-induced silencing complex-dependent targeting of wild-type (wt) and variant microRNA genes. Expression of pairs of wt and variant pre-microRNAs from retroviral vectors and measurement of KSHV mature microRNA expression by real-time reverse transcription-PCR (RT-PCR) revealed differential expression levels that correlated with the presence of specific sequence polymorphisms. Measurement of KSHV mature microRNA expression in a panel of primary effusion lymphoma cell lines by real-time RT-PCR recapitulated some observed expression differences but suggested a more complex relationship between sequence differences and expression of mature microRNA. Furthermore, in vitro maturation assays demonstrated significant SNP-associated changes in Drosha/DGCR8 and/or Dicer processing. These data demonstrate that SNPs within KSHV-encoded pre-microRNAs are associated with differential microRNA expression levels. Given the multiple reports on the involvement of microRNAs in cancer, the biological significance of these phenotypic and genotypic variants merits further studies in patients with KSHV-associated malignancies. PMID:24006441
2014-01-01
Background Variation in seed oil composition and content among soybean varieties is largely attributed to differences in transcript sequences and/or transcript accumulation of oil production related genes in seeds. Discovery and analysis of sequence and expression variations in these genes will accelerate soybean oil quality improvement. Results In an effort to identify these variations, we sequenced the transcriptomes of soybean seeds from nine lines varying in oil composition and/or total oil content. Our results showed that 69,338 distinct transcripts from 32,885 annotated genes were expressed in seeds. A total of 8,037 transcript expression polymorphisms and 50,485 transcript sequence polymorphisms (48,792 SNPs and 1,693 small Indels) were identified among the lines. Effects of the transcript polymorphisms on their encoded protein sequences and functions were predicted. The studies also provided independent evidence that the lack of FAD2-1A gene activity and a non-synonymous SNP in the coding sequence of FAB2C caused elevated oleic acid and stearic acid levels in soybean lines M23 and FAM94-41, respectively. Conclusions As a proof-of-concept, we developed an integrated RNA-seq and bioinformatics approach to identify and functionally annotate transcript polymorphisms, and demonstrated its high effectiveness for discovery of genetic and transcript variations that result in altered oil quality traits. The collection of transcript polymorphisms coupled with their predicted functional effects will be a valuable asset for further discovery of genes, gene variants, and functional markers to improve soybean oil quality. PMID:24755115
Zhou, Jie; Kherani, Femida; Bardakjian, Tanya M.; Katowitz, James; Hughes, Nkecha; Schimmenti, Lisa A.; Schneider, Adele
2008-01-01
Purpose Mutations in the SOX2 and CHX10 genes have been reported in patients with anophthalmia and/or microphthalmia. In this study, we evaluated 34 anophthalmic/microphthalmic patient DNA samples (two sets of siblings included) for mutations and sequence variants in SOX2 and CHX10. Methods Conformational sensitive gel electrophoresis (CSGE) was used for the initial SOX2 and CHX10 screening of 34 affected individuals (two sets of siblings), five unaffected family members, and 80 healthy controls. Patient samples containing heteroduplexes were selected for sequence analysis. Base pair changes in SOX2 and CHX10 were confirmed by sequencing bidirectionally in patient samples. Results Two novel heterozygous mutations and two sequence variants (one known) in SOX2 were identified in this cohort. Mutation c.310 G>T (p. Glu104X), found in one patient, was in the region encoding the high mobility group (HMG) DNA-binding domain and resulted in a change from glutamic acid to a stop codon. The second mutation, noted in two affected siblings, was a single nucleotide deletion c.549delC (p. Pro184ArgfsX19) in the region encoding the activation domain, resulting in a frameshift and premature termination of the coding sequence. The shortened protein products may result in the loss of function. In addition, a novel nucleotide substitution c.*557G>A was identified in the 3′-untranslated region in one patient. The relationship between the nucleotide change and the protein function is indeterminate. A known single nucleotide polymorphism (c. *469 C>A, SNP rs11915160) was also detected in 2 of the 34 patients. Screening of CHX10 identified two synonymous sequence variants, c.471 C>T (p.Ser157Ser, rs35435463) and c.579 G>A (p. Gln193Gln, novel SNP), and one non-synonymous sequence variant, c.871 G>A (p. Asp291Asn, novel SNP). The non-synonymous polymorphism was also present in healthy controls, suggesting non-causality. Conclusions These results support the role of SOX2 in ocular development. Loss of SOX2 function results in severe eye malformation. CHX10 was not implicated with microphthalmia/anophthalmia in our patient cohort. PMID:18385794
Zheng, Yanying; Liu, Li; Sun, Yi; Chen, Jie; Wang, Jianrong; Zhu, Changle; Lai, Rensheng; Xie, Ling
2016-07-30
BAT-26 is one of the representative markers for microsatellite instability evaluation and presents different polymorphisms in different ethnic populations. The current knowledge of its comparative polymorphism between healthy individuals and cancer patients in the Chinese population is insufficient. This study aims to analyze germline polymorphic variations of BAT-26 between healthy individuals and cancer patients in Chinese from Jiangsu province and the associated cancer risk implications. The various BAT-26 alleles and their percentages in cervical cells from 500 healthy women were assessed by direct sequencing. Twenty of these samples were also analyzed by fragment analysis. BAT-26 of blood DNA from 24 healthy individuals and 247 cancer patients was analyzed by fragment analysis. Compared with the sequencing results, 122.6-122.9 bp, 123.4-123.8 bp and 124.1-124.8 bp corresponded to the A25, A26 and A27 alleles, respectively. The 524 healthy individuals showed 4.58%, 92.18% and 3.24% of A25, A26 and A27, respectively. The variant alleles A18, A24, A28, A29 and A32 were only found in cancer patients, accounting for 0.81%, 0.40%, 0.40%, 0.40% and 0.40%, respectively; the A25, A26 and A27 alleles in cancer patients accounted for 6.48%, 77.33% and 13.77%. Healthy individuals had a stable BAT-26 profile within the quasimonomorphic variation range (QMVR), but cancer patients harbored variant alleles outside QMVR and showed a trend from quasimonomorph to polymonomorph, suggesting that variant alleles of BAT-26 in germline cells may be regarded as a potential marker of higher cancer risk in the Chinese population from Jiangsu province.
Flanagan, Jonathan M.; Vege, Sunitha; Luban, Naomi L. C.; Brown, R. Clark; Ware, Russell E.; Westhoff, Connie M.
2017-01-01
RH genes are highly polymorphic and encode the most complex of the 35 human blood group systems. This genetic diversity contributes to Rh alloimmunization in patients with sickle cell anemia (SCA) and is not avoided by serologic Rh-matched red cell transfusions. Standard serologic testing does not distinguish variant Rh antigens. Single nucleotide polymorphism (SNP)–based DNA arrays detect many RHD and RHCE variants, but the number of alleles tested is limited. We explored a next-generation sequencing (NGS) approach using whole-exome sequencing (WES) in 27 Rh alloimmunized and 27 matched non-alloimmunized patients with SCA who received chronic red cell transfusions and were enrolled in a multicenter study. We demonstrate that WES provides a comprehensive RH genotype, identifies SNPs not interrogated by DNA array, and accurately determines RHD zygosity. Among this multicenter cohort, we demonstrate an association between an altered RH genotype and Rh alloimmunization: 52% of Rh immunized vs 19% of non-immunized patients expressed variant Rh without co-expression of the conventional protein. Our findings suggest that RH allele variation in patients with SCA is clinically relevant, and NGS technology can offer a comprehensive alternative to targeted SNP-based testing. This is particularly relevant as NGS data becomes more widely available and could provide the means for reducing Rh alloimmunization in children with SCA. PMID:29296782
Investigating intra-host and intra-herd sequence diversity of foot-and-mouth disease virus.
King, David J; Freimanis, Graham L; Orton, Richard J; Waters, Ryan A; Haydon, Daniel T; King, Donald P
2016-10-01
Due to the poor-fidelity of the enzymes involved in RNA genome replication, foot-and-mouth disease (FMD) virus samples comprise of unique polymorphic populations. In this study, deep sequencing was utilised to characterise the diversity of FMD virus (FMDV) populations in 6 infected cattle present on a single farm during the series of outbreaks in the UK in 2007. A novel RT-PCR method was developed to amplify a 7.6kb nucleotide fragment encompassing the polyprotein coding region of the FMDV genome. Illumina sequencing of each sample identified the fine polymorphic structures at each nucleotide position, from consensus level changes to variants present at a 0.24% frequency. These data were used to investigate population dynamics of FMDV at both herd and host levels, evaluate the impact of host on the viral swarm structure and to identify transmission links with viruses recovered from other farms in the same series of outbreaks. In 7 samples, from 6 different animals, a total of 5 consensus level variants were identified, in addition to 104 sub-consensus variants of which 22 were shared between 2 or more animals. Further analysis revealed differences in swarm structures from samples derived from the same animal suggesting the presence of distinct viral populations evolving independently at different lesion sites within the same infected animal. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lappalainen, J.; Dean, M.; Virkkunen, M.
1995-04-24
Abnormal brain serotonin function may be characteristic of several neuropsychiatric disorders. Thus, it is important to identify polymorphic genes and screen for functional variants at loci coding for genes that control normal serotonin functions. 5-HT{sub 1D{beta}} is a terminal serotonin autoreceptor which may play a role in regulating serotonin synthesis and release. Using an SSCP technique we screened for 5-HT{sub 1D{beta}} coding sequence variants in psychiatrically interviewed populations, which included controls, alcoholics, and alcoholic arsonists and alcoholic violent offenders with low CSF concentrations of the main serotonin metabolite 5-HIAA. A common polymorphism was identified in the 5-HT{sub 1D{beta}} gene withmore » allele frequencies of 0.72 and 0.28. The SSCP variant was caused by a silent G to C substitution at nucleotide 861 of the coding region. This polymorphism could also be detected as a HincII RFLP of amplified DNA. DNAs from informative CEPH families were typed for the HincII RFLP and analyzed with respect to 20 linked markers on chromosome 6. Multipoint analysis placed the 5-HT{sub 1D{beta}} receptor gene between markers D6S286 and D6S275. A maximum two-point lod score of 10.90 was obtained to D6S26, which had been previously localized on 6q14-15. Chromosomal aberrations involving this region have been previously shown to cause retinal anomalies, developmental delay, and abnormal brain development. This region also contains the gene for North Carolina-type macular dystrophy. 34 refs., 3 figs., 1 tab.« less
Novel human CRYGD rare variant in a Brazilian family with congenital cataract
Giordano, Gabriel Gorgone; Tavares, Anderson; da Silva, Márcio José; de Vasconcellos, José Paulo Cabral; Arieta, Carlos Eduardo Leite; de Melo, Mônica Barbosa
2011-01-01
Purpose To describe a novel polymorphism in the γD-crystallin (CRYGD) gene in a Brazilian family with congenital cataract. Methods A Brazilian four-generation family was analyzed. The proband had bilateral lamellar cataract and the phenotypes were classified by slit lamp examination. Genomic DNA was extracted from peripheral blood and coding regions and intron/exon boundaries of the αA-crystallin (CRYAA), γC-crystallin (CRYGC), and CRYGD genes were amplified by polymerase chain reaction and directly sequenced. Results Sequencing of the coding regions of CRYGD showed the presence of a heterozygous A→G transversion at c.401 position, which results in the substitution of a tyrosine to a cysteine (Y134C). The polymorphism was identified in three individuals, two affected and one unaffected. Conclusions A novel rare variant in CRYGD (Y134C) was detected in a Brazilian family with congenital cataract. Because there is no segregation between the substitution and the phenotypes in this family, other genetic alterations are likely to be present. PMID:21866214
Lessons from the canine Oxtr gene: populations, variants and functional aspects.
Bence, M; Marx, P; Szantai, E; Kubinyi, E; Ronai, Z; Banlaki, Z
2017-04-01
Oxytocin receptor (OXTR) acts as a key behavioral modulator of the central nervous system, affecting social behavior, stress, affiliation and cognitive functions. Variants of the Oxtr gene are known to influence behavior both in animals and humans; however, canine Oxtr polymorphisms are less characterized in terms of possible relevance to function, selection criteria in breeding and domestication. In this report, we provide a detailed characterization of common variants of the canine Oxtr gene. In particular (1) novel polymorphisms were identified by direct sequencing of wolf and dog samples, (2) allelic distributions and pairwise linkage disequilibrium patterns of several canine populations were compared, (3) neighbor joining (NJ) tree based on common single nucleotide polymorphisms (SNPs) was constructed, (4) mRNA expression features were assessed, (5) a novel splice variant was detected and (6) in vitro functional assays were performed. Results indicate marked differences regarding Oxtr variations between purebred dogs of different breeds, free-ranging dog populations, wolf subspecies and golden jackals. This, together with existence of explicitly dog-specific alleles and data obtained from the NJ tree implies that Oxtr could indeed have been a target gene during domestication and selection for human preferred aspects of temperament and social behavior. This assumption is further supported by the present observations on gene expression patterns within the brain and luciferase reporter experiments, providing a molecular level link between certain canine Oxtr polymorphisms and differences in nervous system function and behavior. © 2016 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.
NASA Astrophysics Data System (ADS)
Nardo, Luca; Tosi, Giovanna; Bondani, Maria; Accolla, Roberto; Andreoni, Alessandra
2012-06-01
By tens-of-picosecond resolved fluorescence detection we study Förster resonance energy transfer between a donor and a black-hole-quencher bound at the 5'- and 3'-positions of an oligonucleotide probe matching the highly polymorphic region between codons 51 and 58 of the human leukocyte antigen DQB1 0201 allele, conferring susceptibility to type-1 diabetes. The probe is annealed with non-amplified genomic DNAs carrying either the 0201 sequence or other DQB1 allelic variants. We detect the longest-lived donor fluorescence in the case of hybridization with the 0201 allele and definitely faster and distinct decays for the other allelic variants, some of which are single-nucleotide polymorphic.
Nomenclature for alleles of the thiopurine methyltransferase gene
Appell, Malin L.; Berg, Jonathan; Duley, John; Evans, William E.; Kennedy, Martin A.; Lennard, Lynne; Marinaki, Tony; McLeod, Howard L.; Relling, Mary V.; Schaeffeler, Elke; Schwab, Matthias; Weinshilboum, Richard; Yeoh, Allen E.J.; McDonagh, Ellen M.; Hebert, Joan M.; Klein, Teri E.; Coulthard, Sally A.
2013-01-01
The drug-metabolizing enzyme thiopurine methyltransferase (TPMT) has become one of the best examples of pharmacogenomics to be translated into routine clinical practice. TPMT metabolizes the thiopurines 6-mercaptopurine, 6-thioguanine, and azathioprine, drugs that are widely used for treatment of acute leukemias, inflammatory bowel diseases, and other disorders of immune regulation. Since the discovery of genetic polymorphisms in the TPMT gene, many sequence variants that cause a decreased enzyme activity have been identified and characterized. Increasingly, to optimize dose, pretreatment determination of TPMT status before commencing thiopurine therapy is now routine in many countries. Novel TPMT sequence variants are currently numbered sequentially using PubMed as a source of information; however, this has caused some problems as exemplified by two instances in which authors’ articles appeared on PubMed at the same time, resulting in the same allele numbers given to different polymorphisms. Hence, there is an urgent need to establish an order and consensus to the numbering of known and novel TPMT sequence variants. To address this problem, a TPMT nomenclature committee was formed in 2010, to define the nomenclature and numbering of novel variants for the TPMT gene. A website (http://www.imh.liu.se/tpmtalleles) serves as a platform for this work. Researchers are encouraged to submit novel TPMT alleles to the committee for designation and reservation of unique allele numbers. The committee has decided to renumber two alleles: nucleotide position 106 (G > A) from TPMT*24 to TPMT*30 and position 611 (T > C, rs79901429) from TPMT*28 to TPMT*31. Nomenclature for all other known alleles remains unchanged. PMID:23407052
Identification of a null allele of cytochrome P450 3A7: CYP3A7 polymorphism in a Korean population.
Lee, Sang Seop; Jung, Hyun-Ju; Park, Jung Soon; Cha, In-June; Cho, Doo-Yeoun; Shin, Jae-Gook
2010-01-01
Cytochrome P450 3A7 (CYP3A7) is expressed in the human fetal liver and plays a role in the metabolism of hormones, drugs, and toxic compounds. Genetic variants of CYP3A7 are associated with serum estrone level, bone density, and hepatic CYP3A activity in adults. We analyzed the genetic variations of CYP3A7 in a Korean population. From direct sequencing of all exons and flanking regions of the CYP3A7 gene in 48 Koreans, we found five genetic variants, including three novel variants. One variant, a thymidine insertion in exon 2 (4011insT), causes premature termination of CYP3A7 translation, which may result in a null phenotype. The novel variant was assigned to the CYP3A7*3 allele by the CYP allele nomenclature committee. For further screen of this novel variant in other ethnic populations, we used pyrosequencing to analyze an additional 185 Koreans, 100 African Americans, 100 Caucasians, and 159 Vietnamese for the presence of this variant. The variant was not found in any other individuals, except for one Korean subject. The frequencies of two known functional alleles, CYP3A7*2 and CYP3A7*1C, were 26 and 0%, respectively, in Koreans. The frequencies of the functional CYP3A7 polymorphisms in Koreans were significantly different from those in Caucasians and African Americans. This is the first report of a null-type allele of the CYP3A7 gene. It also provides population-level genetic data on CYP3A7 in Koreans to reveal the wide ethnic variation in CYP3A7 polymorphism.
Lim, Eileen C P; Brett, Maggie; Lai, Angeline H M; Lee, Siew-Peng; Tan, Ee-Shien; Jamuar, Saumya S; Ng, Ivy S L; Tan, Ene-Choo
2015-12-14
Next-generation sequencing (NGS) has revolutionized genetic research and offers enormous potential for clinical application. Sequencing the exome has the advantage of casting the net wide for all known coding regions while targeted gene panel sequencing provides enhanced sequencing depths and can be designed to avoid incidental findings in adult-onset conditions. A HaloPlex panel consisting of 180 genes within commonly altered chromosomal regions is available for use on both the Ion Personal Genome Machine (PGM) and MiSeq platforms to screen for causative mutations in these genes. We used this Haloplex ICCG panel for targeted sequencing of 15 patients with clinical presentations indicative of an abnormality in one of the 180 genes. Sequencing runs were done using the Ion 318 Chips on the Ion Torrent PGM. Variants were filtered for known polymorphisms and analysis was done to identify possible disease-causing variants before validation by Sanger sequencing. When possible, segregation of variants with phenotype in family members was performed to ascertain the pathogenicity of the variant. More than 97% of the target bases were covered at >20×. There was an average of 9.6 novel variants per patient. Pathogenic mutations were identified in five genes for six patients, with two novel variants. There were another five likely pathogenic variants, some of which were unreported novel variants. In a cohort of 15 patients, we were able to identify a likely genetic etiology in six patients (40%). Another five patients had candidate variants for which further evaluation and segregation analysis are ongoing. Our results indicate that the HaloPlex ICCG panel is useful as a rapid, high-throughput and cost-effective screening tool for 170 of the 180 genes. There is low coverage for some regions in several genes which might have to be supplemented by Sanger sequencing. However, comparing the cost, ease of analysis, and shorter turnaround time, it is a good alternative to exome sequencing for patients whose features are suggestive of a genetic etiology involving one of the genes in the panel.
A Bioinformatics Workflow for Variant Peptide Detection in Shotgun Proteomics*
Li, Jing; Su, Zengliu; Ma, Ze-Qiang; Slebos, Robbert J. C.; Halvey, Patrick; Tabb, David L.; Liebler, Daniel C.; Pao, William; Zhang, Bing
2011-01-01
Shotgun proteomics data analysis usually relies on database search. However, commonly used protein sequence databases do not contain information on protein variants and thus prevent variant peptides and proteins from been identified. Including known coding variations into protein sequence databases could help alleviate this problem. Based on our recently published human Cancer Proteome Variation Database, we have created a protein sequence database that comprehensively annotates thousands of cancer-related coding variants collected in the Cancer Proteome Variation Database as well as noncancer-specific ones from the Single Nucleotide Polymorphism Database (dbSNP). Using this database, we then developed a data analysis workflow for variant peptide identification in shotgun proteomics. The high risk of false positive variant identifications was addressed by a modified false discovery rate estimation method. Analysis of colorectal cancer cell lines SW480, RKO, and HCT-116 revealed a total of 81 peptides that contain either noncancer-specific or cancer-related variations. Twenty-three out of 26 variants randomly selected from the 81 were confirmed by genomic sequencing. We further applied the workflow on data sets from three individual colorectal tumor specimens. A total of 204 distinct variant peptides were detected, and five carried known cancer-related mutations. Each individual showed a specific pattern of cancer-related mutations, suggesting potential use of this type of information for personalized medicine. Compatibility of the workflow has been tested with four popular database search engines including Sequest, Mascot, X!Tandem, and MyriMatch. In summary, we have developed a workflow that effectively uses existing genomic data to enable variant peptide detection in proteomics. PMID:21389108
Kinoti, Wycliff M; Constable, Fiona E; Nancarrow, Narelle; Plummer, Kim M; Rodoni, Brendan
2017-01-01
PCR amplicon next generation sequencing (NGS) analysis offers a broadly applicable and targeted approach to detect populations of both high- or low-frequency virus variants in one or more plant samples. In this study, amplicon NGS was used to explore the diversity of the tripartite genome virus, Prunus necrotic ringspot virus (PNRSV) from 53 PNRSV-infected trees using amplicons from conserved gene regions of each of PNRSV RNA1, RNA2 and RNA3. Sequencing of the amplicons from 53 PNRSV-infected trees revealed differing levels of polymorphism across the three different components of the PNRSV genome with a total number of 5040, 2083 and 5486 sequence variants observed for RNA1, RNA2 and RNA3 respectively. The RNA2 had the lowest diversity of sequences compared to RNA1 and RNA3, reflecting the lack of flexibility tolerated by the replicase gene that is encoded by this RNA component. Distinct PNRSV phylo-groups, consisting of closely related clusters of sequence variants, were observed in each of PNRSV RNA1, RNA2 and RNA3. Most plant samples had a single phylo-group for each RNA component. Haplotype network analysis showed that smaller clusters of PNRSV sequence variants were genetically connected to the largest sequence variant cluster within a phylo-group of each RNA component. Some plant samples had sequence variants occurring in multiple PNRSV phylo-groups in at least one of each RNA and these phylo-groups formed distinct clades that represent PNRSV genetic strains. Variants within the same phylo-group of each Prunus plant sample had ≥97% similarity and phylo-groups within a Prunus plant sample and between samples had less ≤97% similarity. Based on the analysis of diversity, a definition of a PNRSV genetic strain was proposed. The proposed definition was applied to determine the number of PNRSV genetic strains in each of the plant samples and the complexity in defining genetic strains in multipartite genome viruses was explored.
Mass Spectrometric Determination of ILPR G-quadruplex Binding Sites in Insulin and IGF-2
Xiao, JunFeng
2009-01-01
The insulin-linked polymorphic region (ILPR) of the human insulin gene promoter region forms G-quadruplex structures in vitro. Previous studies show that insulin and insulin-like growth factor-2 (IGF-2) exhibit high affinity binding in vitro to 2-repeat sequences of ILPR variants a and h, but negligible binding to variant i. Two-repeat sequences of variants a and h form intramolecular G-quadruplex structures that are not evidenced for variant i. Here we report on the use of protein digestion combined with affinity capture and MALDI-MS detection to pinpoint ILPR binding sites in insulin and IGF-2. Peptides captured by ILPR variants a and h were sequenced by MALDI-MS/MS, LC-MS and in silico digestion. On-bead digestion of insulin-ILPR variant a complexes supported the conclusions. The results indicate that the sequence VCG(N)RGF is generally present in the captured peptides and is likely involved in the affinity binding interactions of the proteins with the ILPR G-quadruplexes. The significance of arginine in the interactions was studied by comparing the affinities of synthesized peptides VCGERGF and VCGEAGF with ILPR variant a. Peptides from other regions of the proteins that are connected through disulfide linkages were also detected in some capture experiments. Identification of binding sites could facilitate design of DNA binding ligands for capture and detection of insulin and IGF-2. The interactions may have biological significance as well. PMID:19747845
Genetic variants of dopamine D2 receptor impact heterodimerization with dopamine D1 receptor.
Błasiak, Ewa; Łukasiewicz, Sylwia; Szafran-Pilch, Kinga; Dziedzicka-Wasylewska, Marta
2017-04-01
The human dopamine D2 receptor gene has three polymorphic variants that alter its amino acid sequence: alanine substitution by valine in position 96 (V96A), proline substitution by serine in position 310 (P310S) and serine substitution by cysteine in position 311 (S311C). Their functional role has never been the object of extensive studies, even though there is some evidence that their occurrence correlates with schizophrenia. The HEK293 cell line was transfected with dopamine D1 and D2 receptors (or genetic variants of the D2 receptor), coupled to fluorescent proteins which allowed us to measure the extent of dimerization of these receptors, using a highly advanced biophysical approach (FLIM-FRET). Additionally, Fluoro-4 AM was used to examine changes in the level of calcium release after ligand stimulation of cells expressing different combinations of dopamine receptors. Using FLIM-FRET experiments we have shown that in HEK 293 expressing dopamine receptors, polymorphic mutations in the D2 receptor play a role in dimmer formation with the dopamine D1 receptor. The association level of dopamine receptors is affected by ligand administration, with variable effects depending on polymorphic variant of the D2 dopamine receptor. We have found that the level of heteromer formation is reflected by calcium ion release after ligand stimulation and have observed variations of this effect dependent on the polymorphic variant and the ligand. The data presented in this paper support the hypothesis on the role of calcium signaling regulated by the D1-D2 heteromer which may be of relevance for schizophrenia etiology. Copyright © 2016 Institute of Pharmacology, Polish Academy of Sciences. Published by Elsevier Urban & Partner Sp. z o.o. All rights reserved.
High-resolution genetic mapping of allelic variants associated with cell wall chemistry in Populus
Muchero, Wellington; Guo, Jianjun; Difazio, Stephen P.; ...
2015-01-23
We report the identification of six genetic loci and the allelic-variants associated with Populus cell wall phenotypes determined independently using pyrolysis Molecular Beam Mass Spectrometry (pyMBMS), saccharification assay and wet chemistry in two partially overlapping populations of P. trichocarpa genotypes sampled from multiple environments in the Pacific Northwest of North America. All 6 variants co-located with a quantitative trait locus (QTL) hotspot on chromosome XIV for lignin content, syringyl to guaiacyl (S/G) ratio, 5- and 6- carbon sugars identified in an interspecific P. trichocarpa x P. deltoides pseudo-backcross mapping pedigree. Genomic intervals containing an amino acid transporter, a MYB transcriptionmore » factor, an angustifolia CtBP transcription factor, a copper transport protein ATOX1-related, a Ca 2+ transporting ATPase and a protein kinase were identified within 5 QTL regions. Each interval contained single nucleotide polymorphisms (SNPs) that were significantly associated to cell-wall phenotypes, with associations exceeding the chromosome-wise Bonferroni-adjusted p-values in at least one environment. cDNA sequencing for allelic variants of 3 of the 6 genes identified polymorphisms leading to premature stop codons in the MYB transcription factor and protein kinase. On the other hand, variants of the Angustifolia CtBP transcription factor exhibited a polyglutamine (PolyQ) length polymorphism. Results from transient protoplast assays suggested that each of the polymorphisms conferred allelic differences in activation of cellulose, hemicelluloses and lignin pathway marker genes, with truncated and short PolyQ alleles exhibiting significantly reduced marker gene activation. Genes identified in this study represent novel targets for reducing cell wall recalcitrance for lignocellulosic biofuels production using plant biomass.« less
Neanderthal and Denisova tooth protein variants in present-day humans
Zanolli, Clément; Hourset, Mathilde; Esclassan, Rémi
2017-01-01
Environment parameters, diet and genetic factors interact to shape tooth morphostructure. In the human lineage, archaic and modern hominins show differences in dental traits, including enamel thickness, but variability also exists among living populations. Several polymorphisms, in particular in the non-collagenous extracellular matrix proteins of the tooth hard tissues, like enamelin, are involved in dental structure variation and defects and may be associated with dental disorders or susceptibility to caries. To gain insights into the relationships between tooth protein polymorphisms and dental structural morphology and defects, we searched for non-synonymous polymorphisms in tooth proteins from Neanderthal and Denisova hominins. The objective was to identify archaic-specific missense variants that may explain the dental morphostructural variability between extinct and modern humans, and to explore their putative impact on present-day dental phenotypes. Thirteen non-collagenous extracellular matrix proteins specific to hard dental tissues have been selected, searched in the publicly available sequence databases of Neanderthal and Denisova individuals and compared with modern human genome data. A total of 16 non-synonymous polymorphisms were identified in 6 proteins (ameloblastin, amelotin, cementum protein 1, dentin matrix acidic phosphoprotein 1, enamelin and matrix Gla protein). Most of them are encoded by dentin and enamel genes located on chromosome 4, previously reported to show signs of archaic introgression within Africa. Among the variants shared with modern humans, two are ancestral (common with apes) and one is the derived enamelin major variant, T648I (rs7671281), associated with a thinner enamel and specific to the Homo lineage. All the others are specific to Neanderthals and Denisova, and are found at a very low frequency in modern Africans or East and South Asians, suggesting that they may be related to particular dental traits or disease susceptibility in these populations. This modern regional distribution of archaic dental polymorphisms may reflect persistence of archaic variants in some populations and may contribute in part to the geographic dental variations described in modern humans. PMID:28902892
LPA and PLG sequence variation and kringle IV-2 copy number in two populations.
Crawford, Dana C; Peng, Ze; Cheng, Jan-Fang; Boffelli, Dario; Ahearn, Magdalena; Nguyen, Dan; Shaffer, Tristan; Yi, Qian; Livingston, Robert J; Rieder, Mark J; Nickerson, Deborah A
2008-01-01
Lp(a) levels have long been recognized as a potential risk factor for coronary heart disease that is almost completely under genetic control. Much of the genetics impacting Lp(a) levels has been attributed to the highly polymorphic LPA kringle IV-2 copy number variant, and most of the variance in Lp(a) levels in populations of European-descent is inversely correlated with kringle IV copy number. However, less of the variance is explained in African-descent populations for the same structural variation. African-descent populations have, on average, higher levels of Lp(a), suggesting other genetic factors contribute to Lp(a) level variability across populations. To identify potential cis-acting factors, we re-sequenced the gene LPA for single nucleotide polymorphism (SNP) discovery in 23 European-Americans and 24 African-Americans. We also re- sequenced the neighboring gene plasminogen (PLG) and genotyped the kringle IV copy number variant in the same reference samples. These data are the most comprehensive description of sequence variation in LPA and its relationship with the kringle IV copy number variant. With these data, we demonstrate that only a fraction of LPA sequence diversity has been previously documented. Also, we identify several high frequency SNPs present in the African-American sample but absent in the European-American sample. Finally, we show that SNPs within PLG are not in linkage disequilibrium with SNPs in LPA, and we show that kringle IV copy number variation is not in linkage disequilibrium with either LPA or PLG SNPs. Together, these data suggest that LPA SNPs could independently contribute to Lp(a) levels in the general population. Copyright (c) 2008 S. Karger AG, Basel.
Gui, Linsheng; Jiang, Bijie; Zhang, Yaran; Zan, Linsen
2015-03-15
Silent information regulator 6 (SIRT6) belongs to the family of class III nicotinamide adenine dinucleotide (NAD)-dependent deacetylase and plays an essential role in DNA repair and metabolism. This study was conducted to detect potential polymorphisms of the bovine SIRT6 gene and explore their relationships with body measurement and carcass quality in Qinchuan cattle. Four sequence variants (SVs) were identified in intron 6, exon 7, exon 9, and 3' UTR, via sequencing technology conducted in 468 individual Qinchuan cattle. Eleven different haplotypes were identified, of which two major haplotypes had a frequency of 45.7% (-CACT-) and 14.8% (-CGTC-). Three SVs (SV2, SV3 and SV4) were significantly associated with some of the body measurements and carcass quality traits (P<0.05 or P<0.01), and the H2H7 (CC-GA-TT-TC) diplotype had better performance than other combinations. Our results suggest that some polymorphisms in SIRT6 are associated with production traits and may be used as candidates for marker-assisted selection (MAS) and management in beef cattle breeding programs. Copyright © 2015 Elsevier B.V. All rights reserved.
Liu, Siyang; Huang, Shujia; Rao, Junhua; Ye, Weijian; Krogh, Anders; Wang, Jun
2015-01-01
Comprehensive recognition of genomic variation in one individual is important for understanding disease and developing personalized medication and treatment. Many tools based on DNA re-sequencing exist for identification of single nucleotide polymorphisms, small insertions and deletions (indels) as well as large deletions. However, these approaches consistently display a substantial bias against the recovery of complex structural variants and novel sequence in individual genomes and do not provide interpretation information such as the annotation of ancestral state and formation mechanism. We present a novel approach implemented in a single software package, AsmVar, to discover, genotype and characterize different forms of structural variation and novel sequence from population-scale de novo genome assemblies up to nucleotide resolution. Application of AsmVar to several human de novo genome assemblies captures a wide spectrum of structural variants and novel sequences present in the human population in high sensitivity and specificity. Our method provides a direct solution for investigating structural variants and novel sequences from de novo genome assemblies, facilitating the construction of population-scale pan-genomes. Our study also highlights the usefulness of the de novo assembly strategy for definition of genome structure.
Copy Number Variation across European Populations
Chen, Wanting; Hayward, Caroline; Wright, Alan F.; Hicks, Andrew A.; Vitart, Veronique; Knott, Sara; Wild, Sarah H.; Pramstaller, Peter P.; Wilson, James F.; Rudan, Igor; Porteous, David J.
2011-01-01
Genome analysis provides a powerful approach to test for evidence of genetic variation within and between geographical regions and local populations. Copy number variants which comprise insertions, deletions and duplications of genomic sequence provide one such convenient and informative source. Here, we investigate copy number variants from genome wide scans of single nucleotide polymorphisms in three European population isolates, the island of Vis in Croatia, the islands of Orkney in Scotland and the South Tyrol in Italy. We show that whereas the overall copy number variant frequencies are similar between populations, their distribution is highly specific to the population of origin, a finding which is supported by evidence for increased kinship correlation for specific copy number variants within populations. PMID:21829696
2014-01-01
Background Central core disease is a congenital myopathy, characterized by presence of central core-like areas in muscle fibers. Patients have mild or moderate weakness, hypotonia and motor developmental delay. The disease is caused by mutations in the human ryanodine receptor gene (RYR1), which encodes a calcium-release channel. Since the RYR1 gene is huge, containing 106 exons, mutation screening has been limited to three ‘hot spots’, with particular attention to the C-terminal region. Recent next- generation sequencing methods are now identifying multiple numbers of variants in patients, in which interpretation and phenotype prevision is difficult. Case presentation In a Brazilian Caucasian family, clinical, histopathological and molecular analysis identified a new case of central core disease in a 48-year female. Sanger sequencing of the C-terminal region of the RYR1 gene identified two different missense mutations: c.14256 A > C polymorphism in exon 98 and c.14693 T > C in exon 102, which have already been described as pathogenic. Trans-position of the 2 mutations was confirmed because patient’s daughter, mother and sister carried only the exon 98’s mutation, a synonymous variant that was subsequently found in the frequency of 013–0,05 of alleles. Further next generation sequencing study of the whole RYR1 gene in the patient revealed the presence of additional 5 common silent polymorphisms in homozygosis and 8 polymorphisms in heterozygosis. Conclusions Considering that patient’s relatives showed no pathologic phenotype, and the phenotype presented by the patient is within the range observed in other central core disease patients with the same mutation, it was concluded that the c.14256 A > C polymorphism alone is not responsible for disease, and the associated additional silent polymorphisms are not acting as modifiers of the primary pathogenic mutation in the affected patient. The case described above illustrates the present reality where new methods for wide genome screening are becoming more accessible and able to identify a great variety of mutations and polymorphisms of unknown function in patients and their families. PMID:25084811
Ali, Syeda Hafiza Benish; Bangash, Kashif Sardar; Rauf, Abdur; Younis, Muhammad; Anwar, Khursheed; Khurram, Raja; Khawaja, Muhammad Athar; Azam, Maleeha; Qureshi, Abid Ali; Akhter, Saeed; Kiemeney, Lambertus A; Qamar, Raheel
2017-10-01
Urothelial bladder carcinoma (UBC) is the most common among urinary bladder neoplasms. We carried out a preliminary study to determine the genetic etiology of UBC in Pakistani population, for this 25 sequence variants from 17 candidate genes were studied in 400 individuals by using polymerase chain reaction-based techniques. Multivariate logistic regression analysis was performed for association analysis of the overall data as well as the data stratified by smoking status, tumor grade and tumor stage. Variants of GSTM1, IGFBP3, LEPR and ACE were found to be associated with altered UBC risk in the overall comparison. CYP1B1 and CDKN1A variants displayed a risk modulation among smokers; IGFBP3 and LEPR variants among non-smokers while GSTM1 polymorphism exhibited association with both. GSTM1 and LEPR variants conferred an altered susceptibility to low grade UBC; GSTT1, IGFBP3 and PPARG variants to high grade UBC while ACE polymorphism to both grades. GSTM1 and LEPR variants exhibited risk modulation for non-muscle-invasive bladder cancer (NMIBC); GSTT1 and PPARG variants for muscle-invasive bladder cancer (MIBC), and ACE variant for NMIBC as well as MIBC. In general, the susceptibility markers were common for low grade and NMIBC; and distinct from those for high grade and MIBC indicating the distinct pathologies of both groups. In brief, our results conform to reports of previously associated variants in addition to identifying novel potential genetic predictors of UBC susceptibility.
Vembar, Shruthi Sridhar; Seetin, Matthew; Lambert, Christine; Nattestad, Maria; Schatz, Michael C.; Baybayan, Primo; Scherf, Artur; Smith, Melissa Laird
2016-01-01
The application of next-generation sequencing to estimate genetic diversity of Plasmodium falciparum, the most lethal malaria parasite, has proved challenging due to the skewed AT-richness [∼80.6% (A + T)] of its genome and the lack of technology to assemble highly polymorphic subtelomeric regions that contain clonally variant, multigene virulence families (Ex: var and rifin). To address this, we performed amplification-free, single molecule, real-time sequencing of P. falciparum genomic DNA and generated reads of average length 12 kb, with 50% of the reads between 15.5 and 50 kb in length. Next, using the Hierarchical Genome Assembly Process, we assembled the P. falciparum genome de novo and successfully compiled all 14 nuclear chromosomes telomere-to-telomere. We also accurately resolved centromeres [∼90–99% (A + T)] and subtelomeric regions and identified large insertions and duplications that add extra var and rifin genes to the genome, along with smaller structural variants such as homopolymer tract expansions. Overall, we show that amplification-free, long-read sequencing combined with de novo assembly overcomes major challenges inherent to studying the P. falciparum genome. Indeed, this technology may not only identify the polymorphic and repetitive subtelomeric sequences of parasite populations from endemic areas but may also evaluate structural variation linked to virulence, drug resistance and disease transmission. PMID:27345719
Checking of individuality by DNA profiling.
Brdicka, R; Nürnberg, P
1993-08-25
A review of methods of DNA analysis used in forensic medicine for identification, paternity testing, etc. is provided. Among other techniques, DNA fingerprinting using different probes and polymerase chain reaction-based techniques such as amplified sequence polymorphisms and minisatellite variant repeat mapping are thoroughly described and both theoretical and practical aspects are discussed.
Rong, Rong; Tao, Ya-Xiong; Cheung, Bernard M Y; Xu, Aimin; Cheung, Grace C N; Lam, Karen S L
2006-08-01
Mutations in the melanocortin-4 receptor gene (MC4R) are the most common monogenic form of human obesity. However, the contribution of MC4R mutations to obesity in Chinese has not been investigated. We studied the frequency of MC4R mutations in an obese southern Chinese population and the functional consequences of the novel variants identified. We screened for MC4R mutations in 227 obese [body mass index (BMI) 35.29 +/- 5.75 kg/m2] and 100 lean (BMI 21.57 +/- 0.29 kg/m2) southern Chinese subjects using PCR-direct sequencing. In vitro functional studies, including cell surface expression, ligand binding, and cyclic adenosine monophosphate (cAMP) accumulation, were performed to examine the functional properties of three novel missense mutations. Apart from two previously reported polymorphisms, V103I and -176 A > C, three novel missense heterozygous variants (Y35C, C40R and M218T) were identified. The polymorphisms -176 A > C and Y35C were detected in both obese and normal subjects with similar frequency. C40R was identified only in an obese subject. Pedigree analysis revealed M218T carriers in both lean and obese subjects. The prevalence of V103I carriers in normal-weight controls was significantly higher than that in obese subjects (5.3%vs. 1.3%, P < 0.05). In vitro functional studies showed that all three novel missense variants have normal functions. Two known polymorphisms and three novel variants of the MC4R were identified. No overt functional defects were observed for the three novel MC4R variants, suggesting that they might not be the cause of obesity in variant carriers.
Mukda, Ekchol; Trachoo, Objoon; Pasomsub, Ekawat; Tiyasirichokchai, Rawiphorn; Iemwimangsa, Nareenart; Sosothikul, Darintr; Chantratita, Wasun; Pakakasama, Samart
2017-08-01
In the present study, we used exome sequencing to analyze PRF1, UNC13D, STX11, and STXBP2, as well as genes associated with primary immunodeficiency disease (RAB27A, LYST, AP3B1, SH2D1A, ITK, CD27, XIAP, and MAGT1) in Thai children with hemophagocytic lymphohistiocytosis (HLH). We performed mutation analysis of HLH-associated genes in 25 Thai children using an exome sequencing method. Genetic variations found within these target genes were compared to exome sequencing data from 133 healthy individuals. Variants identified with minor allele frequencies <5% and novel mutations were confirmed using Sanger sequencing. Exome sequencing data revealed 101 non-synonymous single nucleotide polymorphisms (SNPs) in all subjects. These SNPs were classified as pathogenic (n = 1), likely pathogenic (n = 16), variant of unknown significance (n = 12), or benign variant (n = 72). Homozygous, compound heterozygous, and double-gene heterozygous variants, involving mutations in PRF1 (n = 3), UNC13D (n = 2), STXBP2 (n = 3), LYST (n = 3), XIAP (n = 2), AP3B1 (n = 1), RAB27A (n = 1), and MAGT1 (n = 1), were demonstrated in 12 patients. Novel mutations were found in most patients in this study. In conclusion, exome sequencing demonstrated the ability to identify rare genetic variants in HLH patients. This method is useful in the detection of mutations in multi-gene associated diseases.
Al-Bustan, Suzanne A; Al-Serri, Ahmad; Annice, Babitha G; Alnaqeeb, Majed A; Al-Kandari, Wafa Y; Dashti, Mohammed
2018-01-01
The role interethnic genetic differences play in plasma lipid level variation across populations is a global health concern. Several genes involved in lipid metabolism and transport are strong candidates for the genetic association with lipid level variation especially lipoprotein lipase (LPL). The objective of this study was to re-sequence the full LPL gene in Kuwaiti Arabs, analyse the sequence variation and identify variants that could attribute to variation in plasma lipid levels for further genetic association. Samples (n = 100) of an Arab ethnic group from Kuwait were analysed for sequence variation by Sanger sequencing across the 30 Kb LPL gene and its flanking sequences. A total of 293 variants including 252 single nucleotide polymorphisms (SNPs) and 39 insertions/deletions (InDels) were identified among which 47 variants (32 SNPs and 15 InDels) were novel to Kuwaiti Arabs. This study is the first to report sequence data and analysis of frequencies of variants at the LPL gene locus in an Arab ethnic group with a novel "rare" variant (LPL:g.18704C>A) significantly associated to HDL (B = -0.181; 95% CI (-0.357, -0.006); p = 0.043), TG (B = 0.134; 95% CI (0.004-0.263); p = 0.044) and VLDL (B = 0.131; 95% CI (-0.001-0.263); p = 0.043) levels. Sequence variation in Kuwaiti Arabs was compared to other populations and was found to be similar with regards to the number of SNPs, InDels and distribution of the number of variants across the LPL gene locus and minor allele frequency (MAF). Moreover, comparison of the identified variants and their MAF with other reports provided a list of 46 potential variants across the LPL gene to be considered for future genetic association studies. The findings warrant further investigation into the association of g.18704C>A with lipid levels in other ethnic groups and with clinical manifestations of dyslipidemia.
Al-Serri, Ahmad; Annice, Babitha G.; Alnaqeeb, Majed A.; Al-Kandari, Wafa Y.; Dashti, Mohammed
2018-01-01
The role interethnic genetic differences play in plasma lipid level variation across populations is a global health concern. Several genes involved in lipid metabolism and transport are strong candidates for the genetic association with lipid level variation especially lipoprotein lipase (LPL). The objective of this study was to re-sequence the full LPL gene in Kuwaiti Arabs, analyse the sequence variation and identify variants that could attribute to variation in plasma lipid levels for further genetic association. Samples (n = 100) of an Arab ethnic group from Kuwait were analysed for sequence variation by Sanger sequencing across the 30 Kb LPL gene and its flanking sequences. A total of 293 variants including 252 single nucleotide polymorphisms (SNPs) and 39 insertions/deletions (InDels) were identified among which 47 variants (32 SNPs and 15 InDels) were novel to Kuwaiti Arabs. This study is the first to report sequence data and analysis of frequencies of variants at the LPL gene locus in an Arab ethnic group with a novel “rare” variant (LPL:g.18704C>A) significantly associated to HDL (B = -0.181; 95% CI (-0.357, -0.006); p = 0.043), TG (B = 0.134; 95% CI (0.004–0.263); p = 0.044) and VLDL (B = 0.131; 95% CI (-0.001–0.263); p = 0.043) levels. Sequence variation in Kuwaiti Arabs was compared to other populations and was found to be similar with regards to the number of SNPs, InDels and distribution of the number of variants across the LPL gene locus and minor allele frequency (MAF). Moreover, comparison of the identified variants and their MAF with other reports provided a list of 46 potential variants across the LPL gene to be considered for future genetic association studies. The findings warrant further investigation into the association of g.18704C>A with lipid levels in other ethnic groups and with clinical manifestations of dyslipidemia. PMID:29438437
Uncovering Local Trends in Genetic Effects of Multiple Phenotypes via Functional Linear Models.
Vsevolozhskaya, Olga A; Zaykin, Dmitri V; Barondess, David A; Tong, Xiaoren; Jadhav, Sneha; Lu, Qing
2016-04-01
Recent technological advances equipped researchers with capabilities that go beyond traditional genotyping of loci known to be polymorphic in a general population. Genetic sequences of study participants can now be assessed directly. This capability removed technology-driven bias toward scoring predominantly common polymorphisms and let researchers reveal a wealth of rare and sample-specific variants. Although the relative contributions of rare and common polymorphisms to trait variation are being debated, researchers are faced with the need for new statistical tools for simultaneous evaluation of all variants within a region. Several research groups demonstrated flexibility and good statistical power of the functional linear model approach. In this work we extend previous developments to allow inclusion of multiple traits and adjustment for additional covariates. Our functional approach is unique in that it provides a nuanced depiction of effects and interactions for the variables in the model by representing them as curves varying over a genetic region. We demonstrate flexibility and competitive power of our approach by contrasting its performance with commonly used statistical tools and illustrate its potential for discovery and characterization of genetic architecture of complex traits using sequencing data from the Dallas Heart Study. Published 2016. This article is a U.S. Government work and is in the public domain in the USA.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ruggles, Kelly V.; Tang, Zuojian; Wang, Xuya
Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations and splice variants identified in cancer cells are translated. Herein we therefore describe a proteogenomic data integration tool (QUILTS) and illustrate its application to whole genome, transcriptome and global MS peptide sequence datasets generated from a pair of luminal and basal-like breast cancer patient derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS process replicates. Despite over thirty sample replicates, only about 10% of all SNV (somatic andmore » germline) were detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNV without a detectable mRNA transcript were also observed demonstrating the transcriptome coverage was also incomplete (~80%). In contrast to germ-line variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than the luminal tumor raising the possibility of differential translation or protein degradation effects. In conclusion, the QUILTS program integrates DNA, RNA and peptide sequencing to assess the degree to which somatic mutations are translated and therefore biologically active. By identifying gaps in sequence coverage QUILTS benchmarks current technology and assesses progress towards whole cancer proteome and transcriptome analysis.« less
Haynes, Edward; Helgason, Thorunn; Young, J Peter W; Thwaites, Richard; Budge, Giles E
2013-08-01
Melissococcus plutonius is the bacterial pathogen that causes European Foulbrood of honeybees, a globally important honeybee brood disease. We have used next-generation sequencing to identify highly polymorphic regions in an otherwise genetically homogenous organism, and used these loci to create a modified MLST scheme. This synthesis of a proven typing scheme format with next-generation sequencing combines reliability and low costs with insights only available from high-throughput sequencing technologies. Using this scheme we show that the global distribution of M.plutonius variants is not uniform. We use the scheme in epidemiological studies to trace movements of infective material around England, insights that would have been impossible to confirm without the typing scheme. We also demonstrate the persistence of local variants over time. © 2013 Crown copyright. Reproduced with the permission of the Controller of Her Majesty's Stationary Office/Queen’s Printer for Scotland and Food and Environment Research Agency.
Sequence variants in oxytocin pathway genes and preterm birth: a candidate gene association study
2013-01-01
Background Preterm birth (PTB) is a complex disorder associated with significant neonatal mortality and morbidity and long-term adverse health consequences. Multiple lines of evidence suggest that genetic factors play an important role in its etiology. This study was designed to identify genetic variation associated with PTB in oxytocin pathway genes whose role in parturition is well known. Methods To identify common genetic variants predisposing to PTB, we genotyped 16 single nucleotide polymorphisms (SNPs) in the oxytocin (OXT), oxytocin receptor (OXTR), and leucyl/cystinyl aminopeptidase (LNPEP) genes in 651 case infants from the U.S. and one or both of their parents. In addition, we examined the role of rare genetic variation in susceptibility to PTB by conducting direct sequence analysis of OXTR in 1394 cases and 1112 controls from the U.S., Argentina, Denmark, and Finland. This study was further extended to maternal triads (maternal grandparents-mother of a case infant, N=309). We also performed in vitro analysis of selected rare OXTR missense variants to evaluate their functional importance. Results Maternal genetic effect analysis of the SNP genotype data revealed four SNPs in LNPEP that show significant association with prematurity. In our case–control sequence analysis, we detected fourteen coding variants in exon 3 of OXTR, all but four of which were found in cases only. Of the fourteen variants, three were previously unreported novel rare variants. When the sequence data from the maternal triads were analyzed using the transmission disequilibrium test, two common missense SNPs (rs4686302 and rs237902) in OXTR showed suggestive association for three gestational age subgroups. In vitro functional assays showed a significant difference in ligand binding between wild-type and two mutant receptors. Conclusions Our study suggests an association between maternal common polymorphisms in LNPEP and susceptibility to PTB. Maternal OXTR missense SNPs rs4686302 and rs237902 may have gestational age-dependent effects on prematurity. Most of the OXTR rare variants identified do not appear to significantly contribute to the risk of PTB, but those shown to affect receptor function in our in vitro study warrant further investigation. Future studies with larger sample sizes are needed to confirm the findings of this study. PMID:23889750
Shabana, -; Hasnain, Shahida
2016-06-01
Leptin is a protein hormone synthesized by adipocytes and is involved in the regulation of food intake and energy expenditure. We hypothesized that any change in the promoter sequence can affect the expression of the gene and hence leptin protein levels in the serum. The aim of the current study was to investigate the relationship of such a promoter variant of the leptin gene, G-2548A polymorphism, with obesity and its effect on various anthropometric and metabolic parameters in a Pakistani cohort consisting of 250 obese and 225 non-obese control subjects. Body weight, height, waist circumference (WC), hip circumference (HC) and blood pressure (BP) were measured by standard methods and levels of fasting blood glucose (FBG), total cholesterol, triglycerides, HDLC, LDLC, and leptin were determined. Genotyping was done by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). The results showed that the LEP G-2548A polymorphism showed significant association with obesity in Pakistan. In addition, the polymorphism showed association with weight, height, BMI, WC, HDLC and serum leptin levels. The findings suggest that the leptin promoter G-2548A variant may play its part in the progression to obesity by not only affecting the body's fat distribution but also by changing the serum leptin and HDLC levels.
Genetic Mapping and Exome Sequencing Identify Variants Associated with Five Novel Diseases
Puffenberger, Erik G.; Jinks, Robert N.; Sougnez, Carrie; Cibulskis, Kristian; Willert, Rebecca A.; Achilly, Nathan P.; Cassidy, Ryan P.; Fiorentini, Christopher J.; Heiken, Kory F.; Lawrence, Johnny J.; Mahoney, Molly H.; Miller, Christopher J.; Nair, Devika T.; Politi, Kristin A.; Worcester, Kimberly N.; Setton, Roni A.; DiPiazza, Rosa; Sherman, Eric A.; Eastman, James T.; Francklyn, Christopher; Robey-Bond, Susan; Rider, Nicholas L.; Gabriel, Stacey; Morton, D. Holmes; Strauss, Kevin A.
2012-01-01
The Clinic for Special Children (CSC) has integrated biochemical and molecular methods into a rural pediatric practice serving Old Order Amish and Mennonite (Plain) children. Among the Plain people, we have used single nucleotide polymorphism (SNP) microarrays to genetically map recessive disorders to large autozygous haplotype blocks (mean = 4.4 Mb) that contain many genes (mean = 79). For some, uninformative mapping or large gene lists preclude disease-gene identification by Sanger sequencing. Seven such conditions were selected for exome sequencing at the Broad Institute; all had been previously mapped at the CSC using low density SNP microarrays coupled with autozygosity and linkage analyses. Using between 1 and 5 patient samples per disorder, we identified sequence variants in the known disease-causing genes SLC6A3 and FLVCR1, and present evidence to strongly support the pathogenicity of variants identified in TUBGCP6, BRAT1, SNIP1, CRADD, and HARS. Our results reveal the power of coupling new genotyping technologies to population-specific genetic knowledge and robust clinical data. PMID:22279524
Reference genotype and exome data from an Australian Aboriginal population for health-based research
Tang, Dave; Anderson, Denise; Francis, Richard W.; Syn, Genevieve; Jamieson, Sarra E.; Lassmann, Timo; Blackwell, Jenefer M.
2016-01-01
Genetic analyses, including genome-wide association studies and whole exome sequencing (WES), provide powerful tools for the analysis of complex and rare genetic diseases. To date there are no reference data for Aboriginal Australians to underpin the translation of health-based genomic research. Here we provide a catalogue of variants called after sequencing the exomes of 72 Aboriginal individuals to a depth of 20X coverage in ∼80% of the sequenced nucleotides. We determined 320,976 single nucleotide variants (SNVs) and 47,313 insertions/deletions using the Genome Analysis Toolkit. We had previously genotyped a subset of the Aboriginal individuals (70/72) using the Illumina Omni2.5 BeadChip platform and found ~99% concordance at overlapping sites, which suggests high quality genotyping. Finally, we compared our SNVs to six publicly available variant databases, such as dbSNP and the Exome Sequencing Project, and 70,115 of our SNVs did not overlap any of the single nucleotide polymorphic sites in all the databases. Our data set provides a useful reference point for genomic studies on Aboriginal Australians. PMID:27070114
Tang, Dave; Anderson, Denise; Francis, Richard W; Syn, Genevieve; Jamieson, Sarra E; Lassmann, Timo; Blackwell, Jenefer M
2016-04-12
Genetic analyses, including genome-wide association studies and whole exome sequencing (WES), provide powerful tools for the analysis of complex and rare genetic diseases. To date there are no reference data for Aboriginal Australians to underpin the translation of health-based genomic research. Here we provide a catalogue of variants called after sequencing the exomes of 72 Aboriginal individuals to a depth of 20X coverage in ∼80% of the sequenced nucleotides. We determined 320,976 single nucleotide variants (SNVs) and 47,313 insertions/deletions using the Genome Analysis Toolkit. We had previously genotyped a subset of the Aboriginal individuals (70/72) using the Illumina Omni2.5 BeadChip platform and found ~99% concordance at overlapping sites, which suggests high quality genotyping. Finally, we compared our SNVs to six publicly available variant databases, such as dbSNP and the Exome Sequencing Project, and 70,115 of our SNVs did not overlap any of the single nucleotide polymorphic sites in all the databases. Our data set provides a useful reference point for genomic studies on Aboriginal Australians.
Gimm, O; Gössling, A; Marsh, D J; Dahia, P L M; Mulligan, L M; Deimling, A von; Eng, C
1999-01-01
Glial cell line-derived neurotrophic factor (GDNF) plays a key role in the control of vertebrate neuron survival and differentiation in both the central and peripheral nervous systems. GDNF preferentially binds to GFRα-1 which then interacts with the receptor tyrosine kinase RET. We investigated a panel of 36 independent cases of mainly advanced sporadic brain tumours for the presence of mutations in GDNF and GFRα-1. No mutations were found in the coding region of GDNF. We identified six previously described GFRα-1 polymorphisms, two of which lead to an amino acid change. In 15 of 36 brain tumours, all polymorphic variants appeared to be homozygous. Of these 15 tumours, one also had a rare, apparently homozygous, sequence variant at codon 361. Because of the rarity of the combination of homozygous sequence variants, analysis for hemizygous deletion was pursued in the 15 samples and loss of heterozygosity was found in 11 tumours. Our data suggest that intragenic point mutations of GDNF or GFRα-1 are not a common aetiologic event in brain tumours. However, either deletion of GFRα-1 and/or nearby genes may contribute to the pathogenesis of these tumours. © 1999 Cancer Research Campaign PMID:10408842
Agouti sequence polymorphisms in coyotes, wolves and dogs suggest hybridization.
Schmutz, Sheila M; Berryere, Thomas G; Barta, Jodi L; Reddick, Kimberley D; Schmutz, Josef K
2007-01-01
Domestic dogs have been shown to have multiple alleles of the Agouti Signal Peptide (ASIP) in exon 4 and we wished to determine the level of polymorphism in the common wild canids of Canada, wolves and coyotes, in comparison. All Canadian coyotes and most wolves have banded hairs. The ASIP coding sequence of the wolf did not vary from the domestic dog but one variant was detected in exon 4 of coyotes that did not alter the arginine at this position. Two other differences were found in the sequence flanking exon 4 of coyotes compared with the 45 dogs and 1 wolf. The coyotes also demonstrated a relatively common polymorphism in the 3' UTR sequence that could be used for population studies. One of the ASIP alleles (R96C) in domestic dogs causes a solid black coat color in homozygotes. Although some wolves are melanistic, this phenotype does not appear to be caused by this same mutation. However, one wolf, potentially a dog-wolf hybrid or descendant thereof, was heterozygous for this allele. Likewise 2 coyotes, potentially dog-coyote or wolf-coyote hybrid descendants, were heterozygous for the several polymorphisms in and flanking exon 4. We could conclude that these were coyote-dog hybrids because both were heterozygous for 2 mutations causing fawn coat color in dogs.
King, Lanikea B.; Walum, Hasse; Inoue, Kiyoshi; Eyrich, Nicholas W.; Young, Larry J.
2015-01-01
Background Oxytocin (OXT) modulates several aspects of social behavior. Intranasal OXT is a leading candidate for treating social deficits in autism spectrum disorder (ASD) and common genetic variants in the human oxytocin receptor (OXTR) are associated with emotion recognition, relationship quality and ASD. Animal models have revealed that individual differences in Oxtr expression in the brain drive social behavior variation. Our understanding of how genetic variation contributes to brain OXTR expression is very limited. Methods We investigated Oxtr expression in monogamous prairie voles, which have a well characterized OXT system. We quantified brain region-specific levels of Oxtr mRNA and OXTR protein with established neuroanatomical methods. We used pyrosequencing to investigate allelic imbalance of Oxtr mRNA, a molecular signature of polymorphic genetic regulatory elements. We performed next-generation sequencing to discover variants in and near the Oxtr gene. We investigated social attachment using the partner preference test. Results Our allelic imbalance data demonstrates that genetic variants contribute to individual differences in Oxtr expression, but only in particular brain regions, including the nucleus accumbens (NAcc), where OXTR signaling facilitates social attachment. Next-generation sequencing identified one polymorphism in the Oxtr intron, near a putative cis-regulatory element, explaining 74% of the variance in striatal Oxtr expression specifically. Males homozygous for the high expressing allele display enhanced social attachment. Discussion Taken together, these findings provide convincing evidence for robust genetic influence on Oxtr expression and provide novel insights into how non-coding polymorphisms in the OXTR might influence individual differences in human social cognition and behavior PMID:26893121
FY*A silencing by the GATA-motif variant FY*A(-69C) in a Caucasian family.
Písačka, Martin; Marinov, Iuri; Králová, Miroslava; Králová, Jana; Kořánová, Michaela; Bohoněk, Miloš; Sood, Chhavi; Ochoa-Garay, Gorka
2015-11-01
The c.1-67C variant polymorphism in a GATA motif of the FY promoter is known to result in erythroid-specific FY silencing, that is, in Fy(a-) and Fy(b-) phenotypes. A Caucasian donor presented with the very rare Fy(a-b-) phenotype and was further investigated. Genomic DNA was analyzed by sequencing to identify the cause of the Fy(a-b-) phenotype. Samples were collected from some of his relatives to establish a correlation between the serology and genotyping results. Red blood cells were analyzed by gel column agglutination and flow cytometry. Genomic DNA was analyzed on genotyping microarrays, by DNA sequencing and by allele-specific PCR. In the donor, a single-nucleotide polymorphism T>C within the GATA motif was found at Position c.1-69 of the FY promoter and shown to occur in the FY*A allele. His genotype was found to be FY*A(-69C), FY*BW.01. In six FY*A/FY*B heterozygous members of the family, a perfect correlation was found between the presence vs. absence of the FY*A(-69C) variant allele and a Fy(a-) vs. Fy(a+) phenotype. The location of the c.1-69C polymorphism in a GATA motif whose disruption is known to result in a Fy null phenotype, together with the perfect correlation between the presence of the FY*A(-69C) allele and the Fy(a-) phenotype support a cause-effect relationship between the two. © 2015 AABB.
Pettigrew, Christopher; Wayte, Nicola; Lovelock, Paul K; Tavtigian, Sean V; Chenevix-Trench, Georgia; Spurdle, Amanda B; Brown, Melissa A
2005-01-01
Introduction Aberrant pre-mRNA splicing can be more detrimental to the function of a gene than changes in the length or nature of the encoded amino acid sequence. Although predicting the effects of changes in consensus 5' and 3' splice sites near intron:exon boundaries is relatively straightforward, predicting the possible effects of changes in exonic splicing enhancers (ESEs) remains a challenge. Methods As an initial step toward determining which ESEs predicted by the web-based tool ESEfinder in the breast cancer susceptibility gene BRCA1 are likely to be functional, we have determined their evolutionary conservation and compared their location with known BRCA1 sequence variants. Results Using the default settings of ESEfinder, we initially detected 669 potential ESEs in the coding region of the BRCA1 gene. Increasing the threshold score reduced the total number to 464, while taking into consideration the proximity to splice donor and acceptor sites reduced the number to 211. Approximately 11% of these ESEs (23/211) either are identical at the nucleotide level in human, primates, mouse, cow, dog and opossum Brca1 (conserved) or are detectable by ESEfinder in the same position in the Brca1 sequence (shared). The frequency of conserved and shared predicted ESEs between human and mouse is higher in BRCA1 exons (2.8 per 100 nucleotides) than in introns (0.6 per 100 nucleotides). Of conserved or shared putative ESEs, 61% (14/23) were predicted to be affected by sequence variants reported in the Breast Cancer Information Core database. Applying the filters described above increased the colocalization of predicted ESEs with missense changes, in-frame deletions and unclassified variants predicted to be deleterious to protein function, whereas they decreased the colocalization with known polymorphisms or unclassified variants predicted to be neutral. Conclusion In this report we show that evolutionary conservation analysis may be used to improve the specificity of an ESE prediction tool. This is the first report on the prediction of the frequency and distribution of ESEs in the BRCA1 gene, and it is the first reported attempt to predict which ESEs are most likely to be functional and therefore which sequence variants in ESEs are most likely to be pathogenic. PMID:16280041
Verbist, Bie M P; Thys, Kim; Reumers, Joke; Wetzels, Yves; Van der Borght, Koen; Talloen, Willem; Aerssens, Jeroen; Clement, Lieven; Thas, Olivier
2015-01-01
In virology, massively parallel sequencing (MPS) opens many opportunities for studying viral quasi-species, e.g. in HIV-1- and HCV-infected patients. This is essential for understanding pathways to resistance, which can substantially improve treatment. Although MPS platforms allow in-depth characterization of sequence variation, their measurements still involve substantial technical noise. For Illumina sequencing, single base substitutions are the main error source and impede powerful assessment of low-frequency mutations. Fortunately, base calls are complemented with quality scores (Qs) that are useful for differentiating errors from the real low-frequency mutations. A variant calling tool, Q-cpileup, is proposed, which exploits the Qs of nucleotides in a filtering strategy to increase specificity. The tool is imbedded in an open-source pipeline, VirVarSeq, which allows variant calling starting from fastq files. Using both plasmid mixtures and clinical samples, we show that Q-cpileup is able to reduce the number of false-positive findings. The filtering strategy is adaptive and provides an optimized threshold for individual samples in each sequencing run. Additionally, linkage information is kept between single-nucleotide polymorphisms as variants are called at the codon level. This enables virologists to have an immediate biological interpretation of the reported variants with respect to their antiviral drug responses. A comparison with existing SNP caller tools reveals that calling variants at the codon level with Q-cpileup results in an outstanding sensitivity while maintaining a good specificity for variants with frequencies down to 0.5%. The VirVarSeq is available, together with a user's guide and test data, at sourceforge: http://sourceforge.net/projects/virtools/?source=directory. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Burgos, Mariana; Arenas, Alvaro; Cabrera, Rodrigo
2016-08-01
Inherited long QT syndrome (LQTS) is a cardiac channelopathy characterized by a prolongation of QT interval and the risk of syncope, cardiac arrest, and sudden cardiac death. Genetic diagnosis of LQTS is critical in medical practice as results can guide adequate management of patients and distinguish phenocopies such as catecholaminergic polymorphic ventricular tachycardia (CPVT). However, extensive screening of large genomic regions is required in order to reliably identify genetic causes. Semiconductor whole exome sequencing (WES) is a promising approach for the identification of variants in the coding regions of most human genes. DNA samples from 21 Colombian patients clinically diagnosed with LQTS were enriched for coding regions using multiplex polymerase chain reaction (PCR) and subjected to WES using a semiconductor sequencer. Semiconductor WES showed mean coverage of 93.6 % for all coding regions relevant to LQTS at >10× depth with high intra- and inter-assay depth heterogeneity. Fifteen variants were detected in 12 patients in genes associated with LQTS. Three variants were identified in three patients in genes associated with CPVT. Co-segregation analysis was performed when possible. All variants were analyzed with two pathogenicity prediction algorithms. The overall prevalence of LQTS and CPVT variants in our cohort was 71.4 %. All LQTS variants previously identified through commercial genetic testing were identified. Standardized WES assays can be easily implemented, often at a lower cost than sequencing panels. Our results show that WES can identify LQTS-causing mutations and permits differential diagnosis of related conditions in a real-world clinical setting. However, high heterogeneity in sequencing depth and low coverage in the most relevant genes is expected to be associated with reduced analytical sensitivity.
Keel, B N; Nonneman, D J; Rohrer, G A
2017-08-01
Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a more significant effect on phenotypic variation than do other types of genetic variants. Hence, a comprehensive list of these functional variants would be of considerable interest in swine genomic studies, particularly those targeting fertility and production traits. Whole-genome sequence was obtained from 72 of the founders of an intensely phenotyped experimental swine herd at the U.S. Meat Animal Research Center (USMARC). These animals included all 24 of the founding boars (12 Duroc and 12 Landrace) and 48 Yorkshire-Landrace composite sows. Sequence reads were mapped to the Sscrofa10.2 genome build, resulting in a mean of 6.1 fold (×) coverage per genome. A total of 22 342 915 high confidence SNPs were identified from the sequenced genomes. These included 21 million previously reported SNPs and 79% of the 62 163 SNPs on the PorcineSNP60 BeadChip assay. Variation was detected in the coding sequence or untranslated regions (UTRs) of 87.8% of the genes in the porcine genome: loss-of-function variants were predicted in 504 genes, 10 202 genes contained nonsynonymous variants, 10 773 had variation in UTRs and 13 010 genes contained synonymous variants. Approximately 139 000 SNPs were classified as loss-of-function, nonsynonymous or regulatory, which suggests that over 99% of the variation detected in our pigs could potentially be ignored, allowing us to focus on a much smaller number of functional SNPs during future analyses. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.
Kim, Junho; Maeng, Ju Heon; Lim, Jae Seok; Son, Hyeonju; Lee, Junehawk; Lee, Jeong Ho; Kim, Sangwoo
2016-10-15
Advances in sequencing technologies have remarkably lowered the detection limit of somatic variants to a low frequency. However, calling mutations at this range is still confounded by many factors including environmental contamination. Vector contamination is a continuously occurring issue and is especially problematic since vector inserts are hardly distinguishable from the sample sequences. Such inserts, which may harbor polymorphisms and engineered functional mutations, can result in calling false variants at corresponding sites. Numerous vector-screening methods have been developed, but none could handle contamination from inserts because they are focusing on vector backbone sequences alone. We developed a novel method-Vecuum-that identifies vector-originated reads and resultant false variants. Since vector inserts are generally constructed from intron-less cDNAs, Vecuum identifies vector-originated reads by inspecting the clipping patterns at exon junctions. False variant calls are further detected based on the biased distribution of mutant alleles to vector-originated reads. Tests on simulated and spike-in experimental data validated that Vecuum could detect 93% of vector contaminants and could remove up to 87% of variant-like false calls with 100% precision. Application to public sequence datasets demonstrated the utility of Vecuum in detecting false variants resulting from various types of external contamination. Java-based implementation of the method is available at http://vecuum.sourceforge.net/ CONTACT: swkim@yuhs.acSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Feline hypersomatotropism and acromegaly tumorigenesis: a potential role for the AIP gene.
Scudder, C J; Niessen, S J; Catchpole, B; Fowkes, R C; Church, D B; Forcada, Y
2017-04-01
Acromegaly in humans is usually sporadic, however up to 20% of familial isolated pituitary adenomas are caused by germline sequence variants of the aryl-hydrocarbon-receptor interacting protein (AIP) gene. Feline acromegaly has similarities to human acromegalic families with AIP mutations. The aim of this study was to sequence the feline AIP gene, identify sequence variants and compare the AIP gene sequence between feline acromegalic and control cats, and in acromegalic siblings. The feline AIP gene was amplified through PCR using whole blood genomic DNA from 10 acromegalic and 10 control cats, and 3 sibling pairs affected by acromegaly. PCR products were sequenced and compared with the published predicted feline AIP gene. A single nonsynonymous SNP was identified in exon 1 (AIP:c.9T > G) of two acromegalic cats and none of the control cats, as well as both members of one sibling pair. The region of this SNP is considered essential for the interaction of the AIP protein with its receptor. This sequence variant has not previously been reported in humans. Two additional synonymous sequence variants were identified (AIP:c.481C > T and AIP:c.826C > T). This is the first molecular study to investigate a potential genetic cause of feline acromegaly and identified a nonsynonymous AIP single nucleotide polymorphism in 20% of the acromegalic cat population evaluated, as well as in one of the sibling pairs evaluated. Copyright © 2016 Elsevier Inc. All rights reserved.
Jang, Su; Lee, Yunjoo; Lee, Gileung; Seo, Jeonghwan; Lee, Dongryung; Yu, Yoye; Chin, Joong Hyoun; Koh, Hee-Jong
2018-01-15
Balancing panicle-related traits such as panicle length and the numbers of primary and secondary branches per panicle, is key to improving the number of spikelets per panicle in rice. Identifying genetic information contributes to a broader understanding of the roles of gene and provides candidate alleles for use as DNA markers. Discovering relations between panicle-related traits and sequence variants allows opportunity for molecular application in rice breeding to improve the number of spikelets per panicle. In total, 142 polymorphic sites, which constructed 58 haplotypes, were detected in coding regions of ten panicle development gene and 35 sequence variants in six genes were significantly associated with panicle-related traits. Rice cultivars were clustered according to their sequence variant profiles. One of the four resultant clusters, which contained only indica and tong-il varieties, exhibited the largest average number of favorable alleles and highest average number of spikelets per panicle, suggesting that the favorable allele combination found in this cluster was beneficial in increasing the number of spikelets per panicle. Favorable alleles identified in this study can be used to develop functional markers for rice breeding programs. Furthermore, stacking several favorable alleles has the potential to substantially improve the number of spikelets per panicle in rice.
Suyama, Yoshihisa; Matsuki, Yu
2015-01-01
Restriction-enzyme (RE)-based next-generation sequencing methods have revolutionized marker-assisted genetic studies; however, the use of REs has limited their widespread adoption, especially in field samples with low-quality DNA and/or small quantities of DNA. Here, we developed a PCR-based procedure to construct reduced representation libraries without RE digestion steps, representing de novo single-nucleotide polymorphism discovery, and its genotyping using next-generation sequencing. Using multiplexed inter-simple sequence repeat (ISSR) primers, thousands of genome-wide regions were amplified effectively from a wide variety of genomes, without prior genetic information. We demonstrated: 1) Mendelian gametic segregation of the discovered variants; 2) reproducibility of genotyping by checking its applicability for individual identification; and 3) applicability in a wide variety of species by checking standard population genetic analysis. This approach, called multiplexed ISSR genotyping by sequencing, should be applicable to many marker-assisted genetic studies with a wide range of DNA qualities and quantities. PMID:26593239
Boldogköi, Zsolt
2004-09-01
Population genetics, the mathematical theory of modern evolutionary biology, defines evolution as the alteration of the frequency of distinct gene variants (alleles) differing in fitness over the time. The major problem with this view is that in gene and protein sequences we can find little evidence concerning the molecular basis of phenotypic variance, especially those that would confer adaptive benefit to the bearers. Some novel data, however, suggest that a large amount of genetic variation exists in the regulatory region of genes within populations. In addition, comparison of homologous DNA sequences of various species shows that evolution appears to depend more strongly on gene expression than on the genes themselves. Furthermore, it has been demonstrated in several systems that genes form functional networks, whose products exhibit interrelated expression profiles. Finally, it has been found that regulatory circuits of development behave as evolutionary units. These data demonstrate that our view of evolution calls for a new synthesis. In this article I propose a novel concept, termed the selfish gene network hypothesis, which is based on an overall consideration of the above findings. The major statements of this hypothesis are as follows. (1) Instead of individual genes, gene networks (GNs) are responsible for the determination of traits and behaviors. (2) The primary source of microevolution is the intraspecific polymorphism in GNs and not the allelic variation in either the coding or the regulatory sequences of individual genes. (3) GN polymorphism is generated by the variation in the regulatory regions of the component genes and not by the variance in their coding sequences. (4) Evolution proceeds through continuous restructuring of the composition of GNs rather than fixing of specific alleles or GN variants.
Endothelial nitric oxide synthase polymorphism and prognosis in systolic heart failure patients.
Azzam, Naiel; Zafrir, Barak; Fares, Fuad; Smith, Yoav; Salman, Nabeeh; Nevzorov, Roman; Amir, Offer
2015-05-01
The endothelial nitric oxide synthase (eNOS) gene single nucleotide polymorphism G894T is associated with thrombotic vascular diseases. However, its functional significance is controversial and data are scarce concerning its influence in heart failure (HF). We studied 215 patients with chronic systolic HF. DNA was analyzed for eNOS gene G894T polymorphism using PCR and DNA sequencing. Evaluation of clinical characteristics and analysis of factors associated with 2-year mortality were performed for the homozygous G-allele G894T variant (GG), relative to the TT and GT variants. The genotype distributions of eNOS G894T alleles were: GG 135 patients (63%) and TT/GT 80 (37%). Two-year mortality was significantly higher in the GG variant (48%) than the combined TT/GT group (32%). The usage of nitrates was associated with increased 2-year mortality (HR 2.0, 95% CI 1.28-3.17; p = 0.003), which was most significant in the GG group treated with nitrates (73.5%) in comparison to the TT/GT group not treated with nitrates (34%); HR 2.75, 95% CI 1.57-4.79, P < 0.001. Homozygosity for the G allele of the eNOS G894T polymorphism was associated with worse survival in systolic HF patients, especially in those treated with nitrates. ENOS polymorphism may result in different mechanistic interactions in HF than in thrombotic vascular diseases, suggesting that overexpression of NO may be associated with deleterious effects in systolic HF. Copyright © 2015 Elsevier Inc. All rights reserved.
A Novel Center Star Multiple Sequence Alignment Algorithm Based on Affine Gap Penalty and K-Band
NASA Astrophysics Data System (ADS)
Zou, Quan; Shan, Xiao; Jiang, Yi
Multiple sequence alignment is one of the most important topics in computational biology, but it cannot deal with the large data so far. As the development of copy-number variant(CNV) and Single Nucleotide Polymorphisms(SNP) research, many researchers want to align numbers of similar sequences for detecting CNV and SNP. In this paper, we propose a novel multiple sequence alignment algorithm based on affine gap penalty and k-band. It can align more quickly and accurately, that will be helpful for mining CNV and SNP. Experiments prove the performance of our algorithm.
Germline sequence variants in TGM3 and RGS22 confer risk of basal cell carcinoma
Stacey, Simon N.; Sulem, Patrick; Gudbjartsson, Daniel F.; Jonasdottir, Aslaug; Thorleifsson, Gudmar; Gudjonsson, Sigurjon A.; Masson, Gisli; Gudmundsson, Julius; Sigurgeirsson, Bardur; Benediktsdottir, Kristrun R.; Thorisdottir, Kristin; Ragnarsson, Rafn; Fuentelsaz, Victoria; Corredera, Cristina; Grasa, Matilde; Planelles, Dolores; Sanmartin, Onofre; Rudnai, Peter; Gurzau, Eugene; Koppova, Kvetoslava; Hemminki, Kari; Nexø, Bjørn A; Tjønneland, Anne; Overvad, Kim; Johannsdottir, Hrefna; Helgadottir, Hafdis T.; Thorsteinsdottir, Unnur; Kong, Augustine; Vogel, Ulla; Kumar, Rajiv; Nagore, Eduardo; Mayordomo, José I.; Rafnar, Thorunn; Olafsson, Jon H.; Stefansson, Kari
2014-01-01
To search for new sequence variants that confer risk of cutaneous basal cell carcinoma (BCC), we conducted a genome-wide association study of 38.5 million single nucleotide polymorphisms (SNPs) and small indels identified through whole-genome sequencing of 2230 Icelanders. We imputed genotypes for 4208 BCC patients and 109 408 controls using Illumina SNP chip typing data, carried out association tests and replicated the findings in independent population samples. We found new BCC susceptibility loci at TGM3 (rs214782[G], P = 5.5 × 10−17, OR = 1.29) and RGS22 (rs7006527[C], P = 8.7 × 10−13, OR = 0.77). TGM3 encodes transglutaminase type 3, which plays a key role in production of the cornified envelope during epidermal differentiation. PMID:24403052
Common variants of the EPDR1 gene and the risk of Dupuytren’s disease.
Dębniak, T; Żyluk, A; Puchalski, P; Serrano-Fernandez, P
2013-10-01
The object of this study was the investigation of 3 common variants of single nucleotide polymorphisms of the ependymin-related gene 1 and its association with the occurrence of Dupuytren's disease. DNA samples were obtained from the peripheral blood of 508 consecutive patients. The control group comprised 515 healthy adults who were age-matched with the Dupuytren's patients. 3 common variants were analysed using TaqMan® genotyping assays and sequencing. The differences in the frequencies of variants of single nucleotide polymorphisms in patients and the control group were statistically tested. Additionally, haplotype frequency and linkage disequilibrium were analysed for these variants. A statistically significant association was noted between rs16879765_CT, rs16879765_TT and rs13240429_AA variants and Dupuytren's disease. 2 haplotypes: rs2722280_C+rs13240429_A+rs16879765_C and rs2722280_C+rs13240429_G+rs16879765_T were found to be statistically significantly associated with Dupuytren's disease. Moreover, we found that rs13240429 and rs16879765 variants were in strong linkage disequilibrium, while rs2722280 was only in moderate linkage disequilibrium. No significant differences were found in the frequencies of the variants of the gene between the groups with a positive and negative familial history of Dupuytren's disease. In conclusion, results of this study suggest that EPDR1 gene can be added to a growing list of genes associated with Dupuytren's disease development. © Georg Thieme Verlag KG Stuttgart · New York.
Uncommon Pathways of Immune Escape Attenuate HIV-1 Integrase Replication Capacity
Chopera, Denis R.; Olvera, Alex; Brumme, Chanson J.; Sela, Jennifer; Markle, Tristan J.; Martin, Eric; Carlson, Jonathan M.; Le, Anh Q.; McGovern, Rachel; Cheung, Peter K.; Kelleher, Anthony D.; Jessen, Heiko; Markowitz, Martin; Rosenberg, Eric; Frahm, Nicole; Sanchez, Jorge; Mallal, Simon; John, Mina; Harrigan, P. Richard; Heckerman, David; Brander, Christian; Walker, Bruce D.; Brumme, Zabrina L.
2012-01-01
An attenuation of the HIV-1 replication capacity (RC) has been observed for immune-mediated escape mutations in Gag restricted by protective HLA alleles. However, the extent to which escape mutations affect other viral proteins during natural infection is not well understood. We generated recombinant viruses encoding plasma HIV-1 RNA integrase sequences from antiretroviral-naïve individuals with early (n = 88) and chronic (n = 304) infections and measured the in vitro RC of each. In contrast to data from previous studies of Gag, we observed little evidence that host HLA allele expression was associated with integrase RC. A modest negative correlation was observed between the number of HLA-B-associated integrase polymorphisms and RC in chronic infection (R = −0.2; P = 0.003); however, this effect was not driven by mutations restricted by protective HLA alleles. Notably, the integrase variants S119R, G163E, and I220L, which represent uncommon polymorphisms associated with HLA-C*05, -A*33, and -B*52, respectively, correlated with lower RC (all q < 0.2). We identified a novel C*05-restricted epitope (HTDNGSNF114–121) that likely contributes to the selection of the S119R variant, the polymorphism most significantly associated with lower RC in patient sequences. An NL4-3 mutant encoding the S119R polymorphism displayed a ∼35%-reduced function that was rescued by a single compensatory mutation of A91E. Together, these data indicate that substantial HLA-driven attenuation of integrase is not a general phenomenon during HIV-1 adaptation to host immunity. However, uncommon polymorphisms selected by HLA alleles that are not conventionally regarded to be protective may be associated with impaired protein function. Vulnerable epitopes in integrase might therefore be considered for future vaccine strategies. PMID:22496233
Uncommon pathways of immune escape attenuate HIV-1 integrase replication capacity.
Brockman, Mark A; Chopera, Denis R; Olvera, Alex; Brumme, Chanson J; Sela, Jennifer; Markle, Tristan J; Martin, Eric; Carlson, Jonathan M; Le, Anh Q; McGovern, Rachel; Cheung, Peter K; Kelleher, Anthony D; Jessen, Heiko; Markowitz, Martin; Rosenberg, Eric; Frahm, Nicole; Sanchez, Jorge; Mallal, Simon; John, Mina; Harrigan, P Richard; Heckerman, David; Brander, Christian; Walker, Bruce D; Brumme, Zabrina L
2012-06-01
An attenuation of the HIV-1 replication capacity (RC) has been observed for immune-mediated escape mutations in Gag restricted by protective HLA alleles. However, the extent to which escape mutations affect other viral proteins during natural infection is not well understood. We generated recombinant viruses encoding plasma HIV-1 RNA integrase sequences from antiretroviral-naïve individuals with early (n = 88) and chronic (n = 304) infections and measured the in vitro RC of each. In contrast to data from previous studies of Gag, we observed little evidence that host HLA allele expression was associated with integrase RC. A modest negative correlation was observed between the number of HLA-B-associated integrase polymorphisms and RC in chronic infection (R = -0.2; P = 0.003); however, this effect was not driven by mutations restricted by protective HLA alleles. Notably, the integrase variants S119R, G163E, and I220L, which represent uncommon polymorphisms associated with HLA-C*05, -A*33, and -B*52, respectively, correlated with lower RC (all q < 0.2). We identified a novel C*05-restricted epitope (HTDNGSNF(114-121)) that likely contributes to the selection of the S119R variant, the polymorphism most significantly associated with lower RC in patient sequences. An NL4-3 mutant encoding the S119R polymorphism displayed a ~35%-reduced function that was rescued by a single compensatory mutation of A91E. Together, these data indicate that substantial HLA-driven attenuation of integrase is not a general phenomenon during HIV-1 adaptation to host immunity. However, uncommon polymorphisms selected by HLA alleles that are not conventionally regarded to be protective may be associated with impaired protein function. Vulnerable epitopes in integrase might therefore be considered for future vaccine strategies.
Tria, Antje; Hiort, Olaf; Sinnecker, Gernot H G
2004-01-01
Defects in the steroid 5alpha-reductase type 2 (SRD5A2) activity cause decreased formation of dihydrotestosterone (DHT) from testosterone (T), resulting in defective masculinization of external genitalia; the T/DHT ratio is increased. We investigated 10 patients with elevated T/DHT ratios in whom mutations in the SRD5A2 and AR genes had been excluded to find out whether structural alterations of the SRD5A1 gene could contribute to their genital malformations. Single-strand conformation polymorphism analysis and direct sequencing were used to detect variations in the SRD5A1 gene of the patients and of 49 adult fertile men who served as controls. The sequence analysis of exon 3 of the SRD5A1 gene indicated an adenine-to-guanine change (ACA vs. ACG), both triplets encoding the amino acid residue threonine. The ACG sequence was detected in 57% of all subjects and was equally distributed in patients and controls. The T/DHT ratio was significantly higher in controls with the ACG variant as compared with those having the ACA variant. However, no particular sequence aberration was found in the SRD5A1 genes of either group. Mutant SRD5A1 isoenzyme does not seem to play a crucial role in the development of hypospadias. Copyright 2004 S. Karger AG, Basel
Edwards, Stefan M.; Sørensen, Izel F.; Sarup, Pernille; Mackay, Trudy F. C.; Sørensen, Peter
2016-01-01
Predicting individual quantitative trait phenotypes from high-resolution genomic polymorphism data is important for personalized medicine in humans, plant and animal breeding, and adaptive evolution. However, this is difficult for populations of unrelated individuals when the number of causal variants is low relative to the total number of polymorphisms and causal variants individually have small effects on the traits. We hypothesized that mapping molecular polymorphisms to genomic features such as genes and their gene ontology categories could increase the accuracy of genomic prediction models. We developed a genomic feature best linear unbiased prediction (GFBLUP) model that implements this strategy and applied it to three quantitative traits (startle response, starvation resistance, and chill coma recovery) in the unrelated, sequenced inbred lines of the Drosophila melanogaster Genetic Reference Panel. Our results indicate that subsetting markers based on genomic features increases the predictive ability relative to the standard genomic best linear unbiased prediction (GBLUP) model. Both models use all markers, but GFBLUP allows differential weighting of the individual genetic marker relationships, whereas GBLUP weighs the genetic marker relationships equally. Simulation studies show that it is possible to further increase the accuracy of genomic prediction for complex traits using this model, provided the genomic features are enriched for causal variants. Our GFBLUP model using prior information on genomic features enriched for causal variants can increase the accuracy of genomic predictions in populations of unrelated individuals and provides a formal statistical framework for leveraging and evaluating information across multiple experimental studies to provide novel insights into the genetic architecture of complex traits. PMID:27235308
Constable, Fiona E.; Nancarrow, Narelle; Plummer, Kim M.; Rodoni, Brendan
2017-01-01
PCR amplicon next generation sequencing (NGS) analysis offers a broadly applicable and targeted approach to detect populations of both high- or low-frequency virus variants in one or more plant samples. In this study, amplicon NGS was used to explore the diversity of the tripartite genome virus, Prunus necrotic ringspot virus (PNRSV) from 53 PNRSV-infected trees using amplicons from conserved gene regions of each of PNRSV RNA1, RNA2 and RNA3. Sequencing of the amplicons from 53 PNRSV-infected trees revealed differing levels of polymorphism across the three different components of the PNRSV genome with a total number of 5040, 2083 and 5486 sequence variants observed for RNA1, RNA2 and RNA3 respectively. The RNA2 had the lowest diversity of sequences compared to RNA1 and RNA3, reflecting the lack of flexibility tolerated by the replicase gene that is encoded by this RNA component. Distinct PNRSV phylo-groups, consisting of closely related clusters of sequence variants, were observed in each of PNRSV RNA1, RNA2 and RNA3. Most plant samples had a single phylo-group for each RNA component. Haplotype network analysis showed that smaller clusters of PNRSV sequence variants were genetically connected to the largest sequence variant cluster within a phylo-group of each RNA component. Some plant samples had sequence variants occurring in multiple PNRSV phylo-groups in at least one of each RNA and these phylo-groups formed distinct clades that represent PNRSV genetic strains. Variants within the same phylo-group of each Prunus plant sample had ≥97% similarity and phylo-groups within a Prunus plant sample and between samples had less ≤97% similarity. Based on the analysis of diversity, a definition of a PNRSV genetic strain was proposed. The proposed definition was applied to determine the number of PNRSV genetic strains in each of the plant samples and the complexity in defining genetic strains in multipartite genome viruses was explored. PMID:28632759
Identification and expression analysis of cDNA encoding insulin-like growth factor 2 in horses
KIKUCHI, Kohta; SASAKI, Keisuke; AKIZAWA, Hiroki; TSUKAHARA, Hayato; BAI, Hanako; TAKAHASHI, Masashi; NAMBO, Yasuo; HATA, Hiroshi; KAWAHARA, Manabu
2017-01-01
Insulin-like growth factor 2 (IGF2) is responsible for a broad range of physiological processes during fetal development and adulthood, but genomic analyses of IGF2 containing the 5ʹ- and 3ʹ-untranslated regions (UTRs) in equines have been limited. In this study, we characterized the IGF2 mRNA containing the UTRs, and determined its expression pattern in the fetal tissues of horses. The complete equine IGF2 mRNA sequence harboring another exon approximately 2.8 kb upstream from the canonical transcription start site was identified as a new transcript variant. As this upstream exon did not contain the start codon, the amino acid sequence was identical to the canonical variant. Analysis of the deduced amino acid sequence revealed that the protein possessed two major domains, IlGF and IGF2_C, and analysis of IGF2 sequence polymorphism in fetal tissues of Hokkaido native horse and Thoroughbreds revealed a single nucleotide polymorphism (T to C transition) at position 398 in Thoroughbreds, which caused an amino acid substitution at position 133 in the IGF2 sequence. Furthermore, the expression pattern of the IGF2 mRNA in the fetal tissues of horses was determined for the first time, and was found to be consistent with those of other species. Taken together, these results suggested that the transcriptional and translational products of the IGF2 gene have conserved functions in the fetal development of mammals, including horses. PMID:29151450
Single nucleotide polymorphisms associated with nonsyndromic cryptorchidism in Mexican patients.
Chávez-Saldaña, M; Vigueras-Villaseñor, R M; Yokoyama-Rebollar, E; Landero-Huerta, D A; Rojas-Castañeda, J C; Taja-Chayeb, L; Cuevas-Alpuche, J O; Zambrano, E
2018-02-01
Cryptorchidism is a frequent genitourinary malformation considered as an important risk factor for infertility and testicular malignancy. The aetiology of cryptorchidism is multifactorial in which certain SNPs, capable of inhibiting the development of the gubernaculum, are implicated. We analysed 16 SNPs by allelic discrimination and automated sequencing in 85 patients and 99 healthy people, with the objective to identify the association between these variants and isolated cryptorchidism. In two different patients with unilateral cryptorchidism, we found the variants rs121912556 and p.R105R of INSL3 gene in a heterozygous form associated with cryptorchidism, so we could considered them as risk factors for cryptorchidism. On the other hand, SNPs rs10421916 of INSL3 gene, as well as the variants rs1555633 and rs7325513 in the RXFP2 gene, and rs3779456 variant of the HOXA10 gene were statistically significant, when the patients and controls were compared and could be considered as protective factors since are predominantly present in controls. The genotype-phenotype correlation did not show statistical significance. With these results, we could conclude that these polymorphisms can be considered as important variants in our population and would contribute in the future knowledge of the aetiology and physiopathology of cryptorchidism. © 2017 Blackwell Verlag GmbH.
GSTM1, GSTP1, and GSTT1 genetic variability in Turkish and worldwide populations.
Karaca, Sefayet; Karaca, Mehmet; Cesuroglu, Tomris; Erge, Sema; Polimanti, Renato
2015-01-01
Glutathione S-transferase (GST) variants have been widely investigated to better understand their role in several pathologic conditions. To our knowledge, no data about these genetic polymorphisms within the Turkish population are currently available. The aim of this study was to analyze GSTM1 positive/null, GSTT1 positive/null, GSTP1*I105V (rs1695), and GSTP1*A114V (rs1138272) variants in the general Turkish population, to provide information about its genetic diversity, and predisposition to GST-related diseases. Genotyping was performed in 500 Turkish individuals using the Sequenom MassARRAY platform. A comparative analysis was executed using the data from the HapMap and Human Genome Diversity Projects (HGDP). Sequence variation was deeply explored using the Phase 1 data of the 1,000 Genomes Project. The variability of GSTM1, GSTT1, and GSTP1 polymorphisms in the Turkish population was similar to that observed in Central Asian, European, and Middle Eastern populations. The high linkage disequilibrium between GSTP1*I105V and GSTP1*A114V in these populations may have a confounding effect on GSTP1 genetic association studies. In analyzing GSTM1, GSTT1, and GSTP1 sequence variation, we observed other common functional variants that may be candidates for associated studies of diseases related to GST genes (e.g., cancer, cardiovascular disease, and allergy). This study provides novel data about GSTM1 positive/null, GSTT1 positive/null, GSTP1*I105V, and GSTP1*A114V variants in the Turkish population, and other functional variants that may affect GSTM1, GSTT1, and GSTP1 functions among worldwide populations. This information can assist in the design of future genetic association studies investigating oxidative stress-related diseases. © 2014 Wiley Periodicals, Inc.
Association studies on ghrelin and ghrelin receptor gene polymorphisms with obesity.
Gueorguiev, Maria; Lecoeur, Cécile; Meyre, David; Benzinou, Michael; Mein, Charles A; Hinney, Anke; Vatin, Vincent; Weill, Jacques; Heude, Barbara; Hebebrand, Johannes; Grossman, Ashley B; Korbonits, Márta; Froguel, Philippe
2009-04-01
Ghrelin exerts a stimulatory effect on appetite and regulates energy homeostasis. Ghrelin gene variants have been shown to be associated with metabolic traits, although there is evidence suggesting linkage and association with obesity and the ghrelin receptor (GHSR). We hypothesized that these genes are good candidates for susceptibility to obesity. Direct sequencing identified 12 ghrelin single-nucleotide polymorphisms (SNPs) and 8 GHSR SNPs. The 10 common SNPs were genotyped in 1,275 obese subjects and in 1,059 subjects from a general population cohort of European origin. In the obesity case-control study, the GHSR SNP rs572169 was found to be associated with obesity (P = 0.007 in additive model, P = 0.001 in dominant model, odds ratio (OR) 1.73, 95% confidence interval (1.23-2.44)). The ghrelin variant, g.A265T (rs4684677), showed an association with obesity (P = 0.009, BMI adjusted for age and sex) in obese families. The ghrelin variant, g.A-604G (rs27647), showed an association with insulin levels at 2-h post-oral glucose tolerance test (OGTT) (P = 0.009) in obese families. We found an association between the eating behavior "overeating" and the GHSR SNP rs2232169 (P = 0.02) in obese subjects. However, none of these associations remained significant when corrected for multiple comparisons. Replication of the nominal associations with obesity could not be confirmed in a German genome-wide association (GWA) study for rs4684677 and rs572169 polymorphisms. Our data suggest that common polymorphisms in ghrelin and its receptor genes are not major contributors to the development of polygenic obesity, although common variants may alter body weight and eating behavior and contribute to insulin resistance, in particular in the context of early-onset obesity.
Isolation and characterization of polymorphic microsatellite markers for blue fox (Alopex lagopus).
Li, Y M; Guo, P C; Lu, J Y; Bai, C Y; Zhao, Z H; Yan, S Q
2016-06-03
The blue fox, belonging to the family Canidae, is a coat color variant of the native arctic fox (Alopex lagopus). To date, microsatellite loci in blue fox are typically amplified using canine simple sequence repeat primers. In the present study, we constructed an (AC)n enrichment library, and isolated and identified 17 polymorphic microsatellite markers for blue fox. The number of alleles per locus is from two to seven based on 24 examined individuals. The expected and observed heterozygosities were in the range of 0.3112 to 0.8236 and 0.2917 to 0.8750, respectively. The polymorphic information content per locus ranged from 0.2583 to 0.8022. These polymorphic markers can be useful for future population genetic studies of both farmed blue foxes and wild arctic foxes.
Deep whole-genome sequencing of 100 southeast Asian Malays.
Wong, Lai-Ping; Ong, Rick Twee-Hee; Poh, Wan-Ting; Liu, Xuanyao; Chen, Peng; Li, Ruoying; Lam, Kevin Koi-Yau; Pillai, Nisha Esakimuthu; Sim, Kar-Seng; Xu, Haiyan; Sim, Ngak-Leng; Teo, Shu-Mei; Foo, Jia-Nee; Tan, Linda Wei-Lin; Lim, Yenly; Koo, Seok-Hwee; Gan, Linda Seo-Hwee; Cheng, Ching-Yu; Wee, Sharon; Yap, Eric Peng-Huat; Ng, Pauline Crystal; Lim, Wei-Yen; Soong, Richie; Wenk, Markus Rene; Aung, Tin; Wong, Tien-Yin; Khor, Chiea-Chuen; Little, Peter; Chia, Kee-Seng; Teo, Yik-Ying
2013-01-10
Whole-genome sequencing across multiple samples in a population provides an unprecedented opportunity for comprehensively characterizing the polymorphic variants in the population. Although the 1000 Genomes Project (1KGP) has offered brief insights into the value of population-level sequencing, the low coverage has compromised the ability to confidently detect rare and low-frequency variants. In addition, the composition of populations in the 1KGP is not complete, despite the fact that the study design has been extended to more than 2,500 samples from more than 20 population groups. The Malays are one of the Austronesian groups predominantly present in Southeast Asia and Oceania, and the Singapore Sequencing Malay Project (SSMP) aims to perform deep whole-genome sequencing of 100 healthy Malays. By sequencing at a minimum of 30× coverage, we have illustrated the higher sensitivity at detecting low-frequency and rare variants and the ability to investigate the presence of hotspots of functional mutations. Compared to the low-pass sequencing in the 1KGP, the deeper coverage allows more functional variants to be identified for each person. A comparison of the fidelity of genotype imputation of Malays indicated that a population-specific reference panel, such as the SSMP, outperforms a cosmopolitan panel with larger number of individuals for common SNPs. For lower-frequency (<5%) markers, a larger number of individuals might have to be whole-genome sequenced so that the accuracy currently afforded by the 1KGP can be achieved. The SSMP data are expected to be the benchmark for evaluating the value of deep population-level sequencing versus low-pass sequencing, especially in populations that are poorly represented in population-genetics studies. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Deep Whole-Genome Sequencing of 100 Southeast Asian Malays
Wong, Lai-Ping; Ong, Rick Twee-Hee; Poh, Wan-Ting; Liu, Xuanyao; Chen, Peng; Li, Ruoying; Lam, Kevin Koi-Yau; Pillai, Nisha Esakimuthu; Sim, Kar-Seng; Xu, Haiyan; Sim, Ngak-Leng; Teo, Shu-Mei; Foo, Jia-Nee; Tan, Linda Wei-Lin; Lim, Yenly; Koo, Seok-Hwee; Gan, Linda Seo-Hwee; Cheng, Ching-Yu; Wee, Sharon; Yap, Eric Peng-Huat; Ng, Pauline Crystal; Lim, Wei-Yen; Soong, Richie; Wenk, Markus Rene; Aung, Tin; Wong, Tien-Yin; Khor, Chiea-Chuen; Little, Peter; Chia, Kee-Seng; Teo, Yik-Ying
2013-01-01
Whole-genome sequencing across multiple samples in a population provides an unprecedented opportunity for comprehensively characterizing the polymorphic variants in the population. Although the 1000 Genomes Project (1KGP) has offered brief insights into the value of population-level sequencing, the low coverage has compromised the ability to confidently detect rare and low-frequency variants. In addition, the composition of populations in the 1KGP is not complete, despite the fact that the study design has been extended to more than 2,500 samples from more than 20 population groups. The Malays are one of the Austronesian groups predominantly present in Southeast Asia and Oceania, and the Singapore Sequencing Malay Project (SSMP) aims to perform deep whole-genome sequencing of 100 healthy Malays. By sequencing at a minimum of 30× coverage, we have illustrated the higher sensitivity at detecting low-frequency and rare variants and the ability to investigate the presence of hotspots of functional mutations. Compared to the low-pass sequencing in the 1KGP, the deeper coverage allows more functional variants to be identified for each person. A comparison of the fidelity of genotype imputation of Malays indicated that a population-specific reference panel, such as the SSMP, outperforms a cosmopolitan panel with larger number of individuals for common SNPs. For lower-frequency (<5%) markers, a larger number of individuals might have to be whole-genome sequenced so that the accuracy currently afforded by the 1KGP can be achieved. The SSMP data are expected to be the benchmark for evaluating the value of deep population-level sequencing versus low-pass sequencing, especially in populations that are poorly represented in population-genetics studies. PMID:23290073
Ribeiro, Antonio; Golicz, Agnieszka; Hackett, Christine Anne; Milne, Iain; Stephen, Gordon; Marshall, David; Flavell, Andrew J; Bayer, Micha
2015-11-11
Single Nucleotide Polymorphisms (SNPs) are widely used molecular markers, and their use has increased massively since the inception of Next Generation Sequencing (NGS) technologies, which allow detection of large numbers of SNPs at low cost. However, both NGS data and their analysis are error-prone, which can lead to the generation of false positive (FP) SNPs. We explored the relationship between FP SNPs and seven factors involved in mapping-based variant calling - quality of the reference sequence, read length, choice of mapper and variant caller, mapping stringency and filtering of SNPs by read mapping quality and read depth. This resulted in 576 possible factor level combinations. We used error- and variant-free simulated reads to ensure that every SNP found was indeed a false positive. The variation in the number of FP SNPs generated ranged from 0 to 36,621 for the 120 million base pairs (Mbp) genome. All of the experimental factors tested had statistically significant effects on the number of FP SNPs generated and there was a considerable amount of interaction between the different factors. Using a fragmented reference sequence led to a dramatic increase in the number of FP SNPs generated, as did relaxed read mapping and a lack of SNP filtering. The choice of reference assembler, mapper and variant caller also significantly affected the outcome. The effect of read length was more complex and suggests a possible interaction between mapping specificity and the potential for contributing more false positives as read length increases. The choice of tools and parameters involved in variant calling can have a dramatic effect on the number of FP SNPs produced, with particularly poor combinations of software and/or parameter settings yielding tens of thousands in this experiment. Between-factor interactions make simple recommendations difficult for a SNP discovery pipeline but the quality of the reference sequence is clearly of paramount importance. Our findings are also a stark reminder that it can be unwise to use the relaxed mismatch settings provided as defaults by some read mappers when reads are being mapped to a relatively unfinished reference sequence from e.g. a non-model organism in its early stages of genomic exploration.
Screening of Variations in CD22 Gene in Children with B-Precursor Acute Lymphoblastic Leukemia.
Aslar Oner, Deniz; Akin, Dilara Fatma; Sipahi, Kadir; Mumcuoglu, Mine; Ezer, Ustun; Kürekci, A Emin; Akar, Nejat
2016-09-01
CD22 is expressed on the surface of B-cell lineage cells from the early progenitor stage of pro-B cell until terminal differentiation to mature B cells. It plays a role in signal transduction and as a regulator of B-cell receptor signaling in B-cell development. We aimed to screen exons 9-14 of the CD22 gene, which is a mutational hot spot region in B-precursor acute lymphoblastic leukemia (pre-B ALL) patients, to find possible genetic variants that could play role in the pathogenesis of pre-B ALL in Turkish children. This study included 109 Turkish children with pre-B ALL who were diagnosed at Losante Hospital for Children with Leukemia. Genomic DNA was extracted from both peripheral blood and bone marrow leukocytes. Gene amplification was performed with PCR, and all samples were screened for the variants by single strand conformation polymorphism. Samples showing band shifts were sequenced on an automated sequencer. In our patient group a total of 9 variants were identified in the CD22 gene by sequencing: a novel variant in intron 10 (T2199G); a missense variant in exon 12; 5 intronic variants between exon 12 and intron 13; a novel intronic variant (C2424T); and a synonymous in exon 13. Thirteen of 109 children (11.9%) carried the T2199G novel intronic variant located in intron 10, and 17 of 109 children (15.6%) carried the C2424T novel intronic variant. Novel variants in the CD22 gene in children with pre-B ALL in Turkey that are not present, in the Human Gene Mutation Database or NCBI SNP database, were found.
[Genetic variants in miRNAs and its association with breast cancer].
Méndez-Gómez, Susana; Ruiz Esparza-Garrido, Ruth; Velázquez-Flores, Miguel; Dolores-Vergara, Maria; Salamanca-Gómez, Fabio; Arenas-Aranda, Diego Julio
2014-01-01
In Mexico, breast cancer represents the first cause of cancer death in females. At the molecular level, non-coding RNAs and especially microRNAs have played an important role in the origin and development of this neoplasm In the Anglo-Saxon population, diverse genetic variants in microRNA genes and in their targets are associated with the development of this disease. In the Mexican population it is not known if these or other variants exist. Identification of these or new variants in our population is fundamental in order to have a better understanding of cancer development and to help establish a better diagnostic strategy. DNA was isolated from mammary tumors, adjacent tissue and peripheral blood of Mexican females with or without cancer. From DNA, five microRNA genes and three of their targets were amplified and sequenced. Genetic variants associated with breast cancer in an Anglo- Saxon population have been previously identified in these sequences. In the samples studied we identified seven single nucleotide polymorphisms (SNPs). Two had not been previously described and were identified only in women with cancer. The new variants may be genetic predisposition factors for the development of breast cancer in our population. Further experiments are needed to determine the involvement of these variants in the development, establishment and progression of breast cancer.
Norman, Paul J.; Parham, Peter
2012-01-01
Pinnipeds, marine carnivores, diverged from terrestrial carnivores ~45 million years ago, before their adaptation to marine environments. This lifestyle change exposed pinnipeds to different microbiota and pathogens, with probable impact on their MHC class I genes. Investigating this question, genomic sequences were determined for 71 MHC class I variants: 27 from harbor seal and 44 from gray seal. These variants form three MHC class I gene lineages, one comprising a pseudogene. The second, a candidate nonclassical MHC class I gene, comprises a nonpolymorphic transcribed gene related to dog DLA-79 and giant panda Aime-1906. The third is the diversity lineage, which includes 62 of the 71 seal MHC class I variants. All are transcribed, and they minimally represent six harbor and 12 gray seal MHC class I genes. Besides species-specific differences in gene number, seal MHC class I haplotypes exhibit gene content variation and allelic polymorphism. Patterns of sequence variation, and of positions for positively selected sites, indicate the diversity lineage genes are the seals’ classical MHC class I genes. Evidence that expansion of diversity lineage genes began before gray and harbor seals diverged is the presence in both species of two distinctive sublineages of diversity lineage genes. Pointing to further expansion following the divergence are the presence of species-specific genes and greater MHC class I diversity in gray seals than harbor seals. The elaboration of a complex variable family of classical MHC class I genes in pinnipeds contrasts with the single, highly polymorphic classical MHC class I gene of dog and giant panda, terrestrial carnivores. PMID:23001684
Márki-Zay, János; Klein, Christoph L; Gancberg, David; Schimmel, Heinz G; Dux, László
2009-04-01
Depending on the method used, rare sequence variants adjacent to the single nucleotide polymorphism (SNP) of interest may cause unusual or erroneous genotyping results. Because such rare variants are known for many genes commonly tested in diagnostic laboratories, we organized a proficiency study to assess their influence on the accuracy of reported laboratory results. Four external quality control materials were processed and sent to 283 laboratories through 3 EQA organizers for analysis of the prothrombin 20210G>A mutation. Two of these quality control materials contained sequence variants introduced by site-directed mutagenesis. One hundred eighty-nine laboratories participated in the study. When samples gave a usual result with the method applied, the error rate was 5.1%. Detailed analysis showed that more than 70% of the failures were reported from only 9 laboratories. Allele-specific amplification-based PCR had a much higher error rate than other methods (18.3% vs 2.9%). The variants 20209C>T and [20175T>G; 20179_20180delAC] resulted in unusual genotyping results in 67 and 85 laboratories, respectively. Eighty-three (54.6%) of these unusual results were not recognized, 32 (21.1%) were attributed to technical issues, and only 37 (24.3%) were recognized as another sequence variant. Our findings revealed that some of the participating laboratories were not able to recognize and correctly interpret unusual genotyping results caused by rare SNPs. Our study indicates that the majority of the failures could be avoided by improved training and careful selection and validation of the methods applied.
New genetic variants of LATS1 detected in urinary bladder and colon cancer.
Saadeldin, Mona K; Shawer, Heba; Mostafa, Ahmed; Kassem, Neemat M; Amleh, Asma; Siam, Rania
2014-01-01
LATS1, the large tumor suppressor 1 gene, encodes for a serine/threonine kinase protein and is implicated in cell cycle progression. LATS1 is down-regulated in various human cancers, such as breast cancer, and astrocytoma. Point mutations in LATS1 were reported in human sarcomas. Additionally, loss of heterozygosity of LATS1 chromosomal region predisposes to breast, ovarian, and cervical tumors. In the current study, we investigated LATS1 genetic variations including single nucleotide polymorphisms (SNPs), in 28 Egyptian patients with either urinary bladder or colon cancers. The LATS1 gene was amplified and sequenced and the expression of LATS1 at the RNA level was assessed in 12 urinary bladder cancer samples. We report, the identification of a total of 29 variants including previously identified SNPs within LATS1 coding and non-coding sequences. A total of 18 variants were novel. Majority of the novel variants, 13, were mapped to intronic sequences and un-translated regions of the gene. Four of the five novel variants located in the coding region of the gene, represented missense mutations within the serine/threonine kinase catalytic domain. Interestingly, LATS1 RNA steady state levels was lost in urinary bladder cancerous tissue harboring four specific SNPs (16045 + 41736 + 34614 + 56177) positioned in the 5'UTR, intron 6, and two silent mutations within exon 4 and exon 8, respectively. This study identifies novel single-base-sequence alterations in the LATS1 gene. These newly identified variants could potentially be used as novel diagnostic or prognostic tools in cancer.
Santos, V C; Grecco, M; Pereira, K M C; Terzian, C C N; Andrade, L E C; Silva, N P
2016-10-01
The objective of this study was to evaluate the association between Fc gamma receptor IIIb polymorphism and susceptibility to systemic lupus erythematosus and clinical traits of the disease. Genomic DNA was obtained from 303 consecutive systemic lupus erythematosus patients and 300 healthy blood donors from the southeastern region of Brazil. The polymorphic region of the FCGR3B gene was sequenced and the alleles FCGR3B*01, FCGR3B*02 and FCGR3B*03 were analyzed. The FCGR3B*01 allele was more frequent in systemic lupus erythematosus patients (43.1%) while the FCGR3B*02 allele prevailed among controls (63.7%) (P = 0.001). The FCGR3B*03 allele was found equally in both groups. The FCGR3B*01/*01 (20.7%) and FCGR3B*01/*02 (41.1%) genotypes were more frequent among systemic lupus erythematosus patients (P = 0.028 and P = 0.012, respectively) while the FCGR3B*02/*02 genotype was more frequent in controls (45.5%) (P < 0.001). One variant of the FCGR3B*01 allele previously described in Germany was found in only one control. A new variant of the FCGR3B*01 allele with two substitutions (A227G/G277A) was found in one control. Three variants of the FCGR3B*02 allele previously described in African-Americans, Brazilians, Chinese and Japanese were found in ten 10 patients and two controls. In addition, several single nucleotide polymorphisms at non-polymorphic positions were identified in both patients and controls. Susceptibility to systemic lupus erythematosus was associated with the FCGR3B*01 allele, as well as with the FCGR3B*01/*01 and FCGR3B*01/*02 genotypes. No association was found between FCGR3B genotypes and clinical manifestations, disease severity or the presence of autoantibodies. © The Author(s) 2016.
Suenaga, Mitsukuni; Schirripa, Marta; Cao, Shu; Zhang, Wu; Yang, Dongyun; Ning, Yan; Cremolini, Chiara; Antoniotti, Carlotta; Borelli, Beatrice; Mashima, Tetsuo; Okazaki, Satoshi; Berger, Martin D; Miyamoto, Yuji; Gopez, Roel; Barzi, Afsaneh; Lonardi, Sara; Yamaguchi, Toshiharu; Falcone, Alfredo; Loupakis, Fotios; Lenz, Heinz-Josef
2018-06-01
The C-C motif chemokine ligand 5/C-C motif chemokine receptor 5 (CCL5/CCR5) pathway has been shown to induce endothelial progenitor cell migration, resulting in increased vascular endothelial growth factor A expression. We hypothesized that genetic polymorphisms in the CCL5/CCR5 pathway predict efficacy and toxicity in patients with metastatic colorectal cancer (mCRC) treated with regorafenib. We analyzed genomic DNA extracted from 229 tumor samples from 2 different cohorts of patients who received regorafenib: an evaluation cohort of 79 Japanese patients and a validation cohort of 150 Italian patients. Single nucleotide polymorphisms of CCL5/CCR5 pathway-related genes were analyzed by PCR-based direct sequencing. CCL4 rs1634517 and CCL3 rs1130371 were associated with progression-free survival in the evaluation cohort (hazard ratio [HR] 1.54, P = .043; HR 1.48, P = .064), and progression-free survival (HR 1.74, P < .001; HR 1.66, P = .002) and overall survival (HR 1.65, P = .004; HR 1.65, P = .004) in the validation cohort. The allelic frequencies of CCL5 single nucleotide polymorphisms varied between the evaluation and validation cohorts (G/G variant in rs2280789, 21.5% vs. 1.3%, P < .001; T/T variant in rs3817655, 22.8% vs. 2.7%, P < .001). In the evaluation cohort, patients with the G/G variant in rs2280789 had a higher incidence of grade 3+ hand-foot skin reaction compared to any A allele (53% vs. 27%, P = .078), and similarly to the T/T variant in rs3817655 compared to any A allele (56% vs. 26%, P = .026). Genetic variants in the CCL5/CCR5 pathway may serve as prognostic markers and may predict severe hand-foot skin reaction in mCRC patients receiving regorafenib therapy. Copyright © 2018 Elsevier Inc. All rights reserved.
Zumaraga, Mark Pretzel; Medina, Paul Julius; Recto, Juan Miguel; Abrahan, Lauro; Azurin, Edelyn; Tanchoco, Celeste C; Jimeno, Cecilia A; Palmes-Saloma, Cynthia
2017-03-01
This study aimed to discover genetic variants in the entire 101 kB vitamin D receptor (VDR) gene for vitamin D deficiency in a group of postmenopausal Filipino women using targeted next generation sequencing (TNGS) approach in a case-control study design. A total of 50 women with and without osteoporotic fracture seen at the Philippine Orthopedic Center were included. Blood samples were collected for determination of serum vitamin D, calcium, phosphorus, glucose, blood urea nitrogen, creatinine, aspartate aminotransferase, alanine aminotransferase and as primary source for targeted VDR gene sequencing using the Ion Torrent Personal Genome Machine. The variant calling was based on the GATK best practice workflow and annotated using Annovar tool. A total of 1496 unique variants in the whole 101-kb VDR gene were identified. Novel sequence variations not registered in the dbSNP database were found among cases and controls at a rate of 23.1% and 16.6% of total discovered variants, respectively. One disease-associated enhancer showed statistically significant association to low serum 25-hydroxy vitamin D levels (Pearson chi-square P-value=0.009). The transcription factor binding site prediction program PROMO predicted the disruption of three transcription factor binding sites in this enhancer region. These findings show the power of TNGS in identifying sequence variations in a very large gene and the surprising results obtained in this study greatly expand the catalog of known VDR sequence variants that may represent an important clue in the emergence of vitamin D deficiency. Such information will also provide the additional guidance necessary toward a personalized nutritional advice to reach sufficient vitamin D status. Copyright © 2016 Elsevier Inc. All rights reserved.
Study of the S427G polymorphism and of MYBL2 variants in patients with acute myeloid leukemia.
Dolz, Sandra; García, Paloma; Llop, Marta; Fuster, Óscar; Luna, Irene; Ibáñez, Mariam; Gómez, Inés; López, María; Such, Esperanza; Cervera, José; Sanz, Miguel A; De Juan, Inmaculada; Palanca, Sarai; Murria, Rosa; Bolufer, Pascual; Barragán, Eva
2015-06-19
Dysregulation of MYBL2 has been associated to tumorigenesis and the S427G polymorphism could induce partial inactivation of MYBL2, associating it with cancer risk. It has previously been shown that MYBL2 was over-expressed in some acute myeloid leukemias (AML), portending poor prognosis. However, to date no studies have investigated the S427G or other genetic variants of MYBL2 in AML. This study analyzed the S427G in 197 AML patients and 179 controls and screened the MYBL2 sequence in patients. In contrast to other studies in solid tumors, the S427G was not associated with the incidence of AML. This study detected four unannotated genetic alterations, of which the Q67X could be involved in MYBL2 dysfunction. Eight polymorphisms were identified, among which the rs73116571, located in a splicing region, was associated with higher incidence in AML and weaker MYBL2 expression, suggesting pre-disposition to AML. Additional functional studies should be performed to verify these genetic variations as possible targets in AML.
Regulatory polymorphisms modulate the expression of HLA class II molecules and promote autoimmunity
Raj, Prithvi; Rai, Ekta; Song, Ran; Khan, Shaheen; Wakeland, Benjamin E; Viswanathan, Kasthuribai; Arana, Carlos; Liang, Chaoying; Zhang, Bo; Dozmorov, Igor; Carr-Johnson, Ferdicia; Mitrovic, Mitja; Wiley, Graham B; Kelly, Jennifer A; Lauwerys, Bernard R; Olsen, Nancy J; Cotsapas, Chris; Garcia, Christine K; Wise, Carol A; Harley, John B; Nath, Swapan K; James, Judith A; Jacob, Chaim O; Tsao, Betty P; Pasare, Chandrashekhar; Karp, David R; Li, Quan Zhen; Gaffney, Patrick M; Wakeland, Edward K
2016-01-01
Targeted sequencing of sixteen SLE risk loci among 1349 Caucasian cases and controls produced a comprehensive dataset of the variations causing susceptibility to systemic lupus erythematosus (SLE). Two independent disease association signals in the HLA-D region identified two regulatory regions containing 3562 polymorphisms that modified thirty-seven transcription factor binding sites. These extensive functional variations are a new and potent facet of HLA polymorphism. Variations modifying the consensus binding motifs of IRF4 and CTCF in the XL9 regulatory complex modified the transcription of HLA-DRB1, HLA-DQA1 and HLA-DQB1 in a chromosome-specific manner, resulting in a 2.5-fold increase in the surface expression of HLA-DR and DQ molecules on dendritic cells with SLE risk genotypes, which increases to over 4-fold after stimulation. Similar analyses of fifteen other SLE risk loci identified 1206 functional variants tightly linked with disease-associated SNPs and demonstrated that common disease alleles contain multiple causal variants modulating multiple immune system genes. DOI: http://dx.doi.org/10.7554/eLife.12089.001 PMID:26880555
High resolution identity testing of inactivated poliovirus vaccines
Mee, Edward T.; Minor, Philip D.; Martin, Javier
2015-01-01
Background Definitive identification of poliovirus strains in vaccines is essential for quality control, particularly where multiple wild-type and Sabin strains are produced in the same facility. Sequence-based identification provides the ultimate in identity testing and would offer several advantages over serological methods. Methods We employed random RT-PCR and high throughput sequencing to recover full-length genome sequences from monovalent and trivalent poliovirus vaccine products at various stages of the manufacturing process. Results All expected strains were detected in previously characterised products and the method permitted identification of strains comprising as little as 0.1% of sequence reads. Highly similar Mahoney and Sabin 1 strains were readily discriminated on the basis of specific variant positions. Analysis of a product known to contain incorrect strains demonstrated that the method correctly identified the contaminants. Conclusion Random RT-PCR and shotgun sequencing provided high resolution identification of vaccine components. In addition to the recovery of full-length genome sequences, the method could also be easily adapted to the characterisation of minor variant frequencies and distinction of closely related products on the basis of distinguishing consensus and low frequency polymorphisms. PMID:26049003
2014-01-01
Background Foxtail millet (Setaria italica (L.) Beauv.) is an important gramineous grain-food and forage crop. It is grown worldwide for human and livestock consumption. Its small genome and diploid nature have led to foxtail millet fast becoming a novel model for investigating plant architecture, drought tolerance and C4 photosynthesis of grain and bioenergy crops. Therefore, cost-effective, reliable and highly polymorphic molecular markers covering the entire genome are required for diversity, mapping and functional genomics studies in this model species. Result A total of 5,020 highly repetitive microsatellite motifs were isolated from the released genome of the genotype 'Yugu1’ by sequence scanning. Based on sequence comparison between S. italica and S. viridis, a set of 788 SSR primer pairs were designed. Of these primers, 733 produced reproducible amplicons and were polymorphic among 28 Setaria genotypes selected from diverse geographical locations. The number of alleles detected by these SSR markers ranged from 2 to 16, with an average polymorphism information content of 0.67. The result obtained by neighbor-joining cluster analysis of 28 Setaria genotypes, based on Nei’s genetic distance of the SSR data, showed that these SSR markers are highly polymorphic and effective. Conclusions A large set of highly polymorphic SSR markers were successfully and efficiently developed based on genomic sequence comparison between different genotypes of the genus Setaria. The large number of new SSR markers and their placement on the physical map represent a valuable resource for studying diversity, constructing genetic maps, functional gene mapping, QTL exploration and molecular breeding in foxtail millet and its closely related species. PMID:24472631
Zhang, Shuo; Tang, Chanjuan; Zhao, Qiang; Li, Jing; Yang, Lifang; Qie, Lufeng; Fan, Xingke; Li, Lin; Zhang, Ning; Zhao, Meicheng; Liu, Xiaotong; Chai, Yang; Zhang, Xue; Wang, Hailong; Li, Yingtao; Li, Wen; Zhi, Hui; Jia, Guanqing; Diao, Xianmin
2014-01-28
Foxtail millet (Setaria italica (L.) Beauv.) is an important gramineous grain-food and forage crop. It is grown worldwide for human and livestock consumption. Its small genome and diploid nature have led to foxtail millet fast becoming a novel model for investigating plant architecture, drought tolerance and C4 photosynthesis of grain and bioenergy crops. Therefore, cost-effective, reliable and highly polymorphic molecular markers covering the entire genome are required for diversity, mapping and functional genomics studies in this model species. A total of 5,020 highly repetitive microsatellite motifs were isolated from the released genome of the genotype 'Yugu1' by sequence scanning. Based on sequence comparison between S. italica and S. viridis, a set of 788 SSR primer pairs were designed. Of these primers, 733 produced reproducible amplicons and were polymorphic among 28 Setaria genotypes selected from diverse geographical locations. The number of alleles detected by these SSR markers ranged from 2 to 16, with an average polymorphism information content of 0.67. The result obtained by neighbor-joining cluster analysis of 28 Setaria genotypes, based on Nei's genetic distance of the SSR data, showed that these SSR markers are highly polymorphic and effective. A large set of highly polymorphic SSR markers were successfully and efficiently developed based on genomic sequence comparison between different genotypes of the genus Setaria. The large number of new SSR markers and their placement on the physical map represent a valuable resource for studying diversity, constructing genetic maps, functional gene mapping, QTL exploration and molecular breeding in foxtail millet and its closely related species.
McClure, Matthew C; Bickhart, Derek; Null, Dan; Vanraden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B; Van Tassell, Curtis P; Sonstegard, Tad S
2014-01-01
The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array.
McClure, Matthew C.; Bickhart, Derek; Null, Dan; VanRaden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B.; Van Tassell, Curtis P.; Sonstegard, Tad S.
2014-01-01
The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array. PMID:24667746
Integrating common and rare genetic variation in diverse human populations.
Altshuler, David M; Gibbs, Richard A; Peltonen, Leena; Altshuler, David M; Gibbs, Richard A; Peltonen, Leena; Dermitzakis, Emmanouil; Schaffner, Stephen F; Yu, Fuli; Peltonen, Leena; Dermitzakis, Emmanouil; Bonnen, Penelope E; Altshuler, David M; Gibbs, Richard A; de Bakker, Paul I W; Deloukas, Panos; Gabriel, Stacey B; Gwilliam, Rhian; Hunt, Sarah; Inouye, Michael; Jia, Xiaoming; Palotie, Aarno; Parkin, Melissa; Whittaker, Pamela; Yu, Fuli; Chang, Kyle; Hawes, Alicia; Lewis, Lora R; Ren, Yanru; Wheeler, David; Gibbs, Richard A; Muzny, Donna Marie; Barnes, Chris; Darvishi, Katayoon; Hurles, Matthew; Korn, Joshua M; Kristiansson, Kati; Lee, Charles; McCarrol, Steven A; Nemesh, James; Dermitzakis, Emmanouil; Keinan, Alon; Montgomery, Stephen B; Pollack, Samuela; Price, Alkes L; Soranzo, Nicole; Bonnen, Penelope E; Gibbs, Richard A; Gonzaga-Jauregui, Claudia; Keinan, Alon; Price, Alkes L; Yu, Fuli; Anttila, Verneri; Brodeur, Wendy; Daly, Mark J; Leslie, Stephen; McVean, Gil; Moutsianas, Loukas; Nguyen, Huy; Schaffner, Stephen F; Zhang, Qingrun; Ghori, Mohammed J R; McGinnis, Ralph; McLaren, William; Pollack, Samuela; Price, Alkes L; Schaffner, Stephen F; Takeuchi, Fumihiko; Grossman, Sharon R; Shlyakhter, Ilya; Hostetter, Elizabeth B; Sabeti, Pardis C; Adebamowo, Clement A; Foster, Morris W; Gordon, Deborah R; Licinio, Julio; Manca, Maria Cristina; Marshall, Patricia A; Matsuda, Ichiro; Ngare, Duncan; Wang, Vivian Ota; Reddy, Deepa; Rotimi, Charles N; Royal, Charmaine D; Sharp, Richard R; Zeng, Changqing; Brooks, Lisa D; McEwen, Jean E
2010-09-02
Despite great progress in identifying genetic variants that influence human disease, most inherited risk remains unexplained. A more complete understanding requires genome-wide studies that fully examine less common alleles in populations with a wide range of ancestry. To inform the design and interpretation of such studies, we genotyped 1.6 million common single nucleotide polymorphisms (SNPs) in 1,184 reference individuals from 11 global populations, and sequenced ten 100-kilobase regions in 692 of these individuals. This integrated data set of common and rare alleles, called 'HapMap 3', includes both SNPs and copy number polymorphisms (CNPs). We characterized population-specific differences among low-frequency variants, measured the improvement in imputation accuracy afforded by the larger reference panel, especially in imputing SNPs with a minor allele frequency of
Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel.
Huang, Jie; Howie, Bryan; McCarthy, Shane; Memari, Yasin; Walter, Klaudia; Min, Josine L; Danecek, Petr; Malerba, Giovanni; Trabetti, Elisabetta; Zheng, Hou-Feng; Gambaro, Giovanni; Richards, J Brent; Durbin, Richard; Timpson, Nicholas J; Marchini, Jonathan; Soranzo, Nicole
2015-09-14
Imputing genotypes from reference panels created by whole-genome sequencing (WGS) provides a cost-effective strategy for augmenting the single-nucleotide polymorphism (SNP) content of genome-wide arrays. The UK10K Cohorts project has generated a data set of 3,781 whole genomes sequenced at low depth (average 7x), aiming to exhaustively characterize genetic variation down to 0.1% minor allele frequency in the British population. Here we demonstrate the value of this resource for improving imputation accuracy at rare and low-frequency variants in both a UK and an Italian population. We show that large increases in imputation accuracy can be achieved by re-phasing WGS reference panels after initial genotype calling. We also present a method for combining WGS panels to improve variant coverage and downstream imputation accuracy, which we illustrate by integrating 7,562 WGS haplotypes from the UK10K project with 2,184 haplotypes from the 1000 Genomes Project. Finally, we introduce a novel approximation that maintains speed without sacrificing imputation accuracy for rare variants.
Sequence variants of the DFNB31 gene among Usher syndrome patients of diverse origin
Aller, Elena; Jaijo, Teresa; van Wijk, Erwin; Ebermann, Inga; Kersten, Ferry; García-García, Gema; Voesenek, Krysta; Aparisi, María José; Hoefsloot, Lies; Cremers, Cor; Díaz-Llopis, Manuel; Pennings, Ronald; Bolz, Hanno J.; Kremer, Hannie; Millán, José M.
2010-01-01
Purpose It has been demonstrated that mutations in deafness, autosomal recessive 31 (DFNB31), the gene encoding whirlin, is responsible for nonsyndromic hearing loss (NSHL; DFNB31) and Usher syndrome type II (USH2D). We screened DFNB31 in a large cohort of patients with different clinical subtypes of Usher syndrome (USH) to determine the prevalence of DFNB31 mutations among USH patients. Methods DFNB31 was screened in 149 USH2, 29 USH1, six atypical USH, and 11 unclassified USH patients from diverse ethnic backgrounds. Mutation detection was performed by direct sequencing of all coding exons. Results We identified 38 different variants among 195 patients. Most variants were clearly polymorphic, but at least two out of the 15 nonsynonymous variants (p.R350W and p.R882S) are predicted to impair whirlin structure and function, suggesting eventual pathogenicity. No putatively pathogenic mutation was found in the second allele of patients with these mutations. Conclusions DFNB31 is not a major cause of USH. PMID:20352026
Ataxia telangiectasia presenting as dopa-responsive cervical dystonia
Mohire, Mahavir D.; Schneider, Susanne A.; Stamelou, Maria; Wood, Nicholas W.; Bhatia, Kailash P.
2013-01-01
Objective: To identify the cause of cervical dopa-responsive dystonia (DRD) in a Muslim Indian family inherited in an apparently autosomal recessive fashion, as previously described in this journal. Methods: Previous testing for mutations in the genes known to cause DRD (GCH1, TH, and SPR) had been negative. Whole exome sequencing was performed on all 3 affected individuals for whom DNA was available to identify potentially pathogenic shared variants. Genotyping data obtained for all 3 affected individuals using the OmniExpress single nucleotide polymorphism chip (Illumina, San Diego, CA) were used to perform linkage analysis, autozygosity mapping, and copy number variation analysis. Sanger sequencing was used to confirm all variants. Results: After filtering of the variants, exome sequencing revealed 2 genes harboring potentially pathogenic compound heterozygous variants (ATM and LRRC16A). Of these, the variants in ATM segregated perfectly with the cervical DRD. Both mutations detected in ATM have been shown to be pathogenic, and α-fetoprotein, a marker of ataxia telangiectasia, was increased in all affected individuals. Conclusion: Biallelic mutations in ATM can cause DRD, and mutations in this gene should be considered in the differential diagnosis of unexplained DRD, particularly if the dystonia is cervical and if there is a recessive family history. ATM has previously been reported to cause isolated cervical dystonia, but never, to our knowledge, DRD. Individuals with dystonia related to ataxia telangiectasia may benefit from a trial of levodopa. PMID:23946315
Thiele, Sonja; Borschewski, Aljona; Küchler, Judit; Bieberbach, Marc; Voigt, Sebastian; Ehlers, Bernhard
2011-07-01
To prevent complications that might follow an infection with varicella-zoster virus (VZV), the live attenuated Oka strain (V-Oka) is administered to children in many developed countries. Three vaccine brands (Varivax from Sanofi Pasteur MSD; Varilrix and Priorix-Tetra, both from Glaxo-Smith-Kline) are licensed in Germany and have been associated with both different degrees of vaccine effectiveness and adverse effects. To identify genetic variants in the vaccines that might contribute to rash-associated syndromes, single nucleotide polymorphism (SNP) profiles of variants from the three vaccines and rash-associated vaccine-type VZV from German vaccinees were quantitatively compared by PCR-based pyrosequencing (PSQ). The Varivax vaccine contained an estimated 3-fold higher diversity of VZV variants, with 20% more wild-type (wt) SNPs than Varilrix and Priorix-Tetra. These minor VZV variants in the vaccines were identified by analyzing cloned full-length open reading frame (ORF) orf62 sequences by chain termination sequencing and PSQ. Some of these sequences amplified from vaccine VZV were very similar or identical to those of the rash-associated vaccine-type VZV from vaccinees and were almost exclusively detected in Varivax. Therefore, minorities of rash-associated VZV variants are present in varicella vaccine formulations, and it can be concluded that the analysis of a core set of four SNPs is required as a minimum for a firm diagnostic differentiation of vaccine-type VZV from wt VZV.
Stam, Remco; Scheikl, Daniela; Tellier, Aurélien
2016-01-01
Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. PMID:27189991
A complex dominance hierarchy is controlled by polymorphism of small RNAs and their targets.
Yasuda, Shinsuke; Wada, Yuko; Kakizaki, Tomohiro; Tarutani, Yoshiaki; Miura-Uno, Eiko; Murase, Kohji; Fujii, Sota; Hioki, Tomoya; Shimoda, Taiki; Takada, Yoshinobu; Shiba, Hiroshi; Takasaki-Yasuda, Takeshi; Suzuki, Go; Watanabe, Masao; Takayama, Seiji
2016-12-22
In diploid organisms, phenotypic traits are often biased by effects known as Mendelian dominant-recessive interactions between inherited alleles. Phenotypic expression of SP11 alleles, which encodes the male determinants of self-incompatibility in Brassica rapa, is governed by a complex dominance hierarchy 1-3 . Here, we show that a single polymorphic 24 nucleotide small RNA, named SP11 methylation inducer 2 (Smi2), controls the linear dominance hierarchy of the four SP11 alleles (S 44 > S 60 > S 40 > S 29 ). In all dominant-recessive interactions, small RNA variants derived from the linked region of dominant SP11 alleles exhibited high sequence similarity to the promoter regions of recessive SP11 alleles and acted in trans to epigenetically silence their expression. Together with our previous study 4 , we propose a new model: sequence similarity between polymorphic small RNAs and their target regulates mono-allelic gene expression, which explains the entire five-phased linear dominance hierarchy of the SP11 phenotypic expression in Brassica.
Huang, Yong-Zhen; Wang, Qin; Zhang, Chun-Lei; Fang, Xing-Tang; Song, En-Liang; Chen, Hong
2016-01-01
Identification of the genes and polymorphisms underlying quantitative traits, and understanding these genes and polymorphisms affect economic growth traits, are important for successful marker-assisted selection and more efficient management strategies in commercial cattle (Bos taurus) population. Syndecan-3 (SDC3), a member of the syndecan family of type I transmembrane heparan sulfate proteoglycans is a novel regulator of feeding behavior and body weight. The aim of this study is to examine the association of the SDC3 polymorphism with growth traits in Chinese Jiaxian and Qinchuan cattle breeds (). Four single nucleotide polymorphisms (SNPs: 1-4) were detected in 555 cows from three Chinese native cattle breeds by means of sequencing pooled DNA samples and polymerase chain reaction-single stranded conformational polymorphism (PCR-SSCP) methods. We found one SNP (g.28362A > G) in intron and three SNPs (g.30742T > G, g.30821C > T and 33418 A > G) in exons. The statistical analyses indicated that these SNPs of SDC3 gene were associated with bovine body height, body length, chest circumference, and circumference of cannon bone (P < 0.05). The mutant-type variant was superior for growth traits; the heterozygote was associated with higher growth traits compared to wild-type homozygote. Our result confirms the polymorphisms in the SDC3 gene are associated with growth traits that may be used for marker-assisted selection in beef cattle breeding programs.
Yang, Xunjun; Zhang, Yuning; Ma, Yin; Zhao, Qiongya; Lyu, Jianxin
2015-12-01
To explore the role of mitochondrial DNA 5178 C/A (Mt5178) polymorphism of NADH-dehydrogenase subunit 2 (ND2) gene in type-2 diabetes mellitus (T2DM) among ethnic Han Chinese through a case-control study. The Mt5178C/A polymorphism was determined by sequencing 1103 T2DM patients and 791 healthy controls. Logistic regression analysis was conducted to estimate odds ratios (OR) and 95% confidence intervals (CI). To confirm the results, a meta-analysis was conducted based on published literature on the association of Mt5178 variant with T2DM. No significant association was found between the Mt5178C/A variant and T2DM either by our study or the meta-analysis which included eight published studies. Nevertheless, it was found that the T2DM patients with 5178C genotype were at a higher risk for nephropathy complication (OR=1.49, 95%CI: 1.005-2.197, P<0.05) and at significantly lower risk for hypertension complication (OR=0.744, 95%CI: 0.556-0.996, P<0.05) compared with those carrying a 5178A genotype. No association was found between the Mt5178C/A polymorphism of mitochondrial ND2 gene with the increased risk of T2DM. However, the polymorphism may affect the development of nephropathy and hypertension complications among T2DM patients.
Breitfeld, Jana; Martens, Susanne; Klammt, Jürgen; Schlicke, Marina; Pfäffle, Roland; Krause, Kerstin; Weidle, Kerstin; Schleinitz, Dorit; Stumvoll, Michael; Führer, Dagmar; Kovacs, Peter; Tönjes, Anke
2013-12-01
The complex process of development of the pituitary gland is regulated by a number of signalling molecules and transcription factors. Mutations in these factors have been identified in rare cases of congenital hypopituitarism but for most subjects with combined pituitary hormone deficiency (CPHD) genetic causes are unknown. Bone morphogenetic proteins (BMPs) affect induction and growth of the pituitary primordium and thus represent plausible candidates for mutational screening of patients with CPHD. We sequenced BMP2, 4 and 7 in 19 subjects with CPHD. For validation purposes, novel genetic variants were genotyped in 1046 healthy subjects. Additionally, potential functional relevance for most promising variants has been assessed by phylogenetic analyses and prediction of effects on protein structure. Sequencing revealed two novel variants and confirmed 30 previously known polymorphisms and mutations in BMP2, 4 and 7. Although phylogenetic analyses indicated that these variants map within strongly conserved gene regions, there was no direct support for their impact on protein structure when applying predictive bioinformatics tools. A mutation in the BMP4 coding region resulting in an amino acid exchange (p.Arg300Pro) appeared most interesting among the identified variants. Further functional analyses are required to ultimately map the relevance of these novel variants in CPHD.
2013-01-01
Background The complex process of development of the pituitary gland is regulated by a number of signalling molecules and transcription factors. Mutations in these factors have been identified in rare cases of congenital hypopituitarism but for most subjects with combined pituitary hormone deficiency (CPHD) genetic causes are unknown. Bone morphogenetic proteins (BMPs) affect induction and growth of the pituitary primordium and thus represent plausible candidates for mutational screening of patients with CPHD. Methods We sequenced BMP2, 4 and 7 in 19 subjects with CPHD. For validation purposes, novel genetic variants were genotyped in 1046 healthy subjects. Additionally, potential functional relevance for most promising variants has been assessed by phylogenetic analyses and prediction of effects on protein structure. Results Sequencing revealed two novel variants and confirmed 30 previously known polymorphisms and mutations in BMP2, 4 and 7. Although phylogenetic analyses indicated that these variants map within strongly conserved gene regions, there was no direct support for their impact on protein structure when applying predictive bioinformatics tools. Conclusions A mutation in the BMP4 coding region resulting in an amino acid exchange (p.Arg300Pro) appeared most interesting among the identified variants. Further functional analyses are required to ultimately map the relevance of these novel variants in CPHD. PMID:24289245
Huszar, Tunde I; Jobling, Mark A; Wetton, Jon H
2018-04-12
Short tandem repeats on the male-specific region of the Y chromosome (Y-STRs) are permanently linked as haplotypes, and therefore Y-STR sequence diversity can be considered within the robust framework of a phylogeny of haplogroups defined by single nucleotide polymorphisms (SNPs). Here we use massively parallel sequencing (MPS) to analyse the 23 Y-STRs in Promega's prototype PowerSeq™ Auto/Mito/Y System kit (containing the markers of the PowerPlex® Y23 [PPY23] System) in a set of 100 diverse Y chromosomes whose phylogenetic relationships are known from previous megabase-scale resequencing. Including allele duplications and alleles resulting from likely somatic mutation, we characterised 2311 alleles, demonstrating 99.83% concordance with capillary electrophoresis (CE) data on the same sample set. The set contains 267 distinct sequence-based alleles (an increase of 58% compared to the 169 detectable by CE), including 60 novel Y-STR variants phased with their flanking sequences which have not been reported previously to our knowledge. Variation includes 46 distinct alleles containing non-reference variants of SNPs/indels in both repeat and flanking regions, and 145 distinct alleles containing repeat pattern variants (RPV). For DYS385a,b, DYS481 and DYS390 we observed repeat count variation in short flanking segments previously considered invariable, and suggest new MPS-based structural designations based on these. We considered the observed variation in the context of the Y phylogeny: several specific haplogroup associations were observed for SNPs and indels, reflecting the low mutation rates of such variant types; however, RPVs showed less phylogenetic coherence and more recurrence, reflecting their relatively high mutation rates. In conclusion, our study reveals considerable additional diversity at the Y-STRs of the PPY23 set via MPS analysis, demonstrates high concordance with CE data, facilitates nomenclature standardisation, and places Y-STR sequence variants in their phylogenetic context. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
A Sequence Polymorphism in MSTN Predicts Sprinting Ability and Racing Stamina in Thoroughbred Horses
Hill, Emmeline W.; Gu, Jingjing; Eivers, Suzanne S.; Fonseca, Rita G.; McGivney, Beatrice A.; Govindarajan, Preethi; Orr, Nick; Katz, Lisa M.; MacHugh, David
2010-01-01
Variants of the MSTN gene encoding myostatin are associated with muscle hypertrophy phenotypes in a range of mammalian species, most notably cattle, dogs, mice, and humans. Using a sample of registered Thoroughbred horses (n = 148), we have identified a novel MSTN sequence polymorphism that is strongly associated (g.66493737C>T, P = 4.85×10−8) with best race distance among elite racehorses (n = 79). This observation was independently validated (P = 1.91×10−6) in a resampled group of Thoroughbreds (n = 62) and in a cohort of Thoroughbreds (n = 37, P = 0.0047) produced by the same trainer. We observed that C/C horses are suited to fast, short-distance races; C/T horses compete favorably in middle-distance races; and T/T horses have greater stamina. Evaluation of retrospective racecourse performance (n = 142) and stallion progeny performance predict that C/C and C/T horses are more likely to be successful two-year-old racehorses than T/T animals. Here we describe for the first time the identification of a gene variant in Thoroughbred racehorses that is predictive of genetic potential for an athletic phenotype. PMID:20098749
Hill, Emmeline W; Gu, Jingjing; Eivers, Suzanne S; Fonseca, Rita G; McGivney, Beatrice A; Govindarajan, Preethi; Orr, Nick; Katz, Lisa M; MacHugh, David E; MacHugh, David
2010-01-20
Variants of the MSTN gene encoding myostatin are associated with muscle hypertrophy phenotypes in a range of mammalian species, most notably cattle, dogs, mice, and humans. Using a sample of registered Thoroughbred horses (n = 148), we have identified a novel MSTN sequence polymorphism that is strongly associated (g.66493737C>T, P = 4.85x10(-8)) with best race distance among elite racehorses (n = 79). This observation was independently validated (P = 1.91x10(-6)) in a resampled group of Thoroughbreds (n = 62) and in a cohort of Thoroughbreds (n = 37, P = 0.0047) produced by the same trainer. We observed that C/C horses are suited to fast, short-distance races; C/T horses compete favorably in middle-distance races; and T/T horses have greater stamina. Evaluation of retrospective racecourse performance (n = 142) and stallion progeny performance predict that C/C and C/T horses are more likely to be successful two-year-old racehorses than T/T animals. Here we describe for the first time the identification of a gene variant in Thoroughbred racehorses that is predictive of genetic potential for an athletic phenotype.
A map of human genome variation from population-scale sequencing.
Abecasis, Gonçalo R; Altshuler, David; Auton, Adam; Brooks, Lisa D; Durbin, Richard M; Gibbs, Richard A; Hurles, Matt E; McVean, Gil A
2010-10-28
The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype. Here we present results of the pilot phase of the project, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms. We undertook three projects: low-coverage whole-genome sequencing of 179 individuals from four populations; high-coverage sequencing of two mother-father-child trios; and exon-targeted sequencing of 697 individuals from seven populations. We describe the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants, most of which were previously undescribed. We show that, because we have catalogued the vast majority of common variation, over 95% of the currently accessible variants found in any individual are present in this data set. On average, each person is found to carry approximately 250 to 300 loss-of-function variants in annotated genes and 50 to 100 variants previously implicated in inherited disorders. We demonstrate how these results can be used to inform association and functional studies. From the two trios, we directly estimate the rate of de novo germline base substitution mutations to be approximately 10(-8) per base pair per generation. We explore the data with regard to signatures of natural selection, and identify a marked reduction of genetic variation in the neighbourhood of genes, due to selection at linked sites. These methods and public data will support the next phase of human genetic research.
2012-01-01
Background Although modern sequencing technologies permit the ready detection of numerous DNA sequence variants in any organisms, converting such information to PCR-based genetic markers is hampered by a lack of simple, scalable tools. Onion is an example of an under-researched crop with a complex, heterozygous genome where genome-based research has previously been hindered by limited sequence resources and genetic markers. Results We report the development of generic tools for large-scale web-based PCR-based marker design in the Galaxy bioinformatics framework, and their application for development of next-generation genetics resources in a wide cross of bulb onion (Allium cepa L.). Transcriptome sequence resources were developed for the homozygous doubled-haploid bulb onion line ‘CUDH2150’ and the genetically distant Indian landrace ‘Nasik Red’, using 454™ sequencing of normalised cDNA libraries of leaf and shoot. Read mapping of ‘Nasik Red’ reads onto ‘CUDH2150’ assemblies revealed 16836 indel and SNP polymorphisms that were mined for portable PCR-based marker development. Tools for detection of restriction polymorphisms and primer set design were developed in BioPython and adapted for use in the Galaxy workflow environment, enabling large-scale and targeted assay design. Using PCR-based markers designed with these tools, a framework genetic linkage map of over 800cM spanning all chromosomes was developed in a subset of 93 F2 progeny from a very large F2 family developed from the ‘Nasik Red’ x ‘CUDH2150’ inter-cross. The utility of tools and genetic resources developed was tested by designing markers to transcription factor-like polymorphic sequences. Bin mapping these markers using a subset of 10 progeny confirmed the ability to place markers within 10 cM bins, enabling increased efficiency in marker assignment and targeted map refinement. The major genetic loci conditioning red bulb colour (R) and fructan content (Frc) were located on this map by QTL analysis. Conclusions The generic tools developed for the Galaxy environment enable rapid development of sets of PCR assays targeting sequence variants identified from Illumina and 454 sequence data. They enable non-specialist users to validate and exploit large volumes of next-generation sequence data using basic equipment. PMID:23157543
Baldwin, Samantha; Revanna, Roopashree; Thomson, Susan; Pither-Joyce, Meeghan; Wright, Kathryn; Crowhurst, Ross; Fiers, Mark; Chen, Leshi; Macknight, Richard; McCallum, John A
2012-11-19
Although modern sequencing technologies permit the ready detection of numerous DNA sequence variants in any organisms, converting such information to PCR-based genetic markers is hampered by a lack of simple, scalable tools. Onion is an example of an under-researched crop with a complex, heterozygous genome where genome-based research has previously been hindered by limited sequence resources and genetic markers. We report the development of generic tools for large-scale web-based PCR-based marker design in the Galaxy bioinformatics framework, and their application for development of next-generation genetics resources in a wide cross of bulb onion (Allium cepa L.). Transcriptome sequence resources were developed for the homozygous doubled-haploid bulb onion line 'CUDH2150' and the genetically distant Indian landrace 'Nasik Red', using 454™ sequencing of normalised cDNA libraries of leaf and shoot. Read mapping of 'Nasik Red' reads onto 'CUDH2150' assemblies revealed 16836 indel and SNP polymorphisms that were mined for portable PCR-based marker development. Tools for detection of restriction polymorphisms and primer set design were developed in BioPython and adapted for use in the Galaxy workflow environment, enabling large-scale and targeted assay design. Using PCR-based markers designed with these tools, a framework genetic linkage map of over 800cM spanning all chromosomes was developed in a subset of 93 F(2) progeny from a very large F(2) family developed from the 'Nasik Red' x 'CUDH2150' inter-cross. The utility of tools and genetic resources developed was tested by designing markers to transcription factor-like polymorphic sequences. Bin mapping these markers using a subset of 10 progeny confirmed the ability to place markers within 10 cM bins, enabling increased efficiency in marker assignment and targeted map refinement. The major genetic loci conditioning red bulb colour (R) and fructan content (Frc) were located on this map by QTL analysis. The generic tools developed for the Galaxy environment enable rapid development of sets of PCR assays targeting sequence variants identified from Illumina and 454 sequence data. They enable non-specialist users to validate and exploit large volumes of next-generation sequence data using basic equipment.
Kamel, Katarzyna A; Kroc, Magdalena; Święcicki, Wojciech
2015-01-01
Sequence tagged site (STS) markers are valuable tools for genetic and physical mapping that can be successfully used in comparative analyses among related species. Current challenges for molecular markers genotyping in plants include the lack of fast, sensitive and inexpensive methods suitable for sequence variant detection. In contrast, high resolution melting (HRM) is a simple and high-throughput assay, which has been widely applied in sequence polymorphism identification as well as in the studies of genetic variability and genotyping. The present study is the first attempt to use the HRM analysis to genotype STS markers in narrow-leafed lupin (Lupinus angustifolius L.). The sensitivity and utility of this method was confirmed by the sequence polymorphism detection based on melting curve profiles in the parental genotypes and progeny of the narrow-leafed lupin mapping population. Application of different approaches, including amplicon size and a simulated heterozygote analysis, has allowed for successful genetic mapping of 16 new STS markers in the narrow-leafed lupin genome.
HomSI: a homozygous stretch identifier from next-generation sequencing data.
Görmez, Zeliha; Bakir-Gungor, Burcu; Sagiroglu, Mahmut Samil
2014-02-01
In consanguineous families, as a result of inheriting the same genomic segments through both parents, the individuals have stretches of their genomes that are homozygous. This situation leads to the prevalence of recessive diseases among the members of these families. Homozygosity mapping is based on this observation, and in consanguineous families, several recessive disease genes have been discovered with the help of this technique. The researchers typically use single nucleotide polymorphism arrays to determine the homozygous regions and then search for the disease gene by sequencing the genes within this candidate disease loci. Recently, the advent of next-generation sequencing enables the concurrent identification of homozygous regions and the detection of mutations relevant for diagnosis, using data from a single sequencing experiment. In this respect, we have developed a novel tool that identifies homozygous regions using deep sequence data. Using *.vcf (variant call format) files as an input file, our program identifies the majority of homozygous regions found by microarray single nucleotide polymorphism genotype data. HomSI software is freely available at www.igbam.bilgem.tubitak.gov.tr/softwares/HomSI, with an online manual.
Screening for rare variants in the PNPLA3 gene in obese liver biopsy patients.
Zegers, Doreen; Verrijken, An; Francque, Sven; de Freitas, Fenna; Beckers, Sigri; Aerts, Evi; Ruppert, Martin; Hubens, Guy; Michielsen, Peter; Van Hul, Wim; Van Gaal, Luc F
2016-12-01
Previous research has clearly implicated the PNPLA3 gene in the etiology of nonalcoholic fatty liver disease as a polymorphism in the gene was found to be robustly associated to the disease. However, data on the involvement of rare PNPLA3 variants in the development of nonalcoholic fatty liver disease (NAFLD) is currently limited. Therefore, we performed an extensive mutation analysis study on a cohort of obese liver biopsy patients to determine PNPLA3 variation and its correlation with fatty liver disease. We screened the entire coding region of the PNPLA3 gene in DNA samples of 393 obese liver biopsy patients with varying degrees of fatty liver disease. Mutation analysis was performed by high-resolution melting curve analysis in combination with direct sequencing. We identified several common polymorphisms as well as one rare synonymous variant (c.867G>A rs139896256), one rare intronic variant (c.979+13C>T) and 3 nonsynonymous coding variants (p.A76T, p.A104V and p.T200M) in the PNPLA3 gene. In silico analysis indicated that the p.A104V variant will probably have no functional effect, whereas for the p.A76T and p.T200M variant a possible pathogenic effect is suggested. Overall, we showed that novel variants in PNPLA3 are very rare in our liver biopsy cohort, thereby indicating that their impact on the etiology of NAFLD is probably limited. Nevertheless, for the three rare coding variants that were identified in patients with advanced liver disease, further functional characterization will be essential to verify their potential disease causality. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Camps, Carme; Petousi, Nayia; Bento, Celeste; Cario, Holger; Copley, Richard R.; McMullin, Mary Frances; van Wijk, Richard; Ratcliffe, Peter J.; Robbins, Peter A.; Taylor, Jenny C.
2016-01-01
Erythrocytosis is a rare disorder characterized by increased red cell mass and elevated hemoglobin concentration and hematocrit. Several genetic variants have been identified as causes for erythrocytosis in genes belonging to different pathways including oxygen sensing, erythropoiesis and oxygen transport. However, despite clinical investigation and screening for these mutations, the cause of disease cannot be found in a considerable number of patients, who are classified as having idiopathic erythrocytosis. In this study, we developed a targeted next-generation sequencing panel encompassing the exonic regions of 21 genes from relevant pathways (~79 Kb) and sequenced 125 patients with idiopathic erythrocytosis. The panel effectively screened 97% of coding regions of these genes, with an average coverage of 450×. It identified 51 different rare variants, all leading to alterations of protein sequence, with 57 out of 125 cases (45.6%) having at least one of these variants. Ten of these were known erythrocytosis-causing variants, which had been missed following existing diagnostic algorithms. Twenty-two were novel variants in erythrocytosis-associated genes (EGLN1, EPAS1, VHL, BPGM, JAK2, SH2B3) and in novel genes included in the panel (e.g. EPO, EGLN2, HIF3A, OS9), some with a high likelihood of functionality, for which future segregation, functional and replication studies will be useful to provide further evidence for causality. The rest were classified as polymorphisms. Overall, these results demonstrate the benefits of using a gene panel rather than existing methods in which focused genetic screening is performed depending on biochemical measurements: the gene panel improves diagnostic accuracy and provides the opportunity for discovery of novel variants. PMID:27651169
Camps, Carme; Petousi, Nayia; Bento, Celeste; Cario, Holger; Copley, Richard R; McMullin, Mary Frances; van Wijk, Richard; Ratcliffe, Peter J; Robbins, Peter A; Taylor, Jenny C
2016-11-01
Erythrocytosis is a rare disorder characterized by increased red cell mass and elevated hemoglobin concentration and hematocrit. Several genetic variants have been identified as causes for erythrocytosis in genes belonging to different pathways including oxygen sensing, erythropoiesis and oxygen transport. However, despite clinical investigation and screening for these mutations, the cause of disease cannot be found in a considerable number of patients, who are classified as having idiopathic erythrocytosis. In this study, we developed a targeted next-generation sequencing panel encompassing the exonic regions of 21 genes from relevant pathways (~79 Kb) and sequenced 125 patients with idiopathic erythrocytosis. The panel effectively screened 97% of coding regions of these genes, with an average coverage of 450×. It identified 51 different rare variants, all leading to alterations of protein sequence, with 57 out of 125 cases (45.6%) having at least one of these variants. Ten of these were known erythrocytosis-causing variants, which had been missed following existing diagnostic algorithms. Twenty-two were novel variants in erythrocytosis-associated genes (EGLN1, EPAS1, VHL, BPGM, JAK2, SH2B3) and in novel genes included in the panel (e.g. EPO, EGLN2, HIF3A, OS9), some with a high likelihood of functionality, for which future segregation, functional and replication studies will be useful to provide further evidence for causality. The rest were classified as polymorphisms. Overall, these results demonstrate the benefits of using a gene panel rather than existing methods in which focused genetic screening is performed depending on biochemical measurements: the gene panel improves diagnostic accuracy and provides the opportunity for discovery of novel variants. Copyright© Ferrata Storti Foundation.
Rozman, Vita; Kunej, Tanja
2018-05-10
Harnessing the genomics big data requires innovation in how we extract and interpret biologically relevant variants. Currently, there is no established catalog of prioritized missense variants associated with deleterious protein function phenotypes. We report in this study, to the best of our knowledge, the first genome-wide prioritization of sequence variants with the most deleterious effect on protein function (potentially deleterious variants [pDelVars]) in nine vertebrate species: human, cattle, horse, sheep, pig, dog, rat, mouse, and zebrafish. The analysis was conducted using the Ensembl/BioMart tool. Genes comprising pDelVars in the highest number of examined species were identified using a Python script. Multiple genomic alignments of the selected genes were built to identify interspecies orthologous potentially deleterious variants, which we defined as the "ortho-pDelVars." Genome-wide prioritization revealed that in humans, 0.12% of the known variants are predicted to be deleterious. In seven out of nine examined vertebrate species, the genes encoding the multiple PDZ domain crumbs cell polarity complex component (MPDZ) and the transforming acidic coiled-coil containing protein 2 (TACC2) comprise pDelVars. Five interspecies ortho-pDelVars were identified in three genes. These findings offer new ways to harness genomics big data by facilitating the identification of functional polymorphisms in humans and animal models and thus provide a future basis for optimization of protocols for whole genome prioritization of pDelVars and screening of orthologous sequence variants. The approach presented here can inform various postgenomic applications such as personalized medicine and multiomics study of health interventions (iatromics).
de Manuel, Marc; Shiina, Takashi; Suzuki, Shingo; Dereuddre-Bosquet, Nathalie; Garchon, Henri-Jean; Tanaka, Masayuki; Congy-Jolivet, Nicolas; Aarnink, Alice; Le Grand, Roger; Marques-Bonet, Tomas; Blancher, Antoine
2018-05-08
In the Mauritian macaque experimentally inoculated with SIV, gene polymorphisms potentially associated with the plasma virus load at a set point, approximately 100 days post inoculation, were investigated. Among the 42 animals inoculated with 50 AID 50 of the same strain of SIV, none of which received any preventive or curative treatment, nine individuals were selected: three with a plasma virus load (PVL) among the lowest, three with intermediate PVL values and three among the highest PVL values. The complete genomes of these nine animals were then analyzed. Initially, attention was focused on variants with a potential functional impact on protein encoding genes (non-synonymous SNPs (NS-SNPs) and splicing variants). Thus, 424 NS-SNPs possibly associated with PVL were detected. The 424 candidates SNPs were genotyped in these 42 SIV experimentally infected animals (including the nine animals subjected to whole genome sequencing). The genes containing variants most probably associated with PVL at a set time point are analyzed herein.
Sun, Yujia; Lan, Xianyong; Lei, Chuzhao; Zhang, Chunlei; Chen, Hong
2015-06-01
The aim of this study was to examine the association of cofilin2 (CFL2) gene polymorphisms with growth traits in Chinese Qinchuan cattle. Three single nucleotide polymorphisms (SNPs) were identified in the bovine CFL2 gene using DNA sequencing and (forced) PCR-RFLP methods. These polymorphisms included a missense mutation (NC_007319.5: g. C 2213 G) in exon 4, one synonymous mutation (NC_007319.5: g. T 1694 A) in exon 4, and a mutation (NC_007319.5: g. G 1500 A) in intron 2, respectively. In addition, we evaluated the haplotype frequency and linkage disequilibrium coefficient of three sequence variants in 488 individuals in QC cattle. All the three SNPs in QC cattle belonged to an intermediate level of genetic diversity (0.25
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shiang, R.; Lidral, A.C.; Ardinger, H.H.
1993-10-01
Genetic analysis and tissue-specific expression studies support a role for transforming growth-factor alpha (TGFA) in craniofacial development. Previous studies have confirmed an association of alleles for TGFA with nonsyndromic cleft lip with or without cleft palate (CL/P) in humans. The authors carried out a retrospective association study to determine whether specific allelic variants of the TGFA gene are also associated with cleft palate only (CPO). The PCR products from 12 overlapping sets of primers to the TGFA cDNA were examined by using single-strand conformational polymorphism analysis. Four DNA polymorphic sites for TGFA were identified in the 3[prime] untranslated region ofmore » the TGFA gene. These variants, as well as previously identified RFLPs for TGFA, were characterized in case and control populations for CPO by using X[sup 2] analysis. A significant association between alleles of TGFA and CPO was identified which further supports a role for this gene as one of the genetic determinants of craniofacial development. Sequence analysis of the variants disclosed a cluster of three variable sites within 30 bp of each other in the 3[prime] untranslated region previously associated with an antisense transcript. These studies extend the role for TGFA in craniofacial morphogenesis and support an interrelated mechanism underlying nonsyndromic forms of CL/P. 46 refs., 3 figs., 3 tabs.« less
Cao, Lili; Li, Tianfeng; Zhu, Yanbei; Zhou, Wei; Guo, Wenwen; Cai, Zhenming; Xie, Yuan; He, Xuan; Li, Xinxiu; Zhu, Dalong; Wang, Yaping
2013-04-01
Mosaicism refers to the presence of genetically distinct cell lines within an organism or a tissue. Somatic mosaicism exists in distinct populations of somatic cells and commonly arises as a result of somatic mutations, mainly in early embryonic development. SNPs are important markers that distinguish between different individuals in heterogeneous biological samples and contribute greatly to disease risk association studies. In this work, we investigated the relationship between the functional variants in the 5'-UTR of the hOGG1 gene and the risk of type 2 diabetes. Upon detection of the polymorphisms c.-53G>C, c.-23A>G, and c.-18G>T in the hOGG1 gene, we found that mosaicism was present in 3/28 (10.71%), 7/51 (13.73%), and 1/44 (2.27%) patients respectively, who were carriers of these single nucleotide variations, by cloning and sequence analysis and pyrosequencing. Statistical analysis showed that the frequency of the variation c.-23A>G in the hOGG1 5'-UTR in type 2 diabetic patients was significantly higher than that in healthy controls. However, sequencing of the mutant alleles in mosaic individuals showed weak peaks that may affect detection of the SNPs and impair association-based investigations. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Yang, Q L; Huang, X Y; Kong, J J; Zhao, S G; Liu, L X; Gun, S B
2016-08-19
Piglet diarrhea is one of the primary factors that affects the benefits of the swine industry. Recent studies have shown that exon 2 of the swine leukocyte antigen-DQA gene is associated with piglet resistance to diarrhea; however, the contributions of additional exon coding regions of this gene remain unclear. Here, we detected and sequenced variants in the exon 3 region and examined their associations with diarrhea infection in 425 suckling piglets using the polymerase chain reaction-single-strand conformational polymorphism and sequencing analysis. The results revealed that exon 3 of the swine leukocyte antigen-DQA gene is highly polymorphic and pivotal to both diarrhea susceptibility and resistance in piglets. We identified 14 genotypes (AA, AB, BB, BC, CC, EE, EF, BE, BF, CF, DD, DH, GG, and GF) and eight alleles (A-H) that were generated by 14 nucleotide variants, eight of which were novel, and three nucleotide deletions. Statistical analyses revealed that the genotypes AB and EF were associated with resistance to diarrheal disease (P < 0.05), and the genotype DD may contribute to diarrhea susceptibility but was unique to Large White pigs (P > 0.05). These results elucidate the genetic and immunological background to piglet diarrhea, and provide useful information for resistance breeding programs.
Arrhythmogenic KCNE gene variants: current knowledge and future challenges
Crump, Shawn M.; Abbott, Geoffrey W.
2014-01-01
There are twenty-five known inherited cardiac arrhythmia susceptibility genes, all of which encode either ion channel pore-forming subunits or proteins that regulate aspects of ion channel biology such as function, trafficking, and localization. The human KCNE gene family comprises five potassium channel regulatory subunits, sequence variants in each of which are associated with cardiac arrhythmias. KCNE gene products exhibit promiscuous partnering and in some cases ubiquitous expression, hampering efforts to unequivocally correlate each gene to specific native potassium currents. Likewise, deducing the molecular etiology of cardiac arrhythmias in individuals harboring rare KCNE gene variants, or more common KCNE polymorphisms, can be challenging. In this review we provide an update on putative arrhythmia-causing KCNE gene variants, and discuss current thinking and future challenges in the study of molecular mechanisms of KCNE-associated cardiac rhythm disturbances. PMID:24478792
Effect of S267F variant of NTCP on the patients with chronic hepatitis B.
Lee, Hye Won; Park, Hye Jung; Jin, Bora; Dezhbord, Mehrangiz; Kim, Do Young; Han, Kwang-Hyub; Ryu, Wang-Shick; Kim, Seungtaek; Ahn, Sang Hoon
2017-12-15
Sodium taurocholate cotransporting polypeptide (NTCP) was identified as an entry receptor for hepatitis B virus (HBV) infection. The substitution of serine at position 267 of NTCP with phenylalanine (S267F) is an Asian-specific variation that hampers HBV entry in vitro. In this study, we aimed to evaluate the prevalence of S267F polymorphism in Korean patients with chronic hepatitis B (CHB) and its association with disease progression and potential viral evolution in the preS1 domain of HBV. We found that the frequency of the S267F variant of NTCP in CHB patients and controls was 2.7% and 5.7% (P = 0.031), respectively, and that those who had S267F variant were less susceptible to chronic HBV infection. The frequency of the S267F variant in CHB, cirrhosis and hepatocellular carcinoma (HCC) patients was 3.3%, 0.9%, and 3.5%, respectively. Thus, the S267F variant correlated significantly with a lower risk for cirrhosis (P = 0.036). Sequencing preS1 domain of HBV from the patients who had S267F variant revealed no significant sequence change compared to the wild type. In conclusion, the S267F variant of NTCP is clinically associated with a lower risk of chronic HBV infection and cirrhosis development, which implicates suppressing HBV entry could reduce the disease burden.
Mutation Update for GNE Gene Variants Associated with GNE Myopathy
Celeste, Frank V.; Vilboux, Thierry; Ciccone, Carla; de Dios, John Karl; Malicdan, May Christine V.; Leoyklang, Petcharat; McKew, John C.; Gahl, William A.; Carrillo-Carrasco, Nuria; Huizing, Marjan
2014-01-01
The GNE gene encodes the rate-limiting, bifunctional enzyme of sialic acid biosynthesis, UDP-N-acetylglucosamine 2-epimerase/N-acetylmannosamine kinase (GNE). Biallelic GNE mutations underlie GNE myopathy, an adult-onset progressive myopathy. GNE myopathy-associated GNE mutations are predominantly missense, resulting in reduced, but not absent, GNE enzyme activities. The exact pathomechanism of GNE myopathy remains unknown, but likely involves aberrant (muscle) sialylation. Here we summarize 154 reported and novel GNE variants associated with GNE myopathy, including 122 missense, 11 nonsense, 14 insertion/deletions and 7 intronic variants. All variants were deposited in the online GNE variation database (http://www.dmd.nl/nmdb2/home.php?select_db=GNE). We report the predicted effects on protein function of all variants as well as the predicted effects on epimerase and/or kinase enzymatic activities of selected variants. By analyzing exome sequence databases, we identified three frequently occurring, unreported GNE missense variants/polymorphisms, important for future sequence interpretations. Based on allele frequencies, we estimate the world-wide prevalence of GNE myopathy to be ~ 4–21/1,000,000. This previously unrecognized high prevalence confirms suspicions that many patients may escape diagnosis. Awareness among physicians for GNE myopathy is essential for the identification of new patients, which is required for better understanding of the disorder’s pathomechanism and for the success of ongoing treatment trials. PMID:24796702
Ghafarian-Alipour, Farzaneh; Ziaee, Shayan; Ashoori, Mohamad Reza; Zakeri, Mir Saeid; Boroumand, Mohammad Ali; Aghamohammadzadeh, Naser; Abbasi-Majdi, Maryam; Shool, Fatemeh; Asbaghi, Navid Sarakhs; Mohammadi, Abolghasem; Zarghami, Nosratollah
2018-01-30
Recent studies show that FTO single nucleotide polymorphisms (SNPs) are associated with obesity and type 2 diabetes mellitus (T2DM). On the other hand, many animal models and clinical studies have demonstrated that apelin, an adipocytokine, is related to the obesity and T2DM. Additionally, obese women are at risk of Hyperandrogenemia. So, the aim of this study was to investigate the relationship between FTO variants (rs763967273, rs759031579, rs141115189, rs9926289, rs76804286 and rs9939609) with T2DM, serum apelin and androgenic hormones in Iranian obese women. 197 obese women (123 women with T2DM and 74 women as healthy control) were participated in this study. Anthropometrical and biochemical characteristics were measured. Serum apelin and androgen hormones levels were determined in 66 subjects consisting of 33 cases and 33 controls. PCR were carried out and subsequently, the PCR production was genotyped by Sanger sequencing assay. Our observations showed that all SNPs are related to T2DM. The rs9926289 FTO variant had a strong association with serum apelin and dehydroepiandrosterone-sulfate levels (P=0.04 and P=0.03, respectively) among SNPs. In addition, apelin and androgenic hormones were correlated with T2DM. Two polymorphisms including rs9939609 and rs9926289 had a strong Linkage disequilibrium (r 2 =1). FTO variants not only were associated with T2DM, but also some variants had a strong association with apelin and androgenic hormones profile. Copyright © 2017 Elsevier B.V. All rights reserved.
Bertolini, Francesca; Scimone, Concetta; Geraci, Claudia; Schiavo, Giuseppina; Utzeri, Valerio Joe; Chiofalo, Vincenzo; Fontanesi, Luca
2015-01-01
Few studies investigated the donkey (Equus asinus) at the whole genome level so far. Here, we sequenced the genome of two male donkeys using a next generation semiconductor based sequencing platform (the Ion Proton sequencer) and compared obtained sequence information with the available donkey draft genome (and its Illumina reads from which it was originated) and with the EquCab2.0 assembly of the horse genome. Moreover, the Ion Torrent Personal Genome Analyzer was used to sequence reduced representation libraries (RRL) obtained from a DNA pool including donkeys of different breeds (Grigio Siciliano, Ragusano and Martina Franca). The number of next generation sequencing reads aligned with the EquCab2.0 horse genome was larger than those aligned with the draft donkey genome. This was due to the larger N50 for contigs and scaffolds of the horse genome. Nucleotide divergence between E. caballus and E. asinus was estimated to be ~ 0.52-0.57%. Regions with low nucleotide divergence were identified in several autosomal chromosomes and in the whole chromosome X. These regions might be evolutionally important in equids. Comparing Y-chromosome regions we identified variants that could be useful to track donkey paternal lineages. Moreover, about 4.8 million of single nucleotide polymorphisms (SNPs) in the donkey genome were identified and annotated combining sequencing data from Ion Proton (whole genome sequencing) and Ion Torrent (RRL) runs with Illumina reads. A higher density of SNPs was present in regions homologous to horse chromosome 12, in which several studies reported a high frequency of copy number variants. The SNPs we identified constitute a first resource useful to describe variability at the population genomic level in E. asinus and to establish monitoring systems for the conservation of donkey genetic resources. PMID:26151450
Bertolini, Francesca; Scimone, Concetta; Geraci, Claudia; Schiavo, Giuseppina; Utzeri, Valerio Joe; Chiofalo, Vincenzo; Fontanesi, Luca
2015-01-01
Few studies investigated the donkey (Equus asinus) at the whole genome level so far. Here, we sequenced the genome of two male donkeys using a next generation semiconductor based sequencing platform (the Ion Proton sequencer) and compared obtained sequence information with the available donkey draft genome (and its Illumina reads from which it was originated) and with the EquCab2.0 assembly of the horse genome. Moreover, the Ion Torrent Personal Genome Analyzer was used to sequence reduced representation libraries (RRL) obtained from a DNA pool including donkeys of different breeds (Grigio Siciliano, Ragusano and Martina Franca). The number of next generation sequencing reads aligned with the EquCab2.0 horse genome was larger than those aligned with the draft donkey genome. This was due to the larger N50 for contigs and scaffolds of the horse genome. Nucleotide divergence between E. caballus and E. asinus was estimated to be ~ 0.52-0.57%. Regions with low nucleotide divergence were identified in several autosomal chromosomes and in the whole chromosome X. These regions might be evolutionally important in equids. Comparing Y-chromosome regions we identified variants that could be useful to track donkey paternal lineages. Moreover, about 4.8 million of single nucleotide polymorphisms (SNPs) in the donkey genome were identified and annotated combining sequencing data from Ion Proton (whole genome sequencing) and Ion Torrent (RRL) runs with Illumina reads. A higher density of SNPs was present in regions homologous to horse chromosome 12, in which several studies reported a high frequency of copy number variants. The SNPs we identified constitute a first resource useful to describe variability at the population genomic level in E. asinus and to establish monitoring systems for the conservation of donkey genetic resources.
Steele, Katherine A; Quinton-Tulloch, Mark J; Amgai, Resham B; Dhakal, Rajeev; Khatiwada, Shambhu P; Vyas, Darshna; Heine, Martin; Witcombe, John R
2018-01-01
Few public sector rice breeders have the capacity to use NGS-derived markers in their breeding programmes despite rapidly expanding repositories of rice genome sequence data. They rely on > 18,000 mapped microsatellites (SSRs) for marker-assisted selection (MAS) using gel analysis. Lack of knowledge about target SNP and InDel variant loci has hampered the uptake by many breeders of Kompetitive allele-specific PCR (KASP), a proprietary technology of LGC genomics that can distinguish alleles at variant loci. KASP is a cost-effective single-step genotyping technology, cheaper than SSRs and more flexible than genotyping by sequencing (GBS) or array-based genotyping when used in selection programmes. Before this study, there were 2015 rice KASP marker loci in the public domain, mainly identified by array-based screening, leaving large proportions of the rice genome with no KASP coverage. Here we have addressed the urgent need for a wide choice of appropriate rice KASP assays and demonstrated that NGS can detect many more KASP to give full genome coverage. Through re-sequencing of nine indica rice breeding lines or released varieties, this study has identified 2.5 million variant sites. Stringent filtering of variants generated 1.3 million potential KASP assay designs, including 92,500 potential functional markers. This strategy delivers a 650-fold increase in potential selectable KASP markers at a density of 3.1 per 1 kb in the indica crosses analysed and 377,178 polymorphic KASP design sites on average per cross. This knowledge is available to breeders and has been utilised to improve the efficiency of public sector breeding in Nepal, enabling identification of polymorphic KASP at any region or quantitative trait loci in relevant crosses. Validation of 39 new KASP was carried out by genotyping progeny from a range of crosses to show that they detected segregating alleles. The new KASP have replaced SSRs to aid trait selection during marker-assisted backcrossing in these crosses, where target traits include rice blast and BLB resistance loci. Furthermore, we provide the software for plant breeders to generate KASP designs from their own datasets.
Bayesian reconstruction of transmission within outbreaks using genomic variants.
De Maio, Nicola; Worby, Colin J; Wilson, Daniel J; Stoesser, Nicole
2018-04-01
Pathogen genome sequencing can reveal details of transmission histories and is a powerful tool in the fight against infectious disease. In particular, within-host pathogen genomic variants identified through heterozygous nucleotide base calls are a potential source of information to identify linked cases and infer direction and time of transmission. However, using such data effectively to model disease transmission presents a number of challenges, including differentiating genuine variants from those observed due to sequencing error, as well as the specification of a realistic model for within-host pathogen population dynamics. Here we propose a new Bayesian approach to transmission inference, BadTrIP (BAyesian epiDemiological TRansmission Inference from Polymorphisms), that explicitly models evolution of pathogen populations in an outbreak, transmission (including transmission bottlenecks), and sequencing error. BadTrIP enables the inference of host-to-host transmission from pathogen sequencing data and epidemiological data. By assuming that genomic variants are unlinked, our method does not require the computationally intensive and unreliable reconstruction of individual haplotypes. Using simulations we show that BadTrIP is robust in most scenarios and can accurately infer transmission events by efficiently combining information from genetic and epidemiological sources; thanks to its realistic model of pathogen evolution and the inclusion of epidemiological data, BadTrIP is also more accurate than existing approaches. BadTrIP is distributed as an open source package (https://bitbucket.org/nicofmay/badtrip) for the phylogenetic software BEAST2. We apply our method to reconstruct transmission history at the early stages of the 2014 Ebola outbreak, showcasing the power of within-host genomic variants to reconstruct transmission events.
Ruggles, Kelly V; Tang, Zuojian; Wang, Xuya; Grover, Himanshu; Askenazi, Manor; Teubl, Jennifer; Cao, Song; McLellan, Michael D; Clauser, Karl R; Tabb, David L; Mertins, Philipp; Slebos, Robbert; Erdmann-Gilmore, Petra; Li, Shunqiang; Gunawardena, Harsha P; Xie, Ling; Liu, Tao; Zhou, Jian-Ying; Sun, Shisheng; Hoadley, Katherine A; Perou, Charles M; Chen, Xian; Davies, Sherri R; Maher, Christopher A; Kinsinger, Christopher R; Rodland, Karen D; Zhang, Hui; Zhang, Zhen; Ding, Li; Townsend, R Reid; Rodriguez, Henry; Chan, Daniel; Smith, Richard D; Liebler, Daniel C; Carr, Steven A; Payne, Samuel; Ellis, Matthew J; Fenyő, David
2016-03-01
Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations, and splice variants identified in cancer cells are translated. Herein, we apply a proteogenomic data integration tool (QUILTS) to illustrate protein variant discovery using whole genome, whole transcriptome, and global proteome datasets generated from a pair of luminal and basal-like breast-cancer-patient-derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS sample process replicates defined here as an independent tandem MS experiment using identical sample material. Despite analysis of over 30 sample process replicates, only about 10% of SNVs (somatic and germline) detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNVs without a detectable mRNA transcript were also observed, suggesting that transcriptome coverage was incomplete (∼80%). In contrast to germline variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than in the luminal tumor, raising the possibility of differential translation or protein degradation effects. In conclusion, this large-scale proteogenomic integration allowed us to determine the degree to which mutations are translated and identify gaps in sequence coverage, thereby benchmarking current technology and progress toward whole cancer proteome and transcriptome analysis. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Robinson, James; Guethlein, Lisbeth A; Cereb, Nezih; Yang, Soo Young; Norman, Paul J; Marsh, Steven G E; Parham, Peter
2017-06-01
HLA class I glycoproteins contain the functional sites that bind peptide antigens and engage lymphocyte receptors. Recently, clinical application of sequence-based HLA typing has uncovered an unprecedented number of novel HLA class I alleles. Here we define the nature and extent of the variation in 3,489 HLA-A, 4,356 HLA-B and 3,111 HLA-C alleles. This analysis required development of suites of methods, having general applicability, for comparing and analyzing large numbers of homologous sequences. At least three amino-acid substitutions are present at every position in the polymorphic α1 and α2 domains of HLA-A, -B and -C. A minority of positions have an incidence >1% for the 'second' most frequent nucleotide, comprising 70 positions in HLA-A, 85 in HLA-B and 54 in HLA-C. The majority of these positions have three or four alternative nucleotides. These positions were subject to positive selection and correspond to binding sites for peptides and receptors. Most alleles of HLA class I (>80%) are very rare, often identified in one person or family, and they differ by point mutation from older, more common alleles. These alleles with single nucleotide polymorphisms reflect the germ-line mutation rate. Their frequency predicts the human population harbors 8-9 million HLA class I variants. The common alleles of human populations comprise 42 core alleles, which represent all selected polymorphism, and recombinants that have assorted this polymorphism.
Cereb, Nezih; Yang, Soo Young; Marsh, Steven G. E.; Parham, Peter
2017-01-01
HLA class I glycoproteins contain the functional sites that bind peptide antigens and engage lymphocyte receptors. Recently, clinical application of sequence-based HLA typing has uncovered an unprecedented number of novel HLA class I alleles. Here we define the nature and extent of the variation in 3,489 HLA-A, 4,356 HLA-B and 3,111 HLA-C alleles. This analysis required development of suites of methods, having general applicability, for comparing and analyzing large numbers of homologous sequences. At least three amino-acid substitutions are present at every position in the polymorphic α1 and α2 domains of HLA-A, -B and -C. A minority of positions have an incidence >1% for the ‘second’ most frequent nucleotide, comprising 70 positions in HLA-A, 85 in HLA-B and 54 in HLA-C. The majority of these positions have three or four alternative nucleotides. These positions were subject to positive selection and correspond to binding sites for peptides and receptors. Most alleles of HLA class I (>80%) are very rare, often identified in one person or family, and they differ by point mutation from older, more common alleles. These alleles with single nucleotide polymorphisms reflect the germ-line mutation rate. Their frequency predicts the human population harbors 8–9 million HLA class I variants. The common alleles of human populations comprise 42 core alleles, which represent all selected polymorphism, and recombinants that have assorted this polymorphism. PMID:28650991
Butte, Nancy F; Voruganti, V Saroja; Cole, Shelley A; Haack, Karin; Comuzzie, Anthony G; Muzny, Donna M; Wheeler, David A; Chang, Kyle; Hawes, Alicia; Gibbs, Richard A
2011-09-22
Our objective was to resequence insulin receptor substrate 2 (IRS2) to identify variants associated with obesity- and diabetes-related traits in Hispanic children. Exonic and intronic segments, 5' and 3' flanking regions of IRS2 (∼14.5 kb), were bidirectionally sequenced for single nucleotide polymorphism (SNP) discovery in 934 Hispanic children using 3730XL DNA Sequencers. Additionally, 15 SNPs derived from Illumina HumanOmni1-Quad BeadChips were analyzed. Measured genotype analysis tested associations between SNPs and obesity and diabetes-related traits. Bayesian quantitative trait nucleotide analysis was used to statistically infer the most likely functional polymorphisms. A total of 140 SNPs were identified with minor allele frequencies (MAF) ranging from 0.001 to 0.47. Forty-two of the 70 coding SNPs result in nonsynonymous amino acid substitutions relative to the consensus sequence; 28 SNPs were detected in the promoter, 12 in introns, 28 in the 3'-UTR, and 2 in the 5'-UTR. Two insertion/deletions (indels) were detected. Ten independent rare SNPs (MAF = 0.001-0.009) were associated with obesity-related traits (P = 0.01-0.00002). SNP 10510452_139 in the promoter region was shown to have a high posterior probability (P = 0.77-0.86) of influencing BMI, fat mass, and waist circumference in Hispanic children. SNP 10510452_139 contributed between 2 and 4% of the population variance in body weight and composition. None of the SNPs or indels were associated with diabetes-related traits or accounted for a previously identified quantitative trait locus on chromosome 13 for fasting serum glucose. Rare but not common IRS2 variants may play a role in the regulation of body weight but not an essential role in fasting glucose homeostasis in Hispanic children.
Informative genomic microsatellite markers for efficient genotyping applications in sugarcane.
Parida, Swarup K; Kalia, Sanjay K; Kaul, Sunita; Dalal, Vivek; Hemaprabha, G; Selvi, Athiappan; Pandit, Awadhesh; Singh, Archana; Gaikwad, Kishor; Sharma, Tilak R; Srivastava, Prem Shankar; Singh, Nagendra K; Mohapatra, Trilochan
2009-01-01
Genomic microsatellite markers are capable of revealing high degree of polymorphism. Sugarcane (Saccharum sp.), having a complex polyploid genome requires more number of such informative markers for various applications in genetics and breeding. With the objective of generating a large set of microsatellite markers designated as Sugarcane Enriched Genomic MicroSatellite (SEGMS), 6,318 clones from genomic libraries of two hybrid sugarcane cultivars enriched with 18 different microsatellite repeat-motifs were sequenced to generate 4.16 Mb high-quality sequences. Microsatellites were identified in 1,261 of the 5,742 non-redundant clones that accounted for 22% enrichment of the libraries. Retro-transposon association was observed for 23.1% of the identified microsatellites. The utility of the microsatellite containing genomic sequences were demonstrated by higher primer designing potential (90%) and PCR amplification efficiency (87.4%). A total of 1,315 markers including 567 class I microsatellite markers were designed and placed in the public domain for unrestricted use. The level of polymorphism detected by these markers among sugarcane species, genera, and varieties was 88.6%, while cross-transferability rate was 93.2% within Saccharum complex and 25% to cereals. Cloning and sequencing of size variant amplicons revealed that the variation in the number of repeat-units was the main source of SEGMS fragment length polymorphism. High level of polymorphism and wide range of genetic diversity (0.16-0.82 with an average of 0.44) assayed with the SEGMS markers suggested their usefulness in various genotyping applications in sugarcane.
Tan, Wei; Dean, Michael; Law, Amanda J.
2010-01-01
ErbB4 is a growth factor receptor tyrosine kinase essential for neurodevelopment. Genetic variation in ErbB4 is associated with schizophrenia and risk-associated polymorphisms predict overexpression of ErbB4 CYT-1 isoforms in the brain in the disorder. The molecular mechanism of association is unclear because the polymorphisms flank exon 3 of the gene and reside 700 kb distal to the CYT-1 defining exon. We hypothesized that the polymorphisms are indirectly associated with ErbB4 CYT-1 via splicing of exon 3 on the CYT-1 background. We report via cloning and sequencing of adult and fetal human brain cDNA libraries the identification of novel splice isoforms of ErbB4, whereby exon 3 is skipped (del.3). ErbB4 del.3 transcripts exist as CYT-2 isoforms and are predicted to produce truncated proteins. Furthermore, our data refine the structure of the human ErbB4 gene, clarify that juxtamembrane (JM) splice variants of ErbB4, JM-a and JM-b respectively, are characterized by the replacement of a 75 nucleotide (nt) sequence with a 45-nt insertion, and demonstrate that there are four alternative exons in the gene. Our analyses reveal that novel splice variants of ErbB4 exist in the developing and adult human brain and, given the failure to identify ErbB4 del.3 CYT-1 transcripts, suggest that the association of risk polymorphisms in the ErbB4 gene with CYT-1 transcript levels is not mediated via an exon 3 splicing event. PMID:20886074
Galan, Maxime; Guivier, Emmanuel; Caraux, Gilles; Charbonnel, Nathalie; Cosson, Jean-François
2010-05-11
High-throughput sequencing technologies offer new perspectives for biomedical, agronomical and evolutionary research. Promising progresses now concern the application of these technologies to large-scale studies of genetic variation. Such studies require the genotyping of high numbers of samples. This is theoretically possible using 454 pyrosequencing, which generates billions of base pairs of sequence data. However several challenges arise: first in the attribution of each read produced to its original sample, and second, in bioinformatic analyses to distinguish true from artifactual sequence variation. This pilot study proposes a new application for the 454 GS FLX platform, allowing the individual genotyping of thousands of samples in one run. A probabilistic model has been developed to demonstrate the reliability of this method. DNA amplicons from 1,710 rodent samples were individually barcoded using a combination of tags located in forward and reverse primers. Amplicons consisted in 222 bp fragments corresponding to DRB exon 2, a highly polymorphic gene in mammals. A total of 221,789 reads were obtained, of which 153,349 were finally assigned to original samples. Rules based on a probabilistic model and a four-step procedure, were developed to validate sequences and provide a confidence level for each genotype. The method gave promising results, with the genotyping of DRB exon 2 sequences for 1,407 samples from 24 different rodent species and the sequencing of 392 variants in one half of a 454 run. Using replicates, we estimated that the reproducibility of genotyping reached 95%. This new approach is a promising alternative to classical methods involving electrophoresis-based techniques for variant separation and cloning-sequencing for sequence determination. The 454 system is less costly and time consuming and may enhance the reliability of genotypes obtained when high numbers of samples are studied. It opens up new perspectives for the study of evolutionary and functional genetics of highly polymorphic genes like major histocompatibility complex genes in vertebrates or loci regulating self-compatibility in plants. Important applications in biomedical research will include the detection of individual variation in disease susceptibility. Similarly, agronomy will benefit from this approach, through the study of genes implicated in productivity or disease susceptibility traits.
Naturally selected hepatitis C virus polymorphisms confer broad neutralizing antibody resistance.
Bailey, Justin R; Wasilewski, Lisa N; Snider, Anna E; El-Diwany, Ramy; Osburn, William O; Keck, Zhenyong; Foung, Steven K H; Ray, Stuart C
2015-01-01
For hepatitis C virus (HCV) and other highly variable viruses, broadly neutralizing mAbs are an important guide for vaccine development. The development of resistance to anti-HCV mAbs is poorly understood, in part due to a lack of neutralization testing against diverse, representative panels of HCV variants. Here, we developed a neutralization panel expressing diverse, naturally occurring HCV envelopes (E1E2s) and used this panel to characterize neutralizing breadth and resistance mechanisms of 18 previously described broadly neutralizing anti-HCV human mAbs. The observed mAb resistance could not be attributed to polymorphisms in E1E2 at known mAb-binding residues. Additionally, hierarchical clustering analysis of neutralization resistance patterns revealed relationships between mAbs that were not predicted by prior epitope mapping, identifying 3 distinct neutralization clusters. Using this clustering analysis and envelope sequence data, we identified polymorphisms in E2 that confer resistance to multiple broadly neutralizing mAbs. These polymorphisms, which are not at mAb contact residues, also conferred resistance to neutralization by plasma from HCV-infected subjects. Together, our method of neutralization clustering with sequence analysis reveals that polymorphisms at noncontact residues may be a major immune evasion mechanism for HCV, facilitating viral persistence and presenting a challenge for HCV vaccine development.
El Hamouchi, Adil; El Kacem, Sofia; Ejghal, Rajaa; Lemrani, Meryem
2018-06-14
Leishmania infantum is the causative agent of human visceral leishmaniasis (VL) and sporadic human cutaneous leishmaniasis (CL) in the Mediterranean region. The genetic variation of the Leishmania parasites may result in different phenotypes that can be associated with the geographical distribution and diversity of the clinical manifestations. The main objective of this study was to explore the genetic polymorphism in L. infantum isolates from human and animal hosts in different regions of Morocco. The intraspecific genetic variability of 40 Moroccan L. infantum MON-1 strains isolated from patients with VL (n = 31) and CL (n = 2) and from dogs (n = 7) was evaluated by PCR-RFLP of nagt, a single-copy gene encoding N-acetylglucosamine-1-phosphate transferase. For a more complete analysis of L. infantum polymorphism, we included the restriction patterns of nagt from 17 strains available in the literature and patterns determined by in-silico digestion of three sequences from the GenBank database. Moroccan L. infantum strains presented a certain level of genetic diversity and six distinct nagt-RFLP genotypes were identified. Three of the six genotypes were exclusively identified in the Moroccan population of L. infantum: variant M1 (15%), variant M2 (7.5%), and variant M3 (2.5%). The most common genotype (65%), variant 2 (2.5%), and variant 4 (7.5%), were previously described in several countries with endemic leishmaniasis. Phylogenetic analysis segregated our L. infantum population into two distinct clusters, whereas variant M2 was clearly distinguished from both cluster I and cluster II. This distribution highlights the degree of genetic variability among the Moroccan L. infantum population. The nagt PCR-RFLP method presented here showed an important genetic heterogeneity among Moroccan L. infantum strains isolated from human and canine reservoirs with 6 genotypes identified. Three of the six Moroccan nagt genotypes, have not been previously described and support the particular genetic diversity of the Moroccan L. infantum population reported in other studies.
Genovar: a detection and visualization tool for genomic variants.
Jung, Kwang Su; Moon, Sanghoon; Kim, Young Jin; Kim, Bong-Jo; Park, Kiejung
2012-05-08
Along with single nucleotide polymorphisms (SNPs), copy number variation (CNV) is considered an important source of genetic variation associated with disease susceptibility. Despite the importance of CNV, the tools currently available for its analysis often produce false positive results due to limitations such as low resolution of array platforms, platform specificity, and the type of CNV. To resolve this problem, spurious signals must be separated from true signals by visual inspection. None of the previously reported CNV analysis tools support this function and the simultaneous visualization of comparative genomic hybridization arrays (aCGH) and sequence alignment. The purpose of the present study was to develop a useful program for the efficient detection and visualization of CNV regions that enables the manual exclusion of erroneous signals. A JAVA-based stand-alone program called Genovar was developed. To ascertain whether a detected CNV region is a novel variant, Genovar compares the detected CNV regions with previously reported CNV regions using the Database of Genomic Variants (DGV, http://projects.tcag.ca/variation) and the Single Nucleotide Polymorphism Database (dbSNP). The current version of Genovar is capable of visualizing genomic data from sources such as the aCGH data file and sequence alignment format files. Genovar is freely accessible and provides a user-friendly graphic user interface (GUI) to facilitate the detection of CNV regions. The program also provides comprehensive information to help in the elimination of spurious signals by visual inspection, making Genovar a valuable tool for reducing false positive CNV results. http://genovar.sourceforge.net/.
Kang, Ho-Jin; Song, Im-Sook; Shin, Ho Jung; Kim, Woo-Young; Lee, Choong-Hee; Shim, Joo-Cheol; Zhou, Hong-Hao; Lee, Sang Seop; Shin, Jae-Gook
2007-04-01
Genetic variants of three human organic cation transporter genes (hOCTs) were extensively explored in a Korean population. The functional changes of hOCT2 variants were evaluated in vitro, and those genetic polymorphisms of hOCTs were compared among different ethnic populations. From direct DNA sequencing, 7 of 13 coding variants were nonsynonymous single-nucleotide polymorphisms (SNPs), including four variants from hOCT1 (F160L, P283L, P341L, and M408V) and three from hOCT2 (T199I, T201M, and A270S), whereas 6 were synonymous SNPs. The linkage disequilibrium analysis presented for three independent LD blocks for each hOCT gene showed no significant linkage among all three hOCT genes. The transporter activities of MDCK cells that overexpress the hOCT2-T199I, -T201M, and -A270S variants showed significantly decreased uptake of [(3)H]methyl-4-phenylpyridinium acetate (MPP(+)) or [(14)C]tetraethylammonium compared with those cells that overexpress wild-type hOCT2, and the estimated kinetic parameters of these variants for [(3)H]MPP(+) uptake in oocytes showed a 2- to 5-fold increase in K(m) values and a 10- to 20-fold decrease in V(max) values. The allele frequencies of the five functional variants hOCT1-P283L, -P341L, and hOCT2-T199I, -T201M, and -A270S were 1.3, 17, 0.7, 0.7, and 11%, respectively, in a Korean population; the frequency distributions of these variants were not significantly different from those of Chinese and Vietnamese populations. These findings suggest that genetic variants of hOCTs are not linked among three genes in a Korean population, and several of the hOCT genetic variants cause decreased transport activity in vitro compared with the wild type, although the clinical relevance of these variants remains to be evaluated.
Ait-Arkoub, Zaïna; Voujon, Delphine; Deback, Claire; Abrao, Emiliana P.; Agut, Henri; Boutolleau, David
2013-01-01
The complete 154-kbp linear double-stranded genomic DNA sequence of herpes simplex virus 2 (HSV-2), consisting of two extended regions of unique sequences bounded by a pair of inverted repeat elements, was published in 1998 and since then has been widely employed in a wide range of studies. Throughout the HSV-2 genome are scattered 150 microsatellites (also referred to as short tandem repeats) of 1- to 6-nucleotide motifs, mainly distributed in noncoding regions. Microsatellites are considered reliable markers for genetic mapping to differentiate herpesvirus strains, as shown for cytomegalovirus and HSV-1. The aim of this work was to characterize 12 polymorphic microsatellites within the HSV-2 genome by use of 3 multiplex PCR assays in combination with length polymorphism analysis for the rapid genetic differentiation of 56 HSV-2 clinical isolates and 2 HSV-2 laboratory strains (gHSV-2 and MS). This new system was applied to a specific new HSV-2 variant recently identified in HIV-1-infected patients originating from West Africa. Our results confirm that microsatellite polymorphism analysis is an accurate tool for studying the epidemiology of HSV-2 infections. PMID:23966512
Sahana, G; Guldbrandtsen, B; Thomsen, B; Holm, L-E; Panitz, F; Brøndum, R F; Bendixen, C; Lund, M S
2014-11-01
Mastitis is a mammary disease that frequently affects dairy cattle. Despite considerable research on the development of effective prevention and treatment strategies, mastitis continues to be a significant issue in bovine veterinary medicine. To identify major genes that affect mastitis in dairy cattle, 6 chromosomal regions on Bos taurus autosome (BTA) 6, 13, 16, 19, and 20 were selected from a genome scan for 9 mastitis phenotypes using imputed high-density single nucleotide polymorphism arrays. Association analyses using sequence-level variants for the 6 targeted regions were carried out to map causal variants using whole-genome sequence data from 3 breeds. The quantitative trait loci (QTL) discovery population comprised 4,992 progeny-tested Holstein bulls, and QTL were confirmed in 4,442 Nordic Red and 1,126 Jersey cattle. The targeted regions were imputed to the sequence level. The highest association signal for clinical mastitis was observed on BTA 6 at 88.97 Mb in Holstein cattle and was confirmed in Nordic Red cattle. The peak association region on BTA 6 contained 2 genes: vitamin D-binding protein precursor (GC) and neuropeptide FF receptor 2 (NPFFR2), which, based on known biological functions, are good candidates for affecting mastitis. However, strong linkage disequilibrium in this region prevented conclusive determination of the causal gene. A different QTL on BTA 6 located at 88.32 Mb in Holstein cattle affected mastitis. In addition, QTL on BTA 13 and 19 were confirmed to segregate in Nordic Red cattle and QTL on BTA 16 and 20 were confirmed in Jersey cattle. Although several candidate genes were identified in these targeted regions, it was not possible to identify a gene or polymorphism as the causal factor for any of these regions. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Genotype Diversity and Distribution of Orientia tsutsugamushi Causing Scrub Typhus in Thailand
2011-07-01
typhus assay and vaccine development. Orientia tsutsugamushi, formerly known as Rickettsia tsutsug- amushi, is the causative agent of scrub typhus, a...Sunderland, MA. 13. Horinoucbi, H., et al. 1996. Genotypic identification of Rickettsia tsutsuga- mushi by restriction fragment length polymorphism... Rickettsia tsutsu· gamushi. Sequence and comparative analyses of the genes encoding TSA homologues from four antigenic variants. J. Bioi. Chern. 267:12728
Stevenson, M; Haggerty, S; Lamonica, C; Mann, A M; Meier, C; Wasiak, A
1990-01-01
The phenomenon of interference was exploited to isolate low-abundance noncytopathic human immunodeficiency virus type 1 (HIV-1) variants from a primary HIV-1 isolate from an asymptomatic HIV-1-seropositive hemophiliac. Successive rounds of virus infection of a cytolysis-susceptible CD4+ cell line and isolation of surviving cells resulted in selective amplification of an HIV-1 variant reduced in the ability to induce cytolysis. The presence of a PvuII polymorphism facilitated subsequent amplification and cloning of cytopathic and noncytopathic HIV-1 variants from the primary isolate. Cloned virus stocks from cytopathic and noncytopathic variants exhibited similar replication kinetics, infectivity, and syncytium induction in susceptible host cells. The noncytopathic HIV-1 variant was unable, however, to induce single-cell killing in susceptible host cells. Construction of viral hybrids in which regions of cytopathic and noncytopathic variants were exchanged indicated that determinants for the noncytopathic phenotype map to the envelope glycoprotein. Sequence analysis of the envelope coding regions indicated the absence of two highly conserved N-linked glycosylation sites in the noncytopathic HIV-1 variant, which accompanied differences in processing of precursor gp160 envelope glycoprotein. These results demonstrate that determinants for syncytium-independent single-cell killing are located within the envelope glycoprotein and suggest that single-cell killing is profoundly influenced by alterations in envelope sequence which affect posttranslational processing of HIV-1 envelope glycoprotein within the infected cell. Images PMID:1695254
G23D: Online tool for mapping and visualization of genomic variants on 3D protein structures.
Solomon, Oz; Kunik, Vered; Simon, Amos; Kol, Nitzan; Barel, Ortal; Lev, Atar; Amariglio, Ninette; Somech, Raz; Rechavi, Gidi; Eyal, Eran
2016-08-26
Evaluation of the possible implications of genomic variants is an increasingly important task in the current high throughput sequencing era. Structural information however is still not routinely exploited during this evaluation process. The main reasons can be attributed to the partial structural coverage of the human proteome and the lack of tools which conveniently convert genomic positions, which are the frequent output of genomic pipelines, to proteins and structure coordinates. We present G23D, a tool for conversion of human genomic coordinates to protein coordinates and protein structures. G23D allows mapping of genomic positions/variants on evolutionary related (and not only identical) protein three dimensional (3D) structures as well as on theoretical models. By doing so it significantly extends the space of variants for which structural insight is feasible. To facilitate interpretation of the variant consequence, pathogenic variants, functional sites and polymorphism sites are displayed on protein sequence and structure diagrams alongside the input variants. G23D also provides modeling of the mutant structure, analysis of intra-protein contacts and instant access to functional predictions and predictions of thermo-stability changes. G23D is available at http://www.sheba-cancer.org.il/G23D . G23D extends the fraction of variants for which structural analysis is applicable and provides better and faster accessibility for structural data to biologists and geneticists who routinely work with genomic information.
Petrat-Melin, B; Andersen, P; Rasmussen, J T; Poulsen, N A; Larsen, L B; Young, J F
2015-01-01
Genetic polymorphisms of bovine milk proteins affect the protein profile of the milk and, hence, certain technological properties, such as casein (CN) number and cheese yield. However, reports show that such polymorphisms may also affect the health-related properties of milk. Therefore, to gain insight into their digestion pattern and bioactive potential, β-CN was purified from bovine milk originating from cows homozygous for the variants A(1), A(2), B, and I by a combination of cold storage, ultracentrifugation, and acid precipitation. The purity of the isolated β-CN was determined by HPLC, variants were verified by mass spectrometry, and molar extinction coefficients at λ=280nm were determined. β-Casein from each of the variants was subjected to in vitro digestion using pepsin and pancreatic enzymes. Antioxidant and angiotensin-converting enzyme (ACE) inhibitory capacities of the hydrolysates were assessed at 3 stages of digestion and related to that of the undigested samples. Neither molar extinction coefficients nor overall digestibility varied significantly between these 4 variants; however, clear differences in digestion pattern were indicated by gel electrophoresis. In particular, after 60min of pepsin followed by 5min of pancreatic enzyme digestion, one ≈4kDa peptide with the N-terminal sequence (106)H-K-E-M-P-F-P-K- was absent from β-CN variant B. This is likely a result of the (122)Ser to (122)Arg substitution in variant B introducing a novel trypsin cleavage site, leading to the changed digestion pattern. All investigated β-CN variants exhibited a significant increase in antioxidant capacity upon digestion, as measured by the Trolox-equivalent antioxidant capacity assay. After 60min of pepsin + 120min of pancreatic enzyme digestion, the accumulated increase in antioxidant capacity was ≈1.7-fold for the 4 β-CN variants. The ACE inhibitory capacity was also significantly increased by digestion, with the B variant reaching the highest inhibitory capacity at the end of digestion (60min of pepsin + 120min of pancreatic enzymes), possibly because of the observed alternative digestion pattern. These results demonstrate that genetic polymorphisms affect the digestion pattern and bioactivity of milk proteins. Moreover, their capacity for radical scavenging and ACE inhibition is affected by digestion. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Association of α-, β-, and γ-Synuclein With Diffuse Lewy Body Disease
Nishioka, Kenya; Wider, Christian; Vilariño-Güell, Carles; Soto-Ortolaza, Alexandra I.; Lincoln, Sarah J.; Kachergus, Jennifer M.; Jasinska-Myga, Barbara; Ross, Owen A.; Rajput, Alex; Robinson, Christopher A.; Ferman, Tanis J.; Wszolek, Zbigniew K.; Dickson, Dennis W.; Farrer, Matthew J.
2016-01-01
Objective To determine the association of the genes that encode α-, β-, and γ-synuclein (SNCA, SNCB, and SNCG, respectively) with diffuse Lewy body disease (DLBD). Design Case-control study. Subjects A total of 172 patients with DLBD consistent with a clinical diagnosis of Parkinson disease dementia/dementia with Lewy bodies and 350 clinically and 97 pathologically normal controls. Interventions Sequencing of SNCA, SNCB, and SNCG and genotyping of single-nucleotide polymorphisms performed on an Applied Biosystems capillary sequencer and a Sequenom MassArray pLEX platform, respectively. Associations were determined using χ2 or Fisher exact tests. Results Initial sequencing studies of the coding regions of each gene in 89 patients with DLBD did not detect any pathogenic substitutions. Nevertheless, genotyping of known polymorphic variability in sequence-conserved regions detected several single-nucleotide polymorphisms in the SNCA and SNCG genes that were significantly associated with disease (P=.05 to <.001). Significant association was also observed for 3 single-nucleotide polymorphisms located in SNCB when comparing DLBD cases and pathologically confirmed normal controls (P=.03-.01); however, this association was not significant for the clinical controls alone or the combined clinical and pathological controls (P>.05). After correction for multiple testing, only 1 single-nucleotide polymorphism in SNCG (rs3750823) remained significant in all of the analyses (P=.05-.009). Conclusion These findings suggest that variants in all 3 members of the synuclein gene family, particularly SNCA and SNCG, affect the risk of developing DLBD and warrant further investigation in larger, pathologically defined data sets as well as clinically diagnosed Parkinson disease/dementia with Lewy bodies case-control series. PMID:20697047
Stam, Remco; Scheikl, Daniela; Tellier, Aurélien
2016-06-02
Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Systematic screening for CYP3A4 genetic polymorphisms in a Han Chinese population.
Hu, Guo-Xin; Dai, Da-Peng; Wang, Hao; Huang, Xiang-Xin; Zhou, Xiao-Yang; Cai, Jie; Chen, Hao; Cai, Jian-Ping
2017-03-01
To systematically investigate the genetic polymorphisms of the CYP3A4 gene in a Han Chinese population. The promoter and exons of CYP3A4 gene in 1114 unrelated, healthy Han Chinese subjects were amplified and genotyped by direct sequencing. In total, five previously reported alleles (*1G, *4, *5, *18B and *23) were detected, of which one allele (*23) was reported for the first time in Han Chinese population. Additionally, seven novel exonic variants were also identified and designated as new alleles CYP3A4*28-*34. This study provides the most comprehensive data of CYP3A4 polymorphisms in Han Chinese population and detects the largest number of novel CYP3A4 alleles in one ethnic group.
Genomics in Cardiovascular Disease
Roberts, Robert; Marian, A.J.; Dandona, Sonny; Stewart, Alexandre F.R.
2013-01-01
A paradigm shift towards biology occurred in the 1990’s subsequently catalyzed by the sequencing of the human genome in 2000. The cost of DNA sequencing has gone from millions to thousands of dollars with sequencing of one’s entire genome costing only $1,000. Rapid DNA sequencing is being embraced for single gene disorders, particularly for sporadic cases and those from small families. Transmission of lethal genes such as associated with Huntington’s disease can, through in-vitro fertilization, avoid passing it on to one’s offspring. DNA sequencing will meet the challenge of elucidating the genetic predisposition for common polygenic diseases, especially in determining the function of the novel common genetic risk variants and identifying the rare variants, which may also partially ascertain the source of the missing heritability. The challenge for DNA sequencing remains great, despite human genome sequences being 99.5% identical, the 3 million single nucleotide polymorphisms (SNPs) responsible for most of the unique features add up to 60 new mutations per person which, for 7 billion people, is 420 billion mutations. It is claimed that DNA sequencing has increased 10,000 fold while information storage and retrieval only 16 fold. The physician and health user will be challenged by the convergence of two major trends, whole genome sequencing and the storage/retrieval and integration of the data. PMID:23524054
A novel variant of aquaporin 3 is expressed in killifish (Fundulus heteroclitus) intestine
Jung, Dawoon; Adamo, Meredith A.; Lehman, Rebecca M.; Barnaby, Roxanna; Jackson, Craig E.; Jackson, Brian P.; Shaw, Joseph R.; Stanton, Bruce A.
2015-01-01
Killifish (Fundulus heteroclitus) are euryhaline teleosts that are widely used in environmental and toxicological studies, and they are tolerant to arsenic, in part due to very low assimilation of arsenic from the environment. The mechanism of arsenic uptake by the intestine, a major route of arsenic uptake in humans is unknown. Thus, the goal of this study was to determine if aquaglyceroporins (AQP), which transport water and other small molecules including arsenite across cell membranes, are expressed in the killifish intestine, and whether AQP expression is affected by osmotic stress. Through RT-PCR and sequence analysis of PCR amplicons, we demonstrated that the intestine expresses kfAQP3a and kfAQP3b, two previously identified variants, and also identified a novel variant of killifish AQP3 (kfAQP3c) in the intestine. The variants likely represent alternate splice forms. A BLAST search of the F. heteroclitus reference genome revealed that the AQP3 gene resides on a single locus, while an alignment of the AQP3 sequence among 384 individuals from eight population ranging from Rhode Island to North Carolina revealed that its coding sequence was remarkably conserved with no fixed polymorphism residing in the region that distinguishes these variants. We further demonstrate that the novel variant transports arsenite into HEK293T cells. Whereas kfAQP3a, which does not transport arsenite, was expressed in both freshwater (FW) and saltwater (SW) acclimated fish, kfAQP3b, an arsenic transporter, was expressed only in FW acclimated fish, and kfAQP3c was expressed only in SW acclimated fish. Thus, we have identified a novel, putative splice variant of kfAQP3, kfAQP3c, which transports arsenic and is expressed only in SW acclimated fish. PMID:25766383
Demonstration of Protein-Based Human Identification Using the Hair Shaft Proteome
Leppert, Tami; Anex, Deon S.; Hilmer, Jonathan K.; Matsunami, Nori; Baird, Lisa; Stevens, Jeffery; Parsawar, Krishna; Durbin-Johnson, Blythe P.; Rocke, David M.; Nelson, Chad; Fairbanks, Daniel J.; Wilson, Andrew S.; Rice, Robert H.; Woodward, Scott R.; Bothner, Brian; Hart, Bradley R.; Leppert, Mark
2016-01-01
Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 single nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). This study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts. PMID:27603779
Leyland-Jones, Brian; Gray, Kathryn P; Abramovitz, Mark; Bouzyk, Mark; Young, Brandon; Long, Bradley; Kammler, Roswitha; Dell'Orto, Patrizia; Biasi, Maria Olivia; Thürlimann, Beat; Harvey, Vernon; Neven, Patrick; Arnould, Laurent; Maibach, Rudolf; Price, Karen N; Coates, Alan S; Goldhirsch, Aron; Gelber, Richard D; Pagani, Olivia; Viale, Giuseppe; Rae, James M; Regan, Meredith M
2015-12-01
Estrogen receptor 1 (ESR1) and ESR2 gene polymorphisms have been associated with endocrine-mediated physiological mechanisms, and inconsistently with breast cancer risk and outcomes, bone mineral density changes, and hot flushes/night sweats. DNA was isolated and genotyped for six ESR1 and two ESR2 single-nucleotide polymorphisms (SNPs) from tumor specimens from 3691 postmenopausal women with hormone receptor-positive breast cancer enrolled in the BIG 1-98 trial to receive tamoxifen and/or letrozole for 5 years. Associations with recurrence and adverse events (AEs) were assessed using Cox proportional hazards models. 3401 samples were successfully genotyped for five SNPs. ESR1 rs9340799(XbaI) (T>C) variants CC or TC were associated with reduced breast cancer risk (HR = 0.82,95% CI = 0.67-1.0), and ESR1 rs2077647 (T>C) variants CC or TC was associated with reduced distant recurrence risk (HR = 0.69, 95% CI = 0.53-0.90), both regardless of the treatments. No differential treatment effects (letrozole vs. tamoxifen) were observed for the association of outcome with any of the SNPs. Letrozole-treated patients with rs2077647 (T>C) variants CC and TC had a reduced risk of bone AE (HR = 0.75, 95% CI = 0.58-0.98, P interaction = 0.08), whereas patients with rs4986938 (G>A) genotype variants AA and AG had an increased risk of bone AE (HR = 1.37, 95% CI = 1.01-1.84, P interaction = 0.07). We observed that (1) rare ESR1 homozygous polymorphisms were associated with lower recurrence, and (2) ESR1 and ESR2 SNPs were associated with bone AEs in letrozole-treated patients. Genes that are involved in estrogen signaling and synthesis have the potential to affect both breast cancer recurrence and side effects, suggesting that individual treatment strategies can incorporate not only oncogenic drivers but also SNPs related to estrogen activity.
Li, Yanwei; Kang, Xing; Yang, Ge; Dai, Penggao; Chen, Chao; Wang, Huijuan
2016-09-01
CYP2W1 is an orphan member of the cytochrome P450 superfamily. Recently, CYP2W1 has gained great research interest because of its unknown enzymatic function and tumor-specific expression property. This study aims to investigate the genetic polymorphisms of the CYP2W1 gene in Chinese populations and explore the functions of the detected variants. All of the nine exons and exon-intron junction regions of the CYP2W1 gene were sequenced in 150 Chinese subjects, including 50 Han Chinese, 50 Tibetans, and 50 Uighurs. A total of 26 genetic variants were identified in this study, and 19 polymorphisms were detected in each population. Frequency comparison between populations showed that nine variants exhibited significantly different allelic distributions. A total of 12 different haplotypes were inferred from 150 samples by using the genotype data of nine exonic variants found in this study. CYP2W1*1A, *1B, *2, *4, and *6 were detected as the main alleles/haplotypes. Moreover, one, three, and two ethnically specific haplotypes were observed in the Han, Tibetan, and Uighur samples, respectively. Then, the effects of four detected missense mutations (Ala181Thr, Gly376Ser, Val432Ile, and Pro488Leu) on the CYP2W1 protein function were predicted using three in silico tools: Polymorphism Phenotyping v2, Sorts Intolerant from Tolerant, and MutationTaster. The results showed that Gly376Ser and Pro488Leu may have deleterious effects. In summary, this study showed that the genetic pattern of CYP2W1 is interethnically different among the three Chinese populations, and this finding can extend our understanding of population genetics of CYP2W1 in the Chinese population. Copyright © 2016 by The American Society for Pharmacology and Experimental Therapeutics.
High resolution identity testing of inactivated poliovirus vaccines.
Mee, Edward T; Minor, Philip D; Martin, Javier
2015-07-09
Definitive identification of poliovirus strains in vaccines is essential for quality control, particularly where multiple wild-type and Sabin strains are produced in the same facility. Sequence-based identification provides the ultimate in identity testing and would offer several advantages over serological methods. We employed random RT-PCR and high throughput sequencing to recover full-length genome sequences from monovalent and trivalent poliovirus vaccine products at various stages of the manufacturing process. All expected strains were detected in previously characterised products and the method permitted identification of strains comprising as little as 0.1% of sequence reads. Highly similar Mahoney and Sabin 1 strains were readily discriminated on the basis of specific variant positions. Analysis of a product known to contain incorrect strains demonstrated that the method correctly identified the contaminants. Random RT-PCR and shotgun sequencing provided high resolution identification of vaccine components. In addition to the recovery of full-length genome sequences, the method could also be easily adapted to the characterisation of minor variant frequencies and distinction of closely related products on the basis of distinguishing consensus and low frequency polymorphisms. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.
A short review of variants calling for single-cell-sequencing data with applications.
Wei, Zhuohui; Shu, Chang; Zhang, Changsheng; Huang, Jingying; Cai, Hongmin
2017-11-01
The field of single-cell sequencing is fleetly expanding, and many techniques have been developed in the past decade. With this technology, biologists can study not only the heterogeneity between two adjacent cells in the same tissue or organ, but also the evolutionary relationships and degenerative processes in a single cell. Calling variants is the main purpose in analyzing single cell sequencing (SCS) data. Currently, some popular methods used for bulk-cell-sequencing data analysis are tailored directly to be applied in dealing with SCS data. However, SCS requires an extra step of genome amplification to accumulate enough quantity for satisfying sequencing needs. The amplification yields large biases and thus raises challenge for using the bulk-cell-sequencing methods. In order to provide guidance for the development of specialized analyzed methods as well as using currently developed tools for SNS, this paper aims to bridge the gap. In this paper, we firstly introduced two popular genome amplification methods and compared their capabilities. Then we introduced a few popular models for calling single-nucleotide polymorphisms and copy-number variations. Finally, break-through applications of SNS were summarized to demonstrate its potential in researching cell evolution. Copyright © 2017 Elsevier Ltd. All rights reserved.
Habibi, Imen; Sfar, Imen; Kort, Fedra; Bouraoui, Rim; Chebil, Ahmed; Limaiem, Rim; Ayed, Saloua; Ben Abdallah, Taïeb; El Matri, Leila; Gorgi, Yousr
2017-04-01
Purpose To explore the association between the polymorphism (S/F) p.R102G in the complement component 3 ( C3 ) gene and age-related macular degeneration (AMD) in a Tunisian population. Methods The molecular study was performed by polymerase chain reaction using sequence-specific primers (PCR-SSP) in 207 control subjects free of any eye disease (fundus normal) and 145 patients with exudative AMD. The CH50 activity and quantification of C3 and C4 have been made by technical home method and nephelometry, respectively. Results The prevalence of C3 GG genotype polymorphism was significantly higher in AMD patients compared to controls (OR: 2.41, IC 95% [1.90-3.05], p = 0.0007). However, no correlation was found between this allelic variant and the type of neovascularization. Similarly, there is no association between this polymorphism and the presence of functional and/or quantitative hypocomplementemia. Conclusions The C3 GG genotype of the gene could be a susceptibility factor for AMD in the Tunisian population. However, it does not seem to influence the clinical profile of the disease. Georg Thieme Verlag KG Stuttgart · New York.
Sherpas share genetic variations with Tibetans for high-altitude adaptation.
Bhandari, Sushil; Zhang, Xiaoming; Cui, Chaoying; Yangla; Liu, Lan; Ouzhuluobu; Baimakangzhuo; Gonggalanzi; Bai, Caijuan; Bianba; Peng, Yi; Zhang, Hui; Xiang, Kun; Shi, Hong; Liu, Shiming; Gengdeng; Wu, Tianyi; Qi, Xuebin; Su, Bing
2017-01-01
Sherpas, a highlander population living in Khumbu region of Nepal, are well known for their superior climbing ability in Himalayas. However, the genetic basis of their adaptation to high-altitude environments remains elusive. We collected DNA samples of 582 Sherpas from Nepal and Tibetan Autonomous Region of China, and we measured their hemoglobin levels and degrees of blood oxygen saturation. We genotyped 29 EPAS1 SNPs, two EGLN1 SNPs and the TED polymorphism (3.4 kb deletion) in Sherpas. We also performed genetic association analysis among these sequence variants with phenotypic data. We found similar allele frequencies on the tested 32 variants of these genes in Sherpas and Tibetans. Sherpa individuals carrying the derived alleles of EPAS1 (rs113305133, rs116611511 and rs12467821), EGLN1 (rs186996510 and rs12097901) and TED have lower hemoglobin levels when compared with those wild-type allele carriers. Most of the EPAS1 variants showing significant association with hemoglobin levels in Tibetans were replicated in Sherpas. The shared sequence variants and hemoglobin trait between Sherpas and Tibetans indicate a shared genetic basis for high-altitude adaptation, consistent with the proposal that Sherpas are in fact a recently derived population from Tibetans and they inherited adaptive variants for high-altitude adaptation from their Tibetan ancestors.
Dardano, Angela; Falzoni, Simonetta; Caraccio, Nadia; Polini, Antonio; Tognini, Sara; Solini, Anna; Berti, Piero; Di Virgilio, Francesco; Monzani, Fabio
2009-02-01
The modulation of the purinergic receptor P2X7 may be implicated in human carcinogenesis. The 1513A>C and 489C>T polymorphisms of P2X7R gene induce loss of function and gain of function, respectively. The aim of the study was to assess the frequency of both 1513A>C and 489C>T polymorphisms in patients with papillary thyroid carcinoma (PTC) and to evaluate the possible association with clinical and histological features. P2X7R analysis was performed in lymphocytes from 121 PTC patients (100 women, 21 men; aged 43.4 +/- 13.6 yr), 100 matched healthy subjects, and 80 patients with nodular goiter. The minor allele frequency for 1513A>C polymorphism in PTC patients with the classical variant was similar to controls (0.21 and 0.20, respectively), whereas it resulted in a significant increase in patients with the follicular variant (0.36; P = 0.01 vs. classical variant, and P = 0.005 vs. controls). In detail, 13.6% of patients with PTC follicular variant were homozygous for the 1513C allele, compared to 2.6% of patients with the classical variant and 2% of controls. Moreover, a positive relationship between 1513A>C polymorphism and either cancer diameter (Rho = 0.22; P = 0.02) or TNM stage (Rho = 0.38; P < 0.001) was found. No significant difference in the genotype frequency of 489C>T polymorphism between PTC patients and healthy controls was observed (0.42 and 0.47, respectively). Our data show, for the first time, a strong association between 1513A>C polymorphism of P2X7R gene and the follicular variant of PTC. Further studies are needed to confirm the possible role of this polymorphism as a novel clinical marker of PTC follicular variant and its usefulness in selecting patients with different clinical outcome.
Canary: an atomic pipeline for clinical amplicon assays.
Doig, Kenneth D; Ellul, Jason; Fellowes, Andrew; Thompson, Ella R; Ryland, Georgina; Blombery, Piers; Papenfuss, Anthony T; Fox, Stephen B
2017-12-15
High throughput sequencing requires bioinformatics pipelines to process large volumes of data into meaningful variants that can be translated into a clinical report. These pipelines often suffer from a number of shortcomings: they lack robustness and have many components written in multiple languages, each with a variety of resource requirements. Pipeline components must be linked together with a workflow system to achieve the processing of FASTQ files through to a VCF file of variants. Crafting these pipelines requires considerable bioinformatics and IT skills beyond the reach of many clinical laboratories. Here we present Canary, a single program that can be run on a laptop, which takes FASTQ files from amplicon assays through to an annotated VCF file ready for clinical analysis. Canary can be installed and run with a single command using Docker containerization or run as a single JAR file on a wide range of platforms. Although it is a single utility, Canary performs all the functions present in more complex and unwieldy pipelines. All variants identified by Canary are 3' shifted and represented in their most parsimonious form to provide a consistent nomenclature, irrespective of sequencing variation. Further, proximate in-phase variants are represented as a single HGVS 'delins' variant. This allows for correct nomenclature and consequences to be ascribed to complex multi-nucleotide polymorphisms (MNPs), which are otherwise difficult to represent and interpret. Variants can also be annotated with hundreds of attributes sourced from MyVariant.info to give up to date details on pathogenicity, population statistics and in-silico predictors. Canary has been used at the Peter MacCallum Cancer Centre in Melbourne for the last 2 years for the processing of clinical sequencing data. By encapsulating clinical features in a single, easily installed executable, Canary makes sequencing more accessible to all pathology laboratories. Canary is available for download as source or a Docker image at https://github.com/PapenfussLab/Canary under a GPL-3.0 License.
Parental origin of sequence variants associated with complex diseases.
Kong, Augustine; Steinthorsdottir, Valgerdur; Masson, Gisli; Thorleifsson, Gudmar; Sulem, Patrick; Besenbacher, Soren; Jonasdottir, Aslaug; Sigurdsson, Asgeir; Kristinsson, Kari Th; Jonasdottir, Adalbjorg; Frigge, Michael L; Gylfason, Arnaldur; Olason, Pall I; Gudjonsson, Sigurjon A; Sverrisson, Sverrir; Stacey, Simon N; Sigurgeirsson, Bardur; Benediktsdottir, Kristrun R; Sigurdsson, Helgi; Jonsson, Thorvaldur; Benediktsson, Rafn; Olafsson, Jon H; Johannsson, Oskar Th; Hreidarsson, Astradur B; Sigurdsson, Gunnar; Ferguson-Smith, Anne C; Gudbjartsson, Daniel F; Thorsteinsdottir, Unnur; Stefansson, Kari
2009-12-17
Effects of susceptibility variants may depend on from which parent they are inherited. Although many associations between sequence variants and human traits have been discovered through genome-wide associations, the impact of parental origin has largely been ignored. Here we show that for 38,167 Icelanders genotyped using single nucleotide polymorphism (SNP) chips, the parental origin of most alleles can be determined. For this we used a combination of genealogy and long-range phasing. We then focused on SNPs that associate with diseases and are within 500 kilobases of known imprinted genes. Seven independent SNP associations were examined. Five-one with breast cancer, one with basal-cell carcinoma and three with type 2 diabetes-have parental-origin-specific associations. These variants are located in two genomic regions, 11p15 and 7q32, each harbouring a cluster of imprinted genes. Furthermore, we observed a novel association between the SNP rs2334499 at 11p15 and type 2 diabetes. Here the allele that confers risk when paternally inherited is protective when maternally transmitted. We identified a differentially methylated CTCF-binding site at 11p15 and demonstrated correlation of rs2334499 with decreased methylation of that site.
htsint: a Python library for sequencing pipelines that combines data through gene set generation.
Richards, Adam J; Herrel, Anthony; Bonneaud, Camille
2015-09-24
Sequencing technologies provide a wealth of details in terms of genes, expression, splice variants, polymorphisms, and other features. A standard for sequencing analysis pipelines is to put genomic or transcriptomic features into a context of known functional information, but the relationships between ontology terms are often ignored. For RNA-Seq, considering genes and their genetic variants at the group level enables a convenient way to both integrate annotation data and detect small coordinated changes between experimental conditions, a known caveat of gene level analyses. We introduce the high throughput data integration tool, htsint, as an extension to the commonly used gene set enrichment frameworks. The central aim of htsint is to compile annotation information from one or more taxa in order to calculate functional distances among all genes in a specified gene space. Spectral clustering is then used to partition the genes, thereby generating functional modules. The gene space can range from a targeted list of genes, like a specific pathway, all the way to an ensemble of genomes. Given a collection of gene sets and a count matrix of transcriptomic features (e.g. expression, polymorphisms), the gene sets produced by htsint can be tested for 'enrichment' or conditional differences using one of a number of commonly available packages. The database and bundled tools to generate functional modules were designed with sequencing pipelines in mind, but the toolkit nature of htsint allows it to also be used in other areas of genomics. The software is freely available as a Python library through GitHub at https://github.com/ajrichards/htsint.
Whole-Genome Sequencing and Variant Analysis of Human Papillomavirus 16 Infections.
van der Weele, Pascal; Meijer, Chris J L M; King, Audrey J
2017-10-01
Human papillomavirus (HPV) is a strongly conserved DNA virus, high-risk types of which can cause cervical cancer in persistent infections. The most common type found in HPV-attributable cancer is HPV16, which can be subdivided into four lineages (A to D) with different carcinogenic properties. Studies have shown HPV16 sequence diversity in different geographical areas, but only limited information is available regarding HPV16 diversity within a population, especially at the whole-genome level. We analyzed HPV16 major variant diversity and conservation in persistent infections and performed a single nucleotide polymorphism (SNP) comparison between persistent and clearing infections. Materials were obtained in the Netherlands from a cohort study with longitudinal follow-up for up to 3 years. Our analysis shows a remarkably large variant diversity in the population. Whole-genome sequences were obtained for 57 persistent and 59 clearing HPV16 infections, resulting in 109 unique variants. Interestingly, persistent infections were completely conserved through time. One reinfection event was identified where the initial and follow-up samples clustered differently. Non-A1/A2 variants seemed to clear preferentially ( P = 0.02). Our analysis shows that population-wide HPV16 sequence diversity is very large. In persistent infections, the HPV16 sequence was fully conserved. Sequencing can identify HPV16 reinfections, although occurrence is rare. SNP comparison identified no strongly acting effect of the viral genome affecting HPV16 infection clearance or persistence in up to 3 years of follow-up. These findings suggest the progression of an early HPV16 infection could be host related. IMPORTANCE Human papillomavirus 16 (HPV16) is the predominant type found in cervical cancer. Progression of initial infection to cervical cancer has been linked to sequence properties; however, knowledge of variants circulating in European populations, especially with longitudinal follow-up, is limited. By sequencing a number of infections with known follow-up for up to 3 years, we gained initial insights into the genetic diversity of HPV16 and the effects of the viral genome on the persistence of infections. A SNP comparison between sequences obtained from clearing and persistent infections did not identify strongly acting DNA variations responsible for these infection outcomes. In addition, we identified an HPV16 reinfection event where sequencing of initial and follow-up samples showed different HPV16 variants. Based on conventional genotyping, this infection would incorrectly be considered a persistent HPV16 infection. In the context of vaccine efficacy and monitoring studies, such infections could potentially cause reduced reported efficacy or efficiency. Copyright © 2017 van der Weele et al.
Podralska, Marta; Ziółkowska-Suchanek, Iwona; Żurawek, Magdalena; Dzikiewicz-Krawczyk, Agnieszka; Słomski, Ryszard; Nowak, Jerzy; Stembalska, Agnieszka; Pesz, Karolina; Mosor, Maria
2018-04-20
DNA damage repair is a complex process, which can trigger the development of cancer if disturbed. In this study, we hypothesize a role of variants in the ATM, H2AFX and MRE11 genes in determining breast cancer (BC) susceptibility. We examined the whole sequence of the ATM kinase domain and estimated the frequency of founder mutations in the ATM gene (c.5932G > T, c.6095G > A, and c.7630-2A > C) and single nucleotide polymorphisms (SNPs) in H2AFX (rs643788, rs8551, rs7759, and rs2509049) and MRE11 (rs1061956 and rs2155209) among 315 breast cancer patients and 515 controls. The analysis was performed using high-resolution melting for new variants and the polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) method for recurrent ATM mutations. H2AFX and MRE11 polymorphisms were analyzed using TaqMan assays. The cumulative genetic risk scores (CGRS) were calculated using unweighted and weighted approaches. We identified four mutations (c.6067G > A, c.8314G > A, c.8187A > T, and c.6095G > A) in the ATM gene in three BC cases and two control subjects. We observed a statistically significant association of H2AFX variants with BC. Risk alleles (the G of rs7759 and the T of rs8551 and rs2509049) were observed more frequently in BC cases compared to the control group, with P values, odds ratios (OR) and 95% confidence intervals (CIs) of 0.0018, 1.47 (1.19 to 1.82); 0.018, 1.33 (1.09 to 1.64); and 0.024, 1.3 (1.06 to 1.59), respectively. Haplotype-based tests identified a significant association of the H2AFX CACT haplotype with BC (P < 0.0001, OR = 27.29, 95% CI 3.56 to 209.5). The risk of BC increased with the growing number of risk alleles. The OR (95% CI) for carriers of ≥ four risk alleles was 1.71 (1.11 to 2.62) for the CGRS. This study confirms that H2AFX variants are associated with an increased risk of BC. The above-reported sequence variants of MRE11 genes may not constitute a risk factor of breast cancer in the Polish population. The contribution of mutations detected in the ATM gene to the development of breast cancer needs further detailed study.
Lorenzetti, Mario Alejandro; Gutiérrez, Marina Inés; Altcheh, Jaime; Moscatelli, Guillermo; Moroni, Samanta; Chabay, Paola Andrea; Preciado, María Victoria
2009-11-01
Epstein-Barr virus genotypes can be distinguished by polymorphic variations in the genes encoding EBNA2, 3A, 3B, and 3C. The immediate early gene BZLF1 plays a key role in modulating the switch from latency to lytic replication and therefore enabling viral propagation. The aim of this study was to investigate and compare BZLF1 promoter sequence (Zp) variation in pediatric infectious mononucleosis (IM) and in pediatric EBV positive lymphoma biopsies. Zp was sequenced from peripheral blood mononuclear cells (PBMC) and throat swabs from 10 patients with IM at the time of diagnosis (D0) and during convalescence; and from 13 lymphoma biopsies. Zp - P and Zp - V3 variants were found in eight and one IM patients, as well as in five and six tumor biopsies, respectively. A correlation between viral genotype and Zp variant was found significant for Zp - V3 and EBV2 (P = 0.0002). One IM patient harbored two concomitant Zp variants. Regardless of anatomical compartment or stage of disease all IM patients displayed the same Zp variant along the course of the study. No new infections or adaptative selection of different variants was evidenced. A new Zp variant (Zp - V3 + 49) was described in two Hodgkin lymphomas, but not in IM. This is the first study to describe Zp variants compartmentalization in children with acute EBV infection and convalescence in a developing country; and comparing them with Zp variants in pediatric lymphomas from the same geographic area.
Dubé, Marie-Pier; Castonguay, Yves; Cloutier, Jean; Michaud, Josée; Bertrand, Annick
2013-03-01
Dehydrin defines a complex family of intrinsically disordered proteins with potential adaptive value with regard to freeze-induced cell dehydration. Search within an expressed sequence tags library from cDNAs of cold-acclimated crowns of alfalfa (Medicago sativa spp. sativa L.) identified transcripts putatively encoding K(3)-type dehydrins. Analysis of full-length coding sequences unveiled two highly homologous sequence variants, K(3)-A and K(3)-B. An increase in the frequency of genotypes yielding positive genomic amplification of the K(3)-dehydrin variants in response to selection for superior tolerance to freezing and the induction of their expression at low temperature strongly support a link with cold adaptation. The presence of multiple allelic forms within single genotypes and independent segregation indicate that the two K(3) dehydrin variants are encoded by distinct genes located at unlinked loci. The co-inheritance of the K(3)-A dehydrin with a Y(2)K(4) dehydrin restriction fragment length polymorphism with a demonstrated impact on freezing tolerance suggests the presence of a genome domain where these functionally related genes are located. These results provide additional evidence that dehydrin play important roles with regard to tolerance to subfreezing temperatures. They also underscore the value of recurrent selection to help identify variants within a large multigene family in allopolyploid species like alfalfa.
Unterseer, Sandra; Bauer, Eva; Haberer, Georg; Seidel, Michael; Knaak, Carsten; Ouzunova, Milena; Meitinger, Thomas; Strom, Tim M; Fries, Ruedi; Pausch, Hubert; Bertani, Christofer; Davassi, Alessandro; Mayer, Klaus Fx; Schön, Chris-Carolin
2014-09-29
High density genotyping data are indispensable for genomic analyses of complex traits in animal and crop species. Maize is one of the most important crop plants worldwide, however a high density SNP genotyping array for analysis of its large and highly dynamic genome was not available so far. We developed a high density maize SNP array composed of 616,201 variants (SNPs and small indels). Initially, 57 M variants were discovered by sequencing 30 representative temperate maize lines and then stringently filtered for sequence quality scores and predicted conversion performance on the array resulting in the selection of 1.2 M polymorphic variants assayed on two screening arrays. To identify high-confidence variants, 285 DNA samples from a broad genetic diversity panel of worldwide maize lines including the samples used for sequencing, important founder lines for European maize breeding, hybrids, and proprietary samples with European, US, semi-tropical, and tropical origin were used for experimental validation. We selected 616 k variants according to their performance during validation, support of genotype calls through sequencing data, and physical distribution for further analysis and for the design of the commercially available Affymetrix® Axiom® Maize Genotyping Array. This array is composed of 609,442 SNPs and 6,759 indels. Among these are 116,224 variants in coding regions and 45,655 SNPs of the Illumina® MaizeSNP50 BeadChip for study comparison. In a subset of 45,974 variants, apart from the target SNP additional off-target variants are detected, which show only a minor bias towards intermediate allele frequencies. We performed principal coordinate and admixture analyses to determine the ability of the array to detect and resolve population structure and investigated the extent of LD within a worldwide validation panel. The high density Affymetrix® Axiom® Maize Genotyping Array is optimized for European and American temperate maize and was developed based on a diverse sample panel by applying stringent quality filter criteria to ensure its suitability for a broad range of applications. With 600 k variants it is the largest currently publically available genotyping array in crop species.
Natarajan, Chandrasekhar; Hoffmann, Federico G.; Lanier, Hayley C.; Wolf, Cole J.; Cheviron, Zachary A.; Spangler, Matthew L.; Weber, Roy E.; Fago, Angela; Storz, Jay F.
2015-01-01
Major challenges for illuminating the genetic basis of phenotypic evolution are to identify causative mutations, to quantify their functional effects, to trace their origins as new or preexisting variants, and to assess the manner in which segregating variation is transduced into species differences. Here, we report an experimental analysis of genetic variation in hemoglobin (Hb) function within and among species of Peromyscus mice that are native to different elevations. A multilocus survey of sequence variation in the duplicated HBA and HBB genes in Peromyscus maniculatus revealed that function-altering amino acid variants are widely shared among geographically disparate populations from different elevations, and numerous amino acid polymorphisms are also shared with closely related species. Variation in Hb-O2 affinity within and among populations of P. maniculatus is attributable to numerous amino acid mutations that have individually small effects. One especially surprising feature of the Hb polymorphism in P. maniculatus is that an appreciable fraction of functional standing variation in the two transcriptionally active HBA paralogs is attributable to recurrent gene conversion from a tandemly linked HBA pseudogene. Moreover, transpecific polymorphism in the duplicated HBA genes is not solely attributable to incomplete lineage sorting or introgressive hybridization; instead, it is mainly attributable to recurrent interparalog gene conversion that has occurred independently in different species. Partly as a result of concerted evolution between tandemly duplicated globin genes, the same amino acid changes that contribute to variation in Hb function within P. maniculatus also contribute to divergence in Hb function among different species of Peromyscus. In the case of function-altering Hb mutations in Peromyscus, there is no qualitative or quantitative distinction between segregating variants within species and fixed differences between species. PMID:25556236
Fernández-Real, J M; Corella, D; Goumidi, L; Mercader, J M; Valdés, S; Rojo Martínez, G; Ortega, F; Martinez-Larrad, M-T; Gómez-Zumaquero, J M; Salas-Salvadó, J; Martinez González, M A; Covas, M I; Botas, P; Delgado, E; Cottel, D; Ferrieres, J; Amouyel, P; Ricart, W; Ros, E; Meirhaeghe, A; Serrano-Rios, M; Soriguer, F; Estruch, R
2013-11-01
Thyroid hormone receptor-beta resistance has been associated with metabolic traits. THRA gene sequencing of an obese woman (index case) who presented as empirical thyroid hormone receptor-α (THRA) resistance, disclosed a polymorphism (rs12939700) in a critical region involved in TRα alternative processing. THRA gene variants were evaluated in three independent europid populations (i) in two population cohorts at baseline (n=3417 and n=2265), 6 years later (n=2139) and (ii) in 4734 high cardiovascular risk subjects (HCVR, PREDIMED trial). The minor allele of the index case polymorphism (rs12939700), despite having a very low frequency (4%), was significantly associated with higher body mass index (BMI) (P=0.042) in HCVR subjects. A more frequent THRA polymorphism (rs1568400) was associated with higher BMI in subjects from the population (P=0.00008 and P=0.05) after adjusting for several confounders. Rs1568400 was also strongly associated with fasting triglycerides (P dominant=3.99 × 10(-5)). In the same sample, 6 years later, age and sex-adjusted risk of developing obesity was significantly increased in GG homozygotes (odds ratio 2.93 (95% confidence interval, 1.05-6.95)). In contrast, no association between rs1568400 and BMI was observed in HCVR subjects, in whom obesity was highly prevalent. This might be explained by the presence of an interaction (P <0.001) among the rs1568400 variant, BMI and saturated fat intake. Only when saturated fat intake was high (>24.5 g d(-1)), GG carriers showed a significantly higher BMI than A carriers after controlling for energy intake and physical activity. THRA gene polymorphisms are associated with obesity development. This is a novel observation linking the THRA locus to metabolic phenotypes.
Altools: a user friendly NGS data analyser.
Camiolo, Salvatore; Sablok, Gaurav; Porceddu, Andrea
2016-02-17
Genotyping by re-sequencing has become a standard approach to estimate single nucleotide polymorphism (SNP) diversity, haplotype structure and the biodiversity and has been defined as an efficient approach to address geographical population genomics of several model species. To access core SNPs and insertion/deletion polymorphisms (indels), and to infer the phyletic patterns of speciation, most such approaches map short reads to the reference genome. Variant calling is important to establish patterns of genome-wide association studies (GWAS) for quantitative trait loci (QTLs), and to determine the population and haplotype structure based on SNPs, thus allowing content-dependent trait and evolutionary analysis. Several tools have been developed to investigate such polymorphisms as well as more complex genomic rearrangements such as copy number variations, presence/absence variations and large deletions. The programs available for this purpose have different strengths (e.g. accuracy, sensitivity and specificity) and weaknesses (e.g. low computation speed, complex installation procedure and absence of a user-friendly interface). Here we introduce Altools, a software package that is easy to install and use, which allows the precise detection of polymorphisms and structural variations. Altools uses the BWA/SAMtools/VarScan pipeline to call SNPs and indels, and the dnaCopy algorithm to achieve genome segmentation according to local coverage differences in order to identify copy number variations. It also uses insert size information from the alignment of paired-end reads and detects potential large deletions. A double mapping approach (BWA/BLASTn) identifies precise breakpoints while ensuring rapid elaboration. Finally, Altools implements several processes that yield deeper insight into the genes affected by the detected polymorphisms. Altools was used to analyse both simulated and real next-generation sequencing (NGS) data and performed satisfactorily in terms of positive predictive values, sensitivity, the identification of large deletion breakpoints and copy number detection. Altools is fast, reliable and easy to use for the mining of NGS data. The software package also attempts to link identified polymorphisms and structural variants to their biological functions thus providing more valuable information than similar tools.
Human immunoglobulin allotypes
Lefranc, Marie-Paule
2009-01-01
More than twenty recombinant monoclonal antibodies are approved as therapeutics. Almost all of these are based on the whole IgG isotype format, but vary in the origin of the variable regions between mouse (chimeric), humanized mouse and fully human sequences; all of those with whole IgG format employ human constant region sequences. Currently, the opposing merits of the four IgG subclasses are considered with respect to the in vivo biological activities considered to be appropriate to the disease indication being treated. Human heavy chain genes also exhibit extensive structural polymorphism(s) and, being closely linked, are inherited as a haplotype. Polymorphisms (allotypes) within the IgG isotype were originally discovered and described using serological reagents derived from humans; demonstrating that allotypic variants can be immunogenic and provoke antibody responses as a result of allo-immunization. The serologically defined allotypes differ widely within and between population groups; therefore, a mAb of a given allotype will, inevitably, be delivered to a cohort of patients homozygous for the alternative allotype. This publication reviews the serologically defined human IgG allotypes and considers the potential for allotype differences to contribute to or potentiate immunogenicity. PMID:20073133
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fennelly, J.; Laval, S.; Wright, E.
1996-04-01
We have identified a genomic locus (DXYH1) that is polymorphic and hypervariable within the CBA/H colony. Using a panel of C57BL/6 x Mus spretus backcross offspring, it was mapped to the distal end of the X chromosome. Pseudoautosomal inheritance was demonstrated through three generations of CBA/H x CBA/H and CBA/H x C57BL/6 crosses and confirmed through linkage to the Sxr locus in X/Y Sxr x 3H1 crosses. Meiotic recombination frequencies place DXYH1 {approximately}28% into the pseudoautosomal region from the boundary. The de novo generation of CBA/H variant DXYH1 restriction fragment length polymorphisms during spermatogenesis is suggestive of the germline instabilitymore » associated with hypermutable human minisatellites. The absence of DXY1-related sequences in Mus spretus provides DNA sequence evidence to support the observed failure of X-Y pairing during meiosis and consequent hybrid infertility in C57BL/6 x Mus spretus male F1 offspring. 19 refs., 4 figs.« less
Dai, Ronghua; Fang, Yu; Zhao, Wenjing; Liu, Shuyun; Ding, Jinmei; Xu, Ke; Yang, Lingyu; He, Chuan; Ding, Fangmei; Meng, He
2016-08-01
The study reported in this Regional Research Communication aimed to analyse the genetic polymorphisms of β-casein in Chinese Holstein cows. β-casein has received considerable research interest in the dairy industry and animal breeding in recent years as a source not only of high quality protein, but also of bioactive peptides that may be linked to health effects. Morever, the polymorphic nature of β-casein and its association with milk production traits, composition, and quality also attracted several efforts in evaluating the allelic distribution of β-casein locus as a potential dairy trait marker. However, few data on beta-casein variants are available for the Chinese Holstein cow. In the present paper, one hundred and thirty three Holstein cows were included in the analysis. Results revealed the presence of 5 variants (A1, A2, A3, B and I), preponderance of the genotype A1A2 (0·353) and superiorities of A1/A2 alleles (0·432 and 0·459, respectively) in the population. Sequence analysis of β-casein gene in the cows showed four nucleotide changes in exon 7. Our study can provide reference and guidance for selection for superior milk for industrial applications and crossbreeding and genetic improvement programmes.
Chen, Qingsong; Lang, Li; Xiao, Bin; Lin, Hansheng; Yang, Aichu; Li, Hongling; Tang, Shichuan; Huang, Hanlin
2016-10-05
To explore whether polymorphic variants of the HTR1B gene are associated with the susceptibility of Raynauds' Phenomenon (RP) coursed by vibration. 148 subjects exposed to vibration for more than 2 years were classified into either induced white finger (VWF) group (n = 72), or non-VWF group (n = 76). Vibration exposure levels were measured and assessed following ISO 5349-1:2001 protocol. All workers were genotyped by sequencing for the single nucleotide polymorphisms (SNPs) in the 5'-flanking and coding region of HTR1B. Genetic characteristics and linkage disequilibrium (LD) were analyzed with Haploview. Serum serotonin levels of each subject were detected using ELISA. The association between the susceptibility of vascular damage and genotype was analyzed via logistic regression. 7 known SNPs were obtained and their allele frequencies were inserted into the Hardy-Weinberg equilibrium. rs6297 variant genotype had an increased risk of VWF compared with wild genotype (OR = 2.14, 95% CI = 1.04- 4.58, P < 0.05). rs6298 mutant type (AG+GG) was found to have a significant interaction on vibration exposure LN(CEI), accounting for VWF occurrence. LN(5-HT) level is significantly different between the VWF group (x¯±s= 1.99±1.09 ng/mL) and the non-VWF group (x¯±s= 2.72±1.47 ng/mL). Serotonin levels may affect the progression of secondary RP. Polymorphic variants of the HTR1B gene are associated with the susceptibility of secondary RP in vibration-exposed occupational populations of Chinese Han people.
Prodosmo, Andrea; Buffone, Amelia; Mattioni, Manlio; Barnabei, Agnese; Persichetti, Agnese; De Leo, Aurora; Appetecchia, Marialuisa; Nicolussi, Arianna; Coppa, Anna; Sciacchitano, Salvatore; Giordano, Carolina; Pinnarò, Paola; Sanguineti, Giuseppe; Strigari, Lidia; Alessandrini, Gabriele; Facciolo, Francesco; Cosimelli, Maurizio; Grazi, Gian Luca; Corrado, Giacomo; Vizza, Enrico; Giannini, Giuseppe; Soddu, Silvia
2016-09-06
Variant ATM heterozygotes have an increased risk of developing cancer, cardiovascular diseases, and diabetes. Costs and time of sequencing and ATM variant complexity make large-scale, general population screenings not cost-effective yet. Recently, we developed a straightforward, rapid, and inexpensive test based on p53 mitotic centrosomal localization (p53-MCL) in peripheral blood mononuclear cells (PBMCs) that diagnoses mutant ATM zygosity and recognizes tumor-associated ATM polymorphisms. Fresh PBMCs from 496 cancer patients were analyzed by p53-MCL: 90 cases with familial BRCA1/2-positive and -negative breast and/or ovarian cancer, 337 with sporadic cancers (ovarian, lung, colon, and post-menopausal breast cancers), and 69 with breast/thyroid cancer. Variants were confirmed by ATM sequencing. A total of seven individuals with ATM variants were identified, 5/65 (7.7 %) in breast cancer cases of familial breast and/or ovarian cancer and 2/69 (2.9 %) in breast/thyroid cancer. No variant ATM carriers were found among the other cancer cases. Excluding a single case in which both BRCA1 and ATM were mutated, no p53-MCL alterations were observed in BRCA1/2-positive cases. These data validate p53-MCL as reliable and specific test for germline ATM variants, confirm ATM as breast cancer susceptibility gene, and highlight a possible association with breast/thyroid cancers.
Cole, Shelley A; Voruganti, V Saroja; Cai, Guowen; Haack, Karin; Kent, Jack W; Blangero, John; Comuzzie, Anthony G; McPherson, John D; Gibbs, Richard A
2010-01-01
Background: Melanocortin-4-receptor (MC4R) haploinsufficiency is the most common form of monogenic obesity; however, the frequency of MC4R variants and their functional effects in general populations remain uncertain. Objective: The aim was to identify and characterize the effects of MC4R variants in Hispanic children. Design: MC4R was resequenced in 376 parents, and the identified single nucleotide polymorphisms (SNPs) were genotyped in 613 parents and 1016 children from the Viva la Familia cohort. Measured genotype analysis (MGA) tested associations between SNPs and phenotypes. Bayesian quantitative trait nucleotide (BQTN) analysis was used to infer the most likely functional polymorphisms influencing obesity-related traits. Results: Seven rare SNPs in coding and 18 SNPs in flanking regions of MC4R were identified. MGA showed suggestive associations between MC4R variants and body size, adiposity, glucose, insulin, leptin, ghrelin, energy expenditure, physical activity, and food intake. BQTN analysis identified SNP 1704 in a predicted micro-RNA target sequence in the downstream flanking region of MC4R as a strong, probable functional variant influencing total, sedentary, and moderate activities with posterior probabilities of 1.0. SNP 2132 was identified as a variant with a high probability (1.0) of exerting a functional effect on total energy expenditure and sleeping metabolic rate. SNP rs34114122 was selected as having likely functional effects on the appetite hormone ghrelin, with a posterior probability of 0.81. Conclusion: This comprehensive investigation provides strong evidence that MC4R genetic variants are likely to play a functional role in the regulation of weight, not only through energy intake but through energy expenditure. PMID:19889825
Liguori, Renato; Quaranta, Sandro; Di Fiore, Rosanna; Elce, Ausilia; Castaldo, Giuseppe; Amato, Felice
2014-12-01
Plasminogen activator inhibitor-1 (PAI-1) is the major physiological inhibitor of tissue-type plasminogen activator in plasma and the most important regulator of the fibrinolytic pathway. The 4G/5G polymorphism (rs1799889) in the PAI-1 promoter is associated with altered PAI-1 transcription. We have identified a new 4G/5G allele, in which a T is inserted near the 4G tract or replaces a G in the 5G tract, forming a T plus 4G (T4G) region. This new variant was first identified in two women, one had experienced juvenile myocardial infarction, the other repeated miscarriage; both had increased PAI-1 plasma activity. In view of the important influence of this promoter region on PAI-1 protein plasma level, we performed in vitro evaluation of the effects of the T4G variant on the transcription activity of the PAI-1 gene promoter. In silico prediction analysis showed that presence of the T4G allele disrupts the E-Box region upstream of the T4G variant, altering the affinity of the target sequence for E-Box binding factors like upstream stimulatory factor-1 (USF-1). Basal T4G promoter activity was 50% higher compared to 4G and 5G variants, but it was less stimulated by USF-1 overexpression. We also analyzed the effects of IL-1β and IL-6 on the PAI-1 promoter activity of our three constructs and showed that the T4G variant was less affected by IL-1β than the other variants. These findings indicate that the T4G variant may be a novel risk factor for thrombotic events. Copyright © 2014 Elsevier Ltd. All rights reserved.
Marrón-Liñares, Grecia M; Núñez, Lucía; Crespo-Leiro, María G; Álvarez-López, Eloy; Barge-Caballero, Eduardo; Barge-Caballero, Gonzalo; Couto-Mallón, David; Pradas-Irun, Concepción; Muñiz, Javier; Tan, Carmela; Rodríguez, E Rene; Vázquez-Rodríguez, José Manuel; Hermida-Prieto, Manuel
2018-04-25
Heart transplantation (HT) is a well-established lifesaving treatment for endstage cardiac failure. Antibody-mediated rejection (AMR) represents one of the main problems after HT because of its diagnostic complexity and the poor evidence for supporting treatments. Complement cascade and B-cells play a key role in AMR and contribute to graft damage. This study explored the importance of variants in genes related to complement pathway and B-cell biology in HT and AMR in donors and in donor-recipient pairs.Methods and Results:Genetic variants in 112 genes (51 complement and 61 B-cell biology genes) were analyzed on next-generation sequencing in 28 donor-recipient pairs, 14 recipients with and 14 recipients without AMR. Statistical analysis was performed with SNPStats, R, and EPIDAT3.1. We identified one single nucleotide polymorphism (SNP) in donors in genes related to B-cell biology,interleukin-4 receptor subunitα (p.Ile75Val-IL4Rα), which correlated with the development of AMR. Moreover, in the analysis of recipient-donor genotype discrepancies, we identified another SNP, in this case inadenosine deaminase(ADA; p.Val178(p=)), which was related to B-cell biology, associated with the absence of AMR. Donor polymorphisms and recipient-donor discrepancies in genes related to the biology of B-cells, could have an important role in the development of AMR. In contrast, no variants in donor or in donor-recipient pairs in complement pathways seem to have an impact on AMR.
Kühne, Annett; Kaiser, Rolf; Schirmer, Markus; Heider, Ulrike; Muhlke, Sabine; Niere, Wiebke; Overbeck, Tobias; Hohloch, Karin; Trümper, Lorenz; Sezer, Orhan; Brockmöller, Jürgen
2007-07-01
Melphalan is widely used in the treatment of multiple myeloma. Pharmacokinetics of this alkylating drug shows high inter-individual variability. As melphalan is a phenylalanine derivative, the pharmacokinetic variability may be determined by genetic polymorphisms in the L-type amino acid transporters LAT1 (SLC7A5) and LAT2 (SLC7A8). Pharmacokinetics were analysed in 64 patients after first administration of intravenous melphalan. Severity of side effects was documented according to WHO criteria. Genomic DNA was analysed for polymorphisms in LAT1 and LAT2 by sequencing of the entire coding region, intron-exon boundaries and 2 kb upstream promoter region. Selected polymorphisms in the common heavy chain of both transporters, the protein 4F2hc (SLC3A2), were analysed by single nucleotide primer extension. Melphalan pharmacokinetics was highly variable with up to 6.2-fold differences in total clearance. A total of 44 polymorphisms were identified in LAT1 and 21 polymorphisms in LAT2. From all variants, only five were in the coding region and only one heterozygous non-synonymous polymorphism (Ala94Thr) was found in LAT2. Numerous polymorphisms were found in the LAT1 and LAT2 5'-flanking regions but did not correlate with expression of the respective genes. No significant correlations could be observed between the polymorphisms in 4F2hc, LAT1, and LAT2 with melphalan pharmacokinetics or with melphalan side effects. The study confirmed that these transporter genes are highly conserved, particularly in the coding sequences. Genetic variation in 4F2hc, LAT1, and LAT2 does not appear to be a major cause of inter-individual variability in pharmacokinetics and of adverse reactions to melphalan.
Kambe, Yoshikazu; Nakata, Katsushi; Yasuda, Shumpei P; Suzuki, Hitoshi
2012-01-01
We examined pelage color variation in wild populations of black rats (the Rattus rattus species complex) in the Yambaru forest area, northern Okinawa Island, Ryukyu Archipelago, Japan. Our field study revealed that 8.7% (38/438) and 0.2% (4/2500) of rats exhibited two types of coat color: white spotting and melanism, respectively. Using 34 representative animals, the phylogeography of the population was inferred using a nuclear gene marker, i.e., sequences (954 bp) of the melanocortin-1 receptor (Mc1r) gene responsible for the melanistic form in black rats. Four sequences from Okinawa were characterized as R. tanezumi, the Asian strain of black rat. Notably, neither of the phenotypic characters of white spotting or melanism was associated with the Mc1r haplotypes. Analysis of mitochondrial cytochrome b (Cytb) sequences (1140 bp) revealed that four haplotypes recovered from Okinawa clustered with the clade of R. tanezumi and differed by one or more bases from haplotypes at other localities in Japan and Asian countries. Thus, both variants may have arisen in the native rat population of Okinawa without interaction with the lineage of R. rattus, which exhibits a worldwide distribution and displays such coat color variants. The Yambaru population of black rats has thus experienced its own evolutionary history in allopatry for a substantial period of time (e.g., 10,000 years), which has preserved valuable genetic polymorphisms and will be useful for assessing the ecological consequences of genetic variation in natural populations.
Ekhart, Corine; Doodeman, Valerie D; Rodenhuis, Sjoerd; Smits, Paul H M; Beijnen, Jos H; Huitema, Alwin D R
2009-01-01
AIMS Thiotepa is widely used in high-dose chemotherapy. Previous studies have shown relations between exposure and severe organ toxicity. Thiotepa is metabolized by cytochrome P450 and glutathione S-transferase enzymes. Polymorphisms of these enzymes may affect elimination of thiotepa and tepa, its main metabolite. The purpose of this study was to evaluate effects of known allelic variants in CYP2B6, CYP3A4, CYP3A5, GSTA1 and GSTP1 genes on pharmacokinetics of thiotepa and tepa. METHODS White patients (n = 124) received a high-dose regimen consisting of cyclophosphamide, thiotepa and carboplatin as intravenous infusions. Genomic DNA was analysed using polymerase chain reaction and sequencing. Plasma concentrations of thiotepa and tepa were determined using validated GC and LC-MS/MS methods. Relations between allelic variants and elimination pharmacokinetic parameters were evaluated using nonlinear mixed effects modelling (nonmem). RESULTS The polymorphisms CYP2B6 C1459T, CYP3A4*1B, CYP3A5*3, GSTA1 (C-69T, G-52A) and GSTP1 C341T had a significant effect on clearance of thiotepa or tepa. Although significant, most effects were generally not large. Clearance of thiotepa and tepa was predominantly affected by GSTP1 C341T polymorphism, which had a frequency of 9.3%. This polymorphism increased non-inducible thiotepa clearance by 52% [95% confidence interval (CI) 41, 64, P < 0.001] and decreased tepa clearance by 32% (95% CI 29, 35, P < 0.001) in heterozygous patients, which resulted in an increase in combined exposure to thiotepa and tepa of 45% in homozygous patients. CONCLUSIONS This study indicates that the presently evaluated variant alleles explain only a small part of the substantial interindividual variability in thiotepa and tepa pharmacokinetics. Patients homozygous for the GSTP1 C341T allele may have enhanced exposure to thiotepa and tepa. PMID:19076156
Couldrey, C; Keehan, M; Johnson, T; Tiplady, K; Winkelman, A; Littlejohn, M D; Scott, A; Kemper, K E; Hayes, B; Davis, S R; Spelman, R J
2017-07-01
Single nucleotide polymorphisms have been the DNA variant of choice for genomic prediction, largely because of the ease of single nucleotide polymorphism genotype collection. In contrast, structural variants (SV), which include copy number variants (CNV), translocations, insertions, and inversions, have eluded easy detection and characterization, particularly in nonhuman species. However, evidence increasingly shows that SV not only contribute a substantial proportion of genetic variation but also have significant influence on phenotypes. Here we present the discovery of CNV in a prominent New Zealand dairy bull using long-read PacBio (Pacific Biosciences, Menlo Park, CA) sequencing technology and the Sniffles SV discovery tool (version 0.0.1; https://github.com/fritzsedlazeck/Sniffles). The CNV identified from long reads were compared with CNV discovered in the same bull from Illumina sequencing using CNVnator (read depth-based tool; Illumina Inc., San Diego, CA) as a means of validation. Subsequently, further validation was undertaken using whole-genome Illumina sequencing of 556 cattle representing the wider New Zealand dairy cattle population. Very limited overlap was observed in CNV discovered from the 2 sequencing platforms, in part because of the differences in size of CNV detected. Only a few CNV were therefore able to be validated using this approach. However, the ability to use CNVnator to genotype the 557 cattle for copy number across all regions identified as putative CNV allowed a genome-wide assessment of transmission level of copy number based on pedigree. The more highly transmissible a putative CNV region was observed to be, the more likely the distribution of copy number was multimodal across the 557 sequenced animals. Furthermore, visual assessment of highly transmissible CNV regions provided evidence supporting the presence of CNV across the sequenced animals. This transmission-based approach was able to confirm a subset of CNV that segregates in the New Zealand dairy cattle population. Genome-wide identification and validation of CNV is an important step toward their inclusion in genomic selection strategies. The Authors. Published by the Federation of Animal Science Societies and Elsevier Inc. on behalf of the American Dairy Science Association®. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/3.0/).
Natural gene expression variation studies in yeast.
Thompson, Dawn A; Cubillos, Francisco A
2017-01-01
The rise of sequence information across different yeast species and strains is driving an increasing number of studies in the emerging field of genomics to associate polymorphic variants, mRNA abundance and phenotypic differences between individuals. Here, we gathered evidence from recent studies covering several layers that define the genotype-phenotype gap, such as mRNA abundance, allele-specific expression and translation efficiency to demonstrate how genetic variants co-evolve and define an individual's genome. Moreover, we exposed several antecedents where inter- and intra-specific studies led to opposite conclusions, probably owing to genetic divergence. Future studies in this area will benefit from the access to a massive array of well-annotated genomes and new sequencing technologies, which will allow the fine breakdown of the complex layers that delineate the genotype-phenotype map. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ozaki, N.; Lappalainen, J.; Linnoila, M.
Serotonin (5-HT){sub ID} receptors are 5-HT release-regulating autoreceptors in the human brain. Abnormalities in brain 5-HT function have been hypothesized in the pathophysiology of various psychiatric disorders, including obsessive-compulsive disorder, autism, mood disorders, eating disorders, impulsive violent behavior, and alcoholism. Thus, mutations occurring in 5-HT autoreceptors may cause or increase the vulnerability to any of these conditions. 5-HT{sub 1D{alpha}} and 5-HT{sub 1D{Beta}} subtypes have been previously localized to chromosomes 1p36.3-p34.3 and 6q13, respectively, using rodent-human hybrids and in situ localization. In this communication, we report the detection of a 5-HT{sub 1D{alpha}} receptor gene polymorphism by single strand conformation polymorphism (SSCP)more » analysis of the coding sequence. The polymorphism was used for fine scale linkage mapping of 5-HT{sub 1D{alpha}} on chromosome 1. This polymorphism should also be useful for linkage studies in populations and in families. Our analysis also demonstrates that functionally significant coding sequence variants of the 5-HT{sub 1D{alpha}} are probably not abundant either among alcoholics or in the general population. 14 refs., 1 fig., 1 tab.« less
McInnis, Opal A; McQuaid, Robyn J; Matheson, Kimberly; Anisman, Hymie
2017-01-01
Two single-nucleotide polymorphisms (SNPs) on oxytocin-related genes, specifically the oxytocin receptor (OXTR) rs53576 and the CD38 rs3796863 variants, have been associated with alterations in prosocial behaviors. A cross-sectional study was conducted among undergraduate students (N = 476) to examine associations between the OXTR and CD38 polymorphisms and unsupportive social interactions and mood states. Results revealed no association between perceived levels of unsupportive social interactions and the OXTR polymorphism. However, A carriers of the CD38 polymorphism, a variant previously associated with elevated oxytocin, reported greater perceived peer unsupportive interactions compared to CC carriers. As expected, perceived unsupportive interactions from peers was associated with greater negative affect, which was moderated by the CD38 polymorphism. Specifically, this relation was stronger among CC carriers of the CD38 polymorphism (a variant thought to be linked to lower oxytocin). When examining whether the OXTR polymorphism moderated the relation between unsupportive social interactions from peers and negative affect there was a trend toward significance, however, this did not withstand multiple testing corrections. These findings are consistent with the perspective that a variant on an oxytocin polymorphism that may be tied to lower oxytocin is related to poor mood outcomes in association with negative social interactions. At the same time, having a genetic constitution presumed to be associated with higher oxytocin was related to increased perceptions of unsupportive social interactions. These seemingly paradoxical findings could be related to previous reports in which variants associated with prosocial behaviors were also tied to relatively more effective coping styles to deal with challenges.
Arlt, Martin F.; Ozdemir, Alev Cagla; Birkeland, Shanda R.; Lyons, Robert H.; Glover, Thomas W.; Wilson, Thomas E.
2011-01-01
Copy-number variants (CNVs) are a major source of genetic variation in human health and disease. Previous studies have implicated replication stress as a causative factor in CNV formation. However, existing data are technically limited in the quality of comparisons that can be made between human CNVs and experimentally induced variants. Here, we used two high-resolution strategies—single nucleotide polymorphism (SNP) arrays and mate-pair sequencing—to compare CNVs that occur constitutionally to those that arise following aphidicolin-induced DNA replication stress in the same human cells. Although the optimized methods provided complementary information, sequencing was more sensitive to small variants and provided superior structural descriptions. The majority of constitutional and all aphidicolin-induced CNVs appear to be formed via homology-independent mechanisms, while aphidicolin-induced CNVs were of a larger median size than constitutional events even when mate-pair data were considered. Aphidicolin thus appears to stimulate formation of CNVs that closely resemble human pathogenic CNVs and the subset of larger nonhomologous constitutional CNVs. PMID:21212237
Prevalence of the prion protein gene E211K variant in U.S. cattle
Heaton, Michael P; Keele, John W; Harhay, Gregory P; Richt, Jürgen A; Koohmaraie, Mohammad; Wheeler, Tommy L; Shackelford, Steven D; Casas, Eduardo; King, D Andy; Sonstegard, Tad S; Van Tassell, Curtis P; Neibergs, Holly L; Chase, Chad C; Kalbfleisch, Theodore S; Smith, Timothy PL; Clawson, Michael L; Laegreid, William W
2008-01-01
Background In 2006, an atypical U.S. case of bovine spongiform encephalopathy (BSE) was discovered in Alabama and later reported to be polymorphic for glutamate (E) and lysine (K) codons at position 211 in the bovine prion protein gene (Prnp) coding sequence. A bovine E211K mutation is important because it is analogous to the most common pathogenic mutation in humans (E200K) which causes hereditary Creutzfeldt – Jakob disease, an autosomal dominant form of prion disease. The present report describes a high-throughput matrix-associated laser desorption/ionization-time-of-flight mass spectrometry assay for scoring the Prnp E211K variant and its use to determine an upper limit for the K211 allele frequency in U.S. cattle. Results The K211 allele was not detected in 6062 cattle, including those from five commercial beef processing plants (3892 carcasses) and 2170 registered cattle from 42 breeds. Multiple nearby polymorphisms in Prnp coding sequence of 1456 diverse purebred cattle (42 breeds) did not interfere with scoring E211 or K211 alleles. Based on these results, the upper bounds for prevalence of the E211K variant was estimated to be extremely low, less than 1 in 2000 cattle (Bayesian analysis based on 95% quantile of the posterior distribution with a uniform prior). Conclusion No groups or breeds of U.S. cattle are presently known to harbor the Prnp K211 allele. Because a carrier was not detected, the number of additional atypical BSE cases with K211 will also be vanishingly low. PMID:18625065
Blumenfeld, Olga O
2002-04-01
Recent advances in molecular biology and technology have provided evidence, at a molecular level, for long-known observations that the human genome is not unique but is characterized by individual sequence variation. At the present time, documentation of genetic variation occurring in a large number of genes is increasing exponentially. The characterization of alleles that encode a variety of blood group antigens has been particularly fruitful for transfusion medicine. Phenotypic variation, as identified by the serologic study of blood group variants, is required to identify the presence of a variant allele. Many of the other alleles currently recorded have been selected and identified on the basis of inherited disease traits. New approaches document single nucleotide polymorphisms that occur throughout the genome and best show how the DNA sequence varies in the human population. The primary data dealing with variant alleles or more general genomic variation are scattered throughout the scientific literature and only within the last few years has information begun to be organized into databases. This article provides guidance on how to access those databases online as a source of information about genetic variation for purposes of molecular, clinical, and diagnostic medicine, research, and teaching. The attributes of the sites are described. A more detailed view of the database dealing specifically with alleles of genes encoding the blood group antigens includes a brief preliminary analysis of the molecular basis for observed polymorphisms. Other online sites that may be particularly useful to the transfusion medicine readership as well as a brief historical account are also presented. Copyright 2002, Elsevier Science (USA). All rights reserved.
The genomic landscape shaped by selection on transposable elements across 18 mouse strains.
Nellåker, Christoffer; Keane, Thomas M; Yalcin, Binnaz; Wong, Kim; Agam, Avigail; Belgard, T Grant; Flint, Jonathan; Adams, David J; Frankel, Wayne N; Ponting, Chris P
2012-06-15
Transposable element (TE)-derived sequence dominates the landscape of mammalian genomes and can modulate gene function by dysregulating transcription and translation. Our current knowledge of TEs in laboratory mouse strains is limited primarily to those present in the C57BL/6J reference genome, with most mouse TEs being drawn from three distinct classes, namely short interspersed nuclear elements (SINEs), long interspersed nuclear elements (LINEs) and the endogenous retrovirus (ERV) superfamily. Despite their high prevalence, the different genomic and gene properties controlling whether TEs are preferentially purged from, or are retained by, genetic drift or positive selection in mammalian genomes remain poorly defined. Using whole genome sequencing data from 13 classical laboratory and 4 wild-derived mouse inbred strains, we developed a comprehensive catalogue of 103,798 polymorphic TE variants. We employ this extensive data set to characterize TE variants across the Mus lineage, and to infer neutral and selective processes that have acted over 2 million years. Our results indicate that the majority of TE variants are introduced though the male germline and that only a minority of TE variants exert detectable changes in gene expression. However, among genes with differential expression across the strains there are twice as many TE variants identified as being putative causal variants as expected. Most TE variants that cause gene expression changes appear to be purged rapidly by purifying selection. Our findings demonstrate that past TE insertions have often been highly deleterious, and help to prioritize TE variants according to their likely contribution to gene expression or phenotype variation.
Jha, Ruchira Menka; Koleck, Theresa A; Puccio, Ava M; Okonkwo, David O; Park, Seo-Young; Zusman, Benjamin E; Clark, Robert S B; Shutter, Lori A; Wallisch, Jessica S; Empey, Philip E; Kochanek, Patrick M; Conley, Yvette P
2018-04-19
ABCC8 encodes sulfonylurea receptor 1, a key regulatory protein of cerebral oedema in many neurological disorders including traumatic brain injury (TBI). Sulfonylurea-receptor-1 inhibition has been promising in ameliorating cerebral oedema in clinical trials. We evaluated whether ABCC8 tag single-nucleotide polymorphisms predicted oedema and outcome in TBI. DNA was extracted from 485 prospectively enrolled patients with severe TBI. 410 were analysed after quality control. ABCC8 tag single-nucleotide polymorphisms (SNPs) were identified (Hapmap, r 2 >0.8, minor-allele frequency >0.20) and sequenced (iPlex-Gold, MassArray). Outcomes included radiographic oedema, intracranial pressure (ICP) and 3-month Glasgow Outcome Scale (GOS) score. Proxy SNPs, spatial modelling, amino acid topology and functional predictions were determined using established software programs. Wild-type rs7105832 and rs2237982 alleles and genotypes were associated with lower average ICP (β=-2.91, p=0.001; β=-2.28, p=0.003) and decreased radiographic oedema (OR 0.42, p=0.012; OR 0.52, p=0.017). Wild-type rs2237982 also increased favourable 3-month GOS (OR 2.45, p=0.006); this was partially mediated by oedema (p=0.03). Different polymorphisms predicted 3-month outcome: variant rs11024286 increased (OR 1.84, p=0.006) and wild-type rs4148622 decreased (OR 0.40, p=0.01) the odds of favourable outcome. Significant tag and concordant proxy SNPs regionally span introns/exons 2-15 of the 39-exon gene. This study identifies four ABCC8 tag SNPs associated with cerebral oedema and/or outcome in TBI, tagging a region including 33 polymorphisms. In polymorphisms predictive of oedema, variant alleles/genotypes confer increased risk. Different variant polymorphisms were associated with favourable outcome, potentially suggesting distinct mechanisms. Significant polymorphisms spatially clustered flanking exons encoding the sulfonylurea receptor site and transmembrane domain 0/loop 0 (juxtaposing the channel pore/binding site). This, if validated, may help build a foundation for developing future strategies that may guide individualised care, treatment response, prognosis and patient selection for clinical trials. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Somatic and germline TP53 alterations in second malignant neoplasms from pediatric cancer survivors
Sherborne, Amy L.; Lavergne, Vincent; Yu, Katharine; Lee, Leah; Davidson, Philip R.; Mazor, Tali; Smirnoff, Ivan; Horvai, Andrew; Loh, Mignon; DuBois, Steven G.; Goldsby, Robert E.; Neglia, Joseph; Hammond, Sue; Robison, Leslie L.; Wustrack, Rosanna; Costello, Joseph; Nakamura, Alice O.; Shannon, Kevin; Bhatia, Smita; Nakamura, Jean L.
2016-01-01
Purpose Second malignant neoplasms (SMNs) are severe late complications that occur in pediatric cancer survivors exposed to radiotherapy and other genotoxic treatments. To characterize the mutational landscape of treatment-induced sarcomas and to identify candidate SMN-predisposing variants we analyzed germline and SMN samples from pediatric cancer survivors. Experimental Design We performed whole exome sequencing (WES) and RNA sequencing on radiation-induced sarcomas arising from two pediatric cancer survivors. To assess the frequency of germline TP53 variants in SMNs, Sanger sequencing was performed to analyze germline TP53 in thirty-seven pediatric cancer survivors from the Childhood Cancer Survivor Study (CCSS) without history of a familial cancer predisposition syndrome but known to have developed SMNs. Results WES revealed TP53 mutations involving p53’s DNA binding domain in both index cases, one of which was also present in the germline. The germline and somatic TP53 mutant variants were enriched in the transcriptomes for both sarcomas. Analysis of TP53 coding exons in germline specimens from the CCSS survivor cohort identified a G215C variant encoding an R72P amino acid substitution in six patients and a synonymous single nucleotide polymorphism A639G in four others, resulting in ten out of 37 evaluable patients (27%) harboring a germline TP53 variant. Conclusions Currently, germline TP53 is not routinely assessed in pediatric cancer patients. These data support the concept that identifying germline TP53 variants at the time a primary cancer is diagnosed may identify patients at high risk for SMN development, who could benefit from modified therapeutic strategies and/or intensive post-treatment monitoring. PMID:27683180
Carbapenem-Resistant Acinetobacter baumannii from Serbia: Revision of CarO Classification
Novovic, Katarina; Mihajlovic, Sanja; Vasiljevic, Zorica; Filipic, Brankica; Begovic, Jelena; Jovcic, Branko
2015-01-01
Carbapenem-resistant A. baumannii present a significant therapeutic challenge for the treatment of nosocomial infections in many European countries. Although it is known that the gradient of A. baumannii prevalence increases from northern to southern Europe, this study provides the first data from Serbia. Twenty-eight carbapenem-resistant A. baumannii clinical isolates were collected at a Serbian pediatric hospital during a 2-year period. The majority of isolates (67.68%) belonged to the sequence type Group 1, European clonal complex II. All isolates harbored intrinsic OXA-51 and AmpC cephalosporinase. OXA-23 was detected in 16 isolates (57.14%), OXA-24 in 23 isolates (82.14%) and OXA-58 in 11 isolates (39.29%). Six of the isolates (21.43%) harbored all of the analyzed oxacillinases, except OXA-143 and OXA-235 that were not detected in this study. Production of oxacillinases was detected in different pulsotypes indicating the presence of horizontal gene transfer. NDM-1, VIM and IMP were not detected in analyzed clinical A. baumannii isolates. ISAba1 insertion sequence was present upstream of OXA-51 in one isolate, upstream of AmpC in 13 isolates and upstream of OXA-23 in 10 isolates. In silico analysis of carO sequences from analyzed A. baumannii isolates revealed the existence of two out of six highly polymorphic CarO variants. The phylogenetic analysis of CarO protein among Acinetobacter species revised the previous classification CarO variants into three groups based on strong bootstraps scores in the tree analysis. Group I comprises four variants (I-IV) while Groups II and III contain only one variant each. One half of the Serbian clinical isolates belong to Group I variant I, while the other half belongs to Group I variant III. PMID:25822626
Reed, K M; Dorschner, M O; Todd, T N; Phillips, R B
1998-09-01
Sequence variation in the control region (D-loop) of the mitochondrial DNA (mtDNA) was examined to assess the genetic distinctiveness of the shortjaw cisco (Coregonus zenithicus). Individuals from within the Great Lakes Basin as well as inland lakes outside the basin were sampled. DNA fragments containing the entire D-loop were amplified by PCR from specimens of C. zenithicus and the related species C. artedi, C. hoyi, C. kiyi, and C. clupeaformis. DNA sequence analysis revealed high similarity within and among species and shared polymorphism for length variants. Based on this analysis, the shortjaw cisco is not genetically distinct from other cisco species.
Fukunaga, Kenji; Ichitani, Katsuyuki; Taura, Satoru; Sato, Muneharu; Kawase, Makoto
2005-02-01
We determined the sequence of ribosomal DNA (rDNA) intergenic spacer (IGS) of foxtail millet isolated in our previous study, and identified subrepeats in the polymorphic region. We also developed a PCR-based method for identifying rDNA types based on sequence information and assessed 153 accessions of foxtail millet. Results were congruent with our previous works. This study provides new findings regarding the geographical distribution of rDNA variants. This new method facilitates analyses of numerous foxtail millet accessions. It is helpful for typing of foxtail millet germplasms and elucidating the evolution of this millet.
Lin, You-Yu; Hsieh, Chia-Hung; Chen, Jiun-Hong; Lu, Xuemei; Kao, Jia-Horng; Chen, Pei-Jer; Chen, Ding-Shinn; Wang, Hurng-Yi
2017-04-26
The accuracy of metagenomic assembly is usually compromised by high levels of polymorphism due to divergent reads from the same genomic region recognized as different loci when sequenced and assembled together. A viral quasispecies is a group of abundant and diversified genetically related viruses found in a single carrier. Current mainstream assembly methods, such as Velvet and SOAPdenovo, were not originally intended for the assembly of such metagenomics data, and therefore demands for new methods to provide accurate and informative assembly results for metagenomic data. In this study, we present a hybrid method for assembling highly polymorphic data combining the partial de novo-reference assembly (PDR) strategy and the BLAST-based assembly pipeline (BBAP). The PDR strategy generates in situ reference sequences through de novo assembly of a randomly extracted partial data set which is subsequently used for the reference assembly for the full data set. BBAP employs a greedy algorithm to assemble polymorphic reads. We used 12 hepatitis B virus quasispecies NGS data sets from a previous study to assess and compare the performance of both PDR and BBAP. Analyses suggest the high polymorphism of a full metagenomic data set leads to fragmentized de novo assembly results, whereas the biased or limited representation of external reference sequences included fewer reads into the assembly with lower assembly accuracy and variation sensitivity. In comparison, the PDR generated in situ reference sequence incorporated more reads into the final PDR assembly of the full metagenomics data set along with greater accuracy and higher variation sensitivity. BBAP assembly results also suggest higher assembly efficiency and accuracy compared to other assembly methods. Additionally, BBAP assembly recovered HBV structural variants that were not observed amongst assembly results of other methods. Together, PDR/BBAP assembly results were significantly better than other compared methods. Both PDR and BBAP independently increased the assembly efficiency and accuracy of highly polymorphic data, and assembly performances were further improved when used together. BBAP also provides nucleotide frequency information. Together, PDR and BBAP provide powerful tools for metagenomic data studies.
HPV16 variants distribution in invasive cancers of the cervix, vulva, vagina, penis, and anus.
Nicolás-Párraga, Sara; Gandini, Carolina; Pimenoff, Ville N; Alemany, Laia; de Sanjosé, Silvia; Xavier Bosch, F; Bravo, Ignacio G
2016-10-01
Human papillomavirus (HPV)16 is the most oncogenic human papillomavirus, responsible for most papillomavirus-induced anogenital cancers. We have explored by sequencing and phylogenetic analysis the viral variant lineages present in 692 HPV16-monoinfected invasive anogenital cancers from Europe, Asia, and Central/South America. We have assessed the contribution of geography and anatomy to the differential prevalence of HPV16 variants and to the nonsynonymous E6 T350G polymorphism. Most (68%) of the variance in the distribution of HPV16 variants was accounted for by the differential abundance of the different viral lineages. The most prevalent variant (above 70% prevalence) in all regions and in all locations was HPV16_A1-3, except in Asia, where HPV16_A4 predominated in anal cancers. The differential prevalence of variants as a function of geographical origin explained 9% of the variance, and the differential prevalence of variants as a function of anatomical location accounted for less than 3% of the variance. Despite containing similar repertoires of HPV16 variants, we confirm the worldwide trend of cervical cancers being diagnosed significantly earlier than other anogenital cancers (early fifties vs. early sixties). Frequencies for alleles in the HPV16 E6 T350G polymorphism were similar across anogenital cancers from the same geographical origin. Interestingly, anogenital cancers from Central/South America displayed higher 350G allele frequencies also within HPV16_A1-3 lineage compared with Europe. Our results demonstrate ample variation in HPV16 variants prevalence in anogenital cancers, which is partly explained by the geographical origin of the sample and only marginally explained by the anatomical location of the lesion, suggesting that tissue specialization is not essential evolutionary forces shaping HPV16 diversity in anogenital cancers. © 2016 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.
Screening of whole genome sequences identified high-impact variants for stallion fertility.
Schrimpf, Rahel; Gottschalk, Maren; Metzger, Julia; Martinsson, Gunilla; Sieme, Harald; Distl, Ottmar
2016-04-14
Stallion fertility is an economically important trait due to the increase of artificial insemination in horses. The availability of whole genome sequence data facilitates identification of rare high-impact variants contributing to stallion fertility. The aim of our study was to genotype rare high-impact variants retrieved from next-generation sequencing (NGS)-data of 11 horses in order to unravel harmful genetic variants in large samples of stallions. Gene ontology (GO) terms and search results from public databases were used to obtain a comprehensive list of human und mice genes predicted to participate in the regulation of male reproduction. The corresponding equine orthologous genes were searched in whole genome sequence data of seven stallions and four mares and filtered for high-impact genetic variants using SnpEFF, SIFT and Polyphen 2 software. All genetic variants with the missing homozygous mutant genotype were genotyped on 337 fertile stallions of 19 breeds using KASP genotyping assays or PCR-RFLP. Mixed linear model analysis was employed for an association analysis with de-regressed estimated breeding values of the paternal component of the pregnancy rate per estrus (EBV-PAT). We screened next generation sequenced data of whole genomes from 11 horses for equine genetic variants in 1194 human and mice genes involved in male fertility and linked through common gene ontology (GO) with male reproductive processes. Variants were filtered for high-impact on protein structure and validated through SIFT and Polyphen 2. Only those genetic variants were followed up when the homozygote mutant genotype was missing in the detection sample comprising 11 horses. After this filtering process, 17 single nucleotide polymorphism (SNPs) were left. These SNPs were genotyped in 337 fertile stallions of 19 breeds using KASP genotyping assays or PCR-RFLP. An association analysis in 216 Hanoverian stallions revealed a significant association of the splice-site disruption variant g.37455302G>A in NOTCH1 with the de-regressed estimated breeding values of the paternal component of the pregnancy rate per estrus (EBV-PAT). For 9 high-impact variants within the genes CFTR, OVGP1, FBXO43, TSSK6, PKD1, FOXP1, TCP11, SPATA31E1 and NOTCH1 (g.37453246G>C) absence of the homozygous mutant genotype in the validation sample of all 337 fertile stallions was obvious. Therefore, these variants were considered as potentially deleterious factors for stallion fertility. In conclusion, this study revealed 17 genetic variants with a predicted high damaging effect on protein structure and missing homozygous mutant genotype. The g.37455302G>A NOTCH1 variant was identified as a significant stallion fertility locus in Hanoverian stallions and further 9 candidate fertility loci with missing homozygous mutant genotypes were validated in a panel including 19 horse breeds. To our knowledge this is the first study in horses using next generation sequencing data to uncover strong candidate factors for stallion fertility.
Parker, Glendon J.; Leppert, Tami; Anex, Deon S.; ...
2016-09-07
Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 singlemore » nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). Furthermore, this study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts.« less
Association between FOXP3 polymorphisms and vitiligo in a Han Chinese population.
Song, P; Wang, X-W; Li, H-X; Li, K; Liu, L; Wei, C; Jian, Z; Yi, X-L; Li, Q; Wang, G; Li, C-Y; Gao, T-W
2013-09-01
Vitiligo is an autoimmune chronic depigmentation disorder caused by melanocyte loss. Previous studies found that CD4(+)CD25(+) regulatory T-cell (Treg) dysfunction was involved in the pathogenesis of vitiligo and that gene polymorphisms in forkhead box P3 (FOXP3) - a master regulator of Treg development and function - were associated with susceptibility to some autoimmune disorders. Therefore, we hypothesized that functional polymorphisms of the FOXP3 gene might be associated with vitiligo via dysregulation of Treg cells. To evaluate whether FOXP3 polymorphisms are associated with vitiligo risk. In this hospital-based case-control study of 682 patients with vitiligo and 682 vitiligo-free age- and sex-matched controls, we genotyped three single nucleotide polymorphisms (SNPs) of the FOXP3 gene - rs2232365, rs3761548 and rs5902434 - by performing polymerase chain reaction with sequence-specific primers (PCR-SSP). Significantly increased vitiligo risk was associated with the rs2232365 GG [odds ratio (OR) 1·68, 95% confidence interval (CI) 1·17-2·39, P = 0·004] and rs3761548 AA (OR 1·82, 95% CI 1·10-3·01, P = 0·033) genotypes compared with the rs2232365 AA and rs3761548 CC genotypes. On combined analysis of these three variant alleles, we found that individuals carrying 2-6 variant alleles had significantly increased vitiligo risk (OR 1·34, 95% CI 1·08-1·66). This risk was more pronounced in the following subgroups: age > 20 years, male sex, active vitiligo, nonsegmental vitiligo and other accompanying autoimmune diseases. FOXP3 gene polymorphisms contributed to vitiligo risk in a Han Chinese population. © 2013 The Authors BJD © 2013 British Association of Dermatologists.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Parker, Glendon J.; Leppert, Tami; Anex, Deon S.
Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 singlemore » nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). Furthermore, this study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts.« less
Costa, Valerio; Federico, Antonio; Pollastro, Carla; Ziviello, Carmela; Cataldi, Simona; Formisano, Pietro; Ciccodicola, Alfredo
2016-01-01
Type 2 diabetes (T2D) is one of the most frequent mortality causes in western countries, with rapidly increasing prevalence. Anti-diabetic drugs are the first therapeutic approach, although many patients develop drug resistance. Most drug responsiveness variability can be explained by genetic causes. Inter-individual variability is principally due to single nucleotide polymorphisms, and differential drug responsiveness has been correlated to alteration in genes involved in drug metabolism (CYP2C9) or insulin signaling (IRS1, ABCC8, KCNJ11 and PPARG). However, most genome-wide association studies did not provide clues about the contribution of DNA variations to impaired drug responsiveness. Thus, characterizing T2D drug responsiveness variants is needed to guide clinicians toward tailored therapeutic approaches. Here, we extensively investigated polymorphisms associated with altered drug response in T2D, predicting their effects in silico. Combining different computational approaches, we focused on the expression pattern of genes correlated to drug resistance and inferred evolutionary conservation of polymorphic residues, computationally predicting the biochemical properties of polymorphic proteins. Using RNA-Sequencing followed by targeted validation, we identified and experimentally confirmed that two nucleotide variations in the CAPN10 gene—currently annotated as intronic—fall within two new transcripts in this locus. Additionally, we found that a Single Nucleotide Polymorphism (SNP), currently reported as intergenic, maps to the intron of a new transcript, harboring CAPN10 and GPR35 genes, which undergoes non-sense mediated decay. Finally, we analyzed variants that fall into non-coding regulatory regions of yet underestimated functional significance, predicting that some of them can potentially affect gene expression and/or post-transcriptional regulation of mRNAs affecting the splicing. PMID:27347941
Peeters, H; Vander, C; Laukens, D; Coucke, P; Marichal, D; Van Den Berghe, M; Cuvelier, C; Remaut, E; Mielants, H; De Keyser, F; Vos, M
2004-01-01
Background: Sacroiliitis is a common extraintestinal manifestation of Crohn's disease but its association with the HLA-B27 phenotype is less evident. Polymorphisms in the CARD15 gene have been linked to higher susceptibility for Crohn's disease. In particular, associations have been found with ileal and fibrostenosing disease, young age at onset of disease, and familial cases. Objectives: To investigate whether the presence of sacroiliitis in patients with Crohn's disease is linked to the carriage of CARD15 polymorphisms. Methods: 102 consecutive patients with Crohn's disease were clinically evaluated by a rheumatologist. Radiographs of the sacroiliac joints were taken and assessed blindly by two investigators. The RFLP-PCR technique was used to genotype all patients for three single nucleotide polymorphisms (SNP) in the CARD15 gene. Every SNP was verified by direct sequencing. The HLA-B27 phenotype was determined. Results: Radiological evidence of sacroiliitis with or without ankylosing spondylitis was found in 23 patients (23%), of whom only three were HLA-B27 positive. In contrast, 78% of patients with sacroiliitis carried a CARD15 variant v 48% of those without sacroiliitis (p = 0.01; odds ratio 3.8 (95% confidence interval, 1.3 to 11.5)). Multivariate analysis (logistic regression) showed that the association between sacroiliitis and CARD15 polymorphisms was independent of other CARD15 related phenotypes (ileal and fibrostenosing disease, young age at onset of disease, familial Crohn's disease) (p = 0.039). Conclusions: CARD15 variants were identified as genetic predictors of Crohn's disease related sacroiliitis. An association was demonstrated between these polymorphisms and an extraintestinal manifestation of Crohn's disease. PMID:15308523
DOE Office of Scientific and Technical Information (OSTI.GOV)
Griffon, N.; Pilon, C.; Martres, M.P.
1996-02-16
DNA fragments from a genomic library were used to establish the partial structure of the human dopamine D{sub 3} receptor gene (DRD3). Its coding sequence contains 6 exons and stretches over 40,000 base pairs. The complete DRD3 transcript and three shorter variants, in which the second and/or third exon are deleted, were detected in similar proportions in brains from four controls and three psychiatric patients. The Msp I polymorphism was localized in the fifth intron of the gene, 40,000 base pairs downstream the Bal I polymorphism and a PCR-based method was developed for genotyping this polymorphism. The distributions of themore » Msp I and Bal I genotypes were not independent in 297 individuals ({chi}{sup 2} = 10.5, df = 4, P = 0.03), but only a weak association was found between allele 1 of the Bal I polymorphism and allele 2 of the Msp I polymorphism ({chi}{sup 2} = 3.99, df = 1, P = 0.04). The previously reported association between homozygosity at both alleles of the Bal I polymorphism and schizophrenia was presently maintained in an extended sample, comprising 119 DSM-III-R chronic schizophrenics and 85 controls ({chi}{sup 2}= 5.3, df = 1, P = 0.02) and found more important in males than in females. The presence of the Bal I allele 2 is associated with an early age at onset, particularly in males (df = 35, t value = 2.6, P = 0.014). In the same sample, allelic frequencies, genotype counts, and proportion of homozygotes for the Msp I polymorphism did not differ between schizophrenics and controls ({chi}{sup 2}= 0.06, df = 1, P = 0.80, {chi}{sup 2} = 0.22, df = 1, P = 0.90 and {chi}{sup 2} = 0.16, df = 1, P = 0.69, respectively). The large distance of the Msp I polymorphism from the Bal I polymorphism and its localization in the 3{prime} part of the gene may explain the discrepant results obtained with the two polymorphisms. 36 refs., 2 figs., 4 tabs.« less
Association of ghrelin Leu72Met polymorphism with type 2 diabetes mellitus in Chinese population.
Liu, Jing; Liu, Jia; Tian, Li-min; Liu, Ju-xiang; Bing, Ya-jun; Zhang, Ji-ping; Wang, Yun-Fang; Zhang, Lu-yan
2012-08-10
Ghrelin, a novel endogenous ligand for the growth hormone secretagogue receptor, is considered to implicate the development of the type 2 diabetes mellitus (T2DM). The Leu72Met (+408C>A) polymorphism of the preproghrelin, has been linked to obesity, insulin resistance and diabetes. To investigate the distribution of ghrelin gene Leu72Met polymorphism and its association with the type 2 diabetes mellitus in Chinese population. We conducted a case-control study on 877 patients with T2DM and 864 controls, which were genotyped by the polymerase chain reaction (PCR) technique, denaturing high performance liquid chromatography (DHPLC) and DNA sequence analysis. Laboratory analyses were carried out in the hospital laboratory. No significant difference in the Leu72Met genotype distributions and allele frequency was observed between type 2 diabetes mellitus and controls (both P>0.05). The polymorphism was not associated with T2DM. However, among the T2DM group, the patients carrying Leu72Leu genotype had significantly increased levels of FPG and serum creatinine compared with variant genotypes (Leu72Met and Met72Met) (P<0.05). In the control group, the subjects with variant genotypes had significantly increased levels of FINS, HOMA-IR compared with Leu72Leu genotype (P<0.05). The Leu72Met polymorphism of the preproghrelin gene was not associated with T2DM in Chinese population. However, it may have some roles in the etiology of insulin resistance. Copyright © 2012 Elsevier B.V. All rights reserved.
Raimondi, Daniele; Gazzo, Andrea M; Rooman, Marianne; Lenaerts, Tom; Vranken, Wim F
2016-06-15
There are now many predictors capable of identifying the likely phenotypic effects of single nucleotide variants (SNVs) or short in-frame Insertions or Deletions (INDELs) on the increasing amount of genome sequence data. Most of these predictors focus on SNVs and use a combination of features related to sequence conservation, biophysical, and/or structural properties to link the observed variant to either neutral or disease phenotype. Despite notable successes, the mapping between genetic variants and their phenotypic effects is riddled with levels of complexity that are not yet fully understood and that are often not taken into account in the predictions, despite their promise of significantly improving the prediction of deleterious mutants. We present DEOGEN, a novel variant effect predictor that can handle both missense SNVs and in-frame INDELs. By integrating information from different biological scales and mimicking the complex mixture of effects that lead from the variant to the phenotype, we obtain significant improvements in the variant-effect prediction results. Next to the typical variant-oriented features based on the evolutionary conservation of the mutated positions, we added a collection of protein-oriented features that are based on functional aspects of the gene affected. We cross-validated DEOGEN on 36 825 polymorphisms, 20 821 deleterious SNVs, and 1038 INDELs from SwissProt. The multilevel contextualization of each (variant, protein) pair in DEOGEN provides a 10% improvement of MCC with respect to current state-of-the-art tools. The software and the data presented here is publicly available at http://ibsquare.be/deogen : wvranken@vub.ac.be Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Furniss, Dominic; Lettice, Laura A.; Taylor, Indira B.; Critchley, Paul S.; Giele, Henk; Hill, Robert E.; Wilkie, Andrew O.M.
2008-01-01
A locus for triphalangeal thumb, variably associated with pre-axial polydactyly, was previously identified in the zone of polarizing activity regulatory sequence (ZRS), a long range limb-specific enhancer of the Sonic Hedgehog (SHH) gene at human chromosome 7q36.3. Here, we demonstrate that a 295T>C variant in the human ZRS, previously thought to represent a neutral polymorphism, acts as a dominant allele with reduced penetrance. We found this variant in three independently ascertained probands from southern England with triphalangeal thumb, demonstrated significant linkage of the phenotype to the variant (LOD = 4.1), and identified a shared microsatellite haplotype around the ZRS, suggesting that the probands share a common ancestor. An individual homozygous for the 295C allele presented with isolated bilateral triphalangeal thumb resembling the heterozygous phenotype, suggesting that the variant is largely dominant to the wild-type allele. As a functional test of the pathogenicity of the 295C allele, we utilized a mutated ZRS construct to demonstrate that it can drive ectopic anterior expression of a reporter gene in the developing mouse forelimb. We conclude that the 295T>C variant is in fact pathogenic and, in southern England, appears to be the most common cause of triphalangeal thumb. Depending on the dispersal of the founding mutation, it may play a wider role in the aetiology of this disorder. PMID:18463159
Kulanuwat, S; Santiprabhob, J; Phonrat, B; Limwongse, C; Tungtrongchitr, A; Chongviriyaphan, N; Tungtrongchitr, R
2015-08-07
Genetic variants of the POMC and PCSK1 genes cause severe obesity among patients in the early stages of childhood. This family-based study analyzed the links between single nucleotide polymorphisms (SNPs) in either the POMC or PCSK1 genes and obesity, as well as obesity-related traits among obese Thai children and their families. The variants rs1042571 and rs6713532 in the POMC gene in a sample of 83 obese children and their family members were investigated using polymerase chain reaction (PCR)-restriction fragment length polymorphism. In addition, the SNPs rs6232, rs155971, rs3762986, rs3811942, and rs371897784 of PCSK1 were analyzed in all samples using PCR and gene sequencing methods. Participants with the homozygous variant genotype in rs155971 had significantly elevated cholesterol and low-density lipoprotein cholesterol (LDL-C) levels (P = 0.011, OR = 1.025, 95%CI = 1.006-1.045; and P = 0.006, OR = 1.030, 95%CI = 1.009-1.053, respectively) after adjustment for age, gender, and body mass index (BMI). In addition, patients with the heterozygous variant genotype in rs371897784 of PCSK1 had a 1.249- fold higher risk (95%CI = 1.081-1.444, P = 0.027) of increased waist circumference than patients with the normal genotype, after adjustment for age, gender, and BMI. However, this analysis did not find any correlation between obesity and SNPs in PCSK1 and POMC. Therefore, these common variants in PCSK1 and POMC were not the major cause of obesity in the Thai subjects sampled. However, variants in PCSK1 did affect cholesterol level, LDL-C level, and waist circumference.
Vidal-Taboada, José M; Pugliese, Marco; Salvadó, Maria; Gámez, Josep; Mahy, Nicole; Rodríguez, Manuel J
2018-02-28
The ATP-sensitive potassium (K ATP ) channel directly regulates the microglia-mediated inflammatory response following CNS injury. To determine the putative role of the K ATP channel in amyotrophic lateral sclerosis (ALS) pathology, we investigated whether ALS induces changes in K ATP channel expression in the spinal cord and motor cortex. We also characterized new functional variants of human ABCC8, ABCC9, KCNJ8, and KCNJ11 genes encoding for the K ATP channel and analyzed their association with ALS risk, rate of progression, and survival in a Spanish ALS cohort. The expression of ABCC8 and KCNJ8 genes was enhanced in the spinal cord of ALS samples, and KCNJ11 increased in motor cortex of ALS samples, as determined by real-time polymerase chain reaction. We then sequenced the exons and regulatory regions of K ATP channel genes from a subset of 28 ALS patients and identified 50 new genetic variants. For the case-control association analysis, we genotyped five selected polymorphisms with predicted functional relevance in 185 Spanish ALS (134 spinal ALS and 51 bulbar ALS) patients and 493 controls. We found that bulbar ALS patients presenting the G/G genotype of the rs4148646 variant of ABCC8 and the T/T genotype of the rs5219 variant of KCNJ11 survived longer than other ALS patients presenting other genotypes. Also, the C/C genotype of the rs4148642 variant of ABCC8 and the T/C genotype of the rs148416760 variant of ABCC9 modified the progression rate in spinal ALS patients. Our results suggest that the K ATP channel plays a role in the pathophysiological mechanisms of ALS.
Utility of Post-Mortem Genetic Testing in Cases of Sudden Arrhythmic Death Syndrome.
Lahrouchi, Najim; Raju, Hariharan; Lodder, Elisabeth M; Papatheodorou, Efstathios; Ware, James S; Papadakis, Michael; Tadros, Rafik; Cole, Della; Skinner, Jonathan R; Crawford, Jackie; Love, Donald R; Pua, Chee J; Soh, Bee Y; Bhalshankar, Jaydutt D; Govind, Risha; Tfelt-Hansen, Jacob; Winkel, Bo G; van der Werf, Christian; Wijeyeratne, Yanushi D; Mellor, Greg; Till, Jan; Cohen, Marta C; Tome-Esteban, Maria; Sharma, Sanjay; Wilde, Arthur A M; Cook, Stuart A; Bezzina, Connie R; Sheppard, Mary N; Behr, Elijah R
2017-05-02
Sudden arrhythmic death syndrome (SADS) describes a sudden death with negative autopsy and toxicological analysis. Cardiac genetic disease is a likely etiology. This study investigated the clinical utility and combined yield of post-mortem genetic testing (molecular autopsy) in cases of SADS and comprehensive clinical evaluation of surviving relatives. We evaluated 302 expertly validated SADS cases with suitable DNA (median age: 24 years; 65% males) who underwent next-generation sequencing using an extended panel of 77 primary electrical disorder and cardiomyopathy genes. Pathogenic and likely pathogenic variants were classified using American College of Medical Genetics (ACMG) consensus guidelines. The yield of combined molecular autopsy and clinical evaluation in 82 surviving families was evaluated. A gene-level rare variant association analysis was conducted in SADS cases versus controls. A clinically actionable pathogenic or likely pathogenic variant was identified in 40 of 302 cases (13%). The main etiologies established were catecholaminergic polymorphic ventricular tachycardia and long QT syndrome (17 [6%] and 11 [4%], respectively). Gene-based rare variants association analysis showed enrichment of rare predicted deleterious variants in RYR2 (p = 5 × 10 -5 ). Combining molecular autopsy with clinical evaluation in surviving families increased diagnostic yield from 26% to 39%. Molecular autopsy for electrical disorder and cardiomyopathy genes, using ACMG guidelines for variant classification, identified a modest but realistic yield in SADS. Our data highlighted the predominant role of catecholaminergic polymorphic ventricular tachycardia and long QT syndrome, especially the RYR2 gene, as well as the minimal yield from other genes. Furthermore, we showed the enhanced utility of combined clinical and genetic evaluation. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Volkan-Salanci, Bilge; Aksoy, Hakan; Kiratli, Pınar Özgen; Tülümen, Erol; Güler, Nilüfer; Öksüzoglu, Berna; Tokgözoğlu, Lale; Erbaş, Belkıs; Alikaşifoğlu, Mehmet
2012-10-01
The aim of this prospective clinical study is to evaluate the relationship between changes in functional cardiac parameters following anthracycline therapy and carbonyl reductase 3 (CBR3p.V244M) and glutathione S transferase Pi (GSTP1p.I105V) polymorphisms. Seventy patients with normal cardiac function and no history of cardiac disease scheduled to undergo anthracycline chemotherapy were included in the study. The patients' cardiac function was evaluated by gated blood pool scintigraphy and echocardiography before and after chemotherapy, as well as 1 year following therapy. Gene polymorphisms were genotyped in 70 patients using TaqMan probes, validated by DNA sequencing. A deteriorating trend was observed in both systolic and diastolic parameters from GG to AA in CBR3p.V244M polymorphism. Patients with G-allele carriers of GSTP1p.I105V polymorphism were common (60%), with significantly decreased PFR compared to patiens with AA genotype. Variants of CBR3 and GSTP1 enzymes may be associated with changes in short-term functional cardiac parameters.
CNTNAP2 Is Significantly Associated With Speech Sound Disorder in the Chinese Han Population.
Zhao, Yun-Jing; Wang, Yue-Ping; Yang, Wen-Zhu; Sun, Hong-Wei; Ma, Hong-Wei; Zhao, Ya-Ru
2015-11-01
Speech sound disorder is the most common communication disorder. Some investigations support the possibility that the CNTNAP2 gene might be involved in the pathogenesis of speech-related diseases. To investigate single-nucleotide polymorphisms in the CNTNAP2 gene, 300 unrelated speech sound disorder patients and 200 normal controls were included in the study. Five single-nucleotide polymorphisms were amplified and directly sequenced. Significant differences were found in the genotype (P = .0003) and allele (P = .0056) frequencies of rs2538976 between patients and controls. The excess frequency of the A allele in the patient group remained significant after Bonferroni correction (P = .0280). A significant haplotype association with rs2710102T/+rs17236239A/+2538976A/+2710117A (P = 4.10e-006) was identified. A neighboring single-nucleotide polymorphism, rs10608123, was found in complete linkage disequilibrium with rs2538976, and the genotypes exactly corresponded to each other. The authors propose that these CNTNAP2 variants increase the susceptibility to speech sound disorder. The single-nucleotide polymorphisms rs10608123 and rs2538976 may merge into one single-nucleotide polymorphism. © The Author(s) 2015.
Khan, Imran A; Jahan, Parveen; Hasan, Qurratulain; Rao, Pragna
2014-12-01
Gestational diabetes mellitus (GDM) is defined as glucose intolerance first recognized during pregnancy. Insertion/deletion (I/D) polymorphism of a 287 bp Alu repetitive sequence in intron 16 of the angiotensin-converting enzyme (ACE) gene has been widely investigated in Asian Indian populations with different ethnic origins. The present study examined possible association between I/D polymorphism of the ACE gene and GDM in Asian Indian pregnant women. A total of 200 pregnant women (100 GDM and 100 non-GDM) were recruited in this study and I/D polymorphism of a 287 bp Alu1 element inside intron 16 of the ACE gene was examined by polymerase chain reaction (PCR)-based gel electrophoresis. The distribution of the variants like II, ID, and DD genotypes of ACE gene showed differences between normal GDM versus non-GDM subjects, and the frequency of the ID+ DD Vs II genotype was significant (p=0.0002) in the GDM group. ACE gene polymorphism was associated with GDM in Asian Indian pregnant women. © The Author(s) 2013.
Pattaradilokrat, Sittiporn; Sawaswong, Vorthon; Simpalipan, Phumin; Kaewthamasorn, Morakot; Siripoon, Napaporn; Harnyuttanakorn, Pongchai
2016-10-21
An effective malaria vaccine is an urgently needed tool to fight against human malaria, the most deadly parasitic disease of humans. One promising candidate is the merozoite surface protein-3 (MSP-3) of Plasmodium falciparum. This antigenic protein, encoded by the merozoite surface protein (msp-3) gene, is polymorphic and classified according to size into the two allelic types of K1 and 3D7. A recent study revealed that both the K1 and 3D7 alleles co-circulated within P. falciparum populations in Thailand, but the extent of the sequence diversity and variation within each allelic type remains largely unknown. The msp-3 gene was sequenced from 59 P. falciparum samples collected from five endemic areas (Mae Hong Son, Kanchanaburi, Ranong, Trat and Ubon Ratchathani) in Thailand and analysed for nucleotide sequence diversity, haplotype diversity and deduced amino acid sequence diversity. The gene was also subject to population genetic analysis (F st ) and neutrality tests (Tajima's D, Fu and Li D* and Fu and Li' F* tests) to determine any signature of selection. The sequence analyses revealed eight unique DNA haplotypes and seven amino acid sequence variants, with a haplotype and nucleotide diversity of 0.828 and 0.049, respectively. Neutrality tests indicated that the polymorphism detected in the alanine heptad repeat region of MSP-3 was maintained by positive diversifying selection, suggesting its role as a potential target of protective immune responses and supporting its role as a vaccine candidate. Comparison of MSP-3 variants among parasite populations in Thailand, India and Nigeria also inferred a close genetic relationship between P. falciparum populations in Asia. This study revealed the extent of the msp-3 gene diversity in P. falciparum in Thailand, providing the fundamental basis for the better design of future blood stage malaria vaccines against P. falciparum.
Salehi, Samaneh; Emadi-Baygi, Modjtaba; Rezaei, Majdaddin; Kelishadi, Roya; Nikpour, Parvaneh
2017-01-01
Metabolic syndrome (MetS) is a common disorder which is a constellation of clinical features including abdominal obesity, increased level of serum triglycerides (TGs) and decrease of serum high-density lipoprotein-cholesterol (HDL-C), elevated blood pressure, and glucose intolerance. The apolipoprotein A5 (APOA5) is involved in lipid metabolism, influencing the level of plasma TG and HDL-C. In the present study, we aimed to investigate the associations between four INDEL variants of APOA5 gene and the MetS risk. In this case-control study, we genotyped 116 Iranian children and adolescents with/without MetS by using Sanger sequencing method for these INDELs. Then, we explored the association of INDELs with MetS risk and their clinical components by logistic regression and one-way analysis of variance analyses. We identified a novel insertion polymorphism, c. *282-283 insAG/c. *282-283 insG variant, which appears among case and control groups. rs72525532 showed a significant difference for TG levels between various genotype groups. In addition, there were significant associations between newly identified single-nucleotide polymorphism (SNP) and rs72525532 with MetS risk. These results show that rs72525532 and the newly identified SNP may influence the susceptibility of the individuals to MetS.
Primary hyperoxaluria type 1: update and additional mutation analysis of the AGXT gene.
Williams, Emma L; Acquaviva, Cecile; Amoroso, Antonio; Chevalier, Francoise; Coulter-Mackie, Marion; Monico, Carla G; Giachino, Daniela; Owen, Tricia; Robbiano, Angela; Salido, Eduardo; Waterham, Hans; Rumsby, Gill
2009-06-01
Primary hyperoxaluria type 1 (PH1) is an autosomal recessive, inherited disorder of glyoxylate metabolism arising from a deficiency of the alanine:glyoxylate aminotransferase (AGT) enzyme, encoded by the AGXT gene. The disease is manifested by excessive endogenous oxalate production, which leads to impaired renal function and associated morbidity. At least 146 mutations have now been described, 50 of which are newly reported here. The mutations, which occur along the length of the AGXT gene, are predominantly single-nucleotide substitutions (75%), 73 are missense, 19 nonsense, and 18 splice mutations; but 36 major and minor deletions and insertions are also included. There is little association of mutation with ethnicity, the most obvious exception being the p.Ile244Thr mutation, which appears to have North African/Spanish origins. A common, polymorphic variant encoding leucine at codon 11, the so-called minor allele, has significantly lower catalytic activity in vitro, and has a higher frequency in PH1 compared to the rest of the population. This polymorphism influences enzyme targeting in the presence of the most common Gly170Arg mutation and potentiates the effect of several other pathological sequence variants. This review discusses the spectrum of AGXT mutations and polymorphisms, their clinical significance, and their diagnostic relevance.
Xu, Gaolian; You, Qimin; Pickerill, Sam; Zhong, Huayan; Wang, Hongying; Shi, Jian; Luo, Ying; You, Paul; Kong, Huimin; Lu, Fengmin; Hu, Lin
2010-07-01
Chronic hepatitis B virus (CHBV) infection causes cirrhosis and hepatocellular carcinoma. Lamivudine (LAM) has been successfully used to treat CHBV infections but prolonged use leads to the emergence of drug-resistant variants. This is primarily linked to a mutation in the tyrosine-methionine-aspartate-aspartate (YMDD) motif of the HBV polymerase gene at position 204. Rapid diagnosis of drug-resistant HBV is necessary for a prompt treatment response. Common diagnostic methods such as sequencing and restriction fragment length polymorphism (RFLP) analysis lack sensitivity and require significant processing. The aim of this study was to demonstrate the usefulness of a novel diagnostic method that combines polymerase chain reaction (PCR), ligase detection reaction (LDR) and a nucleic acid detection strip (NADS) in detecting site-specific mutations related to HBV LAM resistance. We compared this method (PLNA) to direct sequencing and RFLP analysis in 50 clinical samples from HBV infected patients. There was 90% concordance between all three results. PLNA detected more samples containing mutant variants than both sequencing and RFLP analysis and was more sensitive in detecting mixed variant populations. Plasmid standards indicated that the sensitivity of PLNA is at or below 3,000 copies per ml and that it can detect a minor variant at 5% of the total viral population. This warrants its further development and suggests that the PLNA method could be a useful tool in detecting LAM resistance. (c) 2010 Wiley-Liss, Inc.
Fonseca, Dora Janeth; Ortega-Recalde, Oscar; Esteban-Perez, Clara; Moreno-Ortiz, Harold; Patiño, Liliana Catherine; Bermúdez, Olga María; Ortiz, Angela María; Restrepo, Carlos M; Lucena, Elkin; Laissue, Paul
2014-11-01
BMP15 has drawn particular attention in the pathophysiology of reproduction, as its mutations in mammalian species have been related to different reproductive phenotypes. In humans, BMP15 coding regions have been sequenced in large panels of women with premature ovarian failure (POF), but only some mutations have been definitely validated as causing the phenotype. A functional association between the BMP15 c.-9C>G promoter polymorphism and cause of POF have been reported. The aim of this study was to determine the potential functional effect of this sequence variant on specific BMP15 promoter transactivation disturbances. Bioinformatics was used to identify transcription factor binding sites located on the promoter region of BMP15. Reverse transcription polymerase chain reaction was used to study specific gene expression in ovarian tissue. Luciferase reporter assays were used to establish transactivation disturbances caused by the BMP15 c.-9C>G variant. The c.-9C>G variant was found to modify the PITX1 transcription factor binding site. PITX1 and BMP15 co-expressed in human and mouse ovarian tissue, and PITX1 transactivated both BMP15 promoter versions (-9C and -9G). It was found that the BMP15 c.-9G allele was related to BMP15 increased transcription, supporting c.-9C>G as a causal agent of POF. Copyright © 2014 Reproductive Healthcare Ltd. Published by Elsevier Ltd. All rights reserved.
GenProBiS: web server for mapping of sequence variants to protein binding sites.
Konc, Janez; Skrlj, Blaz; Erzen, Nika; Kunej, Tanja; Janezic, Dusanka
2017-07-03
Discovery of potentially deleterious sequence variants is important and has wide implications for research and generation of new hypotheses in human and veterinary medicine, and drug discovery. The GenProBiS web server maps sequence variants to protein structures from the Protein Data Bank (PDB), and further to protein-protein, protein-nucleic acid, protein-compound, and protein-metal ion binding sites. The concept of a protein-compound binding site is understood in the broadest sense, which includes glycosylation and other post-translational modification sites. Binding sites were defined by local structural comparisons of whole protein structures using the Protein Binding Sites (ProBiS) algorithm and transposition of ligands from the similar binding sites found to the query protein using the ProBiS-ligands approach with new improvements introduced in GenProBiS. Binding site surfaces were generated as three-dimensional grids encompassing the space occupied by predicted ligands. The server allows intuitive visual exploration of comprehensively mapped variants, such as human somatic mis-sense mutations related to cancer and non-synonymous single nucleotide polymorphisms from 21 species, within the predicted binding sites regions for about 80 000 PDB protein structures using fast WebGL graphics. The GenProBiS web server is open and free to all users at http://genprobis.insilab.org. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Intra-isolate genome variation in arbuscular mycorrhizal fungi persists in the transcriptome.
Boon, E; Zimmerman, E; Lang, B F; Hijri, M
2010-07-01
Arbuscular mycorrhizal fungi (AMF) are heterokaryotes with an unusual genetic makeup. Substantial genetic variation occurs among nuclei within a single mycelium or isolate. AMF reproduce through spores that contain varying fractions of this heterogeneous population of nuclei. It is not clear whether this genetic variation on the genome level actually contributes to the AMF phenotype. To investigate the extent to which polymorphisms in nuclear genes are transcribed, we analysed the intra-isolate genomic and cDNA sequence variation of two genes, the large subunit ribosomal RNA (LSU rDNA) of Glomus sp. DAOM-197198 (previously known as G. intraradices) and the POL1-like sequence (PLS) of Glomus etunicatum. For both genes, we find high sequence variation at the genome and transcriptome level. Reconstruction of LSU rDNA secondary structure shows that all variants are functional. Patterns of PLS sequence polymorphism indicate that there is one functional gene copy, PLS2, which is preferentially transcribed, and one gene copy, PLS1, which is a pseudogene. This is the first study that investigates AMF intra-isolate variation at the transcriptome level. In conclusion, it is possible that, in AMF, multiple nuclear genomes contribute to a single phenotype.
Wendt, Frank R; Warshauer, David H; Zeng, Xiangpei; Churchill, Jennifer D; Novroski, Nicole M M; Song, Bing; King, Jonathan L; LaRue, Bobby L; Budowle, Bruce
2016-11-01
Short tandem repeat (STR) loci are the traditional markers used for kinship, missing persons, and direct comparison human identity testing. These markers hold considerable value due to their highly polymorphic nature, amplicon size, and ability to be multiplexed. However, many STRs are still too large for use in analysis of highly degraded DNA. Small bi-allelic polymorphisms, such as insertions/deletions (INDELs), may be better suited for analyzing compromised samples, and their allele size differences are amenable to analysis by capillary electrophoresis. The INDEL marker allelic states range in size from 2 to 6 base pairs, enabling small amplicon size. In addition, heterozygote balance may be increased by minimizing preferential amplification of the smaller allele, as is more common with STR markers. Multiplexing a large number of INDELs allows for generating panels with high discrimination power. The Nextera™ Rapid Capture Custom Enrichment Kit (Illumina, Inc., San Diego, CA) and massively parallel sequencing (MPS) on the Illumina MiSeq were used to sequence 68 well-characterized INDELs in four major US population groups. In addition, the STR Allele Identification Tool: Razor (STRait Razor) was used in a novel way to analyze INDEL sequences and detect adjacent single nucleotide polymorphisms (SNPs) and other polymorphisms. This application enabled the discovery of unique allelic variants, which increased the discrimination power and decreased the single-locus random match probabilities (RMPs) of 22 of these well-characterized INDELs which can be considered as microhaplotypes. These findings suggest that additional microhaplotypes containing human identification (HID) INDELs may exist elsewhere in the genome. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Savio, Andrea J.; Bapat, Bharati
2017-01-01
ABSTRACT The MLH1 promoter polymorphism rs1800734 is associated with MLH1 CpG island hypermethylation and expression loss in colorectal cancer (CRC). Conversely, variant rs1800734 is associated with MLH1 shore, but not island, hypomethylation in peripheral blood mononuclear cell DNA. To explore these distinct patterns, MLH1 CpG island and shore methylation was assessed in CRC cell lines stratified by rs1800734 genotype. Cell lines containing the variant A allele demonstrated MLH1 shore hypomethylation compared to wild type (GG). There was significant enrichment of transcription factor AP4 at the MLH1 promoter in GG and GA cell lines, but not the AA cell line, by chromatin immunoprecipitation studies. Preferential binding to the G allele was confirmed by sequencing in the GA cell line. The enhancer-associated histone modification H3K4me1 was enriched at the MLH1 shore; however, H3K27ac was not, indicating the shore is an inactive enhancer. These results demonstrate the role of variant rs1800734 in altering transcription factor binding as well as epigenetics at regions beyond the MLH1 CpG island in which it is located. PMID:28304185
Savio, Andrea J; Bapat, Bharati
2017-06-03
The MLH1 promoter polymorphism rs1800734 is associated with MLH1 CpG island hypermethylation and expression loss in colorectal cancer (CRC). Conversely, variant rs1800734 is associated with MLH1 shore, but not island, hypomethylation in peripheral blood mononuclear cell DNA. To explore these distinct patterns, MLH1 CpG island and shore methylation was assessed in CRC cell lines stratified by rs1800734 genotype. Cell lines containing the variant A allele demonstrated MLH1 shore hypomethylation compared to wild type (GG). There was significant enrichment of transcription factor AP4 at the MLH1 promoter in GG and GA cell lines, but not the AA cell line, by chromatin immunoprecipitation studies. Preferential binding to the G allele was confirmed by sequencing in the GA cell line. The enhancer-associated histone modification H3K4me1 was enriched at the MLH1 shore; however, H3K27ac was not, indicating the shore is an inactive enhancer. These results demonstrate the role of variant rs1800734 in altering transcription factor binding as well as epigenetics at regions beyond the MLH1 CpG island in which it is located.
Voruganti, V. Saroja; Cole, Shelley A.; Haack, Karin; Comuzzie, Anthony G.; Muzny, Donna M.; Wheeler, David A.; Chang, Kyle; Hawes, Alicia; Gibbs, Richard A.
2011-01-01
Our objective was to resequence insulin receptor substrate 2 (IRS2) to identify variants associated with obesity- and diabetes-related traits in Hispanic children. Exonic and intronic segments, 5′ and 3′ flanking regions of IRS2 (∼14.5 kb), were bidirectionally sequenced for single nucleotide polymorphism (SNP) discovery in 934 Hispanic children using 3730XL DNA Sequencers. Additionally, 15 SNPs derived from Illumina HumanOmni1-Quad BeadChips were analyzed. Measured genotype analysis tested associations between SNPs and obesity and diabetes-related traits. Bayesian quantitative trait nucleotide analysis was used to statistically infer the most likely functional polymorphisms. A total of 140 SNPs were identified with minor allele frequencies (MAF) ranging from 0.001 to 0.47. Forty-two of the 70 coding SNPs result in nonsynonymous amino acid substitutions relative to the consensus sequence; 28 SNPs were detected in the promoter, 12 in introns, 28 in the 3′-UTR, and 2 in the 5′-UTR. Two insertion/deletions (indels) were detected. Ten independent rare SNPs (MAF = 0.001–0.009) were associated with obesity-related traits (P = 0.01–0.00002). SNP 10510452_139 in the promoter region was shown to have a high posterior probability (P = 0.77–0.86) of influencing BMI, fat mass, and waist circumference in Hispanic children. SNP 10510452_139 contributed between 2 and 4% of the population variance in body weight and composition. None of the SNPs or indels were associated with diabetes-related traits or accounted for a previously identified quantitative trait locus on chromosome 13 for fasting serum glucose. Rare but not common IRS2 variants may play a role in the regulation of body weight but not an essential role in fasting glucose homeostasis in Hispanic children. PMID:21771880
Linkage and mutational analysis of familial Alzheimer disease kindreds for the APP gene region
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kamino, K.; Anderson, L.; O'dahl, S.
1992-11-01
A large number of familial Alzheimer disease (FAD) kindreds were examined to determine whether mutations in the amyloid precursor protein (APP) gene could be responsible for the disease. Previous studies have identified three mutations at APP codon 717 which are pathogenic for Alzheimer disease (AD). Samples from affected subjects were examined for mutations in exons 16 and 17 of the APP gene. A combination of direct sequencing and single-strand conformational polymorphism analysis was used. Sporadic AD and normal controls were also examined by the same methods. Five sequence variants were identified. One variant at APP codon 693 resulted in amore » Glu[yields]Gly change. This is the same codon as the hereditary cerebral hemorrhage with amyloidosis-Dutch type Glu[yields]Gln mutation. Another single-base change at APP codon 708 did not alter the amino acid encoded at this site. Two point mutations and a 6-bp deletion were identified in the intronic sequences surrounding exon 17. None of the variants could be unambigously determined to be responsible for FAD. The larger families were also analyzed by testing for linkage of FAD to a highly polymorphic short tandem repeat marker (D21S210) that is tightly linked to APP. Highly negative LOD scores were obtained for the family groups tested, and linkage was formally excluded beyond [theta] = .10 for the Volga German kindreds, [theta] = .20 for early-onset non-Volga Germans, and [theta] = .10 for late-onset families. LOD scores for linkage of FAD to markers centromeric to APP (D21S1/S11, D21S13, and D21S215) were also negative in the three family groups. These studies show that APP mutations account for AD in only a small fraction of FAD kindreds. 49 refs., 6 figs., 4 tabs.« less
Insertion and deletion polymorphisms of the ancient AluS family in the human genome.
Kryatova, Maria S; Steranka, Jared P; Burns, Kathleen H; Payer, Lindsay M
2017-01-01
Polymorphic Alu elements account for 17% of structural variants in the human genome. The majority of these belong to the youngest AluY subfamilies, and most structural variant discovery efforts have focused on identifying Alu polymorphisms from these currently retrotranspositionally active subfamilies. In this report we analyze polymorphisms from the evolutionarily older AluS subfamily, whose peak activity was tens of millions of years ago. We annotate the AluS polymorphisms, assess their likely mechanism of origin, and evaluate their contribution to structural variation in the human genome. Of 52 previously reported polymorphic AluS elements ascertained for this study, 48 were confirmed to belong to the AluS subfamily using high stringency subfamily classification criteria. Of these, the majority (77%, 37/48) appear to be deletion polymorphisms. Two polymorphic AluS elements (4%) have features of non-classical Alu insertions and one polymorphic AluS element (2%) likely inserted by a mechanism involving internal priming. Seven AluS polymorphisms (15%) appear to have arisen by the classical target-primed reverse transcription (TPRT) retrotransposition mechanism. These seven TPRT products are 3' intact with 3' poly-A tails, and are flanked by target site duplications; L1 ORF2p endonuclease cleavage sites were also observed, providing additional evidence that these are L1 ORF2p endonuclease-mediated TPRT insertions. Further sequence analysis showed strong conservation of both the RNA polymerase III promoter and SRP9/14 binding sites, important for mediating transcription and interaction with retrotransposition machinery, respectively. This conservation of functional features implies that some of these are fairly recent insertions since they have not diverged significantly from their respective retrotranspositionally competent source elements. Of the polymorphic AluS elements evaluated in this report, 15% (7/48) have features consistent with TPRT-mediated insertion, thus suggesting that some AluS elements have been more active recently than previously thought, or that fixation of AluS insertion alleles remains incomplete. These data expand the potential significance of polymorphic AluS elements in contributing to structural variation in the human genome. Future discovery efforts focusing on polymorphic AluS elements are likely to identify more such polymorphisms, and approaches tailored to identify deletion alleles may be warranted.
Proposal for the nomenclature of human plasminogen (PLG) polymorphism.
Skoda, U; Bertrams, J; Dykes, D; Eiberg, H; Hobart, M; Hummel, K; Kühnl, P; Mauff, G; Nakamura, S; Nishimukai, H
1986-01-01
Since its discovery, human plasminogen (PLG) polymorphism has received widespread acceptance in population genetics and forensic haematology. Due to the large number of variant alleles described, a PLG reference typing and Plasminogen Symposium was held, at which a nomenclature proposal was inaugurated. The technology of comparing PLG variants was based on isoelectric focusing and subsequent detection by caseinolytic overlay and 'Western' blotting. Typing results permitted comparison of so far described variant designations and resulted in a new nomenclature proposal for PLG polymorphism. It is recommended that the two most common alleles found in all investigated races be called: PLG*A (previously also PLG*1) and PLG*B (previously also PLG*2), the known variants with acidic pI: PLG*A1 to *A3, intermediate variants: PLG*M1 to *M5, PLG*M5 being functionally inactive, and basic variants: PLG*B1 to *B3. For future classification of newly discovered variants, samples should be compared at any of the laboratories participating in the reference typing.
Gu, Hong; Sun, Erdan; Cui, Lei; Yang, Xiufen; Lim, Apiradee; Xu, Jun; Snellingen, Torkel; Liu, Xipu; Wang, Ningli; Liu, Ningpu
2012-10-01
To investigate the association between single-nucleotide polymorphisms in the pi isoform of glutathione S-transferase (GSTP1) gene and the risk of exudative age-related macular degeneration (AMD) in a Chinese case-control cohort. A total of 131 Chinese patients with exudative AMD and 138 control individuals were recruited. Genomic DNA was extracted from venous blood leukocytes. Two common nonsynonymous single-nucleotide polymorphisms in GSTP1 (rs1695 and rs1138272) were genotyped by polymerase chain reaction followed by allele-specific restriction enzyme digestion and direct sequencing. Significant association with exudative AMD was detected for single-nucleotide polymorphism, rs1695 (P = 0.019). The risk G allele frequencies were 21.8% in AMD patients and 12.7% in control subjects (P = 0.007). Compared with the wild-type AA genotype, odds ratio for the risk of AMD was 1.91 (95% confidence interval, 1.09-3.35) for the heterozygous AG genotype and 2.52 (95% confidence interval, 0.6-10.61) for the homozygous GG genotype. In contrast, rs1138272 was not associated with exudative AMD (P = 1.00). The risk G allele frequencies of rs1138272 were 0.4% in AMD patients and 0.4% in control subjects (P = 1.00). Our data suggest that the GSTP1 variant rs1695 moderately increases the risk of exudative AMD. The variant rs1138272 was rare and was not associated with exudative AMD in this Chinese cohort.
Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows
Torri, Federica; Dinov, Ivo D.; Zamanyan, Alen; Hobel, Sam; Genco, Alex; Petrosyan, Petros; Clark, Andrew P.; Liu, Zhizhong; Eggert, Paul; Pierce, Jonathan; Knowles, James A.; Ames, Joseph; Kesselman, Carl; Toga, Arthur W.; Potkin, Steven G.; Vawter, Marquis P.; Macciardi, Fabio
2012-01-01
Whole-genome and exome sequencing have already proven to be essential and powerful methods to identify genes responsible for simple Mendelian inherited disorders. These methods can be applied to complex disorders as well, and have been adopted as one of the current mainstream approaches in population genetics. These achievements have been made possible by next generation sequencing (NGS) technologies, which require substantial bioinformatics resources to analyze the dense and complex sequence data. The huge analytical burden of data from genome sequencing might be seen as a bottleneck slowing the publication of NGS papers at this time, especially in psychiatric genetics. We review the existing methods for processing NGS data, to place into context the rationale for the design of a computational resource. We describe our method, the Graphical Pipeline for Computational Genomics (GPCG), to perform the computational steps required to analyze NGS data. The GPCG implements flexible workflows for basic sequence alignment, sequence data quality control, single nucleotide polymorphism analysis, copy number variant identification, annotation, and visualization of results. These workflows cover all the analytical steps required for NGS data, from processing the raw reads to variant calling and annotation. The current version of the pipeline is freely available at http://pipeline.loni.ucla.edu. These applications of NGS analysis may gain clinical utility in the near future (e.g., identifying miRNA signatures in diseases) when the bioinformatics approach is made feasible. Taken together, the annotation tools and strategies that have been developed to retrieve information and test hypotheses about the functional role of variants present in the human genome will help to pinpoint the genetic risk factors for psychiatric disorders. PMID:23139896
Effects of human SAMHD1 polymorphisms on HIV-1 susceptibility
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, Tommy E.; Brandariz-Nuñez, Alberto; Valle-Casuso, Jose Carlos
SAMHD1 is a human restriction factor that prevents efficient infection of macrophages, dendritic cells and resting CD4+ T cells by HIV-1. Here we explored the antiviral activity and biochemical properties of human SAMHD1 polymorphisms. Our studies focused on human SAMHD1 polymorphisms that were previously identified as evolving under positive selection for rapid amino acid replacement during primate speciation. The different human SAMHD1 polymorphisms were tested for their ability to block HIV-1, HIV-2 and equine infectious anemia virus (EIAV). All studied SAMHD1 variants block HIV-1, HIV-2 and EIAV infection when compared to wild type. We found that these variants did notmore » lose their ability to oligomerize or to bind RNA. Furthermore, all tested variants were susceptible to degradation by Vpx, and localized to the nuclear compartment. We tested the ability of human SAMHD1 polymorphisms to decrease the dNTP cellular levels. In agreement, none of the different SAMHD1 variants lost their ability to reduce cellular levels of dNTPs. Finally, we found that none of the tested human SAMHD1 polymorphisms affected the ability of the protein to block LINE-1 retrotransposition. - Highlights: • Human SAMHD1 single-nucleotide polymorphisms block HIV-1 and HIV-2 infection. • SAMHD1 polymorphisms do not affect its ability to block LINE-1 retrotransposition. • SAMHD1 polymorphisms decrease the cellular levels of dNTPs.« less
Iacono, William G; Malone, Stephen M; Vaidyanathan, Uma; Vrieze, Scott I
2014-12-01
This article provides an introductory overview of the investigative strategy employed to evaluate the genetic basis of 17 endophenotypes examined as part of a 20-year data collection effort from the Minnesota Center for Twin and Family Research. Included are characterization of the study samples, descriptive statistics for key properties of the psychophysiological measures, and rationale behind the steps taken in the molecular genetic study design. The statistical approach included (a) biometric analysis of twin and family data, (b) heritability analysis using 527,829 single nucleotide polymorphisms (SNPs), (c) genome-wide association analysis of these SNPs and 17,601 autosomal genes, (d) follow-up analyses of candidate SNPs and genes hypothesized to have an association with each endophenotype, (e) rare variant analysis of nonsynonymous SNPs in the exome, and (f) whole genome sequencing association analysis using 27 million genetic variants. These methods were used in the accompanying empirical articles comprising this special issue, Genome-Wide Scans of Genetic Variants for Psychophysiological Endophenotypes. Copyright © 2014 Society for Psychophysiological Research.
Quantifying evolutionary dynamics from variant-frequency time series
NASA Astrophysics Data System (ADS)
Khatri, Bhavin S.
2016-09-01
From Kimura’s neutral theory of protein evolution to Hubbell’s neutral theory of biodiversity, quantifying the relative importance of neutrality versus selection has long been a basic question in evolutionary biology and ecology. With deep sequencing technologies, this question is taking on a new form: given a time-series of the frequency of different variants in a population, what is the likelihood that the observation has arisen due to selection or neutrality? To tackle the 2-variant case, we exploit Fisher’s angular transformation, which despite being discovered by Ronald Fisher a century ago, has remained an intellectual curiosity. We show together with a heuristic approach it provides a simple solution for the transition probability density at short times, including drift, selection and mutation. Our results show under that under strong selection and sufficiently frequent sampling these evolutionary parameters can be accurately determined from simulation data and so they provide a theoretical basis for techniques to detect selection from variant or polymorphism frequency time-series.
Quantifying evolutionary dynamics from variant-frequency time series.
Khatri, Bhavin S
2016-09-12
From Kimura's neutral theory of protein evolution to Hubbell's neutral theory of biodiversity, quantifying the relative importance of neutrality versus selection has long been a basic question in evolutionary biology and ecology. With deep sequencing technologies, this question is taking on a new form: given a time-series of the frequency of different variants in a population, what is the likelihood that the observation has arisen due to selection or neutrality? To tackle the 2-variant case, we exploit Fisher's angular transformation, which despite being discovered by Ronald Fisher a century ago, has remained an intellectual curiosity. We show together with a heuristic approach it provides a simple solution for the transition probability density at short times, including drift, selection and mutation. Our results show under that under strong selection and sufficiently frequent sampling these evolutionary parameters can be accurately determined from simulation data and so they provide a theoretical basis for techniques to detect selection from variant or polymorphism frequency time-series.
Genetic variations in NADPH-CYP450 oxidoreductase in a Czech Slavic cohort.
Tomková, Mária; Panda, Satya Prakash; Šeda, Ondřej; Baxová, Alice; Hůlková, Martina; Siler Masters, Bettie Sue; Martásek, Pavel
2015-01-01
Estimating polymorphic allele frequencies of the NADPH-CYP450 oxidoreductase (POR) gene in a Czech Slavic population. The POR gene was analyzed in 322 individuals from a control cohort by sequencing and high resolution melting analysis. We identified seven unreported SNP genetic variations, including two SNPs in the 5' flanking region (g.4965C>T and g.4994G>T), one intronic variant (c.1899-20C>T), one synonymous SNP (p.20Ala=) and three nonsynonymous SNPs (p.Thr29Ser, p.Pro384Leu and p.Thr529Met). The p.Pro384Leu variant exhibited reduced enzymatic activities compared with wild-type. New POR variant identification indicates the number of uncommon variants might be specific for each subpopulation being investigated, particularly germane to the singular role that POR plays in providing reducing equivalents to all CYP450s in the endoplasmic reticulum. Original submitted 15 September 2014; Revision submitted 17 November 2014.
Global variation in CYP2C8–CYP2C9 functional haplotypes
Speed, William C; Kang, Soonmo Peter; Tuck, David P; Harris, Lyndsay N; Kidd, Kenneth K
2009-01-01
We have studied the global frequency distributions of 10 single nucleotide polymorphisms (SNPs) across 132 kb of CYP2C8 and CYP2C9 in ∼2500 individuals representing 45 populations. Five of the SNPs were in noncoding sequences; the other five involved the more common missense variants (four in CYP2C8, one in CYP2C9) that change amino acids in the gene products. One haplotype containing two CYP2C8 coding variants and one CYP2C9 coding variant reaches an average frequency of 10% in Europe; a set of haplotypes with a different CYP2C8 coding variant reaches 17% in Africa. In both cases these haplotypes are found in other regions of the world at <1%. This considerable geographic variation in haplotype frequencies impacts the interpretation of CYP2C8/CYP2C9 association studies, and has pharmacogenomic implications for drug interactions. PMID:19381162
2012-01-01
Background Water stress limits plant survival and production in many parts of the world. Identification of genes and alleles responding to water stress conditions is important in breeding plants better adapted to drought. Currently there are no studies examining the transcriptome wide gene and allelic expression patterns under water stress conditions. We used RNA sequencing (RNA-seq) to identify the candidate genes and alleles and to explore the evolutionary signatures of selection. Results We studied the effect of water stress on gene expression in Eucalyptus camaldulensis seedlings derived from three natural populations. We used reference-guided transcriptome mapping to study gene expression. Several genes showed differential expression between control and stress conditions. Gene ontology (GO) enrichment tests revealed up-regulation of 140 stress-related gene categories and down-regulation of 35 metabolic and cell wall organisation gene categories. More than 190,000 single nucleotide polymorphisms (SNPs) were detected and 2737 of these showed differential allelic expression. Allelic expression of 52% of these variants was correlated with differential gene expression. Signatures of selection patterns were studied by estimating the proportion of nonsynonymous to synonymous substitution rates (Ka/Ks). The average Ka/Ks ratio among the 13,719 genes was 0.39 indicating that most of the genes are under purifying selection. Among the positively selected genes (Ka/Ks > 1.5) apoptosis and cell death categories were enriched. Of the 287 positively selected genes, ninety genes showed differential expression and 27 SNPs from 17 positively selected genes showed differential allelic expression between treatments. Conclusions Correlation of allelic expression of several SNPs with total gene expression indicates that these variants may be the cis-acting variants or in linkage disequilibrium with such variants. Enrichment of apoptosis and cell death gene categories among the positively selected genes reveals the past selection pressures experienced by the populations used in this study. PMID:22853646
Genetic study of intracranial aneurysms.
Yan, Junxia; Hitomi, Toshiaki; Takenaka, Katsunobu; Kato, Masayasu; Kobayashi, Hatasu; Okuda, Hiroko; Harada, Kouji H; Koizumi, Akio
2015-03-01
Rupture of intracranial aneurysms (IAs) causes subarachnoid hemorrhage, leading to immediate death or severe disability. Identification of the genetic factors involved is critical for disease prevention and treatment. We aimed to identify the susceptibility genes for IAs. Exome sequencing was performed in 12 families with histories of multiple cases of IA (number of cases per family ≥3), with a total of 42 cases. Various filtering strategies were used to select the candidate variants. Replicate association studies of several candidate variants were performed in probands of 24 additional IA families and 426 sporadic IA cases. Functional analysis for the mutations was conducted. After sequencing and filtering, 78 variants were selected for the following reasons: allele frequencies of variants in 42 patients was significantly (P<0.05) larger than expected; variants were completely shared by all patients with IA within ≥1 family; variants predicted damage to the structure or function of the protein by PolyPhen-2 (Polymorphism Phenotyping V2) and SIFT (Sorting Intolerance From Tolerant). We selected 10 variants from 9 genes (GPR63, ADAMST15, MLL2, IL10RA, PAFAH2, THBD, IL11RA, FILIP1L, and ZNF222) to form 78 candidate variants by considering commonness in families, known disease genes, or ontology association with angiogenesis. Replicate association studies revealed that only p.E133Q in ADAMTS15 was aggregated in the familial IA cases (odds ratio, 5.96; 95% confidence interval, 2.40-14.82; P=0.0001; significant after the Bonferroni correction [P=0.05/78=0.0006]). Silencing ADAMTS15 and overexpression of ADAMTS15 p.E133Q accelerated endothelial cell migration, suggesting that ADAMTS15 may have antiangiogenic activity. ADAMTS15 is a candidate gene for IAs. © 2015 American Heart Association, Inc.
Monroe, Glen R; Kappen, Isabelle FPM; Stokman, Marijn F; Terhal, Paulien A; van den Boogaard, Marie-José H; Savelberg, Sanne MC; van der Veken, Lars T; van Es, Robert JJ; Lens, Susanne M; Hengeveld, Rutger C; Creton, Marijn A; Janssen, Nard G; Mink van der Molen, Aebele B; Ebbeling, Michelle B; Giles, Rachel H; Knoers, Nine V; van Haaften, Gijs
2016-01-01
The oral-facial-digital (OFD) syndromes comprise a group of related disorders with a combination of oral, facial and digital anomalies. Variants in several ciliary genes have been associated with subtypes of OFD syndrome, yet in most OFD patients the underlying cause remains unknown. We investigated the molecular basis of disease in two brothers with OFD type II, Mohr syndrome, by performing single-nucleotide polymorphism (SNP)-array analysis on the brothers and their healthy parents to identify homozygous regions and candidate genes. Subsequently, we performed whole-exome sequencing (WES) on the family. Using WES, we identified compound heterozygous variants c.[464G>C][1226G>A] in NIMA (Never in Mitosis Gene A)-Related Kinase 1 (NEK1). The novel variant c.464G>C disturbs normal splicing in an essential region of the kinase domain. The nonsense variant c.1226G>A, p.(Trp409*), results in nonsense-associated alternative splicing, removing the first coiled-coil domain of NEK1. Candidate variants were confirmed with Sanger sequencing and alternative splicing assessed with cDNA analysis. Immunocytochemistry was used to assess cilia number and length. Patient-derived fibroblasts showed severely reduced ciliation compared with control fibroblasts (18.0 vs 48.9%, P<0.0001), but showed no significant difference in cilia length. In conclusion, we identified compound heterozygous deleterious variants in NEK1 in two brothers with Mohr syndrome. Ciliation in patient fibroblasts is drastically reduced, consistent with a ciliary defect pathogenesis. Our results establish NEK1 variants involved in the etiology of a subset of patients with OFD syndrome type II and support the consideration of including (routine) NEK1 analysis in patients suspected of OFD. PMID:27530628
Monroe, Glen R; Kappen, Isabelle Fpm; Stokman, Marijn F; Terhal, Paulien A; van den Boogaard, Marie-José H; Savelberg, Sanne Mc; van der Veken, Lars T; van Es, Robert Jj; Lens, Susanne M; Hengeveld, Rutger C; Creton, Marijn A; Janssen, Nard G; Mink van der Molen, Aebele B; Ebbeling, Michelle B; Giles, Rachel H; Knoers, Nine V; van Haaften, Gijs
2016-12-01
The oral-facial-digital (OFD) syndromes comprise a group of related disorders with a combination of oral, facial and digital anomalies. Variants in several ciliary genes have been associated with subtypes of OFD syndrome, yet in most OFD patients the underlying cause remains unknown. We investigated the molecular basis of disease in two brothers with OFD type II, Mohr syndrome, by performing single-nucleotide polymorphism (SNP)-array analysis on the brothers and their healthy parents to identify homozygous regions and candidate genes. Subsequently, we performed whole-exome sequencing (WES) on the family. Using WES, we identified compound heterozygous variants c.[464G>C];[1226G>A] in NIMA (Never in Mitosis Gene A)-Related Kinase 1 (NEK1). The novel variant c.464G>C disturbs normal splicing in an essential region of the kinase domain. The nonsense variant c.1226G>A, p.(Trp409*), results in nonsense-associated alternative splicing, removing the first coiled-coil domain of NEK1. Candidate variants were confirmed with Sanger sequencing and alternative splicing assessed with cDNA analysis. Immunocytochemistry was used to assess cilia number and length. Patient-derived fibroblasts showed severely reduced ciliation compared with control fibroblasts (18.0 vs 48.9%, P<0.0001), but showed no significant difference in cilia length. In conclusion, we identified compound heterozygous deleterious variants in NEK1 in two brothers with Mohr syndrome. Ciliation in patient fibroblasts is drastically reduced, consistent with a ciliary defect pathogenesis. Our results establish NEK1 variants involved in the etiology of a subset of patients with OFD syndrome type II and support the consideration of including (routine) NEK1 analysis in patients suspected of OFD.
Tajamolian, Masoud; Kolahdouz, Parisa; Nikpour, Parvaneh; Forouzannia, Seyed Khalil; Sheikhha, Mohammad Hasan; Yazd, Ehsan Farashahi
2018-01-01
Background: Familial hypercholesterolemia (FH) is a disorder that is inherited by autosomal dominant pattern. The main cause of FH disease is the occurrence of mutations in low-density lipoprotein receptor (LDLR) gene sequence, as well as apolipoprotein B and proprotein convertase subtilisin/kexin type 9 genes, located in the next ranks, respectively. Materials and Methods: Forty-five unrelated Iranian patients with FH were screened using a high-resolution melting (HRM) method for exon 9 along with intron/exon boundaries of LDLR gene. Samples with shift in resultant HRM curves were compared to normal ones, sequenced, and analyzed. Results: Our findings revealed a missense mutation c. 1246C>T and a known variant IVS9-30C>T (rs1003723) that was recognized in 71% of the patients (22%: homozygous and 49%: heterozygous genotypes). In silico analysis, predicted the pathological effect of the c. 1246C>T mutation in LDLR protein structure, but IVS9-30C>T variant had no predicted effect on splice site and branch point function. Conclusion: FH is a hereditary type of hypercholesterolemia that leads to premature cardiovascular disease and atherosclerosis, and early diagnosis is needed. We detected a rare missense mutation (1246C>T) and a common single nucleotide polymorphism (SNP) in the Iranian population. These reports could help in the genetic diagnosis and counseling of FH patients. PMID:29531935
Borrego, Salud; Fernández, Raquel M; Dziema, Heather; Japón, Miguel A; Marcos, Irene; Eng, Charis; Antiñolo, Guillermo
2002-11-01
The etiology of sporadic medullary thyroid carcinoma (sMTC) remains elusive. While germline gain-of-function mutations in the RET proto-oncogene cause hereditary MTC, somatic RET mutations have been described in a variable number of sMTC. So far, S836S of RET, is the only variant whose association with sMTC has been found in several European cohorts. Because RET variants seem to be associated with MTC, it is plausible that variants in genes encoding for RET coreceptors may play a role in the pathogenesis of sMTC. Recently, we described two possible low penetrance susceptibility alleles in the gene encoding RET coreceptor GFRalpha1, -193C > G and 537T > C, in a German series of sMTC. In this study, we have genotyped nine polymorphisms within GFRA1-3 genes for 51 Spanish sMTC, and 100 normal controls. Our results show that no statistical signification was found when Spanish sMTC patients were compared to controls. Taken together with the observations in the German sMTC series, the present findings suggest that GFRA1-193C > G and 537T > C could be in linkage disequilibrium with other loci responsible for the disease with a founder effect in Germany. Alternatively, the combined observations might also suggest that, if indeed the polymorphisms are functional, the effect is small.
Single-nucleotide polymorphisms of TNFA and IL1 in allergic rhinitis.
Nasiri, R; Amirzargar, A Akbar; Movahedi, M; Hirbod-Mobarakeh, A; Farhadi, E; Behniafard, N; Tavakkol, M; Ansaripour, B; Moradi, B; Zare, A; Rezaei, N
2013-01-01
Allergic rhinitis is a complex polygenic disorder of the upper respiratory tract. Given that proinflammatory cytokines such as tumor necrosis factor (TNF) and interleukin (IL) 1 seem to play a role in the development of allergic rhinitis, we evaluated the associations between various single-nucleotide polymorphisms (SNPs) of the TNF and IL1 genes in a case-control study. The study population comprised 98 patients with allergic rhinitis. Genotyping was performed using polymerase chain reaction with sequence-specific primers for 2 TNFA promoter variants (rs1800629 and rs361525), 1 variant in the promoter region of IL1A (rs1800587), 2 SNPs in the IL1B gene (rs16944 and rs1 143634), 1 variant in the IL1 receptor (rs2234650), and 1 in IL1RA (rs315952). Patients who were homozygous for the T allele of rs16944 in IL1B had an 8.1-fold greater risk of allergic rhinitis than those with the C allele. In TNFA, a significant relationship was also detected between rs1800629 and rs361525 and allergic rhinitis. Except for rs1800587 in IL1A and rs315952 in IL1RA, significant differences were found between the patient and control groups for all other SNPs. We found that allelic variants in the TNFA and IL1 genes were not only associated with the risk of developing allergic rhinitis, but also affected disease course and severity.
Gok, Ilhami; Huseyinoglu, Nergiz; Ilhan, Dogan
2015-08-01
To investigate the relationship of IL-1β and IL-6 cytokine gene polymorphisms with obstructive sleep apnea syndrome (OSAS) in 61 patients admitted to the neurology clinic in Kafkas University Hospital with insomnia problem who were diagnosed with OSAS in sleeping labs, and 80 healthy subjects not associated with the syndrome. METHODS :Blood samples were taken to isolate DNA from patients diagnosed with OSAS based on polysomnography results and healthy controls. DNA amplification of the genes was performed with PCR. Amplification products were cut with the restriction enzymes in order to determine IL-1 gene (TaqI) and IL-6 gene (Lwel) polymorphisms. The cut DNA fragments were carried out in agarose gel electrophoresis, and RFLP analysis was performed by utilizing the images with gel imaging system. PCR products were sequenced with an Applied Biosystems Automated Sequencer. Polymorphic changes were observed for IL-1β gene in 26 of 62 patients (41.9%), and 16 of the 80 (25.8%) in the control group. The incidence of polymorphic changes in IL-6 gene was in seen in seven (of the 62 patients) (11.3%), and in the 16 (20%) controls. The findings on the genomic level in OSAS may provide an important contribution to diagnosis of obstructive sleep apnea syndrome in clinical practice, as well as it helps to obtain the results easily about environmental and genetic interaction of OSAS patients. Copyright© by the Medical Assotiation of Zenica-Doboj Canton.
Reed, Kent M.; Dorschner, Michael O.; Todd, Thomas N.; Phillips, Ruth B.
1998-01-01
Sequence variation in the control region (D-loop) of the mitochondrial DNA (mtDNA) was examined to assess the genetic distinctiveness of the shortjaw cisco (Coregonus zenithicus). Individuals from within the Great Lakes Basin as well as inland lakes outside the basin were sampled. DNA fragments containing the entire D-loop were amplified by PCR from specimens ofC. zenithicus and the related species C. artedi, C. hoyi, C. kiyi, and C. clupeaformis. DNA sequence analysis revealed high similarity within and among species and shared polymorphism for length variants. Based on this analysis, the shortjaw cisco is not genetically distinct from other cisco species.
HBS1L-MYB intergenic variants modulate fetal hemoglobin via long-range MYB enhancers
Stadhouders, Ralph; Aktuna, Suleyman; Thongjuea, Supat; Aghajanirefah, Ali; Pourfarzad, Farzin; van IJcken, Wilfred; Lenhard, Boris; Rooks, Helen; Best, Steve; Menzel, Stephan; Grosveld, Frank; Thein, Swee Lay; Soler, Eric
2014-01-01
Genetic studies have identified common variants within the intergenic region (HBS1L-MYB) between GTP-binding elongation factor HBS1L and myeloblastosis oncogene MYB on chromosome 6q that are associated with elevated fetal hemoglobin (HbF) levels and alterations of other clinically important human erythroid traits. It is unclear how these noncoding sequence variants affect multiple erythrocyte characteristics. Here, we determined that several HBS1L-MYB intergenic variants affect regulatory elements that are occupied by key erythroid transcription factors within this region. These elements interact with MYB, a critical regulator of erythroid development and HbF levels. We found that several HBS1L-MYB intergenic variants reduce transcription factor binding, affecting long-range interactions with MYB and MYB expression levels. These data provide a functional explanation for the genetic association of HBS1L-MYB intergenic polymorphisms with human erythroid traits and HbF levels. Our results further designate MYB as a target for therapeutic induction of HbF to ameliorate sickle cell and β-thalassemia disease severity. PMID:24614105
Newsum, Astrid M; Ho, Cynthia K Y; Lieveld, Faydra I; van de Laar, Thijs J W; Koekkoek, Sylvie M; Rebers, Sjoerd P; van der Meer, Jan T M; Wensing, Anne M J; Boland, Greet J; Arends, Joop E; van Erpecum, Karel J; Prins, Maria; Molenkamp, Richard; Schinkel, Janke
2017-01-02
The Q80K polymorphism is a naturally occurring resistance-associated variant in the hepatitis C virus (HCV) nonstructural protein 3 (NS3) region and is likely transmissible between hosts. This study describes the Q80K origin and prevalence among HCV risk groups in the Netherlands and examines whether Q80K is linked to specific transmission networks. Stored blood samples from HCV genotype 1a-infected patients were used for PCR and sequencing to reconstruct the NS3 maximum likelihood phylogeny. The most recent common ancestor was estimated with a coalescent-based model within a Bayesian statistical framework. Study participants (n = 150) were either MSM (39%), people who inject drugs (17%), or patients with other (15%) or unknown/unreported (29%) risk behavior. Overall 45% was coinfected with HIV. Q80K was present in 36% (95% confidence interval 28-44%) of patients throughout the sample collection period (2000-2015) and was most prevalent in MSM (52%, 95% confidence interval 38-65%). Five MSM-specific transmission clusters were identified, of which three exclusively contained sequences with Q80K. The HCV-1a most recent common ancestor in the Netherlands was estimated in 1914 (95% higher posterior density 1879-1944) and Q80K originated in 1957 (95% higher posterior density 1942-1970) within HCV-1a clade I. All Q80K lineages could be traced back to this single origin. Q80K is a highly stable and transmissible resistance-associated variant and was present in a large part of Dutch HIV-coinfected MSM. The introduction and expansion of Q80K variants in this key population suggest a founder effect, potentially jeopardizing future treatment with simeprevir.
P450 oxidoreductase deficiency: a disorder of steroidogenesis with multiple clinical manifestations.
Miller, Walter L
2012-10-23
Cytochrome P450 enzymes catalyze the biosynthesis of steroid hormones and metabolize drugs. There are seven human type I P450 enzymes in mitochondria and 50 type II enzymes in endoplasmic reticulum. Type II enzymes, including both drug-metabolizing and some steroidogenic enzymes, require electron donation from a two-flavin protein, P450 oxidoreductase (POR). Although knockout of the POR gene causes embryonic lethality in mice, we discovered human POR deficiency as a disorder of steroidogenesis associated with the Antley-Bixler skeletal malformation syndrome and found mild POR mutations in phenotypically normal adults with infertility. Assay results of mutant forms of POR using the traditional but nonphysiologic assay (reduction of cytochrome c) did not correlate with patient phenotypes; assays based on the 17,20 lyase activity of P450c17 (CYP17) correlated with clinical phenotypes. The POR sequence in 842 normal individuals revealed many polymorphisms; amino acid sequence variant A503V is encoded by ~28% of human alleles. POR A503V has about 60% of wild-type activity in assays with CYP17, CYP2D6, and CYP3A4, but nearly wild-type activity with P450c21, CYP1A2, and CYP2C19. Activity of a particular POR variant with one P450 enzyme will not predict its activity with another P450 enzyme: Each POR-P450 combination must be studied individually. Human POR transcription, initiated from an untranslated exon, is regulated by Smad3/4, thyroid receptors, and the transcription factor AP-2. A promoter polymorphism reduces transcription to 60% in liver cells and to 35% in adrenal cells. POR deficiency is a newly described disorder of steroidogenesis, and POR variants may account for some genetic variation in drug metabolism.
Permuth-Wey, Jennifer; Lawrenson, Kate; Shen, Howard C.; Velkova, Aneliya; Tyrer, Jonathan P.; Chen, Zhihua; Lin, Hui-Yi; Chen, Y. Ann; Tsai, Ya-Yu; Qu, Xiaotao; Ramus, Susan J.; Karevan, Rod; Lee, Janet; Lee, Nathan; Larson, Melissa C.; Aben, Katja K.; Anton-Culver, Hoda; Antonenkova, Natalia; Antoniou, Antonis; Armasu, Sebastian M.; Bacot, François; Baglietto, Laura; Bandera, Elisa V.; Barnholtz-Sloan, Jill; Beckmann, Matthias W.; Birrer, Michael J.; Bloom, Greg; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Brown, Robert; Butzow, Ralf; Cai, Qiuyin; Campbell, Ian; Chang-Claude, Jenny; Chanock, Stephen; Chenevix-Trench, Georgia; Cheng, Jin Q.; Cicek, Mine S.; Coetzee, Gerhard A.; Cook, Linda S.; Couch, Fergus J.; Cramer, Daniel W.; Cunningham, Julie M.; Dansonka-Mieszkowska, Agnieszka; Despierre, Evelyn; Doherty, Jennifer A; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Easton, Douglas F; Eccles, Diana; Edwards, Robert; Ekici, Arif B.; Fasching, Peter A.; Fenstermacher, David A.; Flanagan, James M.; Garcia-Closas, Montserrat; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind M.; Gonzalez-Bosquet, Jesus; Goodman, Marc T.; Gore, Martin; Górski, Bohdan; Gronwald, Jacek; Hall, Per; Halle, Mari K.; Harter, Philipp; Heitz, Florian; Hillemanns, Peter; Hoatlin, Maureen; Høgdall, Claus K.; Høgdall, Estrid; Hosono, Satoyo; Jakubowska, Anna; Jensen, Allan; Jim, Heather; Kalli, Kimberly R.; Karlan, Beth Y.; Kaye, Stanley B.; Kelemen, Linda E.; Kiemeney, Lambertus A.; Kikkawa, Fumitaka; Konecny, Gottfried E.; Krakstad, Camilla; Kjaer, Susanne Krüger; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Lancaster, Johnathan M.; Le, Nhu D.; Leminen, Arto; Levine, Douglas A.; Liang, Dong; Lim, Boon Kiong; Lin, Jie; Lissowska, Jolanta; Lu, Karen H.; Lubiński, Jan; Lurie, Galina; Massuger, Leon F.A.G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B.; Nakanishi, Toru; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Nevanlinna, Heli; Nickels, Stefan; Noushmehr, Houtan; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Paul, James; Pearce, Celeste L; Pejovic, Tanja; Pelttari, Liisa M.; Pike, Malcolm C.; Poole, Elizabeth M.; Raska, Paola; Renner, Stefan P.; Risch, Harvey A.; Rodriguez-Rodriguez, Lorna; Rossing, Mary Anne; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schwaab, Ira; Severi, Gianluca; Shridhar, Vijayalakshmi; Shu, Xiao-Ou; Shvetsov, Yurii B.; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Spiewankiewicz, Beata; Stram, Daniel; Sutphen, Rebecca; Teo, Soo-Hwang; Terry, Kathryn L.; Tessier, Daniel C.; Thompson, Pamela J.; Tworoger, Shelley S.; van Altena, Anne M.; Vergote, Ignace; Vierkant, Robert A.; Vincent, Daniel; Vitonis, Allison F.; Wang-Gohrke, Shan; Weber, Rachel Palmieri; Wentzensen, Nicolas; Whittemore, Alice S.; Wik, Elisabeth; Wilkens, Lynne R.; Winterhoff, Boris; Woo, Yin Ling; Wu, Anna H.; Xiang, Yong-Bing; Yang, Hannah P.; Zheng, Wei; Ziogas, Argyrios; Zulkifli, Famida; Phelan, Catherine M.; Iversen, Edwin; Schildkraut, Joellen M.; Berchuck, Andrew; Fridley, Brooke L.; Goode, Ellen L.; Pharoah, Paul D. P.; Monteiro, Alvaro N.A.; Sellers, Thomas A.; Gayther, Simon A.
2013-01-01
Epithelial ovarian cancer (EOC) has a heritable component that remains to be fully characterized. Most identified common susceptibility variants lie in non-protein-coding sequences. We hypothesized that variants in the 3′ untranslated region at putative microRNA (miRNA) binding sites represent functional targets that influence EOC susceptibility. Here, we evaluate the association between 767 miRNA binding site single nucleotide polymorphisms (miRSNPs) and EOC risk in 18,174 EOC cases and 26,134 controls from 43 studies genotyped through the Collaborative Oncological Gene-environment Study. We identify several miRSNPs associated with invasive serous EOC risk (OR=1.12, P=10−8) mapping to an inversion polymorphism at 17q21.31. Additional genotyping of non-miRSNPs at 17q21.31 reveals stronger signals outside the inversion (P=10−10). Variation at 17q21.31 associates with neurological diseases, and our collaboration is the first to report an association with EOC susceptibility. An integrated molecular analysis in this region provides evidence for ARHGAP27 and PLEKHM1 as candidate EOC susceptibility genes. PMID:23535648
Forouzanfar, Narjes; Baranova, Ancha; Milanizadeh, Saman; Heravi-Moussavi, Alireza; Jebelli, Amir; Abbaszadegan, Mohammad Reza
2017-05-01
Esophageal squamous cell carcinoma is one of the deadliest of all the cancers. Its metastatic properties portend poor prognosis and high rate of recurrence. A more advanced method to identify new molecular biomarkers predicting disease prognosis can be whole exome sequencing. Here, we report the most effective genetic variants of the Notch signaling pathway in esophageal squamous cell carcinoma susceptibility by whole exome sequencing. We analyzed nine probands in unrelated familial esophageal squamous cell carcinoma pedigrees to identify candidate genes. Genomic DNA was extracted and whole exome sequencing performed to generate information about genetic variants in the coding regions. Bioinformatics software applications were utilized to exploit statistical algorithms to demonstrate protein structure and variants conservation. Polymorphic regions were excluded by false-positive investigations. Gene-gene interactions were analyzed for Notch signaling pathway candidates. We identified novel and damaging variants of the Notch signaling pathway through extensive pathway-oriented filtering and functional predictions, which led to the study of 27 candidate novel mutations in all nine patients. Detection of the trinucleotide repeat containing 6B gene mutation (a slice site alteration) in five of the nine probands, but not in any of the healthy samples, suggested that it may be a susceptibility factor for familial esophageal squamous cell carcinoma. Noticeably, 8 of 27 novel candidate gene mutations (e.g. epidermal growth factor, signal transducer and activator of transcription 3, MET) act in a cascade leading to cell survival and proliferation. Our results suggest that the trinucleotide repeat containing 6B mutation may be a candidate predisposing gene in esophageal squamous cell carcinoma. In addition, some of the Notch signaling pathway genetic mutations may act as key contributors to esophageal squamous cell carcinoma.
Zhang, Yiwen; Cao, Man; Wang, Mengting; Ding, Xianping; Jing, Yaling; Chen, Zuyi; Ma, Tengjiao; Chen, Honghan
2016-07-01
Human papillomavirus (HPV) is the major causative agent of cervical cancer, which accounts for the second highest cancer burden in women worldwide. HPV-52, the prevalent subtype in Asia, especially in southwest China, was analyzed in this study. To analyze polymorphisms, intratypic variants, and genetic variability in the E6-E7 (n=26) and L1 (n=53) genes of HPV-52, these genes were sequenced and the sequences were submitted to GenBank. Phylogenetic trees were constructed using the neighbor-joining and Kimura 2-parameters methods, followed by analysis of the diversity of secondary structure. Finally, we estimated the selection pressures acting on the E6-E7 and L1 genes. Fifty-one novel variants of HPV-52 L1, and two novel variants of HPV-52 E6-E7 were identified in this study. Thirty single nucleotide changes were observed in HPV-52 E6-E7 sequences with 19/30 non-synonymous mutations and 11/30 synonymous mutations (five in the alpha helix and five in the beta sheet). Fifty-five single nucleotide changes were observed in HPV-52 L1 sequences with 17/55 non-synonymous mutations (seven in the alpha helix and fourteen in the beta sheet) and 38/55 synonymous mutations. Selective pressure analysis predicted that most of these mutations reflect positive selection. Identifying new variants in HPV-52 may inform the rational design of new vaccines specifically for women in southwest China. Knowledge of genetic variation in HPV may be useful as an epidemiologic correlate of cervical cancer risk, or may even provide critical information for developing diagnostic probes. Copyright © 2016 Elsevier B.V. All rights reserved.
Lescat, Mathilde; Hoede, Claire; Clermont, Olivier; Garry, Louis; Darlu, Pierre; Tuffery, Pierre; Denamur, Erick; Picard, Bertrand
2009-12-29
Previous studies have established a correlation between electrophoretic polymorphism of esterase B, and virulence and phylogeny of Escherichia coli. Strains belonging to the phylogenetic group B2 are more frequently implicated in extraintestinal infections and include esterase B2 variants, whereas phylogenetic groups A, B1 and D contain less virulent strains and include esterase B1 variants. We investigated esterase B as a marker of phylogeny and/or virulence, in a thorough analysis of the esterase B-encoding gene. We identified the gene encoding esterase B as the acetyl-esterase gene (aes) using gene disruption. The analysis of aes nucleotide sequences in a panel of 78 reference strains, including the E. coli reference (ECOR) strains, demonstrated that the gene is under purifying selection. The phylogenetic tree reconstructed from aes sequences showed a strong correlation with the species phylogenetic history, based on multi-locus sequence typing using six housekeeping genes. The unambiguous distinction between variants B1 and B2 by electrophoresis was consistent with Aes amino-acid sequence analysis and protein modelling, which showed that substituted amino acids in the two esterase B variants occurred mostly at different sites on the protein surface. Studies in an experimental mouse model of septicaemia using mutant strains did not reveal a direct link between aes and extraintestinal virulence. Moreover, we did not find any genes in the chromosomal region of aes to be associated with virulence. Our findings suggest that aes does not play a direct role in the virulence of E. coli extraintestinal infection. However, this gene acts as a powerful marker of phylogeny, illustrating the extensive divergence of B2 phylogenetic group strains from the rest of the species.
Finno, C.J.; Famula, T.; Aleman, M.; Higgins, R.J.; Madigan, J.E.; Bannasch, D.L.
2015-01-01
Background Equine neuroaxonal dystrophy/equine degenerative myeloencephalopathy (NAD/EDM) is a neurodegenerative disorder affecting young horses of various breeds that resembles ataxia with vitamin E deficiency in humans, an inherited disorder caused by mutations in the alpha-tocopherol transfer protein gene (TTPA). To evaluate variants found upon sequencing TTPA in the horse, the mode of inheritance for NAD/EDM had to be established. Hypothesis NAD/EDM in the American Quarter Horse (QH) is caused by a mutation in TTPA. Animals 88 clinically phenotyped (35 affected [ataxia score ≥2], 53 unaffected) QHs with a diagnosis of NAD/EDM with 6 affected and 4 unaffected cases confirmed at postmortem examination. Procedures Pedigrees and genotypes across 54,000 single nucleotide polymorphism (SNP) markers were assessed to determine heritability and mode of inheritance of NAD/EDM. TTPA sequence of exon/intron boundaries was evaluated in 2 affected and 2 control horses. An association analysis was performed by 71 SNPs surrounding TTPA and 8 SNPs within TTPA that were discovered by sequencing. RT-PCR for TTPA was performed on mRNA from the liver of 4 affected and 4 control horses. Results Equine NAD/EDM appears to be inherited as a polygenic trait and, within this family of QHs, demonstrates high heritability. Sequencing of TTPA identified 12 variants. No significant association was found using the 79 available variants in and surrounding TTPA. RT-PCR yielded PCR products of equivalent sizes between affected cases and controls. Conclusions and Clinical Importance NAD/EDM demonstrates heritability in this family of QHs. Variants in TTPA are not responsible for NAD/EDM in this study population. PMID:23186252
Qualtieri, Antonio; Le, Pera Maria; Pedace, Vera; Magariello, Angela; Brancati, Carlo
2002-02-01
We have identified a new neutral hemoglobin variant in a pregnant Italian woman, that resulted from a GTG-->CTG replacement at codon 126 of the beta chain, corresponding to a Val-->Leu amino acid change at position beta126(H4). Thermal and isopropanol stability tests were normal and there were no abnormal clinical features. Routine electrophoretic and ion exchange chromatographic methods for hemoglobin separation failed to show this variant, but reversed phase high performance liquid chromatography revealed an abnormal peak eluting near the normal beta chain. No abnormal tryptic peptide was revealed on the high performance liquid chromatographic elution pattern of the total globin digest. The mutation was determined at the DNA level by amplification of the three beta exons by polymerase chain reaction and direct sequencing of one exon that showed an abnormal migration on single strand conformational polymorphism analysis.
Coval: Improving Alignment Quality and Variant Calling Accuracy for Next-Generation Sequencing Data
Kosugi, Shunichi; Natsume, Satoshi; Yoshida, Kentaro; MacLean, Daniel; Cano, Liliana; Kamoun, Sophien; Terauchi, Ryohei
2013-01-01
Accurate identification of DNA polymorphisms using next-generation sequencing technology is challenging because of a high rate of sequencing error and incorrect mapping of reads to reference genomes. Currently available short read aligners and DNA variant callers suffer from these problems. We developed the Coval software to improve the quality of short read alignments. Coval is designed to minimize the incidence of spurious alignment of short reads, by filtering mismatched reads that remained in alignments after local realignment and error correction of mismatched reads. The error correction is executed based on the base quality and allele frequency at the non-reference positions for an individual or pooled sample. We demonstrated the utility of Coval by applying it to simulated genomes and experimentally obtained short-read data of rice, nematode, and mouse. Moreover, we found an unexpectedly large number of incorrectly mapped reads in ‘targeted’ alignments, where the whole genome sequencing reads had been aligned to a local genomic segment, and showed that Coval effectively eliminated such spurious alignments. We conclude that Coval significantly improves the quality of short-read sequence alignments, thereby increasing the calling accuracy of currently available tools for SNP and indel identification. Coval is available at http://sourceforge.net/projects/coval105/. PMID:24116042
Widespread Site-Dependent Buffering of Human Regulatory Polymorphism
Kutyavin, Tanya; Stamatoyannopoulos, John A.
2012-01-01
The average individual is expected to harbor thousands of variants within non-coding genomic regions involved in gene regulation. However, it is currently not possible to interpret reliably the functional consequences of genetic variation within any given transcription factor recognition sequence. To address this, we comprehensively analyzed heritable genome-wide binding patterns of a major sequence-specific regulator (CTCF) in relation to genetic variability in binding site sequences across a multi-generational pedigree. We localized and quantified CTCF occupancy by ChIP-seq in 12 related and unrelated individuals spanning three generations, followed by comprehensive targeted resequencing of the entire CTCF–binding landscape across all individuals. We identified hundreds of variants with reproducible quantitative effects on CTCF occupancy (both positive and negative). While these effects paralleled protein–DNA recognition energetics when averaged, they were extensively buffered by striking local context dependencies. In the significant majority of cases buffering was complete, resulting in silent variants spanning every position within the DNA recognition interface irrespective of level of binding energy or evolutionary constraint. The prevalence of complex partial or complete buffering effects severely constrained the ability to predict reliably the impact of variation within any given binding site instance. Surprisingly, 40% of variants that increased CTCF occupancy occurred at positions of human–chimp divergence, challenging the expectation that the vast majority of functional regulatory variants should be deleterious. Our results suggest that, even in the presence of “perfect” genetic information afforded by resequencing and parallel studies in multiple related individuals, genomic site-specific prediction of the consequences of individual variation in regulatory DNA will require systematic coupling with empirical functional genomic measurements. PMID:22457641
Dodds, Peter N.; Lawrence, Gregory J.; Catanzariti, Ann-Maree; Teh, Trazel; Wang, Ching-I. A.; Ayliffe, Michael A.; Kobe, Bostjan; Ellis, Jeffrey G.
2006-01-01
Plant resistance proteins (R proteins) recognize corresponding pathogen avirulence (Avr) proteins either indirectly through detection of changes in their host protein targets or through direct R–Avr protein interaction. Although indirect recognition imposes selection against Avr effector function, pathogen effector molecules recognized through direct interaction may overcome resistance through sequence diversification rather than loss of function. Here we show that the flax rust fungus AvrL567 genes, whose products are recognized by the L5, L6, and L7 R proteins of flax, are highly diverse, with 12 sequence variants identified from six rust strains. Seven AvrL567 variants derived from Avr alleles induce necrotic responses when expressed in flax plants containing corresponding resistance genes (R genes), whereas five variants from avr alleles do not. Differences in recognition specificity between AvrL567 variants and evidence for diversifying selection acting on these genes suggest they have been involved in a gene-specific arms race with the corresponding flax R genes. Yeast two-hybrid assays indicate that recognition is based on direct R–Avr protein interaction and recapitulate the interaction specificity observed in planta. Biochemical analysis of Escherichia coli-produced AvrL567 proteins shows that variants that escape recognition nevertheless maintain a conserved structure and stability, suggesting that the amino acid sequence differences directly affect the R–Avr protein interaction. We suggest that direct recognition associated with high genetic diversity at corresponding R and Avr gene loci represents an alternative outcome of plant–pathogen coevolution to indirect recognition associated with simple balanced polymorphisms for functional and nonfunctional R and Avr genes. PMID:16731621
Clinical evaluation incorporating a personal genome
Ashley, Euan A.; Butte, Atul J.; Wheeler, Matthew T.; Chen, Rong; Klein, Teri E.; Dewey, Frederick E.; Dudley, Joel T.; Ormond, Kelly E.; Pavlovic, Aleksandra; Hudgins, Louanne; Gong, Li; Hodges, Laura M.; Berlin, Dorit S.; Thorn, Caroline F.; Sangkuhl, Katrin; Hebert, Joan M.; Woon, Mark; Sagreiya, Hersh; Whaley, Ryan; Morgan, Alexander A.; Pushkarev, Dmitry; Neff, Norma F; Knowles, Joshua W.; Chou, Mike; Thakuria, Joseph; Rosenbaum, Abraham; Zaranek, Alexander Wait; Church, George; Greely, Henry T.; Quake, Stephen R.; Altman, Russ B.
2010-01-01
Background The cost of genomic information has fallen steeply but the path to clinical translation of risk estimates for common variants found in genome wide association studies remains unclear. Since the speed and cost of sequencing complete genomes is rapidly declining, more comprehensive means of analyzing these data in concert with rare variants for genetic risk assessment and individualisation of therapy are required. Here, we present the first integrated analysis of a complete human genome in a clinical context. Methods An individual with a family history of vascular disease and early sudden death was evaluated. Clinical assessment included risk prediction for coronary artery disease, screening for causes of sudden cardiac death, and genetic counselling. Genetic analysis included the development of novel methods for the integration of whole genome sequence data including 2.6 million single nucleotide polymorphisms and 752 copy number variations. The algorithm focused on predicting genetic risk of genes associated with known Mendelian disease, recognised drug responses, and pathogenicity for novel variants. In addition, since integration of risk ratios derived from case control studies is challenging, we estimated posterior probabilities from age and sex appropriate prior probability and likelihood ratios derived for each genotype. In addition, we developed a visualisation approach to account for gene-environment interactions and conditionally dependent risks. Findings We found increased genetic risk for myocardial infarction, type II diabetes and certain cancers. Rare variants in LPA are consistent with the family history of coronary artery disease. Pharmacogenomic analysis suggested a positive response to lipid lowering therapy, likely clopidogrel resistance, and a low initial dosing requirement for warfarin. Many variants of uncertain significance were reported. Interpretation Although challenges remain, our results suggest that whole genome sequencing can yield useful and clinically relevant information for individual patients, especially for those with a strong family history of significant disease. PMID:20435227
Fong, Wai-Ying; Ho, Chi-Chun; Poon, Wing-Tat
2017-05-12
Thiopurine intolerance and treatment-related toxicity, such as fatal myelosuppression, is related to non-function genetic variants encoding thiopurine S-methyltransferase (TPMT) and Nudix hydrolase 15 (NUDT15). Genetic testing of the common variants NUDT15:NM_018283.2:c.415C>T (Arg139Cys, dbSNP rs116855232 T allele) and TPMT: NM_000367.4:c.719A>G (TPMT*3C, dbSNP rs1142345 G allele) in East Asians including Chinese can potentially prevent treatment-related complications. Two complementary genotyping approaches, real-time PCR-high resolution melt (PCR-HRM) and PCR-restriction fragment length morphism (PCR-RFLP) analysis were evaluated using conventional PCR and Sanger sequencing genotyping as the gold standard. Sixty patient samples were tested, revealing seven patients (11.7%) heterozygous for NUDT15 c.415C>T, one patient homozygous for the variant and one patient heterozygous for the TPMT*3C non-function allele. No patient was found to harbor both variants. In total, nine out of 60 (15%) patients tested had genotypic evidence of thiopurine intolerance, which may require dosage adjustment or alternative medication should they be started on azathioprine, mercaptopurine or thioguanine. The two newly developed assays were more efficient and showed complete concordance (60/60, 100%) compared to the Sanger sequencing results. Accurate and cost-effective genotyping assays by real-time PCR-HRM and PCR-RFLP for NUDT15 c.415C>T and TPMT*3C were successfully developed. Further studies may establish their roles in genotype-informed clinical decision-making in the prevention of morbidity and mortality due to thiopurine intolerance.
Gao, Li; Bin, Lianghua; Rafaels, Nicholas M; Huang, Lili; Potee, Joseph; Ruczinski, Ingo; Beaty, Terri H; Paller, Amy S; Schneider, Lynda C; Gallo, Rich; Hanifin, Jon M; Beck, Lisa A; Geha, Raif S; Mathias, Rasika A; Barnes, Kathleen C; Leung, Donald Y M
2015-12-01
A subset of atopic dermatitis is associated with increased susceptibility to eczema herpeticum (ADEH+). We previously reported that common single nucleotide polymorphisms (SNPs) in the IFN-γ (IFNG) and IFN-γ receptor 1 (IFNGR1) genes were associated with the ADEH+ phenotype. We sought to interrogate the role of rare variants in interferon pathway genes for the risk of ADEH+. We performed targeted sequencing of interferon pathway genes (IFNG, IFNGR1, IFNAR1, and IL12RB1) in 228 European American patients with AD selected according to their eczema herpeticum status, and severity was measured by using the Eczema Area and Severity Index. Replication genotyping was performed in independent samples of 219 European American and 333 African American subjects. Functional investigation of loss-of-function variants was conducted by using site-directed mutagenesis. We identified 494 single nucleotide variants encompassing 105 kb of sequence, including 145 common, 349 (70.6%) rare (minor allele frequency <5%), and 86 (17.4%) novel variants, of which 2.8% were coding synonymous, 93.3% were noncoding (64.6% intronic), and 3.8% were missense. We identified 6 rare IFNGR1 missense variants, including 3 damaging variants (Val14Met [V14M], Val61Ile, and Tyr397Cys [Y397C]) conferring a higher risk for ADEH+ (P = .031). Variants V14M and Y397C were confirmed to be deleterious, leading to partial IFNGR1 deficiency. Seven common IFNGR1 SNPs, along with common protective haplotypes (2-7 SNPs), conferred a reduced risk of ADEH+ (P = .015-.002 and P = .0015-.0004, respectively), and both SNP and haplotype associations were replicated in an independent African American sample (P = .004-.0001 and P = .001-.0001, respectively). Our results provide evidence that both genetic variants in the gene encoding IFNGR1 are implicated in susceptibility to the ADEH+ phenotype. Copyright © 2015 American Academy of Allergy, Asthma & Immunology. Published by Elsevier Inc. All rights reserved.
Xu, Shuhua
2015-01-01
Noncoding DNA sequences (NCS) have attracted much attention recently due to their functional potentials. Here we attempted to reveal the functional roles of noncoding sequences from the point of view of natural selection that typically indicates the functional potentials of certain genomic elements. We analyzed nearly 37 million single nucleotide polymorphisms (SNPs) of Phase I data of the 1000 Genomes Project. We estimated a series of key parameters of population genetics and molecular evolution to characterize sequence variations of the noncoding genome within and between populations, and identified the natural selection footprints in NCS in worldwide human populations. Our results showed that purifying selection is prevalent and there is substantial constraint of variations in NCS, while positive selectionis more likely to be specific to some particular genomic regions and regional populations. Intriguingly, we observed larger fraction of non-conserved NCS variants with lower derived allele frequency in the genome, indicating possible functional gain of non-conserved NCS. Notably, NCS elements are enriched for potentially functional markers such as eQTLs, TF motif, and DNase I footprints in the genome. More interestingly, some NCS variants associated with diseases such as Alzheimer's disease, Type 1 diabetes, and immune-related bowel disorder (IBD) showed signatures of positive selection, although the majority of NCS variants, reported as risk alleles by genome-wide association studies, showed signatures of negative selection. Our analyses provided compelling evidence of natural selection forces on noncoding sequences in the human genome and advanced our understanding of their functional potentials that play important roles in disease etiology and human evolution. PMID:26053627
Ultrasensitive Genotypic Detection of Antiviral Resistance in Hepatitis B Virus Clinical Isolates▿ †
Fang, Jie; Wichroski, Michael J.; Levine, Steven M.; Baldick, Carl J.; Mazzucco, Charles E.; Walsh, Ann W.; Kienzle, Bernadette K.; Rose, Ronald E.; Pokornowski, Kevin A.; Colonno, Richard J.; Tenney, Daniel J.
2009-01-01
Amino acid substitutions that confer reduced susceptibility to antivirals arise spontaneously through error-prone viral polymerases and are selected as a result of antiviral therapy. Resistance substitutions first emerge in a fraction of the circulating virus population, below the limit of detection by nucleotide sequencing of either the population or limited sets of cloned isolates. These variants can expand under drug pressure to dominate the circulating virus population. To enhance detection of these viruses in clinical samples, we established a highly sensitive quantitative, real-time allele-specific PCR assay for hepatitis B virus (HBV) DNA. Sensitivity was accomplished using a high-fidelity DNA polymerase and oligonucleotide primers containing locked nucleic acid bases. Quantitative measurement of resistant and wild-type variants was accomplished using sequence-matched standards. Detection methodology that was not reliant on hybridization probes, and assay modifications, minimized the effect of patient-specific sequence polymorphisms. The method was validated using samples from patients chronically infected with HBV through parallel sequencing of large numbers of cloned isolates. Viruses with resistance to lamivudine and other l-nucleoside analogs and entecavir, involving 17 different nucleotide substitutions, were reliably detected at levels at or below 0.1% of the total population. The method worked across HBV genotypes. Longitudinal analysis of patient samples showed earlier emergence of resistance on therapy than was seen with sequencing methodologies, including some cases of resistance that existed prior to treatment. In summary, we established and validated an ultrasensitive method for measuring resistant HBV variants in clinical specimens, which enabled earlier, quantitative measurement of resistance to therapy. PMID:19433559
Schaschl, Helmut; Huber, Susanne; Schaefer, Katrin; Windhager, Sonja; Wallner, Bernard; Fieder, Martin
2015-05-13
The evolutionary highly conserved neurohypophyseal hormones oxytocin and arginine vasopressin play key roles in regulating social cognition and behaviours. The effects of these two peptides are meditated by their specific receptors, which are encoded by the oxytocin receptor (OXTR) and arginine vasopressin receptor 1a genes (AVPR1A), respectively. In several species, polymorphisms in these genes have been linked to various behavioural traits. Little, however, is known about whether positive selection acts on sequence variants in genes influencing variation in human behaviours. We identified, in both neuroreceptor genes, signatures of balancing selection in the cis-regulative acting sequences such as transcription factor binding and enhancer sequences, as well as in a transcriptional repressor sequence motif. Additionally, in the intron 3 of the OXTR gene, the SNP rs59190448 appears to be under positive directional selection. For rs59190448, only one phenotypical association is known so far, but it is in high LD' (>0.8) with loci of known association; i.e., variants associated with key pro-social behaviours and mental disorders in humans. Only for one SNP on the OXTR gene (rs59190448) was a sign of positive directional selection detected with all three methods of selection detection. For rs59190448, however, only one phenotypical association is known, but rs59190448 is in high LD' (>0.8), with variants associated with important pro-social behaviours and mental disorders in humans. We also detected various signatures of balancing selection on both neuroreceptor genes.
Azevedo, Ana P; Silva, Susana N; De Lima, João P; Reichert, Alice; Lima, Fernando; Júnior, Esmeraldina; Rueff, José
2017-06-01
The role of base excision repair (BER) genes in Philadelphia-negative (PN)-myeloproliferative neoplasms (MPNs) susceptibility was evaluated by genotyping eight polymorphisms [apurinic/apyrimidinic endodeoxyribonuclease 1, mutY DNA glycosylase, earlier mutY homolog ( E. coli ) (MUTYH), 8-oxoguanine DNA glycosylase 1, poly (ADP-ribose) polymerase (PARP) 1, PARP4 and X-ray repair cross-complementing 1 (XRCC1)] in a case-control study involving 133 Caucasian Portuguese patients. The results did not reveal a correlation between individual BER polymorphisms and PN-MPNs when considered as a whole. However, stratification for essential thrombocythaemia revealed i) borderline effect/tendency to increased risk when carrying at least one variant allele for XRCC1_399 single-nucleotide polymorphism (SNP); ii) decreased risk for Janus kinase 2-positive patients carrying at least one variant allele for XRCC1_399 SNP; and iii) decreased risk in females carrying at least one variant allele for MUTYH SNP. Combination of alleles demonstrated an increased risk to PN-MPNs for one specific haplogroup. These findings may provide evidence for gene variants in susceptibility to MPNs. Indeed, common variants in DNA repair genes may hamper the capacity to repair DNA, thus increasing cancer susceptibility.
Azevedo, Ana P.; Silva, Susana N.; De Lima, João P.; Reichert, Alice; Lima, Fernando; Júnior, Esmeraldina; Rueff, José
2017-01-01
The role of base excision repair (BER) genes in Philadelphia-negative (PN)-myeloproliferative neoplasms (MPNs) susceptibility was evaluated by genotyping eight polymorphisms [apurinic/apyrimidinic endodeoxyribonuclease 1, mutY DNA glycosylase, earlier mutY homolog (E. coli) (MUTYH), 8-oxoguanine DNA glycosylase 1, poly (ADP-ribose) polymerase (PARP) 1, PARP4 and X-ray repair cross-complementing 1 (XRCC1)] in a case-control study involving 133 Caucasian Portuguese patients. The results did not reveal a correlation between individual BER polymorphisms and PN-MPNs when considered as a whole. However, stratification for essential thrombocythaemia revealed i) borderline effect/tendency to increased risk when carrying at least one variant allele for XRCC1_399 single-nucleotide polymorphism (SNP); ii) decreased risk for Janus kinase 2-positive patients carrying at least one variant allele for XRCC1_399 SNP; and iii) decreased risk in females carrying at least one variant allele for MUTYH SNP. Combination of alleles demonstrated an increased risk to PN-MPNs for one specific haplogroup. These findings may provide evidence for gene variants in susceptibility to MPNs. Indeed, common variants in DNA repair genes may hamper the capacity to repair DNA, thus increasing cancer susceptibility. PMID:28599464
Hölzemer, Angelique; Thobakgale, Christina F; Jimenez Cruz, Camilo A; Garcia-Beltran, Wilfredo F; Carlson, Jonathan M; van Teijlingen, Nienke H; Mann, Jaclyn K; Jaggernath, Manjeetha; Kang, Seung-gu; Körner, Christian; Chung, Amy W; Schafer, Jamie L; Evans, David T; Alter, Galit; Walker, Bruce D; Goulder, Philip J; Carrington, Mary; Hartmann, Pia; Pertel, Thomas; Zhou, Ruhong; Ndung'u, Thumbi; Altfeld, Marcus
2015-11-01
Viruses can evade immune surveillance, but the underlying mechanisms are insufficiently understood. Here, we sought to understand the mechanisms by which natural killer (NK) cells recognize HIV-1-infected cells and how this virus can evade NK-cell-mediated immune pressure. Two sequence mutations in p24 Gag associated with the presence of specific KIR/HLA combined genotypes were identified in HIV-1 clade C viruses from a large cohort of infected, untreated individuals in South Africa (n = 392), suggesting viral escape from KIR+ NK cells through sequence variations within HLA class I-presented epitopes. One sequence polymorphism at position 303 of p24 Gag (TGag303V), selected for in infected individuals with both KIR2DL3 and HLA-C*03:04, enabled significantly better binding of the inhibitory KIR2DL3 receptor to HLA-C*03:04-expressing cells presenting this variant epitope compared to the wild-type epitope (wild-type mean 18.01 ± 10.45 standard deviation [SD] and variant mean 44.67 ± 14.42 SD, p = 0.002). Furthermore, activation of primary KIR2DL3+ NK cells from healthy donors in response to HLA-C*03:04+ target cells presenting the variant epitope was significantly reduced in comparison to cells presenting the wild-type sequence (wild-type mean 0.78 ± 0.07 standard error of the mean [SEM] and variant mean 0.63 ± 0.07 SEM, p = 0.012). Structural modeling and surface plasmon resonance of KIR/peptide/HLA interactions in the context of the different viral sequence variants studied supported these results. Future studies will be needed to assess processing and antigen presentation of the investigated HIV-1 epitope in natural infection, and the consequences for viral control. These data provide novel insights into how viruses can evade NK cell immunity through the selection of mutations in HLA-presented epitopes that enhance binding to inhibitory NK cell receptors. Better understanding of the mechanisms by which HIV-1 evades NK-cell-mediated immune pressure and the functional validation of a structural modeling approach will facilitate the development of novel targeted immune interventions to harness the antiviral activities of NK cells.
Fine-scale population structure and the era of next-generation sequencing.
Henn, Brenna M; Gravel, Simon; Moreno-Estrada, Andres; Acevedo-Acevedo, Suehelay; Bustamante, Carlos D
2010-10-15
Fine-scale population structure characterizes most continents and is especially pronounced in non-cosmopolitan populations. Roughly half of the world's population remains non-cosmopolitan and even populations within cities often assort along ethnic and linguistic categories. Barriers to random mating can be ecologically extreme, such as the Sahara Desert, or cultural, such as the Indian caste system. In either case, subpopulations accumulate genetic differences if the barrier is maintained over multiple generations. Genome-wide polymorphism data, initially with only a few hundred autosomal microsatellites, have clearly established differences in allele frequency not only among continental regions, but also within continents and within countries. We review recent evidence from the analysis of genome-wide polymorphism data for genetic boundaries delineating human population structure and the main demographic and genomic processes shaping variation, and discuss the implications of population structure for the distribution and discovery of disease-causing genetic variants, in the light of the imminent availability of sequencing data for a multitude of diverse human genomes.
Miller, John J; Eackles, Michael S.; Stauffer, Jay R; King, Timothy L.
2015-01-01
We characterized variation within the mitochondrial genomes of the invasive silver carp (Hypophthalmichthys molitrix) and bighead carp (H. nobilis) from the Mississippi River drainage by mapping our Next-Generation sequences to their publicly available genomes. Variant detection resulted in 338 single-nucleotide polymorphisms for H. molitrix and 39 for H. nobilis. The much greater genetic variation in H. molitrix mitochondria relative to H. nobilis may be indicative of a greater North American female effective population size of the former. When variation was quantified by gene, many tRNA loci appear to have little or no variability based on our results whereas protein-coding regions were more frequently polymorphic. These results provide biologists with additional regions of DNA to be used as markers to study the invasion dynamics of these species.
Tang, Clara S; Zhang, He; Cheung, Chloe Y Y; Xu, Ming; Ho, Jenny C Y; Zhou, Wei; Cherny, Stacey S; Zhang, Yan; Holmen, Oddgeir; Au, Ka-Wing; Yu, Haiyi; Xu, Lin; Jia, Jia; Porsch, Robert M; Sun, Lijie; Xu, Weixian; Zheng, Huiping; Wong, Lai-Yung; Mu, Yiming; Dou, Jingtao; Fong, Carol H Y; Wang, Shuyu; Hong, Xueyu; Dong, Liguang; Liao, Yanhua; Wang, Jiansong; Lam, Levina S M; Su, Xi; Yan, Hua; Yang, Min-Lee; Chen, Jin; Siu, Chung-Wah; Xie, Gaoqiang; Woo, Yu-Cho; Wu, Yangfeng; Tan, Kathryn C B; Hveem, Kristian; Cheung, Bernard M Y; Zöllner, Sebastian; Xu, Aimin; Eugene Chen, Y; Jiang, Chao Qiang; Zhang, Youyi; Lam, Tai-Hing; Ganesh, Santhi K; Huo, Yong; Sham, Pak C; Lam, Karen S L; Willer, Cristen J; Tse, Hung-Fat; Gao, Wei
2015-12-22
Blood lipids are important risk factors for coronary artery disease (CAD). Here we perform an exome-wide association study by genotyping 12,685 Chinese, using a custom Illumina HumanExome BeadChip, to identify additional loci influencing lipid levels. Single-variant association analysis on 65,671 single nucleotide polymorphisms reveals 19 loci associated with lipids at exome-wide significance (P<2.69 × 10(-7)), including three Asian-specific coding variants in known genes (CETP p.Asp459Gly, PCSK9 p.Arg93Cys and LDLR p.Arg257Trp). Furthermore, missense variants at two novel loci-PNPLA3 p.Ile148Met and PKD1L3 p.Thr429Ser-also influence levels of triglycerides and low-density lipoprotein cholesterol, respectively. Another novel gene, TEAD2, is found to be associated with high-density lipoprotein cholesterol through gene-based association analysis. Most of these newly identified coding variants show suggestive association (P<0.05) with CAD. These findings demonstrate that exome-wide genotyping on samples of non-European ancestry can identify additional population-specific possible causal variants, shedding light on novel lipid biology and CAD.
Molecular analysis of MLH1 variants in Chinese sporadic colorectal cancer patients.
Peng, H X; Xu, X; Yang, R; Chu, Y M; Yang, D M; Xu, Y; Zhou, F L; Ma, W Z; Zhang, X J; Guan, M; Yang, Z H; Jin, Z D
2016-04-26
Single nucleotide polymorphisms (SNPs) in mismatch repair genes, especially in the MLH1 gene, are closely associated with susceptibility to hereditary nonpolyposis colorectal cancer. However, few relevant findings are available regarding the association between sporadic colorectal cancer (SCRC) and SNPs of MLH1 in Chinese patients. Therefore, the present study aimed to describe the pathogenic association between three important MLH1 polymorphisms and SCRC in the Chinese population. Peripheral blood samples from 156 SCRC patients and 311 healthy controls were collected. DNA was purified from peripheral blood, and the V384D, R217C, and I219V polymorphisms were evaluated using high-resolution melting analysis and direct sequencing. The association between the three important MLH1 polymorphisms and clinical pathological features of the SCRC patients was analyzed. In addition, PMS2-MLH1 protein interactions were determined by co-immunoprecipitation (Co-IP) to determine the protein functional alteration induced by these SNPs. Among the three polymorphisms, V384D was significantly associated with the risk of SCRC (OR = 31.36, P < 0.0001). The allele frequencies were 4.81 and 0.16% in the SCRC group. No association was found between SCRC and R217C, or between SCRC and I219V. Moreover, the allele frequency of R217C was significantly higher in the SCRC patients younger than 60 years than in those older than 60 years. Co-IP showed that the MLH1 R217C, V384D, and I219V variants had relative binding abilities with PMS2 of 0.59, 0.70, and 0.80, respectively, compared with the wild-type. These findings suggest that MLH1 V384D could be a promising genetic marker for susceptibility to SCRC.
Bigi, María Mercedes; Lopez, Beatriz; Blanco, Federico Carlos; Sasiain, María Del Carmen; De la Barrera, Silvia; Marti, Marcelo A; Sosa, Ezequiel Jorge; Fernández Do Porto, Darío Augusto; Ritacco, Viviana; Bigi, Fabiana; Soria, Marcelo Abel
2017-03-01
Globally, about 4.5% of new tuberculosis (TB) cases are multi-drug-resistant (MDR), i.e. resistant to the two most powerful first-line anti-TB drugs. Indeed, 480,000 people developed MDR-TB in 2015 and 190,000 people died because of MDR-TB. The MDR Mycobacterium tuberculosis M family, which belongs to the Haarlem lineage, is highly prosperous in Argentina and capable of building up further drug resistance without impairing its ability to spread. In this study, we sequenced the whole genomes of a highly prosperous M-family strain (Mp) and its contemporary variant, strain 410, which produced only one recorded tuberculosis case in the last two decades. Previous reports have demonstrated that Mp induced dysfunctional CD8 + cytotoxic T cell activity, suggesting that this strain has the ability to evade the immune response against M. tuberculosis. Comparative analysis of Mp and 410 genomes revealed non-synonymous polymorphisms in eleven genes and five intergenic regions with polymorphisms between both strains. Some of these genes and promoter regions are involved in the metabolism of cell wall components, others in drug resistance and a SNP in Rv1861, a gene encoding a putative transglycosylase that produces a truncated protein in Mp. The mutation in Rv3787c, a putative S-adenosyl-l-methionine-dependent methyltransferase, is conserved in all of the other prosperous M strains here analysed and absent in non-prosperous M strains. Remarkably, three polymorphic promoter regions displayed differential transcriptional activity between Mp and 410. We speculate that the observed mutations/polymorphisms are associated with the reported higher capacity of Mp for modulating the host's immune response. Copyright © 2017 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Xi, T; Jones, I M; Mohrenweiser, H W
2003-11-03
Over 520 different amino acid substitution variants have been previously identified in the systematic screening of 91 human DNA repair genes for sequence variation. Two algorithms were employed to predict the impact of these amino acid substitutions on protein activity. Sorting Intolerant From Tolerant (SIFT) classified 226 of 508 variants (44%) as ''Intolerant''. Polymorphism Phenotyping (PolyPhen) classed 165 of 489 amino acid substitutions (34%) as ''Probably or Possibly Damaging''. Another 9-15% of the variants were classed as ''Potentially Intolerant or Damaging''. The results from the two algorithms are highly associated, with concordance in predicted impact observed for {approx}62% of themore » variants. Twenty one to thirty one percent of the variant proteins are predicted to exhibit reduced activity by both algorithms. These variants occur at slightly lower individual allele frequency than do the variants classified as ''Tolerant'' or ''Benign''. Both algorithms correctly predicted the impact of 26 functionally characterized amino acid substitutions in the APE1 protein on biochemical activity, with one exception. It is concluded that a substantial fraction of the missense variants observed in the general human population are functionally relevant. These variants are expected to be the molecular genetic and biochemical basis for the associations of reduced DNA repair capacity phenotypes with elevated cancer risk.« less
The polymorphisms of LCR, E6, and E7 of HPV-58 isolates in Yunnan, Southwest China.
Xi, Juemin; Chen, Junying; Xu, Miaoling; Yang, Hongying; Wen, Songjiao; Pan, Yue; Wang, Xiaodan; Ye, Chao; Qiu, Lijuan; Sun, Qiangming
2018-04-25
Variations in HPV LCR/E6/E7 have been shown to be associated with the viral persistence and cervical cancer development. So far, there are few reports about the polymorphisms of the HPV-58 LCR/E6/E7 sequences in Southwest China. This study aims to characterize the gene polymorphisms of the HPV-58 LCR/E6/E7 sequences in women of Southwest China, and assess the effects of variations on the immune recognition of viral E6 and E7 antigens. Twelve LCR/E6/E7 of the HPV-58 isolates were amplified and sequenced. A neighbor-joining phylogenetic tree was constructed by MEGA 7.0, followed by the secondary structure prediction of the related proteins using PSIPRED v3.3. The selection pressure acting on the HPV-58 E6 and E7 coding regions was estimated by Bayes empirical Bayes analysis of PAML 4.8. Meanwhile, the MHC class-I and II binding peptides were predicted by the ProPred-I server and ProPred server. The transcription factor binding sites in the HPV-58 LCR were analyzed using the JASPAR database. Twenty nine SNPs (20 in the LCR, 3 in the E6, 6 in the E7) were identified at 27 nucleotide sites across the HPV-58 LCR/E6/E7. From the most variable to the least variable, the nucleotide variations were LCR > E7 > E6. The combinations of all the SNPs resulted in 11 unique sequences, which were clustered into the A lineage (7 belong to A1, 2 belong to A2, and 2 belong to A3). An insertion (TGTCAGTTTCCT) was found between the nucleotide sites 7280 and 7281 in 2 variants, and a deletion (TTTAT) was found between 7429 and 7433 in 1 variant. The most common non-synonymous substitution V77A in the E7 was observed in the sequences encoding the α-helix. 63G in the E7 was determined to be the only one positively selected site in the HPV-58 E6/E7 sequences. Six non-synonymous amino acid substitutions (including S71F and K93 N in the E6, and T20I, G41R, G63S/D, and V77A in the E7) were affecting multiple putative epitopes for both CD4 + and CD8 + T-cells. In the LCR, C7265G and C7266T were the most variable sites and were the potential binding sites for the transcription factor SOX10. These results provide an insight into the intrinsic geographical relatedness and biological differences of the HPV-58 variants, and contribute to further research on the HPV-58 epidemiology, carcinogenesis, and therapeutic vaccine development.
Trujillano, D; Ramos, M D; González, J; Tornador, C; Sotillo, F; Escaramis, G; Ossowski, S; Armengol, L; Casals, T; Estivill, X
2013-07-01
Here we have developed a novel and much more efficient strategy for the complete molecular characterisation of the cystic fibrosis (CF) transmembrane regulator (CFTR) gene, based on multiplexed targeted resequencing. We have tested this approach in a cohort of 92 samples with previously characterised CFTR mutations and polymorphisms. After enrichment of the pooled barcoded DNA libraries with a custom NimbleGen SeqCap EZ Choice array (Roche) and sequencing with a HiSeq2000 (Illumina) sequencer, we applied several bioinformatics tools to call mutations and polymorphisms in CFTR. The combination of several bioinformatics tools allowed us to detect all known pathogenic variants (point mutations, short insertions/deletions, and large genomic rearrangements) and polymorphisms (including the poly-T and poly-thymidine-guanine polymorphic tracts) in the 92 samples. In addition, we report the precise characterisation of the breakpoints of seven genomic rearrangements in CFTR, including those of a novel deletion of exon 22 and a complex 85 kb inversion which includes two large deletions affecting exons 4-8 and 12-21, respectively. This work is a proof-of-principle that targeted resequencing is an accurate and cost-effective approach for the genetic testing of CF and CFTR-related disorders (ie, male infertility) amenable to the routine clinical practice, and ready to substitute classical molecular methods in medical genetics.
Masny, Aleksander; Jagiełło, Agata; Płucienniczak, Grażyna; Golab, Elzbieta
2012-09-01
Ribo HRM, a single-tube PCR and high resolution melting (HRM) assay for detection of polymorphisms in the large subunit ribosomal DNA expansion segment V, was developed on a Trichinella model. Four Trichinella species: T. spiralis (isolates ISS3 and ISS160), T. nativa (isolates ISS10 and ISS70), T. britovi (isolates ISS2 and ISS392) and T. pseudospiralis (isolates ISS13 and ISS1348) were genotyped. Cloned allelic variants of the expansion segment V were used as standards to prepare reference HRM curves characteristic for single sequences and mixtures of several cloned sequences imitating allelic composition detected in Trichinella isolates. Using the primer pair Tsr1 and Trich1bi, it was possible to amplify a fragment of the ESV and detect PCR products obtained from the genomic DNA of pools of larvae belonging to the four investigated species: T. pseudospiralis, T. spiralis, T. britovi and T. nativa, in a single tube Real-Time PCR reaction. Differences in the shape of the HRM curves of Trichinella isolates suggested the presence of differences between examined isolates of T. nativa, T. britovi and T. pseudospiralis species. No differences were observed between T. spiralis isolates. The presence of polymorphisms within the amplified ESV sequence fragment of T. nativa T. britovi and T. pseudospiralis was confirmed by sequencing of the cloned PCR products. Novel sequences were discovered and deposited in GenBank (GenBank IDs: JN971020-JN971027, JN120902.1, JN120903.1, JN120904.1, JN120906.1, JN120905.1). Screening the ESV region of Trichinella for polymorphism is possible using the genotyping assay Ribo HRM at the current state of its development. The Ribo HRM assay could be useful in phylogenetic studies of the Trichinella genus. Copyright © 2012 Elsevier B.V. All rights reserved.
Wroblewski, Emily E; Norman, Paul J; Guethlein, Lisbeth A; Rudicell, Rebecca S; Ramirez, Miguel A; Li, Yingying; Hahn, Beatrice H; Pusey, Anne E; Parham, Peter
2015-05-01
Major histocompatibility complex (MHC) class I molecules determine immune responses to viral infections. These polymorphic cell-surface glycoproteins bind peptide antigens, forming ligands for cytotoxic T and natural killer cell receptors. Under pressure from rapidly evolving viruses, hominoid MHC class I molecules also evolve rapidly, becoming diverse and species-specific. Little is known of the impact of infectious disease epidemics on MHC class I variant distributions in human populations, a context in which the chimpanzee is the superior animal model. Population dynamics of the chimpanzees inhabiting Gombe National Park, Tanzania have been studied for over 50 years. This population is infected with SIVcpz, the precursor of human HIV-1. Because HLA-B is the most polymorphic human MHC class I molecule and correlates strongly with HIV-1 progression, we determined sequences for its ortholog, Patr-B, in 125 Gombe chimpanzees. Eleven Patr-B variants were defined, as were their frequencies in Gombe's three communities, changes in frequency with time, and effect of SIVcpz infection. The growing populations of the northern and central communities, where SIVcpz is less prevalent, have stable distributions comprising a majority of low-frequency Patr-B variants and a few high-frequency variants. Driving the latter to high frequency has been the fecundity of immigrants to the northern community, whereas in the central community, it has been the fecundity of socially dominant individuals. In the declining population of the southern community, where greater SIVcpz prevalence is associated with mortality and emigration, Patr-B variant distributions have been changing. Enriched in this community are Patr-B variants that engage with natural killer cell receptors. Elevated among SIVcpz-infected chimpanzees, the Patr-B*06:03 variant has striking structural and functional similarities to HLA-B*57, the human allotype most strongly associated with delayed HIV-1 progression. Like HLA-B*57, Patr-B*06:03 correlates with reduced viral load, as assessed by detection of SIVcpz RNA in feces.
Kadri, Naveen K; Guldbrandtsen, Bernt; Lund, Mogens S; Sahana, Goutam
2015-12-01
Intense selection to increase milk yield has had negative consequences for mastitis incidence in dairy cattle. Due to low heritability of mastitis resistance and an unfavorable genetic correlation with milk yield, a reduction in mastitis through traditional breeding has been difficult to achieve. Here, we examined quantitative trait loci (QTL) that segregate for clinical mastitis and milk yield on Bos taurus autosome 20 (BTA20) to determine whether both traits are affected by a single polymorphism (pleiotropy) or by multiple closely linked polymorphisms. In the latter but not the former situation, undesirable genetic correlation could potentially be broken by selecting animals that have favorable variants for both traits. First, we performed a within-breed association study using a haplotype-based method in Danish Holstein cattle (HOL). Next, we analyzed Nordic Red dairy cattle (RDC) and Danish Jersey cattle (JER) with the goal of determining whether these QTL identified in Holsteins were segregating across breeds. Genotypes for 12,566 animals (5,966 HOL, 5,458 RDC, and 1,142 JER) were determined by using the Illumina Bovine SNP50 BeadChip (50K; Illumina, San Diego, CA), which identifies 1,568 single nucleotide polymorphisms on BTA20. Data were combined, phased, and clustered into haplotype states, followed by within- and across-breed haplotype-based association analyses using a linear mixed model. Association signals for both clinical mastitis and milk yield peaked in the 26- to 40-Mb region on BTA20 in HOL. Single-variant association analyses were carried out in the QTL region using whole sequence level variants imputed from references of 2,036 HD genotypes (BovineHD BeadChip; Illumina) and 242 whole-genome sequences. The milk QTL were also segregating in RDC and JER on the BTA20-targeted region; however, an indication of differences in the causal factor(s) was observed across breeds. A previously reported F279Y mutation (rs385640152) within the growth hormone receptor gene showed strong association with milk, fat, and protein yields. In HOL, the highest peaks for milk yield and susceptibility to mastitis were separated by over 3.5 Mb (3.8 Mb by haplotype analysis, 3.6 Mb by single nucleotide polymorphism analysis), suggesting separate genetic variants for the traits. Further analysis yielded 2 candidate mutations for the mastitis QTL, at 33,642,072 bp (rs378947583) in an intronic region of the caspase recruitment domain protein 6 gene and 35,969,994 bp (rs133596506) in an intronic region of the leukemia-inhibitory factor receptor gene. These findings suggest that it may be possible to separate these beneficial and detrimental genetic factors through targeted selective breeding. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Generalization of Associations of Kidney-Related Genetic Loci to American Indians
Haack, Karin; Almasy, Laura; Laston, Sandra; Lee, Elisa T.; Best, Lyle G.; Fabsitz, Richard R.; MacCluer, Jean W.; Howard, Barbara V.; Umans, Jason G.; Cole, Shelley A.
2014-01-01
Summary Background and objectives CKD disproportionally affects American Indians, who similar to other populations, show genetic susceptibility to kidney outcomes. Recent studies have identified several loci associated with kidney traits, but their relevance in American Indians is unknown. Design, setting, participants, & measurements This study used data from a large, family-based genetic study of American Indians (the Strong Heart Family Study), which includes 94 multigenerational families enrolled from communities located in Oklahoma, the Dakotas, and Arizona. Individuals were recruited from the Strong Heart Study, a population-based study of cardiovascular disease in American Indians. This study selected 25 single nucleotide polymorphisms in 23 loci identified from recently published kidney-related genome-wide association studies in individuals of European ancestry to evaluate their associations with kidney function (estimated GFR; individuals 18 years or older, up to 3282 individuals) and albuminuria (urinary albumin to creatinine ratio; n=3552) in the Strong Heart Family Study. This study also examined the association of single nucleotide polymorphisms in the APOL1 region with estimated GFR in 1121 Strong Heart Family Study participants. GFR was estimated using the abbreviated Modification of Diet in Renal Disease Equation. Additive genetic models adjusted for age and sex were used. Results This study identified significant associations of single nucleotide polymorphisms with estimated GFR in or nearby PRKAG2, SLC6A13, UBE2Q2, PIP5K1B, and WDR72 (P<2.1 × 10-3 to account for multiple testing). Single nucleotide polymorphisms in these loci explained 2.2% of the estimated GFR total variance and 2.9% of its heritability. An intronic variant of BCAS3 was significantly associated with urinary albumin to creatinine ratio. APOL1 single nucleotide polymorphisms were not associated with estimated GFR in a single variant test or haplotype analyses, and the at-risk variants identified in individuals with African ancestry were not detected in DNA sequencing of American Indians. Conclusion This study extends the genetic associations of loci affecting kidney function to American Indians, a population at high risk of kidney disease, and provides additional support for a potential biologic relevance of these loci across ancestries. PMID:24311711
Pandith, Arshad A; Qasim, Iqbal; Zahoor, Wani; Shah, Parveen; Bhat, Abdul R
2018-01-10
ACE I/D and MTHFR C677T gene polymorphisms can be seen as candidate genes for glioma on the basis of their biological functions and their involvement in different cancers. The aim of this study was to analyze potential association and overall survival between MTHFR C677T and ACE I/D polymorphism in glioma patients in our population. We tested genotype distribution of 112 glioma patients against 141 cancer-free controls from the same region. Kaplan-Meier survival analysis was performed to evaluate overall survival of patients for both genes. No significant differences were found among MTHFR C677T wild type C and variant genotypes CT/TT with glioma patients. In ACE, the distribution of variant ID and DD was found to be significantly higher in glioma cases as compared to controls (p<0.0001). ACE DD genotypes were highly presented in glioma cases 26.8% versus 10.6% in controls (p<0.0001) and conferred 5-fold risk for predisposition in glioma cases. Per copy D allele frequency was found higher in cases than in controls (0.54 versus 0.25: p<0.0001). Interestingly we found a significant overall survival (with log rank p<0.01) in patients who presented with ACE DD genotypes had the least estimated overall survival of 13.4months in comparison to 21. 7 and 17.6months for ACE II and I/D genotypes respectively. We conclude ACE I/D polymorphism plays a vital role in predisposition of higher risk for glioma. We also suggest that ACE DD genotypes may act as an important predictive biomarker for overall survival of glioma patients. Copyright © 2017. Published by Elsevier B.V.
Hinney, Anke; Hoch, Anne; Geller, Frank; Schäfer, Helmut; Siegfried, Wolfgang; Goldschmidt, Hanspeter; Remschmidt, Helmut; Hebebrand, Johannes
2002-06-01
Ghrelin induces obesity via central and peripheral mechanisms. Administration of ghrelin leads to increased food intake and decreased fat utilisation in rodents. Ghrelin levels are decreased in obese individuals. Recently, a polymorphism (Arg-51-Gln) within the ghrelin gene (GHRL) was described to be associated with obesity. We screened the GHRL coding region in 215 extremely obese German Children and adolescents (study group 1) and 93 normal weight students (study group 2) by single strand conformation polymorphism analysis (SSCP). We found the two previously described single nucleotide polymorphisms (SNP: Arg-51-Gln and Leu-72-Met) in similar frequencies in study groups 1 and 2 (allele frequencies were: 0.019 and 0.016 for the 51-Gln allele and 0.091 and 0.086 for the 72-Met allele, respectively). Hence, we could not confirm the previous finding. Additionally, two novel variants were identified within the coding region: (1) We detected one healthy normal weight individual with a frameshift mutation (2bp deletion at codon 34). This frameshift mutation affects the coding region of the mature ghrelin. Hence, it is highly likely that the normal weight student is haplo-insufficient for ghrelin. (2) An A to T transversion leads to an amino acid exchange from Gln to Leu at amino acid position 90. The frequency of the 90-Leu allele was significantly higher in the extremely obese children and adolescents (0.063) than in the normal weight students (0.016; nominal p = 0.011). Additionally, we genotyped 134 underweight students and 44 normal weight adults for this SNP. Genotype frequencies were similar in extremely obese children and adolescents, underweight students and normal weight adults (p > 0.8). In conclusion, we identified four sequence variants in the coding region of the ghrelin gene in individuals belonging to different weight extremes. A frameshift mutation was detected in a normal weight individual. None of the variants seem to influence weight regulation.
Delaneau, Olivier; Marchini, Jonathan
2014-06-13
A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000 GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants.
Brady, Graham F; Kwan, Raymond; Ulintz, Peter J; Nguyen, Phirum; Bassirian, Shirin; Basrur, Venkatesha; Nesvizhskii, Alexey I; Loomba, Rohit; Omary, M Bishr
2018-05-01
Nonalcoholic fatty liver disease (NAFLD) is becoming the major chronic liver disease in many countries. Its pathogenesis is multifactorial, but twin and familial studies indicate significant heritability, which is not fully explained by currently known genetic susceptibility loci. Notably, mutations in genes encoding nuclear lamina proteins, including lamins, cause lipodystrophy syndromes that include NAFLD. We hypothesized that variants in lamina-associated proteins predispose to NAFLD and used a candidate gene-sequencing approach to test for variants in 10 nuclear lamina-related genes in a cohort of 37 twin and sibling pairs: 21 individuals with and 53 without NAFLD. Twelve heterozygous sequence variants were identified in four lamina-related genes (ZMPSTE24, TMPO, SREBF1, SREBF2). The majority of NAFLD patients (>90%) had at least one variant compared to <40% of controls (P < 0.0001). When only insertions/deletions and changes in conserved residues were considered, the difference between the groups was similarly striking (>80% versus <25%; P < 0.0001). Presence of a lamina variant segregated with NAFLD independently of the PNPLA3 I148M polymorphism. Several variants were found in TMPO, which encodes the lamina-associated polypeptide-2 (LAP2) that has not been associated with liver disease. One of these, a frameshift insertion that generates truncated LAP2, abrogated lamin-LAP2 binding, caused LAP2 mislocalization, altered endogenous lamin distribution, increased lipid droplet accumulation after oleic acid treatment in transfected cells, and led to cytoplasmic association with the ubiquitin-binding protein p62/SQSTM1. Several variants in nuclear lamina-related genes were identified in a cohort of twins and siblings with NAFLD; one such variant, which results in a truncated LAP2 protein and a dramatic phenotype in cell culture, represents an association of TMPO/LAP2 variants with NAFLD and underscores the potential importance of the nuclear lamina in NAFLD. (Hepatology 2018;67:1710-1725). © 2017 by the American Association for the Study of Liver Diseases.
Functional Consequences of a Novel Variant of PCSK1
Pickett, Lindsay A.; Yourshaw, Michael; Albornoz, Valeria; Chen, Zijun; Solorzano-Vargas, R. Sergio; Nelson, Stanley F.; Martín, Martín G.; Lindberg, Iris
2013-01-01
Background Common single nucleotide polymorphisms (SNPs) in proprotein convertase subtilisin/kexin type 1 with modest effects on PC1/3 in vitro have been associated with obesity in five genome-wide association studies and with diabetes in one genome-wide association study. We here present a novel SNP and compare its biosynthesis, secretion and catalytic activity to wild-type enzyme and to SNPs that have been linked to obesity. Methodology/Principal Findings A novel PC1/3 variant introducing an Arg to Gln amino acid substitution at residue 80 (within the secondary cleavage site of the prodomain) (rs1799904) was studied. This novel variant was selected for analysis from the 1000 Genomes sequencing project based on its predicted deleterious effect on enzyme function and its comparatively more frequent allele frequency. The actual existence of the R80Q (rs1799904) variant was verified by Sanger sequencing. The effects of this novel variant on the biosynthesis, secretion, and catalytic activity were determined; the previously-described obesity risk SNPs N221D (rs6232), Q665E/S690T (rs6234/rs6235), and the Q665E and S690T SNPs (analyzed separately) were included for comparative purposes. The novel R80Q (rs1799904) variant described in this study resulted in significantly detrimental effects on both the maturation and in vitro catalytic activity of PC1/3. Conclusion/Significance Our findings that this novel R80Q (rs1799904) variant both exhibits adverse effects on PC1/3 activity and is prevalent in the population suggests that further biochemical and genetic analysis to assess its contribution to the risk of metabolic disease within the general population is warranted. PMID:23383060
Functional consequences of a novel variant of PCSK1.
Pickett, Lindsay A; Yourshaw, Michael; Albornoz, Valeria; Chen, Zijun; Solorzano-Vargas, R Sergio; Nelson, Stanley F; Martín, Martín G; Lindberg, Iris
2013-01-01
Common single nucleotide polymorphisms (SNPs) in proprotein convertase subtilisin/kexin type 1 with modest effects on PC1/3 in vitro have been associated with obesity in five genome-wide association studies and with diabetes in one genome-wide association study. We here present a novel SNP and compare its biosynthesis, secretion and catalytic activity to wild-type enzyme and to SNPs that have been linked to obesity. A novel PC1/3 variant introducing an Arg to Gln amino acid substitution at residue 80 (within the secondary cleavage site of the prodomain) (rs1799904) was studied. This novel variant was selected for analysis from the 1000 Genomes sequencing project based on its predicted deleterious effect on enzyme function and its comparatively more frequent allele frequency. The actual existence of the R80Q (rs1799904) variant was verified by Sanger sequencing. The effects of this novel variant on the biosynthesis, secretion, and catalytic activity were determined; the previously-described obesity risk SNPs N221D (rs6232), Q665E/S690T (rs6234/rs6235), and the Q665E and S690T SNPs (analyzed separately) were included for comparative purposes. The novel R80Q (rs1799904) variant described in this study resulted in significantly detrimental effects on both the maturation and in vitro catalytic activity of PC1/3. Our findings that this novel R80Q (rs1799904) variant both exhibits adverse effects on PC1/3 activity and is prevalent in the population suggests that further biochemical and genetic analysis to assess its contribution to the risk of metabolic disease within the general population is warranted.
Is IGSF1 involved in human pituitary tumor formation?
Faucz, Fabio R; Horvath, Anelia D; Azevedo, Monalisa F; Levy, Isaac; Bak, Beata; Wang, Ying; Xekouki, Paraskevi; Szarek, Eva; Gourgari, Evgenia; Manning, Allison D; de Alexandre, Rodrigo Bertollo; Saloustros, Emmanouil; Trivellin, Giampaolo; Lodish, Maya; Hofman, Paul; Anderson, Yvonne C; Holdaway, Ian; Oldfield, Edward; Chittiboina, Prashant; Nesterova, Maria; Biermasz, Nienke R; Wit, Jan M; Bernard, Daniel J; Stratakis, Constantine A
2015-02-01
IGSF1 is a membrane glycoprotein highly expressed in the anterior pituitary. Pathogenic mutations in the IGSF1 gene (on Xq26.2) are associated with X-linked central hypothyroidism and testicular enlargement in males. In this study, we tested the hypothesis that IGSF1 is involved in the development of pituitary tumors, especially those that produce growth hormone (GH). IGSF1 was sequenced in 21 patients with gigantism or acromegaly and 92 healthy individuals. Expression studies with a candidate pathogenic IGSF1 variant were carried out in transfected cells and immunohistochemistry for IGSF1 was performed in the sections of GH-producing adenomas, familial somatomammotroph hyperplasia, and in normal pituitary. We identified the sequence variant p.N604T, which in silico analysis suggested could affect IGSF1 function, in two male patients and one female with somatomammotroph hyperplasia from the same family. Of 60 female controls, two carried the same variant and seven were heterozygous for other variants. Immunohistochemistry showed increased IGSF1 staining in the GH-producing tumor from the patient with the IGSF1 p.N604T variant compared with a GH-producing adenoma from a patient negative for any IGSF1 variants and with normal control pituitary tissue. The IGSF1 gene appears polymorphic in the general population. A potentially pathogenic variant identified in the germline of three patients with gigantism from the same family (segregating with the disease) was also detected in two healthy female controls. Variations in IGSF1 expression in pituitary tissue in patients with or without IGSF1 germline mutations point to the need for further studies of IGSF1 action in pituitary adenoma formation. © 2015 Society for Endocrinology.
Is IGSF1 involved in human pituitary tumor formation?
Faucz, Fabio R.; Horvath, Anelia D.; Azevedo, Monalisa F.; Levy, Isaac; Bak, Beata; Wang, Ying; Xekouki, Paraskevi; Szarek, Eva; Gourgari, Evgenia; Manning, Allison D.; de Alexandre, Rodrigo Bertollo; Saloustros, Emmanouil; Trivellin, Giampaolo; Lodish, Maya; Hofman, Paul; Anderson, Yvonne C; Holdaway, Ian; Oldfield, Edward; Chittiboina, Prashant; Nesterova, Maria; Biermasz, Nienke R.; Wit, Jan M.; Bernard, Daniel J.; Stratakis, Constantine A.
2014-01-01
IGSF1 is a membrane glycoprotein highly expressed in the anterior pituitary. Pathogenic mutations in the IGSF1 gene (on Xq26.2) are associated with X-linked central hypothyroidism and testicular enlargement in males. In this study we tested the hypothesis that IGSF1 is involved in the development of pituitary tumors, especially those that produce growth hormone (GH). IGSF1 was sequenced in 21 patients with gigantism or acromegaly and 92 healthy individuals. Expression studies with a candidate pathogenic IGSF1 variant were carried out in transfected cells and immunohistochemistry for IGSF1 was performed in sections from GH-producing adenomas, familial somatomammotroph hyperplasia and in normal pituitary. In two male patients, and in one female, with somatomammotroph hyperplasia from the same family, we identified the sequence variant p.N604T, which in silico analysis suggested could affect IGSF1 function. Of 60 female controls, two carried the same variant, and seven were heterozygous for other variants. Immunohistochemistry showed increase IGSF1 staining in the GH-producing tumor from the patient with the IGSF1 p.N604T variant compared to a GH-producing adenoma from a patient negative for any IGSF1 variants and to normal control pituitary tissue. The IGSF1 gene appears polymorphic in the general population. A potentially pathogenic variant identified in the germline of three patients with gigantism from the same family (segregating with the disease) was also detected in two healthy female controls. Variations in IGSF1 expression in pituitary tissue in patients with or without IGSF1 germline mutations point to the need for further studies of IGSF1 action in pituitary adenoma formation. PMID:25527509
Smoking, genes encoding dopamine pathway and risk for Parkinson's disease.
Gu, Zhuqin; Feng, Xiuli; Dong, Xiumin; Chan, Piu
2010-09-20
Smoking has been reported to be inversely associated with Parkinson's disease (PD) in many studies, but a recent study in China found that smoking increased the risk of PD. Variants in genes associated with dopamine metabolism found to increase the risk for PD have also been associated with smoking behavior. To investigate the association between smoking and PD in a Chinese population and determine whether the genetic variants of genes involved in dopamine metabolism influence the relationship between smoking and risk for PD. Chinese PD patients were recruited from Xuanwu Hospital. Controls were sampled from community. Detailed information on life-long smoking behavior was collected by face-to-face interview. Genotypes were determined for SLC6A3 VNTR, COMT Val108/158Met and MAO-B intron13 A/G polymorphisms by PCR-RFLP, DHPLC and sequencing. Chi-square and logistic regression model were used in the analysis. 176 PD cases and 354 controls were enrolled in this study. 23.9% cases are smokers, compared to 48.0% in controls. Ever smoking is inversely associated with PD (odds ratio=0.14, 95% CI 0.08-0.26, adjusted for age and gender). None of the above-mentioned genetic polymorphisms was associated with PD risk or smoking. When each variant was included in the logistic regression model, the inverse association between smoking and PD remained the same, and the interactions between smoking and variants were not significant in the model. Our data support a reduction of PD risk associated with smoking in a Chinese population. These variants of genes associated with DA uptake and metabolism do not affect the inverse association between smoking and PD. Copyright 2010 Elsevier Ireland Ltd. All rights reserved.
Novotny, Dalibor; Vaverkova, Helena; Karasek, David; Malina, Pavel
2014-08-01
The aim was to evaluate the relationships of the T-1131C (rs662799) polymorphism variants of apolipoprotein A5 (Apo A5) gene and variants of apolipoprotein E (Apo E) gene common polymorphism (rs429358, rs7412) to signs of metabolic syndrome (MetS). We examined 590 asymptomatic dyslipidemic patients divided into MetS+ (n=146) and MetS- (n=444) groups according to criteria of NCEP ATPIII Panel. We evaluated genotype frequencies and differences in MetS features between individual groups. Logistic regression analysis was used for the evaluation of Apo A5/Apo E variants as possible risk factors for MetS. We found no statistical differences between genotype and allele frequencies for both Apo A5 and Apo E polymorphisms between MetS+ and MetS- groups. In all subjects and MetS- group, we confirmed well-known association of the -1131C Apo A5 minor allele with elevated triglycerides (TG, p<0.001). The Apo E gene E2 and E4 variants were associated with higher levels of TG (p<0.01) in comparison to E33 common variant. However, no statistical differences were observed in MetS+ subjects, regardless of significantly higher TG levels in this group. Apo A5/Apo E variant analysis in all dyslipidemic patients revealed significant increase of TG levels in all subgroups in comparison to common -1131T/E3 variant carriers, the most in -1131C/E4 variant subgroup. Logistic regression analysis models showed no association of Apo A5, Apo E and all Apo A5/Apo E variants with metabolic syndrome, even after adjustment for age and sex. Our study refined the role of Apo A5 and Apo E genetic variants in the group of adult dyslipidemic patients. We demonstrate that except of TG, Apo A5 T-1131C (rs662799) and Apo E (rs429358, rs7412) polymorphisms have no remarkable effect on MetS characteristics. Copyright © 2014 The Canadian Society of Clinical Chemists. Published by Elsevier Inc. All rights reserved.
2010-01-01
Background Hypertriglyceridemia (HTG) is a well-established independent risk factor for cardiovascular disease and the influence of several genetic variants in genes related with triglyceride (TG) metabolism has been described, including LPL, APOA5 and APOE. The combined analysis of these polymorphisms could produce clinically meaningful complementary information. Methods A subgroup of the ICARIA study comprising 1825 Spanish subjects (80% men, mean age 36 years) was genotyped for the LPL-HindIII (rs320), S447X (rs328), D9N (rs1801177) and N291S (rs268) polymorphisms, the APOA5-S19W (rs3135506) and -1131T/C (rs662799) variants, and the APOE polymorphism (rs429358; rs7412) using PCR and restriction analysis and TaqMan assays. We used regression analyses to examine their combined effects on TG levels (with the log-transformed variable) and the association of variant combinations with TG levels and hypertriglyceridemia (TG ≥ 1.69 mmol/L), including the covariates: gender, age, waist circumference, blood glucose, blood pressure, smoking and alcohol consumption. Results We found a significant lowering effect of the LPL-HindIII and S447X polymorphisms (p < 0.0001). In addition, the D9N, N291S, S19W and -1131T/C variants and the APOE-ε4 allele were significantly associated with an independent additive TG-raising effect (p < 0.05, p < 0.01, p < 0.001, p < 0.0001 and p < 0.001, respectively). Grouping individuals according to the presence of TG-lowering or TG-raising polymorphisms showed significant differences in TG levels (p < 0.0001), with the lowest levels exhibited by carriers of two lowering variants (10.2% reduction in TG geometric mean with respect to individuals who were homozygous for the frequent alleles of all the variants), and the highest levels in carriers of raising combinations (25.1% mean TG increase). Thus, carrying two lowering variants was protective against HTG (OR = 0.62; 95% CI, 0.39-0.98; p = 0.042) and having one single raising polymorphism (OR = 1.20; 95% CI, 1.39-2.87; p < 0.001) or more (2 or 3 raising variants; OR = 2.90; 95% CI, 1.56-5.41; p < 0.001) were associated with HTG. Conclusion Our results showed a significant independent additive effect on TG levels of the LPL polymorphisms HindIII, S447X, D9N and N291S; the S19W and -1131T/C variants of APOA5, and the ε4 allele of APOE in our study population. Moreover, some of the variant combinations studied were significantly associated with the absence or the presence of hypertriglyceridemia. PMID:20429872
Migliore, Sergio; Agnello, Stefano; D'Avola, Salvatore; Goldmann, Wilfred; Di Marco Lo Presti, Vincenzo; Vitale, Maria
2017-06-01
Transmissible spongiform encephalopathies (TSEs) are a group of neurodegenerative diseases affecting humans and animals, and scrapie in small ruminants is considered the archetype of TSEs. Derivata di Siria is a native dairy goat of Sicily (south Italy), which is related to Syrian goat breeds. Scrapie disease is considered endemic in Sicily since 1997, following the administration of an infected vaccine.Derivata di Siria goatswere involved in six of 66 scrapie-infected flocks in Sicily. Prion protein gene (PRNP) analysis revealed that none of the scrapie cases carried the p.Gln222Lys variant. Sequencing of PRNP in this goat population showed a high frequency (15%) of p.Gln222Lys variant confirming its association with scrapie resistance. PRNP polymorphisms were also analysed in the population of Pantelleria, a small Sicilian Island, where scrapie has never been reported. The native goat breed 'Pantesca' was maintained up to almost 80 years and the size of the sheep population on this island has historically been very low. Currently, a crossbreed goat population of 253 heads is present on the island. PRNP genotyping of Pantelleria goats showed genetic variation, with low presence of wild-type goats and the lack of protective alleles. These data reinforce the association between PRNP polymorphisms in small ruminants and scrapie incidence.
[Association between APOC3 promoter region polymorphisms and non-alcoholic fatty liver disease].
Niu, Tonghong; Jiang, Man; Liu, Haogang; Jiang, Xiangjun; Lin, Zhonghua; Zhang, Mei; Wang, Jian; Geng, Ning; Xin, Yongning; Xuan, Shiying
2014-05-01
To investigate the association between two polymorphisms of the APOC3 gene (T-455C and C-482T) and hereditary risk of non-alcoholic fatty liver disease (NAFLD). A total of 287 patients with NAFLD and 310 control subjects were genotyped by PCR and direct sequencing. Serum lipid profiles were also detected by standard biochemical One-hundred-and-eighty of the study participants were used to measure the APOC3 content by enzyme-linked immunosorbent assay. Inter-group differences and associations were assessed statistically using Chi square and t tests and logistic and linear regression analyses. The frequencies of neither the genotypes or alleles were significantly different between the NAFLD cases and the controls. Compared with the most common genotypes-455TT or-482CC, none of the variants showed a significant increase in risk of NAFLD or for the clinical and biochemical parameters. The adjusted odds ratios (with 95% confidence intervals) of NAFLD were 1.25 (0.79-1.96) and 1.20 (0.76-1.89) for carriers of the APOC3-455C and-482 T variants respectively (P more than 0.05). The T-455C and C-482T polymorphisms of the APOC3 gene are not associated with risk of NAFLD, pathogenic changes in lipid profiles, or insulin resistance in Han Chinese.
Gao, Li; Rafaels, Nicholas M; Huang, Lili; Potee, Joseph; Ruczinski, Ingo; Beaty, Terri H.; Paller, Amy S.; Schneider, Lynda C.; Gallo, Rich; Hanifin, Jon M.; Beck, Lisa A.; Geha, Raif S.; Mathias, Rasika A.; Leung, Donald Y. M.
2015-01-01
Background A subset of atopic dermatitis (AD) is associated with increased susceptibility to eczema herpeticum (ADEH+). We previously reported that common single nucleotide polymorphisms (SNPs) in interferon-gamma (IFNG) and receptor 1 (IFNGR1) were associated with ADEH+ phenotype. Objective To interrogate the role of rare variants in IFN-pathway genes for risk of ADEH+. Methods We performed targeted sequencing of interferon-pathway genes (IFNG, IFNGR1, IFNAR1 and IL12RB1) in 228 European American (EA) AD patients selected according to their EH status and severity measured by Eczema Area and Severity Index (EASI). Replication genotyping was performed in independent samples of 219 EA and 333 African Americans (AA). Functional investigation of ‘loss-of-function’ variants was conducted using site-directed mutagenesis. Results We identified 494 single nucleotide variants (SNVs) encompassing 105kb of sequence, including 145 common, 349 (70.6%) rare (minor allele frequency (MAF) <5%) and 86 (17.4%) novel variants, of which 2.8% were coding-synonymous, 93.3% were non-coding (64.6% intronic), and 3.8% were missense. We identified six rare IFNGR1 missense including three damaging variants (Val14Met (V14M), Val61Ile and Tyr397Cys (Y397C)) conferring a higher risk for ADEH+ (P=0.031). Variants V14M and Y397C were confirmed to be deleterious leading to partial IFNGR1 deficiency. Seven common IFNGR1 SNPs, along with common protective haplotypes (2 to 7-SNPs) conferred a reduced risk of ADEH+ (P=0.015-0.002, P=0.0015-0.0004, respectively), and both SNP and haplotype associations were replicated in an independent AA sample (P=0.004-0.0001 and P=0.001-0.0001, respectively). Conclusion Our results provide evidence that both genetic variants in the gene encoding IFNGR1 are implicated in susceptibility to the ADEH+ phenotype. CAPSULE SUMMARY We provided the first evidence that rare functional IFNGR1 mutations contribute to a defective systemic IFN-γ immune response that accounts for the propensity of AD patients to disseminated viral skin infections. PMID:26343451
Two variants in the resistin gene and the response to long-term overfeeding.
Ukkola, O; Kesäniemi, Y Antero; Tremblay, A; Bouchard, C
2004-04-01
To investigate the role of resistin gene variants on the adiposity and metabolic changes observed in response to a 100-day overfeeding protocol conducted with 12 pairs of monozygotic twins. Body-fat measurements included hydrodensitometry and abdominal fat from computed tomography. Plasma glucose and insulin during fasting and in response to an oral glucose tolerance test (OGTT) were assayed. A 4.2 MJ test meal was consumed, after which calorimetric measurements were performed for 240 min. Respiratory quotient (RQ) decreased (P=0.001) more in AA/AG than in GG subjects of the IVS2+181G>A polymorphism after the caloric surplus and the significance persisted when correction for multiple testing was performed. Total abdominal (P=0.027) and visceral (P=0.004) fat increased more in TC than in TT subjects of the IVS2+39C>T polymorphism. In response to overfeeding, glucose area under the curve during the OGTT showed a slight decrease (P=0.031) in the TC while it increased in TT subjects. OGTT insulin area tended to increase less (P=0.055) in TC than in TT subjects. After overfeeding, fasting insulin was lower in TC than in TT subjects (P=0.010). In addition, TC subjects experienced more decrease in RQ than TT subjects (P=0.034). The IVS2+181G>A variant was associated with the changes in RQ in response to overfeeding. The IVS2+39C>T polymorphism was associated with overfeeding-induced changes in abdominal visceral fat, OGTT glucose area and RQ. The results suggest that sequence variation in the resistin gene is involved in the adaptation to chronic positive energy balance.
Protein-based forensic identification using genetically variant peptides in human bone.
Mason, Katelyn Elizabeth; Anex, Deon; Grey, Todd; Hart, Bradley; Parker, Glendon
2018-04-22
Bone tissue contains organic material that is useful for forensic investigations and may contain preserved endogenous protein that can persist in the environment for extended periods of time over a range of conditions. Single amino acid polymorphisms in these proteins reflect genetic information since they result from non-synonymous single nucleotide polymorphisms (SNPs) in DNA. Detection of genetically variant peptides (GVPs) - those peptides that contain amino acid polymorphisms - in digests of bone proteins allows for the corresponding SNP alleles to be inferred. Resulting genetic profiles can be used to calculate statistical measures of association between a bone sample and an individual. In this study proteomic analysis on rib cortical bone samples from 10 recently deceased individuals demonstrates this concept. A straight-forward acidic demineralization protocol yielded proteins that were digested with trypsin. Tryptic digests were analyzed by liquid chromatography mass spectrometry. A total of 1736 different proteins were identified across all resulting datasets. On average, individual samples contained 454±121 (x¯±σ) proteins. Thirty-five genetically variant peptides were identified from 15 observed proteins. Overall, 134 SNP inferences were made based on proteomically detected GVPs, which were confirmed by sequencing of subject DNA. Inferred individual SNP genetic profiles ranged in random match probability (RMP) from 1/6 to 1/42,472 when calculated with European population frequencies in the 1000 Genomes Project, Phase 3. Similarly, RMPs based on African population frequencies were calculated for each SNP genetic profile and likelihood ratios (LR) were obtained by dividing each European RMP by the corresponding African RMP. Resulting LR values ranged from 1.4 to 825 with a median value of 16. GVP markers offer a basis for the identification of compromised skeletal remains independent of the presence of DNA template. Published by Elsevier B.V.
Wujcicka, Wioletta Izabela; Wilczyński, Jan Szczęsny; Nowakowska, Dorota Ewa
2017-05-01
The study was aimed to estimate the role and prevalence rates of genotypes, haplotypes, and alleles, located within the single-nucleotide polymorphisms (SNPs) of interleukin (IL) 1A, IL1B, and IL6 genes, in the occurrence and development of human cytomegalovirus (HCMV) infection among pregnant women. A research was conducted in 129 pregnant women, out of whom, 65 were HCMV infected and 64 were age-matched control uninfected individuals. HCMV DNA was quantitated for UL55 gene by the real-time Q PCR in the body fluids. The genotypic statuses within the SNPs were determined by nested PCR-RFLP assays and confirmed, by sequencing for randomly selected representative PCR products. A relationship between the genotypes and alleles, as well as haplotypes and multiple variants in the studied polymorphisms, and the occurrence of HCMV infection in pregnant women, was determined using a logistic regression model. TT genotype within IL1A polymorphism significantly decreased the risk of HCMV infection (OR 0.32, 95% CI 0.09-1.05; p ≤ 0.050). Considering IL6 SNP, the prevalence rate of GC genotype was significantly decreased among the HCMV infected, compared to the uninfected control individuals (OR 0.45, 95% CI 0.21-0.99; p ≤ 0.050). Moreover, CC homozygotic status in IL6 SNP, found in pregnant women, significantly decreased the risk of congenital infection with HCMV in their offsprings (OR 0.12; p ≤ 0.050). In multiple SNP analysis, TC haplotype within the IL1 polymorphisms significantly decreased the risk of the infection in pregnant women (OR 0.38 95% CI 0.15-0.96; p ≤ 0.050). In addition, TTG complex variants for all the studied polymorphisms and TG variants for IL1B and IL6 SNPs were significantly more prevalent among the infected offsprings with symptomatic congenital cytomegaly than among the asymptomatic cases (p ≤ 0.050). In conclusion, the analyzed IL1A -889 C>T, IL1B +3954 C>T, and IL6 -174 G>C polymorphisms may be associated with the occurrence and development of HCMV infection among studied patients.
Detailed Investigation of the Role of Common and Low-Frequency WFS1 Variants in Type 2 Diabetes Risk
Fawcett, Katherine A.; Wheeler, Eleanor; Morris, Andrew P.; Ricketts, Sally L.; Hallmans, Göran; Rolandsson, Olov; Daly, Allan; Wasson, Jon; Permutt, Alan; Hattersley, Andrew T.; Glaser, Benjamin; Franks, Paul W.; McCarthy, Mark I.; Wareham, Nicholas J.; Sandhu, Manjinder S.; Barroso, Inês
2010-01-01
OBJECTIVE Wolfram syndrome 1 (WFS1) single nucleotide polymorphisms (SNPs) are associated with risk of type 2 diabetes. In this study we aimed to refine this association and investigate the role of low-frequency WFS1 variants in type 2 diabetes risk. RESEARCH DESIGN AND METHODS For fine-mapping, we sequenced WFS1 exons, splice junctions, and conserved noncoding sequences in samples from 24 type 2 diabetic case and 68 control subjects, selected tagging SNPs, and genotyped these in 959 U.K. type 2 diabetic case and 1,386 control subjects. The same genomic regions were sequenced in samples from 1,235 type 2 diabetic case and 1,668 control subjects to compare the frequency of rarer variants between case and control subjects. RESULTS Of 31 tagging SNPs, the strongest associated was the previously untested 3′ untranslated region rs1046320 (P = 0.008); odds ratio 0.84 and P = 6.59 × 10−7 on further replication in 3,753 case and 4,198 control subjects. High correlation between rs1046320 and the original strongest SNP (rs10010131) (r2 = 0.92) meant that we could not differentiate between their effects in our samples. There was no difference in the cumulative frequency of 82 rare (minor allele frequency [MAF] <0.01) nonsynonymous variants between type 2 diabetic case and control subjects (P = 0.79). Two intermediate frequency (MAF 0.01–0.05) nonsynonymous changes also showed no statistical association with type 2 diabetes. CONCLUSIONS We identified six highly correlated SNPs that show strong and comparable associations with risk of type 2 diabetes, but further refinement of these associations will require large sample sizes (>100,000) or studies in ethnically diverse populations. Low frequency variants in WFS1 are unlikely to have a large impact on type 2 diabetes risk in white U.K. populations, highlighting the complexities of undertaking association studies with low-frequency variants identified by resequencing. PMID:20028947
Chevalier, Christophe; Al Bazzal, Ali; Vidic, Jasmina; Février, Vincent; Bourdieu, Christiane; Bouguyon, Edwige; Le Goffic, Ronan; Vautherot, Jean-François; Bernard, Julie; Moudjou, Mohammed; Noinville, Sylvie; Chich, Jean-François; Da Costa, Bruno; Rezaei, Human; Delmas, Bernard
2010-01-01
The influenza A virus PB1-F2 protein, encoded by an alternative reading frame in the PB1 polymerase gene, displays a high sequence polymorphism and is reported to contribute to viral pathogenesis in a sequence-specific manner. To gain insights into the functions of PB1-F2, the molecular structure of several PB1-F2 variants produced in Escherichia coli was investigated in different environments. Circular dichroism spectroscopy shows that all variants have a random coil secondary structure in aqueous solution. When incubated in trifluoroethanol polar solvent, all PB1-F2 variants adopt an α-helix-rich structure, whereas incubated in acetonitrile, a solvent of medium polarity mimicking the membrane environment, they display β-sheet secondary structures. Incubated with asolectin liposomes and SDS micelles, PB1-F2 variants also acquire a β-sheet structure. Dynamic light scattering revealed that the presence of β-sheets is correlated with an oligomerization/aggregation of PB1-F2. Electron microscopy showed that PB1-F2 forms amorphous aggregates in acetonitrile. In contrast, at low concentrations of SDS, PB1-F2 variants exhibited various abilities to form fibers that were evidenced as amyloid fibers in a thioflavin T assay. Using a recombinant virus and its PB1-F2 knock-out mutant, we show that PB1-F2 also forms amyloid structures in infected cells. Functional membrane permeabilization assays revealed that the PB1-F2 variants can perforate membranes at nanomolar concentrations but with activities found to be sequence-dependent and not obviously correlated with their differential ability to form amyloid fibers. All of these observations suggest that PB1-F2 could be involved in physiological processes through different pathways, permeabilization of cellular membranes, and amyloid fiber formation. PMID:20172856
Chevalier, Christophe; Al Bazzal, Ali; Vidic, Jasmina; Février, Vincent; Bourdieu, Christiane; Bouguyon, Edwige; Le Goffic, Ronan; Vautherot, Jean-François; Bernard, Julie; Moudjou, Mohammed; Noinville, Sylvie; Chich, Jean-François; Da Costa, Bruno; Rezaei, Human; Delmas, Bernard
2010-04-23
The influenza A virus PB1-F2 protein, encoded by an alternative reading frame in the PB1 polymerase gene, displays a high sequence polymorphism and is reported to contribute to viral pathogenesis in a sequence-specific manner. To gain insights into the functions of PB1-F2, the molecular structure of several PB1-F2 variants produced in Escherichia coli was investigated in different environments. Circular dichroism spectroscopy shows that all variants have a random coil secondary structure in aqueous solution. When incubated in trifluoroethanol polar solvent, all PB1-F2 variants adopt an alpha-helix-rich structure, whereas incubated in acetonitrile, a solvent of medium polarity mimicking the membrane environment, they display beta-sheet secondary structures. Incubated with asolectin liposomes and SDS micelles, PB1-F2 variants also acquire a beta-sheet structure. Dynamic light scattering revealed that the presence of beta-sheets is correlated with an oligomerization/aggregation of PB1-F2. Electron microscopy showed that PB1-F2 forms amorphous aggregates in acetonitrile. In contrast, at low concentrations of SDS, PB1-F2 variants exhibited various abilities to form fibers that were evidenced as amyloid fibers in a thioflavin T assay. Using a recombinant virus and its PB1-F2 knock-out mutant, we show that PB1-F2 also forms amyloid structures in infected cells. Functional membrane permeabilization assays revealed that the PB1-F2 variants can perforate membranes at nanomolar concentrations but with activities found to be sequence-dependent and not obviously correlated with their differential ability to form amyloid fibers. All of these observations suggest that PB1-F2 could be involved in physiological processes through different pathways, permeabilization of cellular membranes, and amyloid fiber formation.
Jin, Tianbo; Yang, Hua; Zhang, Jiayi; Yunus, Zulfiya; Sun, Qiang; Geng, Tingting; Chen, Chao; Yang, Jie
2015-01-01
Genetic polymorphisms in CYP3A4 can change its activity to a certain degree, thus leading to differences among different populations in drug efficacy or adverse drug reactions. The study was intended to validate the genetic polymorphisms in CYP3A4 in Uygur Chinese population, we sequenced and screened for genetic variants including 5'UTR, promoters, exons, introns, and 3'UTR region of the whole CYP3A4 gene in 100 unrelated, healthy. Twenty-one genetic polymorphisms in CYP3A4, and nine of them were novel. We detected CYP3A4*8, a putative poor-metabolizer allele, with the frequency of 0.5% in Uygur population. Tfsitescan revealed that the density of transcription factor varied in the different promoter regions, among which some were key regions for transcription factor binding. our results provide basic information about CPY3A4 alleles in Uygur and suggest that the enzymatic activities of CPY3A4 may differ among the diverse ethnic populations of China.
Jin, Tianbo; Yang, Hua; Zhang, Jiayi; Yunus, Zulfiya; Sun, Qiang; Geng, Tingting; Chen, Chao; Yang, Jie
2015-01-01
Purpose: Genetic polymorphisms in CYP3A4 can change its activity to a certain degree, thus leading to differences among different populations in drug efficacy or adverse drug reactions. Methods: The study was intended to validate the genetic polymorphisms in CYP3A4 in Uygur Chinese population, we sequenced and screened for genetic variants including 5’UTR, promoters, exons, introns, and 3’UTR region of the whole CYP3A4 gene in 100 unrelated, healthy. Results: Twenty-one genetic polymorphisms in CYP3A4, and nine of them were novel. We detected CYP3A4*8, a putative poor-metabolizer allele, with the frequency of 0.5% in Uygur population. Tfsitescan revealed that the density of transcription factor varied in the different promoter regions, among which some were key regions for transcription factor binding. Conclusion: our results provide basic information about CPY3A4 alleles in Uygur and suggest that the enzymatic activities of CPY3A4 may differ among the diverse ethnic populations of China. PMID:26261601
Dengue Virus Type 3 Adaptive Changes during Epidemics in São Jose de Rio Preto, Brazil, 2006–2007
Bosch, Irene; Schimitt, Diane; Calzavara-Silva, Carlos E.; de A Zanotto, Paolo M.; Nogueira, Maurício L.
2013-01-01
Global dengue virus spread in tropical and sub-tropical regions has become a major international public health concern. It is evident that DENV genetic diversity plays a significant role in the immunopathology of the disease and that the identification of polymorphisms associated with adaptive responses is important for vaccine development. The investigation of naturally occurring genomic variants may play an important role in the comprehension of different adaptive strategies used by these mutants to evade the human immune system. In order to elucidate this role we sequenced the complete polyprotein-coding region of thirty-three DENV-3 isolates to characterize variants circulating under high endemicity in the city of São José de Rio Preto, Brazil, during the onset of the 2006-07 epidemic. By inferring the evolutionary history on a local-scale and estimating rates of synonymous (dS) and nonsynonimous (dN) substitutions, we have documented at least two different introductions of DENV-3 into the city and detected 10 polymorphic codon sites under significant positive selection (dN/dS > 1) and 8 under significant purifying selection (dN/dS < 1). We found several polymorphic amino acid coding sites in the envelope (15), NS1 (17), NS2A (11), and NS5 (24) genes, which suggests that these genes may be experiencing relatively recent adaptive changes. Furthermore, some polymorphisms correlated with changes in the immunogenicity of several epitopes. Our study highlights the existence of significant and informative DENV variability at the spatio-temporal scale of an urban outbreak. PMID:23667626
Polymorphism and methylation of the MC4R gene in obese and non-obese dogs.
Mankowska, Monika; Nowacka-Woszuk, Joanna; Graczyk, Aneta; Ciazynska, Paulina; Stachowiak, Monika; Switonski, Marek
2017-08-01
The dog is considered to be a useful biomedical model for human diseases and disorders, including obesity. One of the numerous genes associated with human polygenic obesity is MC4R, encoding the melanocortin 4 receptor. The aim of our study was to analyze polymorphisms and methylation of the canine MC4R in relation to adiposity. Altogether 270 dogs representing four breeds predisposed to obesity: Labrador Retriever (n = 187), Golden Retriever (n = 38), Beagle (n = 28) and Cocker Spaniel (n = 17), were studied. The dogs were classified into three groups: lean, overweight and obese, according to the 5-point Body Condition Score (BCS) scale. In the cohort of Labradors a complete phenotypic data (age, sex, neutering status, body weight and BCS) were collected for 127 dogs. The entire coding sequence as well as 5' and 3'-flanking regions of the studied gene were sequenced and six polymorphic sites were reported. Genotype frequencies differed considerably between breeds and Labrador Retrievers appeared to be the less polymorphic. Moreover, distribution of some polymorphic variants differed significantly (P < 0.05) between small cohorts with diverse BCS in Golden Retrievers (c.777T>C, c.868C>T and c.*33C>G) and Beagles (c.-435T>C and c.637G>T). On the contrary, in Labradors no association between the studied polymorphisms and BCS or body weight was observed. Methylation analysis, using bisulfite DNA conversion followed by Sanger sequencing, was carried out for 12 dogs with BCS = 3 and 12 dogs with BCS = 5. Two intragenic CpG islands, containing 19 cytosines, were analyzed and the methylation profile did not differ significantly between lean and obese animals. We conclude that an association of the MC4R gene polymorphism with dog obesity or body weight is unlikely, in spite of the fact that some associations were found in small cohorts of Beagles and Golden Retrievers. Also methylation level of this gene is not related with dog adiposity.
Zhang, X; Wang, C; Zhang, Y; Ju, Z; Qi, C; Wang, X; Huang, J; Zhang, S; Li, J; Zhong, J; Shi, F
2014-10-01
Katanin p60 subunit A-like 1 (KATNAL1) is an ATPase that regulates Sertoli cell microtubule dynamics and sperm retention. We evaluated one novel splice variant and characterized the promoter and a functional single nucleotide polymorphism (SNP) of the bovine KATNAL1 gene to explore its expression pattern, possible regulatory mechanism and relationship with semen traits in Chinese Holstein bulls. A novel splice variant, KATNAL1 transcript variant 2 (KATNAL1-TV2) of the retained 68 bp in intron 2, was identified by RT-PCR and compared with KATNAL1 transcript variant 1 (KATNAL1-TV1, NM 001192918.1) in various tissues. Bioinformatics analyses predicted that KATNAL1 transcription was regulated by two promoters: P1 in KATNAL1-TV1 and P2 in KATNAL1-TV2. Results of qRT-PCR revealed that KATNAL1-TV1 had higher expression than did KATNAL1-TV2 in testes of adult bulls (P < 0.05). Promoter luciferase activity analysis suggested that the core sequences of P1 and P2 were mapped to the region of c.-575˜c.-180 and c.163-40˜c.333+59 respectively. One novel SNP (c.163-210T>C, ss836312085) located in intron 1 was found using sequence alignment. The SNP in P2 resulted in the presence of the DeltaE binding site, improving its basal promoter activity (P < 0.05); and we observed a greater sperm deformity rate in bulls with the genotype CC than in those with the genotype TT (P < 0.05), which indicated that different genotypes were associated with the bovine semen traits. Bioinformatics analysis of the KATNAL1 protein sequence predicted that the loss of the MIT domain in the KATNAL1-TV2 transcript resulted in protein dysfunction. These findings help us to understand that a functional SNP in P2 and subsequent triggering of expression diversity of KATNAL1 transcripts are likely to play an important role with regard to semen traits in bull breeding programs. © 2014 Stichting International Foundation for Animal Genetics.
Lessons for livestock genomics from genome and transcriptome sequencing in cattle and other mammals.
Taylor, Jeremy F; Whitacre, Lynsey K; Hoff, Jesse L; Tizioto, Polyana C; Kim, JaeWoo; Decker, Jared E; Schnabel, Robert D
2016-08-17
Decreasing sequencing costs and development of new protocols for characterizing global methylation, gene expression patterns and regulatory regions have stimulated the generation of large livestock datasets. Here, we discuss experiences in the analysis of whole-genome and transcriptome sequence data. We analyzed whole-genome sequence (WGS) data from 132 individuals from five canid species (Canis familiaris, C. latrans, C. dingo, C. aureus and C. lupus) and 61 breeds, three bison (Bison bison), 64 water buffalo (Bubalus bubalis) and 297 bovines from 17 breeds. By individual, data vary in extent of reference genome depth of coverage from 4.9X to 64.0X. We have also analyzed RNA-seq data for 580 samples representing 159 Bos taurus and Rattus norvegicus animals and 98 tissues. By aligning reads to a reference assembly and calling variants, we assessed effects of average depth of coverage on the actual coverage and on the number of called variants. We examined the identity of unmapped reads by assembling them and querying produced contigs against the non-redundant nucleic acids database. By imputing high-density single nucleotide polymorphism data on 4010 US registered Angus animals to WGS using Run4 of the 1000 Bull Genomes Project and assessing the accuracy of imputation, we identified misassembled reference sequence regions. We estimate that a 24X depth of coverage is required to achieve 99.5 % coverage of the reference assembly and identify 95 % of the variants within an individual's genome. Genomes sequenced to low average coverage (e.g., <10X) may fail to cover 10 % of the reference genome and identify <75 % of variants. About 10 % of genomic DNA or transcriptome sequence reads fail to align to the reference assembly. These reads include loci missing from the reference assembly and misassembled genes and interesting symbionts, commensal and pathogenic organisms. Assembly errors and a lack of annotation of functional elements significantly limit the utility of the current draft livestock reference assemblies. The Functional Annotation of Animal Genomes initiative seeks to annotate functional elements, while a 70X Pac-Bio assembly for cow is underway and may result in a significantly improved reference assembly.
Kozlov, Konstantin N.; Kulakovskiy, Ivan V.; Zubair, Asif; Marjoram, Paul; Lawrie, David S.; Nuzhdin, Sergey V.; Samsonova, Maria G.
2017-01-01
Annotating the genotype-phenotype relationship, and developing a proper quantitative description of the relationship, requires understanding the impact of natural genomic variation on gene expression. We apply a sequence-level model of gap gene expression in the early development of Drosophila to analyze single nucleotide polymorphisms (SNPs) in a panel of natural sequenced D. melanogaster lines. Using a thermodynamic modeling framework, we provide both analytical and computational descriptions of how single-nucleotide variants affect gene expression. The analysis reveals that the sequence variants increase (decrease) gene expression if located within binding sites of repressors (activators). We show that the sign of SNP influence (activation or repression) may change in time and space and elucidate the origin of this change in specific examples. The thermodynamic modeling approach predicts non-local and non-linear effects arising from SNPs, and combinations of SNPs, in individual fly genotypes. Simulation of individual fly genotypes using our model reveals that this non-linearity reduces to almost additive inputs from multiple SNPs. Further, we see signatures of the action of purifying selection in the gap gene regulatory regions. To infer the specific targets of purifying selection, we analyze the patterns of polymorphism in the data at two phenotypic levels: the strengths of binding and expression. We find that combinations of SNPs show evidence of being under selective pressure, while individual SNPs do not. The model predicts that SNPs appear to accumulate in the genotypes of the natural population in a way biased towards small increases in activating action on the expression pattern. Taken together, these results provide a systems-level view of how genetic variation translates to the level of gene regulatory networks via combinatorial SNP effects. PMID:28898266
Jiang, Yue; Turinsky, Andrei L.; Brudno, Michael
2015-01-01
With the development of High-Throughput Sequencing (HTS) thousands of human genomes have now been sequenced. Whenever different studies analyze the same genome they usually agree on the amount of single-nucleotide polymorphisms, but differ dramatically on the number of insertion and deletion variants (indels). Furthermore, there is evidence that indels are often severely under-reported. In this manuscript we derive the total number of indel variants in a human genome by combining data from different sequencing technologies, while assessing the indel detection accuracy. Our estimate of approximately 1 million indels in a Yoruban genome is much higher than the results reported in several recent HTS studies. We identify two key sources of difficulties in indel detection: the insufficient coverage, read length or alignment quality; and the presence of repeats, including short interspersed elements and homopolymers/dimers. We quantify the effect of these factors on indel detection. The quality of sequencing data plays a major role in improving indel detection by HTS methods. However, many indels exist in long homopolymers and repeats, where their detection is severely impeded. The true number of indel events is likely even higher than our current estimates, and new techniques and technologies will be required to detect them. PMID:26130710
Jewell, Brittany E.; Versalovic, Erika M.; Olsen, Randall J.; Bachert, Beth A.; Lukomski, Slawomir; Musser, James M.
2015-01-01
Group A Streptococcus (GAS) predominantly exists as a colonizer of the human oropharynx that occasionally breaches epithelial barriers to cause invasive diseases. Despite the frequency of GAS carriage, few investigations into the contributory molecular mechanisms exist. To this end, we identified a naturally occurring polymorphism in the gene encoding the streptococcal collagen-like protein A (SclA) in GAS carrier strains. All previously sequenced invasive serotype M3 GAS possess a premature stop codon in the sclA gene truncating the protein. The carrier polymorphism is predicted to restore SclA function and was infrequently identified by targeted DNA sequencing in invasive strains of the same serotype. We demonstrate that a strain with the carrier sclA allele expressed a full-length SclA protein, while the strain with the invasive sclA allele expressed a truncated variant. An isoallelic mutant invasive strain with the carrier sclA allele exhibited decreased virulence in a mouse model of invasive disease and decreased multiplication in human blood. Further, the isoallelic invasive strain with the carrier sclA allele persisted in the mouse nasopharynx and had increased adherence to cultured epithelial cells. Repair of the premature stop codon in the invasive sclA allele restored the ability to bind the extracellular matrix proteins laminin and cellular fibronectin. These data demonstrate that a mutation in GAS carrier strains increases adherence and decreases virulence and suggest selection against increased adherence in GAS invasive isolates. PMID:25561712
Multi-species Identification of Polymorphic Peptide Variants via Propagation in Spectral Networks*
Bandeira, Nuno
2016-01-01
Peptide and protein identification remains challenging in organisms with poorly annotated or rapidly evolving genomes, as are commonly encountered in environmental or biofuels research. Such limitations render tandem mass spectrometry (MS/MS) database search algorithms ineffective as they lack corresponding sequences required for peptide-spectrum matching. We address this challenge with the spectral networks approach to (1) match spectra of orthologous peptides across multiple related species and then (2) propagate peptide annotations from identified to unidentified spectra. We here present algorithms to assess the statistical significance of spectral alignments (Align-GF), reduce the impurity in spectral networks, and accurately estimate the error rate in propagated identifications. Analyzing three related Cyanothece species, a model organism for biohydrogen production, spectral networks identified peptides from highly divergent sequences from networks with dozens of variant peptides, including thousands of peptides in species lacking a sequenced genome. Our analysis further detected the presence of many novel putative peptides even in genomically characterized species, thus suggesting the possibility of gaps in our understanding of their proteomic and genomic expression. A web-based pipeline for spectral networks analysis is available at http://proteomics.ucsd.edu/software. PMID:27609420
Lack of association between sigma receptor gene variants and schizophrenia.
Satoh, Fumiaki; Miyatake, Ryosuke; Furukawa, Aizo; Suwaki, Hiroshi
2004-08-01
Several pharmacological studies suggest the possible involvement of sigma(1) receptors in the pathogenesis of schizophrenia. An association has been reported between schizophrenia and two variants (GC-241-240TT and Gln2Pro) in the sigma(1) receptor gene (SIGMAR1). We also previously reported that, along with T-485 A, these two variants alter SIGMAR1 function. To investigate the role of SIGMAR1 in conveying susceptibility to schizophrenia, we performed a case-control study. We initially screened for polymorphisms in the SIGMAR1 coding region using PCR-single strand conformation polymorphism analysis. The distribution of SIGMAR1 polymorphisms was analyzed in 100 schizophrenic and 104 control subjects. A novel G620A variant was detected in exon4. G620A was predicted to alter the amino acid represented by codon 211 from arginine to glutamine. Our case-control study showed no significant association between the T-485 A, GC-241-240TT, Gln2Pro, and G620A (Arg211Gln) variants and schizophrenia and clinical characteristics. These findings suggest that these SIGMAR1 variants may not affect susceptibility to schizophrenia.
Characterization of the COL2A1 VNTR polymorphism
DOE Office of Scientific and Technical Information (OSTI.GOV)
Berg, E.S.; Olaisen, B.
1993-05-01
The variable number of tandem repeat (VNTR) region 3{prime} to the collagen type II gene (COL2A1) was amplified in vitro by the polymerase chain reaction. Subsequent high-resolution gel electrophoresis showed that the five earlier reported alleles could be further subtyped. A total of 17 allelic variants with a heterozygosity of 73.0% were found in 202 unrelated Norwegians. DNA sequencing of 19 COL2A1 alleles has been performed. The internal organization of the VNTR was common for all alleles, as previously shown for a few alleles. Moreover, the polymorphism in the COL2A1 locus is mainly due to variation in the numbers ofmore » copies of two repeat units, containing 34 and 31 bp, respectively, and/or to small deletions in either of the two units. DNA sequencing of alleles with the same electrophoretic size revealed no heterogeneity such as an alternating order of the different units, a feature that might have been expected to be the result of unequal crossing-over events. The observed ordered structure of the VNTR and the possibility of single-stranded DNA from the cores in the VNTR forming hairpins and loops suggest that the COL2A1 polymorphism may have evolved mainly by replication slippage mechanisms. 23 refs., 2 figs., 3 tabs.« less
Variation resources at UC Santa Cruz.
Thomas, Daryl J; Trumbower, Heather; Kern, Andrew D; Rhead, Brooke L; Kuhn, Robert M; Haussler, David; Kent, W James
2007-01-01
The variation resources within the University of California Santa Cruz Genome Browser include polymorphism data drawn from public collections and analyses of these data, along with their display in the context of other genomic annotations. Primary data from dbSNP is included for many organisms, with added information including genomic alleles and orthologous alleles for closely related organisms. Display filtering and coloring is available by variant type, functional class or other annotations. Annotation of potential errors is highlighted and a genomic alignment of the variant's flanking sequence is displayed. HapMap allele frequencies and linkage disequilibrium (LD) are available for each HapMap population, along with non-human primate alleles. The browsing and analysis tools, downloadable data files and links to documentation and other information can be found at http://genome.ucsc.edu/.
The role of ghrelin and ghrelin-receptor gene variants and promoter activity in type 2 diabetes.
Garcia, Edwin A; King, Peter; Sidhu, Kally; Ohgusu, Hideko; Walley, Andrew; Lecoeur, Cecile; Gueorguiev, Maria; Khalaf, Sahira; Davies, Derek; Grossman, Ashley B; Kojima, Masayasu; Petersenn, Stephan; Froguel, Phillipe; Korbonits, Márta
2009-08-01
Ghrelin and its receptor play an important role in glucose metabolism and energy homeostasis, and therefore they are functional candidates for genes carrying susceptibility alleles for type 2 diabetes. We assessed common genetic variation of the ghrelin (GHRL; five single nucleotide polymorphisms (SNP)) and the ghrelin-receptor (GHSR) genes (four SNPs) in 610 Caucasian patients with type 2 diabetes and 820 controls. In addition, promoter reporter assays were conducted to model the regulatory regions of both genes. Neither GHRL nor GHSR gene SNPs were associated with type 2 diabetes. One of the ghrelin haplotypes showed a marginal protective role in type 2 diabetes. We observed profound differences in the regulation of the GHRL gene according to promoter sequence variants. There are three different GHRL promoter haplotypes represented in the studied cohort causing up to 45% difference in the level of gene expression, while the promoter region of GHSR gene is primarily represented by a single haplotype. The GHRL and GHSR gene variants are not associated with type 2 diabetes, although GHRL promoter variants have significantly different activities.
Tang, Rongying; Prosser, Debra O.; Love, Donald R.
2016-01-01
The increasing diagnostic use of gene sequencing has led to an expanding dataset of novel variants that lie within consensus splice junctions. The challenge for diagnostic laboratories is the evaluation of these variants in order to determine if they affect splicing or are merely benign. A common evaluation strategy is to use in silico analysis, and it is here that a number of programmes are available online; however, currently, there are no consensus guidelines on the selection of programmes or protocols to interpret the prediction results. Using a collection of 222 pathogenic mutations and 50 benign polymorphisms, we evaluated the sensitivity and specificity of four in silico programmes in predicting the effect of each variant on splicing. The programmes comprised Human Splice Finder (HSF), Max Entropy Scan (MES), NNSplice, and ASSP. The MES and ASSP programmes gave the highest performance based on Receiver Operator Curve analysis, with an optimal cut-off of score reduction of 10%. The study also showed that the sensitivity of prediction is affected by the level of conservation of individual positions, with in silico predictions for variants at positions −4 and +7 within consensus splice sites being largely uninformative. PMID:27313609
Kowalczyk, Marek; Jakubczak, Andrzej; Horecka, Beata; Kostro, Krzysztof
2018-05-29
The Aleutian mink disease virus (AMDV) is one of the most serious threats to modern mink breeding. The disease can have various courses, from progressive to subclinical infections. The objective of the study was to provide a comparative molecular characterization of isolates of AMDV from farms with a clinical and subclinical course of the disease. The qPCR analysis showed a difference of two orders of magnitude between the number of copies of the viral DNA on the farm with the clinical course of the disease (10 5 ) and the farm with the subclinical course (10 3 ). The sequencing results confirm a high level of homogeneity within each farm and variation between them. The phylogenetic analysis indicates that the variants belonging to different farms are closely related and occupy different branches of the same clade. The in silico analysis of the effect of differences in the sequence encoding the VP2 protein between the farms revealed no effect of the polymorphism on its functionality. The close phylogenetic relationship between the isolates from the two farms, the synonymous nature of most of the polymorphisms and the potentially minor effect on the functionality of the protein indicate that the differences in the clinical picture may be due not only to polymorphisms in the nucleotide and amino acid sequences, but also to the stage of infection on the farm and the degree of stabilization of the pathogen-host relationship.
The sheep growth hormone gene polymorphism and its effects on milk traits.
Dettori, Maria Luisa; Pazzola, Michele; Pira, Emanuela; Paschino, Pietro; Vacca, Giuseppe Massimo
2015-05-01
Growth hormone (GH) is encoded by the GH gene, which may be single copy or duplicate in sheep. The two copies of the sheep GH gene (GH1/GH2-N and GH2-Z) were entirely sequenced in one 106 ewes of Sarda breed, in order to highlight sequence polymorphisms and investigate possible association between genetic variants and milk traits. Milk traits included milk yield, fat, protein, casein and lactose percentage. We evidenced 75 nucleotide changes. Transcription factor binding site prediction revealed two sequences potentially recognised by the pituitary-specific transcription factor POU1FI at the GH1/GH2-N gene, which were lost at the promoter of GH2-Z, which might explain the different tissues of expression of GH1/GH2-N (pituitary) and GH2-Z (placenta). Significant differences in milk traits were observed among genotypes at polymorphic loci only for the GH2-Z gene. Sheep with homozygote genotype ss748770547 CC had higher fat percentage (P < 0.01) than TT. SNP ss748770547 was part of a potential transcription factor binding site for C/EBP alpha (CCAAT/Enhancer Binding Protein), which is involved in the regulation of adipogenesis and adipoblast differentiation. SNP ss748770547, located in the GH2-Z gene 5' flanking region, may be a causal mutation affecting milk fat content. These findings might contribute to the knowledge of the sheep GH locus and might be useful in selection processes in sheep.
Tatonova, Yulia V; Chelomina, Galina N; Nguyen, Hung Manh
2017-11-01
Here we examined the intraspecific genetic variability of Clonorchis sinensis from Russia and Vietnam using nuclear DNA sequences (the 5.8S gene and two internal transcribed spacers of the ribosomal cluster). Despite the low level of variability in the ITS1 region, this marker has revealed some features of C. sinensis across multiple geographic regions. The genetic diversity levels for the Russian and Vietnamese populations were similar (0.1 and 0.09%, respectively) but were significantly lower than the C. sinensis from China (0.31%). About half of the sequences of the Chinese (53%) and Korean (47%) populations and about a tenth of the Vietnamese (12%) and Russian (8%) sequences included a 5bp insertion. No sequences with nucleotide substitutions both upstream and downstream of the 5bp insertion were found within the whole data set. The population of northern China had both sequence variants (with substitutions either upstream or downstream of the insertion), while only one of these variants was presented at the other localities. The Vietnamese population had a higher frequency of intragenomic polymorphism than the Russian population (69% vs. 46% and 23% vs. 3% at the 114bp and 339bp positions, respectively). These data are discussed in connection with parasite origin and adaptation, and also its invasive capacity and drug-resistance. Copyright © 2017 Elsevier B.V. All rights reserved.
Derkach, Andriy; Chiang, Theodore; Gong, Jiafen; Addis, Laura; Dobbins, Sara; Tomlinson, Ian; Houlston, Richard; Pal, Deb K.; Strug, Lisa J.
2014-01-01
Motivation: Sufficiently powered case–control studies with next-generation sequence (NGS) data remain prohibitively expensive for many investigators. If feasible, a more efficient strategy would be to include publicly available sequenced controls. However, these studies can be confounded by differences in sequencing platform; alignment, single nucleotide polymorphism and variant calling algorithms; read depth; and selection thresholds. Assuming one can match cases and controls on the basis of ethnicity and other potential confounding factors, and one has access to the aligned reads in both groups, we investigate the effect of systematic differences in read depth and selection threshold when comparing allele frequencies between cases and controls. We propose a novel likelihood-based method, the robust variance score (RVS), that substitutes genotype calls by their expected values given observed sequence data. Results: We show theoretically that the RVS eliminates read depth bias in the estimation of minor allele frequency. We also demonstrate that, using simulated and real NGS data, the RVS method controls Type I error and has comparable power to the ‘gold standard’ analysis with the true underlying genotypes for both common and rare variants. Availability and implementation: An RVS R script and instructions can be found at strug.research.sickkids.ca, and at https://github.com/strug-lab/RVS. Contact: lisa.strug@utoronto.ca Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24733292
RNF213 Rare Variants in Slovakian and Czech Moyamoya Disease Patients.
Kobayashi, Hatasu; Brozman, Miroslav; Kyselová, Kateřina; Viszlayová, Daša; Morimoto, Takaaki; Roubec, Martin; Školoudík, David; Petrovičová, Andrea; Juskanič, Dominik; Strauss, Jozef; Halaj, Marián; Kurray, Peter; Hranai, Marián; Harada, Kouji H; Inoue, Sumiko; Yoshida, Yukako; Habu, Toshiyuki; Herzig, Roman; Youssefian, Shohab; Koizumi, Akio
2016-01-01
RNF213/Mysterin has been identified as a susceptibility gene for moyamoya disease, a cerebrovascular disease characterized by occlusive lesions in the circle of Willis. The p.R4810K (rs112735431) variant is a founder polymorphism that is strongly associated with moyamoya disease in East Asia. Many non-p.R4810K rare variants of RNF213 have been identified in white moyamoya disease patients, although the ethnic mutations have not been investigated in this population. In the present study, we screened for RNF213 variants in 19 Slovakian and Czech moyamoya disease patients. A total of 69 RNF213 coding exons were directly sequenced in 18 probands and one relative who suffered from moyamoya disease in Slovakia and the Czech Republic. We previously reported one proband harboring RNF213 p.D4013N. Results from the present study identified four rare variants other than p.D4013N (p.R4019C, p.E4042K, p.V4146A, and p.W4677L) in four of the patients. P.V4146A was determined to be a novel de novo mutation, and p.R4019C and p.E4042K were identified as double mutations inherited on the same allele. P.W4677L, found in two moyamoya disease patients and an unaffected subject in the same pedigree, was a rare single nucleotide polymorphism. Functional analysis showed that RNF213 p.D4013N, p.R4019C and p.V4146A-transfected human umbilical vein endothelial cells displayed significant lowered migration, and RNF213 p.V4146A significantly reduced tube formation, indicating that these are disease-causing mutations. Results from the present study identified RNF213 rare variants in 22.2% (4/18 probands) of Slovakian and Czech moyamoya disease patients, confirming that RNF213 may also be a major causative gene in a relative large population of white patients.
RNF213 Rare Variants in Slovakian and Czech Moyamoya Disease Patients
Kyselová, Kateřina; Viszlayová, Daša; Morimoto, Takaaki; Roubec, Martin; Školoudík, David; Petrovičová, Andrea; Juskanič, Dominik; Strauss, Jozef; Halaj, Marián; Kurray, Peter; Hranai, Marián; Harada, Kouji H.; Inoue, Sumiko; Yoshida, Yukako; Habu, Toshiyuki; Herzig, Roman; Youssefian, Shohab; Koizumi, Akio
2016-01-01
RNF213/Mysterin has been identified as a susceptibility gene for moyamoya disease, a cerebrovascular disease characterized by occlusive lesions in the circle of Willis. The p.R4810K (rs112735431) variant is a founder polymorphism that is strongly associated with moyamoya disease in East Asia. Many non-p.R4810K rare variants of RNF213 have been identified in white moyamoya disease patients, although the ethnic mutations have not been investigated in this population. In the present study, we screened for RNF213 variants in 19 Slovakian and Czech moyamoya disease patients. A total of 69 RNF213 coding exons were directly sequenced in 18 probands and one relative who suffered from moyamoya disease in Slovakia and the Czech Republic. We previously reported one proband harboring RNF213 p.D4013N. Results from the present study identified four rare variants other than p.D4013N (p.R4019C, p.E4042K, p.V4146A, and p.W4677L) in four of the patients. P.V4146A was determined to be a novel de novo mutation, and p.R4019C and p.E4042K were identified as double mutations inherited on the same allele. P.W4677L, found in two moyamoya disease patients and an unaffected subject in the same pedigree, was a rare single nucleotide polymorphism. Functional analysis showed that RNF213 p.D4013N, p.R4019C and p.V4146A-transfected human umbilical vein endothelial cells displayed significant lowered migration, and RNF213 p.V4146A significantly reduced tube formation, indicating that these are disease-causing mutations. Results from the present study identified RNF213 rare variants in 22.2% (4/18 probands) of Slovakian and Czech moyamoya disease patients, confirming that RNF213 may also be a major causative gene in a relative large population of white patients. PMID:27736983
Espin‐Garcia, Osvaldo; Craiu, Radu V.
2017-01-01
ABSTRACT We evaluate two‐phase designs to follow‐up findings from genome‐wide association study (GWAS) when the cost of regional sequencing in the entire cohort is prohibitive. We develop novel expectation‐maximization‐based inference under a semiparametric maximum likelihood formulation tailored for post‐GWAS inference. A GWAS‐SNP (where SNP is single nucleotide polymorphism) serves as a surrogate covariate in inferring association between a sequence variant and a normally distributed quantitative trait (QT). We assess test validity and quantify efficiency and power of joint QT‐SNP‐dependent sampling and analysis under alternative sample allocations by simulations. Joint allocation balanced on SNP genotype and extreme‐QT strata yields significant power improvements compared to marginal QT‐ or SNP‐based allocations. We illustrate the proposed method and evaluate the sensitivity of sample allocation to sampling variation using data from a sequencing study of systolic blood pressure. PMID:29239496
Rybicka, Magda; Stalke, Piotr; Dreczewski, Marcin; Smiatacz, Tomasz; Bielawski, Krzysztof Piotr
2014-01-01
Long-term antiviral therapy of chronic hepatitis B virus (HBV) infection can lead to the selection of drug-resistant HBV variants and treatment failure. Moreover, these HBV strains are possibly present in treatment-naive patients. Currently available assays for the detection of HBV drug resistance can identify mutants that constitute ≥5% of the viral population. Furthermore, drug-resistant HBV variants can be detected when a viral load is >10(4) copies/ml (1,718 IU/ml). The aim of this study was to compare matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) and multitemperature single-strand conformation polymorphism (MSSCP) with commercially available assays for the detection of drug-resistant HBV strains. HBV DNA was extracted from 87 serum samples acquired from 45 chronic hepatitis B (CHB) patients. The 37 selected HBV variants were analyzed in 4 separate primer extension reactions on the MALDI-TOF MS. Moreover, MSSCP for identifying drug-resistant HBV YMDD variants was developed and turned out to be more sensitive than INNOLiPA HBV DR and direct sequencing. MALDI-TOF MS had the capability to detect mutant strains within a mixed viral population occurring with an allelic frequency of approximately 1% (with a specific value of ≥10(2) copies/ml, also expressed as ≥17.18 IU/ml). In our study, MSSCP detected 98% of the HBV YMDD variants among strains detected by the MALDI-TOF MS assay. The routine tests revealed results of 40% and 11%, respectively, for INNOLiPA and direct sequencing. The commonly available HBV tests are less sensitive than MALDI-TOF MS in the detection of HBV-resistant variants, including quasispecies.
Horvath, Anelia; Korde, Larissa; Greene, Mark H.; Libe, Rosella; Osorio, Paulo; Faucz, Fabio Rueda; Raffin-Sanson, Marie Laure; Tsang, Kit Man; Drori-Herishanu, Limor; Patronas, Yianna; Remmers, Elaine F; Nikita, Maria-Elena; Moran, Jason; Greene, Joseph; Nesterova, Maria; Merino, Maria; Bertherat, Jerome; Stratakis, Constantine A.
2009-01-01
Inactivating germline mutations in phosphodiesterase 11A (PDE11A) have been implicated in adrenal tumor susceptibility. PDE11A is highly-expressed in endocrine steroidogenic tissues, especially the testis, and mice with inactivated Pde11a exhibit male infertility, a known testicular germ cell tumor (TGCT) risk factor. We sequenced the PDE11A gene-coding region in 95 patients with TGCT from 64 unrelated kindreds. We identified 8 non-synonymous substitutions in 20 patients from 15 families: four (R52T; F258Y; G291R; V820M) were newly-recognized, three (R804H; R867G; M878V) were functional variants previously implicated in adrenal tumor predisposition, and one (Y727C) was a known polymorphism. We compared the frequency of these variants in our patients to unrelated controls that had been screened and found negative for any endocrine diseases: only the two previously-reported variants, R804H and R867G, known to be frequent in general population, were detected in these controls. The frequency of all PDE11A-gene variants (combined) was significantly higher among patients with TGCT (P=0.0002), present in 19% of the families of our cohort. Most variants were detected in the general population, but functional studies showed that all these mutations reduced PDE activity, and that PDE11A protein expression was decreased (or absent) in TGCT samples from carriers. This is the first demonstration of a PDE gene’s involvement in TGCT, although the cAMP signaling pathway has been investigated extensively in other reproductive organs and their diseases. In conclusion, we report that PDE11A-inactivating sequence variants may modify the risk of familial and bilateral TGCT. PMID:19549888
Imajoh, Masayuki; Hashida, Yumiko; Murakami, Masanao; Maeda, Akihiko; Sato, Tetsuya; Fujieda, Mikiya; Wakiguchi, Hiroshi; Daibata, Masanori
2012-06-01
Epstein-Barr virus (EBV) genotypes can be distinguished based on gene sequence differences in EBV nuclear antigens 2, 3A, 3B, and 3C, and the BZLF1 promoter zone (Zp). EBV subtypes and BZLF1 Zp variants were examined in Japanese patients with infectious mononucleosis, chronic active EBV infection, and EBV-associated hemophagocytic lymphohistiocytosis. The results of EBV typing showed that samples of infectious mononucleosis, chronic active EBV infection, and EBV-associated hemophagocytic lymphohistiocytosis all belonged to EBV type 1. However, sequencing analysis of BZLF1 Zp found three polymorphic Zp variants in the same samples. The Zp-P prototype and the Zp-V3 variant were both detected in infectious mononucleosis and chronic active EBV infection. Furthermore, a novel variant previously identified in Chinese children with infectious mononucleosis, Zp-V1, was also found in 3 of 18 samples of infectious mononucleosis, where it coexisted with the Zp-P prototype. This is the first evidence that the EBV variant distribution in Japanese patients resembles that found in other Asian patients. The expression levels of 29 chronic active EBV infection-associated cellular genes were also compared in the three EBV-related disorders, using quantitative real-time reverse transcription polymerase chain reaction analysis. Two upregulated genes, RIPK2 and CDH9, were identified as common specific markers for chronic active EBV infection in both in vitro and in vivo studies. RIPK2 activates apoptosis and autophagy, and could be responsible for the pathogenesis of chronic active EBV infection. Copyright © 2012 Wiley Periodicals, Inc.
Tohno, Masanori; Shinkai, Hiroki; Toki, Daisuke; Okumura, Naohiko; Tajima, Kiyoshi; Uenishi, Hirohide
2016-10-01
The nucleotide-binding domain, leucine-rich-containing family, pyrin-domain containing-3 (NLRP3) inflammasome comprises the major components caspase-1, apoptosis-associated speck-like protein containing a caspase recruitment domain (ASC), and NLRP3. NLRP3 plays important roles in maintaining immune homeostasis mediated by intestinal microorganisms and in the immunostimulatory properties of vaccine adjuvants used to induce an immune response. In the present study, we first cloned a complementary DNA (cDNA) encoding porcine ASC because its genomic sequence was not completely determined. The availability of the ASC cDNA enabled us to reconstitute porcine NLRP3 inflammasomes using an in vitro system that led to the identification of the immune functions of porcine NLRP3 and ASC based on the production of interleukin-1β (IL-1β). Further, we identified six synonymous and six nonsynonymous single-nucleotide polymorphisms (SNPs) in the coding sequence of NLRP3 of six breeds of pigs, including major commercial breeds. Among the nonsynonymous SNPs, the Q969R polymorphism is associated with an increased release of IL-1β compared with other porcine NLRP3 variants, indicating that this polymorphism represents a gain-of-function mutation. This allele was detected in 100 % of the analyzed Chinese Jinhua and Japanese wild boars, suggesting that the allele is maintained in the major commercial native European breeds Landrace, Large White, and Berkshire. These findings represent an important contribution to our knowledge of the diversity of NLRP3 nucleotide sequences among various pig populations. Moreover, efforts to exploit the gain of function induced by the Q969R polymorphism promise to improve pig breeding and husbandry by conferring enhanced resistance to pathogens as well as contributing to vaccine efficacy.
Xavier, Crislaine; Soares, Rógean Vinícius Santos; Amorim, Igor Costa; Cabral-de-Mello, Diogo Cavalcanti; de Cássia de Moura, Rita
2018-03-09
Euchroma Dejean, 1833 (Buprestidae: Coleoptera) is a monotypic genus comprising the species Euchroma gigantea, with populations presenting a degree of karyotypic variation/polymorphism rarely found within a single taxonomic (specific) unit, as well as drastically incompatible meiotic configurations in populations from extremes of the species range. To better understand the complex karyotypic evolution of E. gigantea, the karyotypes of specimens from five populations in Brazil were investigated using molecular cytogenetics and phylogenetic approaches. Herein, we used FISH with histone genes as well as sequencing of the COI to determine differential distribution of markers and relationships among populations. The analyses revealed new karyotypes, with variability for chromosome number and morphology of multiple sex chromosome mechanisms, occurrence of B chromosome variants (punctiform and large ones), and high dispersion of histone genes in different karyotypes. These data indicate that chromosomal polymorphism in E. gigantea is greater than previously reported, and that the species can be a valuable model for cytogenetic studies. The COI phylogenetic and haplotype analyses highlighted the formation of three groups with chromosomally polymorphic individuals. Finally, we compared the different karyotypes and proposed a model for the chromosomal evolution of this species. The species E. gigantea includes at least three cytogenetically polymorphic lineages. Moreover, in each of these lineages, different chromosomal rearrangements have been fixed. Dispersion of repetitive sequences may have favored the high frequency of these rearrangements, which could be related to both adaptation of the species to different habitats and the speciation process.
A novel germline PALB2 deletion in Polish breast and ovarian cancer patients.
Dansonka-Mieszkowska, Agnieszka; Kluska, Anna; Moes, Joanna; Dabrowska, Michalina; Nowakowska, Dorota; Niwinska, Anna; Derlatka, Pawel; Cendrowski, Krzysztof; Kupryjanczyk, Jolanta
2010-02-02
PALB2 protein was recently identified as a partner of BRCA1 and BRCA2 which determines their proper function in DNA repair. Initially, the entire coding sequence of the PALB2 gene with exon/intron boundaries was evaluated by the PCR-SSCP and direct sequencing methods on 70 ovarian carcinomas. Sequence variants of interest were further studied on enlarged groups of ovarian carcinomas (total 339 non-consecutive ovarian carcinomas), blood samples from 334 consecutive sporadic and 648 consecutive familial breast cancer patients, and 1310 healthy controls from central Poland. Ten types of sequence variants were detected, and among them four novel polymorphisms: c.2996+58T>C in intron 9; c.505C>A (p.L169I), c.618T>G (p.L206L), both in exon 4; and c.2135C>T (A712V) in exon 5 of the PALB2 gene. Another two polymorphisms, c.212-58A>C and c.2014G>C (E672Q) were always detected together, both in cancer (7.5% of patients) and control samples (4.9% of controls, p = 0.2). A novel germline truncating mutation, c.509_510delGA (p.R170fs) was found in exon 4: in 2 of 339 (0.6%) unrelated ovarian cancer patients, in 4 of 648 (0.6%) unrelated familial breast cancer patients, and in 1 of 1310 controls (0.08%, p = 0.1, p = 0.044, respectively). One ovarian cancer patient with the PALB2 mutation had also a germline nonsense mutation of the BRCA2 gene. The c.509_510delGA is a novel PALB2 mutation that increases the risk of familial breast cancer. Occurrence of the same PALB2 alteration in seven unrelated women suggests that c.509_510delGA (p.R170fs) is a recurrent mutation for Polish population.
Cridland, Julie M; Thornton, Kevin R
2010-01-13
Several recent studies have focused on the evolution of recently duplicated genes in Drosophila. Currently, however, little is known about the evolutionary forces acting upon duplications that are segregating in natural populations. We used a high-throughput, paired-end sequencing platform (Illumina) to identify structural variants in a population sample of African D. melanogaster. Polymerase chain reaction and sequencing confirmation of duplications detected by multiple, independent paired-ends showed that paired-end sequencing reliably uncovered the break points of structural rearrangements and allowed us to identify a number of tandem duplications segregating within a natural population. Our confirmation experiments show that rates of confirmation are very high, even at modest coverage. Our results also compare well with previous studies using microarrays (Emerson J, Cardoso-Moreira M, Borevitz JO, Long M. 2008. Natural selection shapes genome wide patterns of copy-number polymorphism in Drosophila melanogaster. Science. 320:1629-1631. and Dopman EB, Hartl DL. 2007. A portrait of copy-number polymorphism in Drosophila melanogaster. Proc Natl Acad Sci U S A. 104:19920-19925.), which both gives us confidence in the results of this study as well as confirms previous microarray results.We were also able to identify whole-gene duplications, such as a novel duplication of Or22a, an olfactory receptor, and identify copy-number differences in genes previously known to be under positive selection, like Cyp6g1, which confers resistance to dichlorodiphenyltrichloroethane. Several "hot spots" of duplications were detected in this study, which indicate that particular regions of the genome may be more prone to generating duplications. Finally, population frequency analysis of confirmed events also showed an excess of rare variants in our population, which indicates that duplications segregating in the population may be deleterious and ultimately destined to be lost from the population.
Bergfors, Assar; Leenheer, Daniël; Bergqvist, Anders; Ameur, Adam; Lennerstrand, Johan
2016-02-01
Development of Hepatitis C virus (HCV) resistance against direct-acting antivirals (DAAs), including NS5A inhibitors, is an obstacle to successful treatment of HCV when DAAs are used in sub-optimal combinations. Furthermore, it has been shown that baseline (pre-existing) resistance against DAAs is present in treatment naïve-patients and this will potentially complicate future treatment strategies in different HCV genotypes (GTs). Thus the aim was to detect low levels of NS5A resistant associated variants (RAVs) in a limited sample set of treatment-naïve patients of HCV GT1a and 3a, since such polymorphisms can display in vitro resistance as high as 60000 fold. Ultra-deep single molecule real time (SMRT) sequencing with the Pacific Biosciences (PacBio) RSII instrument was used to detect these RAVs. The SMRT sequencing was conducted on ten samples; three of them positive with Sanger sequencing (GT1a Q30H and Y93N, and GT3a Y93H), five GT1a samples, and two GT3a non-positive samples. The same methods were applied to the HCV GT1a H77-plasmid in a dilution series, in order to determine the error rates of replication, which in turn was used to determine the limit of detection (LOD), as defined by mean + 3SD, of minority variants down to 0.24%. We found important baseline NS5A RAVs at levels between 0.24 and 0.5%, which could potentially have clinical relevance. This new method with low level detection of baseline RAVs could be useful in predicting the most cost-efficient combination of DAA treatment, and reduce the treatment duration for an HCV infected individual. Copyright © 2015 Elsevier B.V. All rights reserved.
PERCH: A Unified Framework for Disease Gene Prioritization.
Feng, Bing-Jian
2017-03-01
To interpret genetic variants discovered from next-generation sequencing, integration of heterogeneous information is vital for success. This article describes a framework named PERCH (Polymorphism Evaluation, Ranking, and Classification for a Heritable trait), available at http://BJFengLab.org/. It can prioritize disease genes by quantitatively unifying a new deleteriousness measure called BayesDel, an improved assessment of the biological relevance of genes to the disease, a modified linkage analysis, a novel rare-variant association test, and a converted variant call quality score. It supports data that contain various combinations of extended pedigrees, trios, and case-controls, and allows for a reduced penetrance, an elevated phenocopy rate, liability classes, and covariates. BayesDel is more accurate than PolyPhen2, SIFT, FATHMM, LRT, Mutation Taster, Mutation Assessor, PhyloP, GERP++, SiPhy, CADD, MetaLR, and MetaSVM. The overall approach is faster and more powerful than the existing quantitative method pVAAST, as shown by the simulations of challenging situations in finding the missing heritability of a complex disease. This framework can also classify variants of unknown significance (variants of uncertain significance) by quantitatively integrating allele frequencies, deleteriousness, association, and co-segregation. PERCH is a versatile tool for gene prioritization in gene discovery research and variant classification in clinical genetic testing. © 2016 The Authors. **Human Mutation published by Wiley Periodicals, Inc.
Identification and validation of loss of function variants in clinical contexts.
Lescai, Francesco; Marasco, Elena; Bacchelli, Chiara; Stanier, Philip; Mantovani, Vilma; Beales, Philip
2014-01-01
The choice of an appropriate variant calling pipeline for exome sequencing data is becoming increasingly more important in translational medicine projects and clinical contexts. Within GOSgene, which facilitates genetic analysis as part of a joint effort of the University College London and the Great Ormond Street Hospital, we aimed to optimize a variant calling pipeline suitable for our clinical context. We implemented the GATK/Queue framework and evaluated the performance of its two callers: the classical UnifiedGenotyper and the new variant discovery tool HaplotypeCaller. We performed an experimental validation of the loss-of-function (LoF) variants called by the two methods using Sequenom technology. UnifiedGenotyper showed a total validation rate of 97.6% for LoF single-nucleotide polymorphisms (SNPs) and 92.0% for insertions or deletions (INDELs), whereas HaplotypeCaller was 91.7% for SNPs and 55.9% for INDELs. We confirm that GATK/Queue is a reliable pipeline in translational medicine and clinical context. We conclude that in our working environment, UnifiedGenotyper is the caller of choice, being an accurate method, with a high validation rate of error-prone calls like LoF variants. We finally highlight the importance of experimental validation, especially for INDELs, as part of a standard pipeline in clinical environments.
Chao, Yu-Kai; Schludi, Verena; Chen, Cheng-Chang; Butz, Elisabeth; Nguyen, O N Phuong; Müller, Martin; Krüger, Jens; Kammerbauer, Claudia; Ben-Johny, Manu; Vollmar, Angelika M; Berking, Carola; Biel, Martin; Wahl-Schott, Christian A; Grimm, Christian
2017-10-10
Two-pore channels (TPCs) are endolysosomal cation channels. Two members exist in humans, TPC1 and TPC2. Functional roles associated with the ubiquitously expressed TPCs include VEGF-induced neoangiogenesis, LDL-cholesterol trafficking and degradation, physical endurance under fasting conditions, autophagy regulation, the acrosome reaction in sperm, cancer cell migration, and intracellular trafficking of pathogens such as Ebola virus or bacterial toxins (e.g., cholera toxin). In a genome-wide association study for variants associated with human pigmentation characteristics two coding variants of TPC2, rs35264875 (encoding M484L) and rs3829241 (encoding G734E), have been found to be associated with a shift from brown to blond hair color. In two recent follow-up studies a role for TPC2 in pigmentation has been further confirmed. However, these human polymorphic variants have not been functionally characterized until now. The development of endolysosomal patch-clamp techniques has made it possible to investigate directly ion channel activities and characteristics in isolated endolysosomal organelles. We applied this technique here to scrutinize channel characteristics of the polymorphic TPC2 variants in direct comparison with WT. We found that both polymorphisms lead to a gain of channel function by independent mechanisms. We next conducted a clinical study with more than 100 blond- and brown/black-haired individuals. We performed a genotype/phenotype analysis and subsequently isolated fibroblasts from WT and polymorphic variant carriers for endolysosomal patch-clamp experimentation to confirm key in vitro findings.
2009-01-01
Background Expressed sequence tags (ESTs) are an important source of gene-based markers such as those based on insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). Several gel based methods have been reported for the detection of sequence variants, however they have not been widely exploited in common bean, an important legume crop of the developing world. The objectives of this project were to develop and map EST based markers using analysis of single strand conformation polymorphisms (SSCPs), to create a transcript map for common bean and to compare synteny of the common bean map with sequenced chromosomes of other legumes. Results A set of 418 EST based amplicons were evaluated for parental polymorphisms using the SSCP technique and 26% of these presented a clear conformational or size polymorphism between Andean and Mesoamerican genotypes. The amplicon based markers were then used for genetic mapping with segregation analysis performed in the DOR364 × G19833 recombinant inbred line (RIL) population. A total of 118 new marker loci were placed into an integrated molecular map for common bean consisting of 288 markers. Of these, 218 were used for synteny analysis and 186 presented homology with segments of the soybean genome with an e-value lower than 7 × 10-12. The synteny analysis with soybean showed a mosaic pattern of syntenic blocks with most segments of any one common bean linkage group associated with two soybean chromosomes. The analysis with Medicago truncatula and Lotus japonicus presented fewer syntenic regions consistent with the more distant phylogenetic relationship between the galegoid and phaseoloid legumes. Conclusion The SSCP technique is a useful and inexpensive alternative to other SNP or Indel detection techniques for saturating the common bean genetic map with functional markers that may be useful in marker assisted selection. In addition, the genetic markers based on ESTs allowed the construction of a transcript map and given their high conservation between species allowed synteny comparisons to be made to sequenced genomes. This synteny analysis may support positional cloning of target genes in common bean through the use of genomic information from these other legumes. PMID:20030833
Cherepkova, E V; Aftanas, L I; Maksimov, N; Menshanov, P N
2016-11-01
Predisposition to antisocial behavior can be related to the presence of certain polymorphic variants of genes encoding dopaminergic system proteins. We studied the frequencies of allele variants and genotypes of variable number tandem repeat polymorphism in 3' untranslated region (3' VTNR) of the dopaminergic transporter SLC6A3 gene in Caucasian men committed socially dangerous violent and non-violent crimes. Alleles with 9 and 10 repeats were most frequent in both the control group and group of men predisposed to antisocial behavior. At the same time, the 10/10 genotype was more frequently observed in the group of men prone to antisocial non-violent behavior. Hence, the presence of certain variants of 3' VTNR polymorphism of SLC6A3 gene in men is associated with predisposition to certain forms of antisocial behavior.
Mahurkar, Swapna; Bhaskar, Seema; Reddy, D Nageshwar; Prakash, Swami; Rao, G Venkat; Singh, Shivaram Prasad; Thomas, Varghese; Chandak, Giriraj Ratan
2008-08-16
Tropical calcific pancreatitis (TCP) is a type of chronic pancreatitis unique to developing countries in tropical regions and one of its important features is invariable progression to diabetes, a condition called fibro-calculous pancreatic diabetes (FCPD), but the nature of diabetes in TCP is controversial. We analysed the recently reported type 2 diabetes (T2D) associated polymorphisms in the TCF7L2 gene using a case-control approach, under the hypothesis that TCF7L2 variants should show similar association if diabetes in FCPD is similar to T2D. We also investigated the interaction between the TCF7L2 variants and N34S SPINK1 and L26V CTSB mutations, since they are strong predictors of risk for TCP. Two polymorphisms rs7903146 and rs12255372 in the TCF7L2 gene were analyzed by direct sequencing in 478 well-characterized TCP patients and 661 healthy controls of Dravidian and Indo-European ethnicities. Their association with TCP with diabetes (FCPD) and without diabetes was tested in both populations independently using chi-square test. Finally, a meta analysis was performed on all the cases and controls for assessing the overall significance irrespective of ethnicity. We dichotomized the whole cohort based on the presence or absence of N34S SPINK1 and L26V CTSB mutations and further subdivided them into TCP and FCPD patients and compared the distribution of TCF7L2 variants between them. The allelic and genotypic frequencies for both TCF7L2 polymorphisms, did not differ significantly between TCP patients and controls belonging to either of the ethnic groups or taken together. No statistically significant association of the SNPs was observed with TCP or FCPD or between carriers and non-carriers of N34S SPINK1 and L26V CTSB mutations. The minor allele frequency for rs7903146 was different between TCP and FCPD patients carrying the N34S SPINK1 variant but did not reach statistical significance (OR = 1.59, 95% CI = 0.93-2.70, P = 0.09), while, TCF7L2variant showed a statistically significant association between TCP and FCPD patients carrying the 26V allele (OR = 1.69, 95% CI = 1.11-2.56, P = 0.013). Type 2 diabetes associated TCF7L2 variants are not associated with diabetes in TCP. Since, TCF7L2 is a major susceptibility gene for T2D, it may be hypothesized that the diabetes in TCP patients may not be similar to T2D. Our data also suggests that co-existence of TCF7L2 variants and the SPINK1 and CTSB mutations, that predict susceptibility to exocrine damage, may interact to determine the onset of diabetes in TCP patients.
Barber, Lisa M; McGrath, Helen E N; Meyer, Stefan; Will, Andrew M; Birch, Jillian M; Eden, Osborn B; Taylor, G Malcolm
2003-04-01
The extent to which genetic susceptibility contributes to the causation of childhood acute myeloid leukaemia (AML) is not known. The inherited bone marrow failure disorder Fanconi anaemia (FA) carries a substantially increased risk of AML, raising the possibility that constitutional variation in the FA (FANC) genes is involved in the aetiology of childhood AML. We have screened genomic DNA extracted from remission blood samples of 97 children with sporadic AML and 91 children with sporadic acute lymphoblastic leukaemia (ALL), together with 104 cord blood DNA samples from newborn children, for variations in the Fanconi anaemia group C (FANCC) gene. We found no evidence of known FANCC pathogenic mutations in children with AML, ALL or in the cord blood samples. However, we detected 12 different FANCC sequence variants, of which five were novel to this study. Among six FANCC variants leading to amino-acid substitutions, one (S26F) was present at a fourfold greater frequency in children with AML than in the cord blood samples (odds ratio: 4.09, P = 0.047; 95% confidence interval 1.08-15.54). Our results thus do not exclude the possibility that this polymorphic variant contributes to the risk of a small proportion of childhood AML.
Epistasis analysis for quantitative traits by functional regression model.
Zhang, Futao; Boerwinkle, Eric; Xiong, Momiao
2014-06-01
The critical barrier in interaction analysis for rare variants is that most traditional statistical methods for testing interactions were originally designed for testing the interaction between common variants and are difficult to apply to rare variants because of their prohibitive computational time and poor ability. The great challenges for successful detection of interactions with next-generation sequencing (NGS) data are (1) lack of methods for interaction analysis with rare variants, (2) severe multiple testing, and (3) time-consuming computations. To meet these challenges, we shift the paradigm of interaction analysis between two loci to interaction analysis between two sets of loci or genomic regions and collectively test interactions between all possible pairs of SNPs within two genomic regions. In other words, we take a genome region as a basic unit of interaction analysis and use high-dimensional data reduction and functional data analysis techniques to develop a novel functional regression model to collectively test interactions between all possible pairs of single nucleotide polymorphisms (SNPs) within two genome regions. By intensive simulations, we demonstrate that the functional regression models for interaction analysis of the quantitative trait have the correct type 1 error rates and a much better ability to detect interactions than the current pairwise interaction analysis. The proposed method was applied to exome sequence data from the NHLBI's Exome Sequencing Project (ESP) and CHARGE-S study. We discovered 27 pairs of genes showing significant interactions after applying the Bonferroni correction (P-values < 4.58 × 10(-10)) in the ESP, and 11 were replicated in the CHARGE-S study. © 2014 Zhang et al.; Published by Cold Spring Harbor Laboratory Press.
de Bruin, Christiaan; Mericq, Verónica; Andrew, Shayne F.; van Duyvenvoorde, Hermine A.; Verkaik, Nicole S.; Losekoot, Monique; Porollo, Aleksey; Garcia, Hernán; Kuang, Yi; Hanson, Dan; Clayton, Peter; van Gent, Dik C.; Wit, Jan M.; Hwa, Vivian
2015-01-01
Context: Severe short stature can be caused by defects in numerous biological processes including defects in IGF-1 signaling, centromere function, cell cycle control, and DNA damage repair. Many syndromic causes of short stature are associated with medical comorbidities including hypogonadism and microcephaly. Objective: To identify an underlying genetic etiology in two siblings with severe short stature and gonadal failure. Design: Clinical phenotyping, genetic analysis, complemented by in vitro functional studies of the candidate gene. Setting: An academic pediatric endocrinology clinic. Patients or Other Participants: Two adult siblings (male patient [P1] and female patient 2 [P2]) presented with a history of severe postnatal growth failure (adult heights: P1, −6.8 SD score; P2, −4 SD score), microcephaly, primary gonadal failure, and early-onset metabolic syndrome in late adolescence. In addition, P2 developed a malignant gastrointestinal stromal tumor at age 28. Intervention(s): Single nucleotide polymorphism microarray and exome sequencing. Results: Combined microarray analysis and whole exome sequencing of the two affected siblings and one unaffected sister identified a homozygous variant in XRCC4 as the probable candidate variant. Sanger sequencing and mRNA studies revealed a splice variant resulting in an in-frame deletion of 23 amino acids. Primary fibroblasts (P1) showed a DNA damage repair defect. Conclusions: In this study we have identified a novel pathogenic variant in XRCC4, a gene that plays a critical role in non-homologous end-joining DNA repair. This finding expands the spectrum of DNA damage repair syndromes to include XRCC4 deficiency causing severe postnatal growth failure, microcephaly, gonadal failure, metabolic syndrome, and possibly tumor predisposition. PMID:25742519
Jackson, Robert; Rosa, Bruce A; Lameiras, Sonia; Cuninghame, Sean; Bernard, Josee; Floriano, Wely B; Lambert, Paul F; Nicolas, Alain; Zehbe, Ingeborg
2016-11-02
Human papillomaviruses (HPVs) are a worldwide burden as they are a widespread group of tumour viruses in humans. Having a tropism for mucosal tissues, high-risk HPVs are detected in nearly all cervical cancers. HPV16 is the most common high-risk type but not all women infected with high-risk HPV develop a malignant tumour. Likely relevant, HPV genomes are polymorphic and some HPV16 single nucleotide polymorphisms (SNPs) are under evolutionary constraint instigating variable oncogenicity and immunogenicity in the infected host. To investigate the tumourigenicity of two common HPV16 variants, we used our recently developed, three-dimensional organotypic model reminiscent of the natural HPV infectious cycle and conducted various "omics" and bioinformatics approaches. Based on epidemiological studies we chose to examine the HPV16 Asian-American (AA) and HPV16 European Prototype (EP) variants. They differ by three non-synonymous SNPs in the transforming and virus-encoded E6 oncogene where AAE6 is classified as a high- and EPE6 as a low-risk variant. Remarkably, the high-risk AAE6 variant genome integrated into the host DNA, while the low-risk EPE6 variant genome remained episomal as evidenced by highly sensitive Capt-HPV sequencing. RNA-seq experiments showed that the truncated form of AAE6, integrated in chromosome 5q32, produced a local gene over-expression and a large variety of viral-human fusion transcripts, including long distance spliced transcripts. In addition, differential enrichment of host cell pathways was observed between both HPV16 E6 variant-containing epithelia. Finally, in the high-risk variant, we detected a molecular signature of host chromosomal instability, a common property of cancer cells. We show how naturally occurring SNPs in the HPV16 E6 oncogene cause significant changes in the outcome of HPV infections and subsequent viral and host transcriptome alterations prone to drive carcinogenesis. Host genome instability is closely linked to viral integration into the host genome of HPV-infected cells, which is a key phenomenon for malignant cellular transformation and the reason for uncontrolled E6 oncogene expression. In particular, the finding of variant-specific integration potential represents a new paradigm in HPV variant biology.
Chopra, Ratan; Burow, Gloria; Farmer, Andrew; Mudge, Joann; Simpson, Charles E; Wilkins, Thea A; Baring, Michael R; Puppala, Naveen; Chamberlin, Kelly D; Burow, Mark D
2015-06-01
Single-nucleotide polymorphisms, which can be identified in the thousands or millions from comparisons of transcriptome or genome sequences, are ideally suited for making high-resolution genetic maps, investigating population evolutionary history, and discovering marker-trait linkages. Despite significant results from their use in human genetics, progress in identification and use in plants, and particularly polyploid plants, has lagged. As part of a long-term project to identify and use SNPs suitable for these purposes in cultivated peanut, which is tetraploid, we generated transcriptome sequences of four peanut cultivars, namely OLin, New Mexico Valencia C, Tamrun OL07 and Jupiter, which represent the four major market classes of peanut grown in the world, and which are important economically to the US southwest peanut growing region. CopyDNA libraries of each genotype were used to generate 2 × 54 paired-end reads using an Illumina GAIIx sequencer. Raw reads were mapped to a custom reference consisting of Tifrunner 454 sequences plus peanut ESTs in GenBank, compromising 43,108 contigs; 263,840 SNP and indel variants were identified among four genotypes compared to the reference. A subset of 6 variants was assayed across 24 genotypes representing four market types using KASP chemistry to assess the criteria for SNP selection. Results demonstrated that transcriptome sequencing can identify SNPs usable as selectable DNA-based markers in complex polyploid species such as peanut. Criteria for effective use of SNPs as markers are discussed in this context.
Kapplinger, Jamie D; Pundi, Krishna N; Larson, Nicholas B; Callis, Thomas E; Tester, David J; Bikker, Hennie; Wilde, Arthur A M; Ackerman, Michael J
2018-02-01
Pathogenic RYR2 variants account for ≈60% of clinically definite cases of catecholaminergic polymorphic ventricular tachycardia. However, the rate of rare benign RYR2 variants identified in the general population remains a challenge for genetic test interpretation. Therefore, we examined the results of the RYR2 genetic test among patients referred for commercial genetic testing and examined factors impacting variant interpretability. Frequency and location comparisons were made for RYR2 variants identified among 1355 total patients of varying clinical certainty and 60 706 Exome Aggregation Consortium controls. The impact of the clinical phenotype on the yield of RYR2 variants was examined. Six in silico tools were assessed using patient- and control-derived variants. A total of 18.2% (218/1200) of patients referred for commercial testing hosted rare RYR2 variants, statistically less than the 59% (46/78) yield among clinically definite cases, resulting in a much higher potential genetic false discovery rate among referrals considering the 3.2% background rate of rare, benign RYR2 variants. Exclusion of clearly putative pathogenic variants further complicates the interpretation of the next novel RYR2 variant. Exonic/topologic analyses revealed overrepresentation of patient variants in exons covering only one third of the protein. In silico tools largely failed to show evidence toward enhancement of variant interpretation. Current expert recommendations have resulted in increased use of RYR2 genetic testing in patients with questionable clinical phenotypes. Using the largest to date catecholaminergic polymorphic ventricular tachycardia patient versus control comparison, this study highlights important variables in the interpretation of variants to overcome the 3.2% background rate that confounds RYR2 variant interpretation. © 2018 American Heart Association, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Inayama, Y.; Yoneda, H.; Sakai, T.
Sixty-two patients with schizophrenia and 96 normal controls were investigated for genetic association with restriction fragment length polymorphisms (RFLPs) in the serotonin receptor genes. A positive association between the serotonin 2A receptor gene (HTR2A) and schizophrenia was found, but not between schizophrenia and the serotonin 1A receptor gene. The positive association we report here would suggest that the DNA region with susceptibility to schizophrenia lies in the HTR2A on the long arm of chromosome 13. 15 refs., 2 tabs.
An ATP2B4 polymorphism protects against malaria in pregnancy.
Bedu-Addo, George; Meese, Stefanie; Mockenhaupt, Frank P
2013-05-15
Polymorphisms of ATP2B4 encoding an ubiquitous Ca(2+) pump protect against severe childhood malaria. We assessed the influence of a main polymorphism (rs10900585) on malaria among 834 delivering Ghanaian women. In homozygous primiparae, the odds of placental Plasmodium falciparum infection were reduced by 64%. No influence of the polymorphism on parasite density, low birth weight, or preterm delivery was discernible. However, malarial anemia was greatly reduced in primiparous carriers of the variant allele, paralleling the reduced impact of malaria on hemoglobin levels in this group. A common ATP2B4 polymorphism protects against malaria in pregnancy and related maternal anemia, suggesting ATP2B4 variant associated protection not to be limited to severe childhood malaria.
McGregor, N W; Hemmings, S M J; Erdman, L; Calmarza-Font, I; Stein, D J; Lochner, C
2016-12-30
The monoamine oxidases (MAOA/B) and catechol-O-methyltransferase (COMT) enzymes break down regulatory components within serotonin and dopamine pathways, and polymorphisms within these genes are candidates for OCD susceptibility. Childhood trauma has been linked OCD psychopathology, but little attention has been paid to the interactions between genes and environment in OCD aetiology. This pilot study investigated gene-by-environment interactions between childhood trauma and polymorphisms in the MAOA, MAOB and COMT genes in OCD. Ten polymorphisms (MAOA: 3 variants, MAOB: 4 variants, COMT: 3 variants) were genotyped in a cohort of OCD patients and controls. Early-life trauma was assessed using the Childhood Trauma Questionnaire (CTQ). Gene-by-gene (GxG) and gene-by-environment interactions (GxE) of the variants and childhood trauma were assessed using logistic regression models. Significant GxG interactions were found between rs362204 (COMT) and two independent polymorphisms in the MAOB gene (rs1799836 and rs6651806). Haplotype associations for OCD susceptibility were found for MAOB. Investigation of GxE interactions indicated that the sexual abuse sub-category was significantly associated with all three genes in haplotype x environment interaction analyses. Preliminary findings indicate that polymorphisms within the MAOB and COMT genes interact resulting in risk for OCD. Childhood trauma interacts with haplotypes in COMT, MAOA and MAOB, increasing risk for OCD. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Kim, Minjoo; Kim, Minkyung; Yoo, Hye Jin; Lee, Eunji; Chae, Jey Sook; Lee, Sang-Hyun; Lee, Jong Ho
2017-01-01
Hypertriglyceridemia is recognized as an independent risk factor for coronary artery disease. The apolipoprotein A5 gene (APOA5) is a key regulator of triglyceride levels. We aimed to evaluate the associations of single nucleotide polymorphisms (SNPs) in APOA5, including -1131T>C and c.553G>T, with hypertriglyceridemia, apoA5 concentrations, atherogenic LDL cholesterol levels, and arterial stiffness in hypertriglyceridemic patients. The study population included 599 hypertriglyceridemic patients (case) and 1,549 untreated normotriglyceridemic subjects (control). We genotyped two APOA5 variants, -1131T>C (rs662799) and c.553G>T (rs2075291). The frequencies of the CC genotype of -1131T>C (0.165) and the T allele of c.553G>T (0.119) were significantly higher in hypertriglyceridemic patients than in normotriglyceridemic subjects (0.061 and 0.070, respectively; all p<0.001). In the control and case groups, both the -1131T>C and c.553G>T variants were associated with higher triglyceride and lower HDL cholesterol levels. Controls with the -1131CC variant had lower apoA5 concentrations than controls with the -1131TT variant. Similar effects of the -1131T>C variant on apoA5 were observed in the cases. In the hypertriglyceridemic group, the -1131T>C variant was associated with a smaller LDL particle size, higher levels of oxidized LDL and malondialdehyde, and higher brachial-ankle pulse wave velocity. The -1131T>C and c.553G>T polymorphisms were associated with hypertriglyceridemia in the study population, but only the -1131T>C polymorphism directly affected apoA5 concentrations. Hypertriglyceridemic patients carrying the APOA5 -1131T>C polymorphism exhibited increased atherogenic LDL levels and arterial stiffness, probably due to an effect of the -1131T>C polymorphism on apoA5 concentrations.
Mlakar, Simona Jurkovic; Ostanek, Barbara
2011-01-01
Gilbert's syndrome is the most common hereditary disorder of bilirubin metabolism. The causative mutation in Caucasians is almost exclusively a (TA) dinucleotide insertion in the UGT1A1 promoter. Affected individuals are homozygous for the variant promoter and have 7 TA repeats instead of 6. Promoters with 5 and 8 TA repeats also exist but are extremely rare in Caucasians. The aim of our study was to develop denaturing high-performance liquid chromatography (DHPLC) assay for genotyping UGT1A1(TA)n polymorphism and to compare it with a previously described single-strand conformation polymorphism (SSCP) assay. Fifty DNA samples with common genotypes ((TA)6/6, (TA)6/7, (TA)7/7) as well as 7 samples with one of the following rare genotypes- (TA)5/6, (TA)5/7, (TA)6/8 or (TA)7/8 were amplified by polymerase chain reaction (PCR) and genotyped by DHPLC using sizing mode. All samples were previously genotyped by SSCP assay which was validated by sequencing analysis. All samples with either common or rare genotypes showed completely concordant results between DHPLC and SSCP assays. Our results show that sizing DHPLC assay is more efficient compared to classical SSCP assay due to shorter time of genotyping analysis, ability of genotyping increased number of samples per day, higher robustness, reproducibility and cost-effectiveness with no loss of accuracy in detection of all UGT1A1(TA)n genotypes. We developed a new DHPLC assay which is suitable for accurate, automated, highthroughput, robust genotyping of all UGT1A1(TA)n polymorphism variants, compared to a labour intensive and time-consuming SSCP assay.
Human papillomavirus type 56 polymorphism in Canadian women with and without cervical lesions.
Rodrigues-Coutlée, Catherine; Archambault, Jacques; Money, Deborah; Ramanakumar, Agnihotram V; Raboud, Janet; Hankins, Catherine; Koushik, Anita; Richardson, Harriet; Brassard, Paul; Franco, Eduardo L; Coutlée, Francois
2013-12-01
The genomic diversity of high-risk human papillomaviruses (HPV) has been associated with viral persistence and HPV-induced lesions. Studies on HPV56 persistence are still pending. To assess the association between HPV56 polymorphism and HPV56 persistence and presence of high-grade cervical intraepithelial neoplasia (CIN2,3) or cancer. HPV56-positive cervical specimens from 204 women selected from a total of 4669 participants recruited in 5 epidemiological studies (parent studies) were further analyzed by PCR-sequencing of the long control region (LCR). Of the 81 women followed prospectively in cohort studies who could be classified, 34 had persistent and 47 had transient HPV56 infections. Variant HPV56-LCR-MTL-21 was detected more frequently in persistent infections (52.9%, 95% CI: 36.7-68.6%) than in transient infections (25.5%, 95% CI: 15.1-39.4). Considering only women recruited in a cohort of women infected or at high risk for HIV infection, infection with variant HPV56-LCR-MTL-21 (OR=4.4, 95% CI: 1.3-14.5) was significantly associated with HPV56 persistence controlling in multivariate analysis for high risk HPV detection and HIV infection. A variation at nucleotide 7800 in HPV56-LCR-MTL-21 resulted in the loss of a binding site for Elf-1 embedded in one of the E2 binding sites, a potential activator or repressor of expression of the HPV genome. HPV56 polymorphism was not associated with CIN2,3 or cancer in women enrolled in cross-sectional and case-control studies. Polymorphism in HPV56 may influence the risk that infections with this type will persist. Copyright © 2013 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Toye, P.G.; Metzelaar, M.J.; Wijngaard, P.L.J.
1995-08-01
Theileria parva, a tick-transmitted protozoan parasite related to Plasmodium spp., causes the disease East Coast fever, an acute and usually fatal lymphoproliferative disorder of cattle in Africa. Previous studies using sera from cattle that have survived infection identified a polymorphic immunodominant molecule (PIM) that is expressed by both the infective sporozoite stage of the parasite and the intracellular schizont. Here we show that mAb specific for the PIM Ag can inhibit sporozoite invasion of lymphocytes in vitro. A cDNA clone encoding the PIM Ag of the T. parva (Muguga) stock was obtained by using these mAb in a novel eukaryoticmore » expression cloning system that allows isolation of cDNA encoding cytoplasmic or surface Ags. To establish the molecular basis of the polymorphism of PIM, the cDNA of the PIM Ag from a buffalo-derived T. parva stock was isolated and its sequence was compared with that of the cattle-derived Muguga PIM. The two cDNAs showed considerable identity in both the 5{prime} and 3{prime} regions, but there was substantial sequence divergence in the central regions. Several types of repeated sequences were identified in the variant regions. In the Muguga form of the molecule, there were five tandem repeats of the tetrapeptide, QPEP, that were shown, by transfection of a deleted version of the PIM gene, not to react with several anti-PIM mAbs. By isolating and sequencing the genomic version of the gene, we identified two small introns in the 3{prime} region of the gene. Finally, we showed that polyclonal rat Abs against recombinant PIM neutralize sporozoite infectivity in vitro, suggesting that the PIM Ag should be evaluated for its capacity to immunize cattle against East Coast Fever.« less
Bandarian, Fatemeh; Daneshpour, Maryam Sadat; Hedayati, Mehdi; Naseri, Mohsen; Azizi, Fereidoun
2016-01-01
Apolipoprotein A2 (APOA2) is the second major apolipoprotein of the high-density lipoprotein cholesterol (HDL-C). The study aim was to identify APOA2 gene variation in individuals within two extreme tails of HDL-C levels and its relationship with HDL-C level. This cross-sectional survey was conducted on participants from Tehran Glucose and Lipid Study (TLGS) at Research Institute for Endocrine Sciences, Tehran, Iran from April 2012 to February 2013. In total, 79 individuals with extreme low HDL-C levels (≤5th percentile for age and gender) and 63 individuals with extreme high HDL-C levels (≥95th percentile for age and gender) were selected. Variants were identified using DNA amplification and direct sequencing. Screen of all exons and the core promoter region of APOA2 gene identified nine single nucleotide substitutions and one microsatellite; five of which were known and four were new variants. Of these nine variants, two were common tag single nucleotide polymorphisms (SNPs) and seven were rare SNPs. Both exonic substitutions were missense mutations and caused an amino acid change. There was a significant association between the new missense mutation (variant Chr.1:16119226, Ala98Pro) and HDL-C level. None of two common tag SNPs of rs6413453 and rs5082 contributes to the HDL-C trait in Iranian population, but a new missense mutation in APOA2 in our population has a significant association with HDL-C.
Green, Nancy S.; Ender, Katherine L.; Pashankar, Farzana; Driscoll, Catherine; Giardina, Patricia J.; Mullen, Craig A.; Clark, Lorraine N.; Manwani, Deepa; Crotty, Jennifer; Kisselev, Sergey; Neville, Kathleen A.; Hoppe, Carolyn; Barral, Sandra
2013-01-01
Background Fetal hemoglobin level is a heritable complex trait that strongly correlates swith the clinical severity of sickle cell disease. Only few genetic loci have been identified as robustly associated with fetal hemoglobin in patients with sickle cell disease, primarily adults. The sole approved pharmacologic therapy for this disease is hydroxyurea, with effects largely attributable to induction of fetal hemoglobin. Methodology/Principal Findings In a multi-site observational analysis of children with sickle cell disease, candidate single nucleotide polymorphisms associated with baseline fetal hemoglobin levels in adult sickle cell disease were examined in children at baseline and induced by hydroxyurea therapy. For baseline levels, single marker analysis demonstrated significant association with BCL11A and the beta and epsilon globin loci (HBB and HBE, respectively), with an additive attributable variance from these loci of 23%. Among a subset of children on hydroxyurea, baseline fetal hemoglobin levels explained 33% of the variance in induced levels. The variant in HBE accounted for an additional 13% of the variance in induced levels, while variants in the HBB and BCL11A loci did not contribute beyond baseline levels. Conclusions/Significance These findings clarify the overlap between baseline and hydroxyurea-induced fetal hemoglobin levels in pediatric disease. Studies assessing influences of specific sequence variants in these and other genetic loci in larger populations and in unusual hydroxyurea responders are needed to further understand the maintenance and therapeutic induction of fetal hemoglobin in pediatric sickle cell disease. PMID:23409025
Dong, Chengliang; Wei, Peng; Jian, Xueqiu; Gibbs, Richard; Boerwinkle, Eric; Wang, Kai; Liu, Xiaoming
2015-01-01
Accurate deleteriousness prediction for nonsynonymous variants is crucial for distinguishing pathogenic mutations from background polymorphisms in whole exome sequencing (WES) studies. Although many deleteriousness prediction methods have been developed, their prediction results are sometimes inconsistent with each other and their relative merits are still unclear in practical applications. To address these issues, we comprehensively evaluated the predictive performance of 18 current deleteriousness-scoring methods, including 11 function prediction scores (PolyPhen-2, SIFT, MutationTaster, Mutation Assessor, FATHMM, LRT, PANTHER, PhD-SNP, SNAP, SNPs&GO and MutPred), 3 conservation scores (GERP++, SiPhy and PhyloP) and 4 ensemble scores (CADD, PON-P, KGGSeq and CONDEL). We found that FATHMM and KGGSeq had the highest discriminative power among independent scores and ensemble scores, respectively. Moreover, to ensure unbiased performance evaluation of these prediction scores, we manually collected three distinct testing datasets, on which no current prediction scores were tuned. In addition, we developed two new ensemble scores that integrate nine independent scores and allele frequency. Our scores achieved the highest discriminative power compared with all the deleteriousness prediction scores tested and showed low false-positive prediction rate for benign yet rare nonsynonymous variants, which demonstrated the value of combining information from multiple orthologous approaches. Finally, to facilitate variant prioritization in WES studies, we have pre-computed our ensemble scores for 87 347 044 possible variants in the whole-exome and made them publicly available through the ANNOVAR software and the dbNSFP database. PMID:25552646
Yang, Yuhong; Mu, Yunxiang; Zhao, Yu; Liu, Xinyu; Zhao, Lili; Wang, Junmei; Xie, Yonghong
2007-05-01
To investigate the association between the mutations in lipoprotein lipase gene and hypertriglyceridemia (HTG). The lipoprotein lipase (LPL) gene was screened for mutations in 386 Chinese subjects with (108 cases in the HTG group) or without HTG (278 cases in the control group), by single-strand conformation polymorphism (SSCP) analysis and DNA sequencing. One novel silent mutation L103L, one missense mutation P207L, three splicing mutations Int3/3'-ass/C(-6) --> T, and the common S447X polymorphism has been identified in the whole coding region and exon-intron junctions of the LPL gene were examined. Heterozygous P207L found in the HTG group was the first case reported in Asia and subsequently another P207L heterozygote was found in the proband's family, all of which suggested that P207L was one of the causes of familial combined hyperlipidemia, but was not so prevalent as that in French Canadian. Int3/3'-ass/C(-6) --> T was found in both groups in the present study although it was regarded as a pathogenic variant to HTG earlier on. Moreover about the beneficial polymorphism S447X, there was also some supportive evidence that the levels of triglycerides (TG) in S447X carriers were significantly lower than noncarriers in the subjects without HTG. The association between the LPL variants and HTG is quite complicated and versatile, genotyping of LPL in a larger-scale screening should be necessary and justifiable.
Yang, Shao-Hua; Bi, Xiao-Jun; Xie, Yan; Li, Cong; Zhang, Sheng-Li; Zhang, Qin; Sun, Dong-Xiao
2015-11-05
Phosphodiesterase9A (PDE9A) is a cyclic guanosine monophosphate (cGMP)-specific enzyme widely expressed among the tissues, which is important in activating cGMP-dependent signaling pathways. In our previous genome-wide association study, a single nucleotide polymorphism (SNP) (BTA-55340-no-rs(b)) located in the intron 14 of PDE9A, was found to be significantly associated with protein yield. In addition, we found that PDE9A was highly expressed in mammary gland by analyzing its mRNA expression in different tissues. The objectives of this study were to identify genetic polymorphisms of PDE9A and to determine the effects of these variants on milk production traits in dairy cattle. DNA sequencing identified 11 single nucleotide polymorphisms (SNPs) and six SNPs in 5' regulatory region were genotyped to test for the subsequent association analyses. After Bonferroni correction for multiple testing, all these identified SNPs were statistically significant for one or more milk production traits (p < 0.0001~0.0077). Interestingly, haplotype-based association analysis revealed similar effects on milk production traits (p < 0.01). In follow-up RNA expression analyses, two SNPs (c.-1376 G>A, c.-724 A>G) were involved in the regulation of gene expression. Consequently, our findings provide confirmatory evidences for associations of PDE9A variants with milk production traits and these identified SNPs may serve as genetic markers to accelerate Chinese Holstein breeding program.
Balancing Selection and Its Effects on Sequences in Nearby Genome Regions
Charlesworth, Deborah
2006-01-01
Our understanding of balancing selection is currently becoming greatly clarified by new sequence data being gathered from genes in which polymorphisms are known to be maintained by selection. The data can be interpreted in conjunction with results from population genetics models that include recombination between selected sites and nearby neutral marker variants. This understanding is making possible tests for balancing selection using molecular evolutionary approaches. Such tests do not necessarily require knowledge of the functional types of the different alleles at a locus, but such information, as well as information about the geographic distribution of alleles and markers near the genes, can potentially help towards understanding what form of balancing selection is acting, and how long alleles have been maintained. PMID:16683038
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lumbroso, R.; Vasiliou, M.; Beitel, L.K.
1994-09-01
Exon 1 at the X-linked androgen receptor (AR) locus encodes an N-terminal modulatory domain that contains two large homopolyamino acid tracts: (CAG;glutamine;Gln){sub 11-33} and (GGN;Glycine;Cly){sub 15-27}. Certain AR mutations cause partial androgen insensitivity (PAI) with frank genital ambiguity that may engender appreciable parental anxiety and patient morbidity. If the AR mutation in a PAI family is unknown, the AR`s intragenic trinucleotide repeat polymorphisms may be used for prenatal diagnosis. However, intergenerational instability of repeat-size may be worrisome, particularly when the information alleles differ by only a few repeats. Here, we report the discovery of a codon-usage (silent substitution) variant inmore » the GGN repeat, and describe its use as a source of complementary information for prenatal diagnosis. The standard sense sequence of the (GGN){sub n} tract is (GGT){sub 3} GGG(GGT){sub 2} (GGC){sub 9-21}. On 4 of 27 X chromosomes we noted that the internal GGT sequence was expanded to 3 or 4 repeats. We used an internal (GGT){sub 4} repeat in a total (GGN){sub 24} tract together with a (CAG){sub 20} tract to distinguish an X chromosome with a mutant AR allele from another X chromosome, bearing a normal allele, that had an internal (GGT){sub 2} repeat in a total (GGN){sub 23} tract together with a (CAG){sub 21} tract. Subsequently, we found the base change leading to a pathogenic amino acid substitution (M779I) in codon 6 of the mutant AR gene in an affected maternal aunt and the fetus at risk. This confirmed the prenatal diagnosis based on the intragenic trinucleotide repeat polymorphisms, and it strengthened the prediction of external genital ambiguity using our previous experience with M779I in another family.« less
Wang, S; Wang, J; Fan, M-J; Li, T-Y; Pan, H; Wang, X; Liu, H-K; Lin, Q-F; Zhang, J-G; Guan, L-P; Zhernakova, D V; O'Brien, S J; Feng, Z-R; Chang, L; Dai, E-H; Lu, J-H; Xi, H-L; Zeng, Z; Yu, Y-Y; Wang, B-B
2018-03-27
The underlying mechanism of coexistence of hepatitis B surface antigen (HBsAg) and hepatitis B surface antigen antibody (anti-HBs) is still controversial. To identify the host genetic factors related to this unusual clinical phenomenon, a two-stage study was conducted in the Chinese Han population. In the first stage, we performed a case-control (1:1) age- and gender-matched study of 101 cases with concurrent HBsAg and anti-HBs and 102 controls with negative HBsAg and positive anti-HBs using whole exome sequencing. In the second validation stage, we directly sequence the 16 exons on the OAS3 gene in two dependent cohorts of 48 cases and 200 controls. Although, in the first stage, a genome-wide association study of 58,563 polymorphism variants in 101 cases and 102 controls found no significant loci (P-value ≤ .05/58563), and neither locus achieved a conservative genome-wide significance threshold (P-value ≤ 5e-08), gene-based burden analysis showed that OAS3 gene rare variants were associated with the coexistence of HBsAg and anti-HBs. (P-value = 4.127e-06 ≤ 0.05/6994). A total of 16 rare variants were screened out from 21 cases and 3 controls. In the second validation stage, one case with a stop-gained rare variant was identified. Fisher's exact test of all 149 cases and 302 controls showed that the rare coding sequence mutations were more frequent in cases vs controls (P-value = 7.299e-09, OR = 17.27, 95% CI [5.01-58.72]). Protein-coding rare variations on the OAS3 gene are associated with the coexistence of HBsAg and anti-HBs in patients with chronic HBV infection in Chinese Han population. © 2018 John Wiley & Sons Ltd.
Association between MTHFR variant and diabetic neuropathy.
Kakavand Hamidi, Armita; Radfar, Mania; Amoli, Mahsa M
2018-02-01
Methylene-tetrahydrofolate reductase (MTHFR) gene variant may play an important role in the pathophysiology of diabetes and its complications due to its influence on plasma homocysteine levels and also its effect on scavenging peroxynitrite radicals. Diabetic peripheral neuropathy (DPN) is one of the most common diabetic chronic complications. The aim of this study was to investigate the relationship between diabetic neuropathy and MTHFR gene C677T and 1298A ⁄C polymorphisms. Patients with type 2 diabetes N=248 were enrolled in the study, consisting of patients with neuropathy (N=141) and patients without neuropathy (N=107). MTHFR C677T polymorphism was analyzed using polymerase chain reaction followed by restriction fragment length polymorphism (PCR-RFLP) of genomic DNA for genotyping of samples. 1298A/C polymorphism was evaluated using ARMS-PCR. There was a significant difference in MTHFR polymorphism between the groups with and without neuropathy. Our results suggest that MTHFR 677 variant confer risk for diabetic neuropathy among Iranian patients with type 2 diabetes. Copyright © 2017 Institute of Pharmacology, Polish Academy of Sciences. Published by Elsevier B.V. All rights reserved.
Wen, Bo; Xu, Shaohang; Sheynkman, Gloria M; Feng, Qiang; Lin, Liang; Wang, Quanhui; Xu, Xun; Wang, Jun; Liu, Siqi
2014-11-01
Single nucleotide variations (SNVs) located within a reading frame can result in single amino acid polymorphisms (SAPs), leading to alteration of the corresponding amino acid sequence as well as function of a protein. Accurate detection of SAPs is an important issue in proteomic analysis at the experimental and bioinformatic level. Herein, we present sapFinder, an R software package, for detection of the variant peptides based on tandem mass spectrometry (MS/MS)-based proteomics data. This package automates the construction of variation-associated databases from public SNV repositories or sample-specific next-generation sequencing (NGS) data and the identification of SAPs through database searching, post-processing and generation of HTML-based report with visualized interface. sapFinder is implemented as a Bioconductor package in R. The package and the vignette can be downloaded at http://bioconductor.org/packages/devel/bioc/html/sapFinder.html and are provided under a GPL-2 license. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Xu, Yuejuan; Li, Tingting; Pu, Tian; Cao, Ruixue; Long, Fei; Chen, Sun; Sun, Kun; Xu, Rang
2017-12-01
Congenital heart disease (CHD) is one of the most common birth defects. More than 200 susceptibility loci have been identified for CHDs, yet a large part of the genetic risk factors remain unexplained. Monozygotic (MZ) twins are thought to be completely genetically identical; however, discordant phenotypes have been found in MZ twins. Recent studies have demonstrated genetic differences between MZ twins. We aimed to test whether copy number variants (CNVs) and/or genetic mutation differences play a role in the etiology of CHDs by using single nucleotide polymorphism (SNP) genotyping arrays and whole exome sequencing of twin pairs discordant for CHDs. Our goal was to identify mutations present only in the affected twins, which could identify novel candidates for CHD susceptibility loci. We present a comprehensive analysis for the CNVs and genetic mutation results of the selected individuals but detected no consistent differences within the twin pairs. Our study confirms that chromosomal structure or genetic mutation differences do not seem to play a role in the MZ twins discordant for CHD.
NASA Astrophysics Data System (ADS)
Abuzahra, M. A. M.; Jakaria; Listyarini, K.; Furqon, A.; Sumantri, C.; Uddin, M. J.; Gunawan, A.
2018-05-01
High-throughput RNA sequencing (RNA-Seq) reveals new challenges for the detection of transcriptome variants (SNPs) in different tissues and species. The aims of this study was to characterize a SNP discovery analysis in the sheep meat odour and flavour transcriptome using RNA-Seq. Six liver samples from divergent sheep meat odour and flavour were analyzed using the Illumina Genome Hiseq 2500 Analyzer. The SNP detection analysis revealed 142 SNPs in sheep meat samples, and a large number of those corresponded to differences between high and low sheep meat odour and flavour ovis genome assembly OAR v4.0. Among them, about 90.4% of genes had multiple polymorphisms within 12 genes (JAML, ANGPTL8, LOC101103463, SEPW1, SCN5A, LOC101113036, DOCK6, GTSE1, KIF12, KCTD17, KANK2, CYP2A6). Several of the SNPs (JAML, CYP2A6, SEPW1, and KIF12) found in this study could be included as suitable markers in genotyping platforms to perform association analyses in commercial populations and apply genomic selection protocols in the sheep meat production.
Muddana, Venkata; Park, James; Lamb, Janette; Yadav, Dhiraj; Papachristou, Georgios I; Hawes, Robert H; Brand, Randall; Slivka, Adam; Whitcomb, David C
2010-11-01
Platelet-derived growth factor [beta] (PDGF-[beta]) is a major signal in proliferation and matrix synthesis through activated pancreatic stellate cells, leading to fibrosis of the pancreas. Recurrent acute pancreatitis (RAP) seems to predispose to chronic pancreatitis (CP) in some patients but not others. We tested the hypothesis that 2 known PDGF-[beta] polymorphisms are associated with progression from RAP to CP. We also tested the hypothesis that PDGF-[beta] polymorphisms in combination with environmental risk factors such as alcohol and smoking are associated with CP. Three hundred eighty-two patients with CP (n = 176) and RAP (n = 206) and 251 controls were evaluated. Platelet-derived growth factor [beta] polymorphisms +286 A/G (rs#1800818) seen in 5'-UTR and +1135 A/C (rs#1800817) in first intron were genotyped using single-nucleotide polymorphism polymerase chain reaction approach and confirmed by DNA sequencing. The genotypic frequencies for PDGF-[beta] polymorphisms in positions +286 and +1135 were found to be similar in controls and patients with RAP and CP. There was no difference in genotypic frequencies among RAP, CP, and controls in subjects in the alcohol and smoking subgroups. Known variations in the PDGF-[beta] gene do not have a significant effect on promoting or preventing fibrogenesis in pancreatitis. Further evaluation of this important pathway is warranted.
Mankowska, M; Stachowiak, M; Graczyk, A; Ciazynska, P; Gogulski, M; Nizanski, W; Switonski, M
2016-04-01
Obesity is an emerging health problem in purebred dogs. Due to their crucial role in energy homeostasis control, genes encoding adipokines are considered candidate genes, and their variants may be associated with predisposition to obesity. Searching for polymorphism was carried out in three adipokine genes (TNF, RETN and IL6). The study was performed on 260 dogs, including lean (n = 109), overweight (n = 88) and obese (n = 63) dogs. The largest cohort was represented by Labrador Retrievers (n = 136). Altogether, 24 novel polymorphisms were identified: 12 in TNF (including one missense SNP), eight in RETN (including one missense SNP) and four in IL6. Distributions of five common SNPs (two in TNF, two in RETN and one in IL6) were further analyzed with regard to body condition score. Two SNPs in the non-coding parts of TNF (c.-40A>C and c.233+14G>A) were associated with obesity in Labrador dogs. The obtained results showed that the studied adipokine genes are highly polymorphic and two polymorphisms in the TNF gene may be considered as markers predisposing Labrador dogs to obesity. © 2015 Stichting International Foundation for Animal Genetics.
Plasmodium vivax rhomboid-like protease 1 gene diversity in Thailand.
Mataradchakul, Touchchapol; Uthaipibull, Chairat; Nosten, Francois; Vega-Rodriguez, Joel; Jacobs-Lorena, Marcelo; Lek-Uthai, Usa
2017-10-01
Plasmodium vivax infection remains a major public health problem, especially along the Thailand border regions. We examined the genetic diversity of this parasite by analyzing single-nucleotide polymorphisms (SNPs) of the P. vivax rhomboid-like protease 1 gene (Pvrom1) in parasites collected from western (Tak province, Thai-Myanmar border) and eastern (Chanthaburi province, Thai-Cambodia border) regions. Data were collected by a cross-sectional survey, consisting of 47 and 45 P. vivax-infected filter paper-spotted blood samples from the western and eastern regions of Thailand, respectively during September 2013 to May 2014. Extracted DNA was examined for presence of P. vivax using Plasmodium species-specific nested PCR. Pvrom1 gene was PCR amplified, sequenced and the SNP diversity was analyzed using F-STAT, DnaSP, MEGA and LIAN programs. Comparison of sequences of the 92 Pvrom1 831-base open reading frames with that of a reference sequence (GenBank acc. no. XM001615211) revealed 17 samples with a total of 8 polymorphic sites, consisting of singleton (exon 3, nt 645) and parsimony informative (exon 1, nt 22 and 39; exon 3, nt 336, 537 and 656; and exon 4, nt 719 and 748) sites, which resulted in six different deduced Pvrom1 variants. Non-synonymous to synonymous substitutions ratio estimated by the DnaSP program was 1.65 indicating positive selection, but the Z-tests of selection showed no significant deviations from neutrality for Pvrom1 samples from western region of Thailand. In addition McDonald Kreitman test (MK) showed not significant, and Fst values are not different between the two regions and the regions combined. Interestingly, only Pvrom1 exon 2 was the most conserved sequences among the four exons. The relatively high degree of Pvrom1 polymorphism suggests that the protein is important for parasite survival in face of changes in both insect vector and human populations. These polymorphisms could serve as a sensitive marker for studying plasmodial genetic diversity. The significance of Pvrom1 conserved exon 2 sequence remains to be investigated. Copyright © 2017 Mahidol University. Published by Elsevier Inc. All rights reserved.
Jo, Jihoon; Oh, Jooseong; Lee, Hyun-Gwan; Hong, Hyun-Hee; Lee, Sung-Gwon; Cheon, Seongmin; Kern, Elizabeth M A; Jin, Soyeong; Cho, Sung-Jin; Park, Joong-Ki; Park, Chungoo
2017-01-01
The Japanese sea cucumber (Apostichopus japonicus Selenka 1867) is an economically important species as a source of seafood and ingredient in traditional medicine. It is mainly found off the coasts of northeast Asia. Recently, substantial exploitation and widespread biotic diseases in A. japonicus have generated increasing conservation concern. However, the genomic knowledge base and resources available for researchers to use in managing this natural resource and to establish genetically based breeding systems for sea cucumber aquaculture are still in a nascent stage. A total of 312 Gb of raw sequences were generated using the Illumina HiSeq 2000 platform and assembled to a final size of 0.66 Gb, which is about 80.5% of the estimated genome size (0.82 Gb). We observed nucleotide-level heterozygosity within the assembled genome to be 0.986%. The resulting draft genome assembly comprising 132 607 scaffolds with an N50 value of 10.5 kb contains a total of 21 771 predicted protein-coding genes. We identified 6.6-14.5 million heterozygous single nucleotide polymorphisms in the assembled genome of the three natural color variants (green, red, and black), resulting in an estimated nucleotide diversity of 0.00146. We report the first draft genome of A. japonicus and provide a general overview of the genetic variation in the three major color variants of A. japonicus. These data will help provide a comprehensive view of the genetic, physiological, and evolutionary relationships among color variants in A. japonicus, and will be invaluable resources for sea cucumber genomic research. © The Author 2017. Published by Oxford University Press.
Shchepotina, E G; Vavilin, V A; Goreva, O B; Lyakhovich, V V
2006-06-01
Analysis of variants of exon 7 sequences in cytochrome P450 gene 3A4 in a sample of Caucasoid persons was carried out. The effect of these variants on activity of CYP3A was assessed by the level of cortisol 6beta-hydroxylation. Alleles CYP3A4*5 and *17 were not detected: probably, these mutations are rare and consequently they have little effect on the character of polymorphic distribution of CYP3A4 activity in this population. The incidence of CYP3A4*2 was 5.26%. The 6betaOH-cortisol/cortisol ratio in an individual with CYP3A4*2/*2 genotype was 7.408, which corresponded to "slow metabolizer" phenotype in this sample.
Mainardi-Novo, D T O; Santos, A S; Fukui, R T; Gamberini, M; Correia, M R S; Ruiz, M O; Mangueira, C L P; Matioli, S R; Vasconcelos, D M; Silva, M E R
2013-01-01
Interleukin (IL)-21 and protein tyrosine phosphatase non-receptor 22 (PTPN22) regulate lymphocyte function and have been implicated in the pathogenesis of autoimmune diabetes. We sequenced the proximal promoter of the IL-21 gene for the first time and analysed the PTPN22 1858T polymorphism in type 1A diabetes (T1AD) patients and healthy controls (HC). We correlated the frequencies of islet and extra-pancreatic autoantibodies with genotypes from both loci. The case series comprised 612 T1AD patients and 792 HC. Genotyping of PTPN22 C1858T was performed on 434 T1AD patients and 689 HC. The −448 to +83 base pairs (bp) region of the IL-21 gene was sequenced in 309 Brazilian T1AD and 189 HC subjects. We also evaluated human leucocyte antigen (HLA) DR3/DR4 alleles. The frequencies of glutamic acid decarboxylase (GAD65), tyrosine phosphatase-like protein (IA)-2, anti-nuclear antibody (ANA), thyroid peroxidase (TPO), thyroglobulin (TG), thyrotrophin receptor autoantibody (TRAb), anti-smooth muscle (ASM) and 21-hydroxylase (21-OH) autoantibodies were higher in T1AD patients than in HC. The PTPN22 1858T allele was associated with an increased risk for developing T1AD [odds ratio (OR) = 1·94; P < 0·001], particularly in patients of European ancestry, and with a higher frequency of GAD65 and TG autoantibodies. HLA-DR3/DR4 alleles predominated in T1AD patients. A heterozygous allelic IL-21 gene variant (g.-241 T > A) was found in only one patient. In conclusion, only PTPN22 C1858T polymorphism and HLA-DR3 and/or DR4 alleles, but not allelic variants in the 5′-proximal region of the IL-21 gene were associated with T1AD risk. Patients with T1AD had increased frequencies of anti-islet-cell, anti-thyroid, anti-nuclear, anti-smooth muscle and anti-21-OH autoantibodies. The C1858T PTPN22 polymorphism was also associated with a higher frequency of GAD65 and TG autoantibodies. PMID:23480181
Mainardi-Novo, D T O; Santos, A S; Fukui, R T; Gamberini, M; Correia, M R S; Ruiz, M O; Mangueira, C L P; Matioli, S R; Vasconcelos, D M; Silva, M E R
2013-04-01
Interleukin (IL)-21 and protein tyrosine phosphatase non-receptor 22 (PTPN22) regulate lymphocyte function and have been implicated in the pathogenesis of autoimmune diabetes. We sequenced the proximal promoter of the IL-21 gene for the first time and analysed the PTPN22 1858T polymorphism in type 1A diabetes (T1AD) patients and healthy controls (HC). We correlated the frequencies of islet and extra-pancreatic autoantibodies with genotypes from both loci. The case series comprised 612 T1AD patients and 792 HC. Genotyping of PTPN22 C1858T was performed on 434 T1AD patients and 689 HC. The -448 to +83 base pairs (bp) region of the IL-21 gene was sequenced in 309 Brazilian T1AD and 189 HC subjects. We also evaluated human leucocyte antigen (HLA) DR3/DR4 alleles. The frequencies of glutamic acid decarboxylase (GAD65), tyrosine phosphatase-like protein (IA)-2, anti-nuclear antibody (ANA), thyroid peroxidase (TPO), thyroglobulin (TG), thyrotrophin receptor autoantibody (TRAb), anti-smooth muscle (ASM) and 21-hydroxylase (21-OH) autoantibodies were higher in T1AD patients than in HC. The PTPN22 1858T allele was associated with an increased risk for developing T1AD [odds ratio (OR) = 1·94; P < 0·001], particularly in patients of European ancestry, and with a higher frequency of GAD65 and TG autoantibodies. HLA-DR3/DR4 alleles predominated in T1AD patients. A heterozygous allelic IL-21 gene variant (g.-241 T > A) was found in only one patient. In conclusion, only PTPN22 C1858T polymorphism and HLA-DR3 and/or DR4 alleles, but not allelic variants in the 5'-proximal region of the IL-21 gene were associated with T1AD risk. Patients with T1AD had increased frequencies of anti-islet-cell, anti-thyroid, anti-nuclear, anti-smooth muscle and anti-21-OH autoantibodies. The C1858T PTPN22 polymorphism was also associated with a higher frequency of GAD65 and TG autoantibodies. © 2012 British Society for Immunology.
2014-01-01
Background Drug metabolism via the cytochrome P450 (CYP450) system has emerged as an important determinant in the occurrence of several drug interactions (adverse drug reactions, reduced pharmacological effect, drug toxicities). In particular, CYP3A4 and CYP3A5 (interacting with more than 60% of licensed drugs) exhibit the most individual variations of gene expression, mostly caused by single nucleotide polymorphisms (SNPs) within the regulatory region of the CYP3A4 and CYP3A5 genes which might affect the level of enzyme production. In this study, we sought to improve the performance of sensitive screening for CYP3A polymorphism detection in twenty HIV-1 infected patients undergoing lopinavir/ritonavir (LPV/r) monotherapy. Methods The study was performed by an effective, easy and inexpensive home-made Polymerase Chain Reaction Direct Sequencing approach for analyzing CYP3A4 and CYP3A5 genes which can detect both reported and unreported genetic variants potentially associated with altered or decreased functions of CYP3A4 and CYP3A5 proteins. Proportions and tests of association were used. Results Among the genetic variants considered, CYP3A4*1B (expression of altered function) was only found in 3 patients (15%) and CYP3A5*3 (expression of splicing defect) in 3 other patients (15%). CYP3A5*3 did not appear to be associated with decreased efficacy of LPV/r in any patient, since none of the patients carrying this variant showed virological rebound during LPV/r treatment or low levels of TDM. In contrast, low-level virological rebound was observed in one patient and a low TDM level was found in another; both were carrying CYP3A4*1B. Conclusions Our method exhibited an overall efficiency of 100% (DNA amplification and sequencing in our group of patients). This may contribute to producing innovative results for better understanding the inter-genotypic variability in gene coding for CYP3A, and investigating SNPs as biological markers of individual response to drugs requiring metabolism via the cytochrome P450 system. PMID:24986243
Silvestre, Rafaele T; Delmonico, Lucas; Bravo, Maryah; Santiago, Fábio; Scherrer, Luciano R; Moreira, Aline Dos Santos; Tabalipa, Marianne; Otero, Ubirani; Ornellas, Maria Helena F; Alves, Gilda
2017-12-01
Gas station workers are exposed to chemicals known to be carcinogenic, especially benzene. The objective was to analyze the health problems of female gas station workers by means of sociodemographic and clinical questionnaires, and laboratorial exams. We performed the genotyping of the polymorphisms BRCA1/P871L and BRCA1/Q356R by Polymerase Chain Reaction-Restriction Fragment Length Polymorphism, and of variant allele BRCA2/N372H through direct sequencing. The female workers showed a higher concentration of monocytes (P = 0.039); a greater number of spontaneous abortions (P = 0.025, OR = 4.977, 95% CI = 1.135-30.669); higher tobacco consumption (P = 0.013); and higher alcohol consumption (P = 0.05). The statistical analysis of the polymorphisms associated with the variables monocyte concentration and miscarriage number did not reveal a significant relationship, and smoking and spontaneous abortion were not statistically associated either. Environ. Mol. Mutagen. 58:730-734, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Mismatch repair polymorphisms and the risk of colorectal cancer.
Berndt, Sonja I; Platz, Elizabeth A; Fallin, M Daniele; Thuita, Lucy W; Hoffman, Sandra C; Helzlsouer, Kathy J
2007-04-01
Rare germline variants in mismatch repair genes have been linked to hereditary nonpolyposis colorectal cancer; however, it is unknown whether common polymorphisms in these genes alter the risk of colorectal cancer. To examine the association between common variants in mismatch repair genes and colorectal cancer, we conducted a case-cohort study within the CLUE II cohort. Four single nucleotide polymorphisms in 3 mismatch repair genes (MSH3 R940Q, MSH3 T1036A, MSH6 G39E and MLH1 I219V) were genotyped in 237 colorectal cancer cases and a subcohort of 2,189 participants. Incidence rate ratios (RRs) and 95% confidence intervals (95% CIs) for each polymorphism were estimated. The MSH3 1036A variant was found to be associated with an increased risk of colorectal cancer (RR=1.28, 95% CI: 0.94-1.74 and RR=1.65, 95% CI: 1.01-2.70 for the AT and TT genotypes, respectively, with p(trend)=0.02), particularly proximal colon cancer. Although the MSH3 940Q variant was only weakly associated with colorectal cancer overall (p(trend)=0.07), it was associated with a significant increased risk of proximal colon cancer (RR=1.69, 95% CI: 1.10-2.61 and RR=2.68, 95% CI: 0.96-7.47 for the RQ and QQ genotypes, respectively with p(trend)=0.005). Processed meat intake appeared to modify the association between the MSH3 polymorphisms and colorectal cancer (p(interaction) < 0.10 for both). No association was observed with the MSH6 and MLH1 polymorphisms overall. This study suggests that common polymorphisms in the mismatch repair gene, MSH3, may increase the risk of colorectal cancer, especially proximal colon cancer. (c) 2006 Wiley-Liss, Inc.
How important are rare variants in common disease?
Saint Pierre, Aude; Génin, Emmanuelle
2014-09-01
Genome-wide association studies have uncovered hundreds of common genetic variants involved in complex diseases. However, for most complex diseases, these common genetic variants only marginally contribute to disease susceptibility. It is now argued that rare variants located in different genes could in fact play a more important role in disease susceptibility than common variants. These rare genetic variants were not captured by genome-wide association studies using single nucleotide polymorphism-chips but with the advent of next-generation sequencing technologies, they have become detectable. It is now possible to study their contribution to common disease by resequencing samples of cases and controls or by using new genotyping exome arrays that cover rare alleles. In this review, we address the question of the contribution of rare variants in common disease by taking the examples of different diseases for which some resequencing studies have already been performed, and by summarizing the results of simulation studies conducted so far to investigate the genetic architecture of complex traits in human. So far, empirical data have not allowed the exclusion of many models except the most extreme ones involving only a small number of rare variants with large effects contributing to complex disease. To unravel the genetic architecture of complex disease, case-control data will not be sufficient, and alternative study designs need to be proposed together with methodological developments. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Criscione, Andrea; Cunsolo, Vincenzo; Tumino, Serena; Di Francesco, Antonella; Bordonaro, Salvatore; Muccilli, Vera; Saletti, Rosaria; Marletta, Donata
2018-06-01
In the last years, donkey milk had evidenced a renewed interest as a potential functional food and a breast milk substitute. In this light, the study of the protein composition assumes an important role. In particular, β-lactoglobulin (β-LG), which is considered as one of the main allergenic milk protein, in donkey species consists of two molecular forms, namely β-LG I and β-LG II. In the present research, a genetic analysis coupled with a proteomic approach showed the presence of a new allele, here named F, which is apparently associated with a null or a severely reduced expression of β-LG II protein. The new β-LG II F genetic variant shows a theoretical average mass (M av ) of 18,310.64 Da, a value practically corresponding with that of the variant D (∆ mass < 0.07 Da), but differs from β-LG II D for two amino acid substitutions: Thr 100 (variant F) → Ala 100 (variant D) and Thr 118 (variant F) → Met 118 (variant D). Proteomic investigation of the whey protein fraction of an individual milk sample, homozygous FF at β-LG II locus, allowed to identify, as very minor component, the new β-LG II F genetic variant. By MS/MS analysis of enzymatic digests, the sequence of the β-LG II F was characterized, and the predicted genomic data confirmed.
Claudio-Campos, Karla; Labastida, Aurora; Ramos, Alga; Gaedigk, Andrea; Renta-Torres, Jessicca; Padilla, Dariana; Rivera-Miranda, Giselle; Scott, Stuart A; Ruaño, Gualberto; Cadilla, Carmen L; Duconge-Soler, Jorge
2017-01-01
Existing algorithms account for ~50% of observed variance in warfarin dose requirements after including common polymorphisms. However, they do not perform as well in populations other than Caucasians, in part because some ethno-specific genetic variants are overlooked. The objective of the present study was to identify genetic polymorphisms that can explain variability in warfarin dose requirements among Caribbean Hispanics of Puerto Rico. Next-Generation Sequencing of candidate genes CYP2C9 and VKORC1 and genotyping by DMET® Plus Assay of cardiovascular patients were performed. We also aimed at characterizing the genomic structure and admixture pattern of this study cohort. Our study used the Extreme Discordant Phenotype approach to perform a case-control association analysis. The CYP2C9 variant rs2860905, which was found in all the major haplotypes occurring in the Puerto Rican population, showed stronger association with warfarin sensitivity (<4 mg/day) than common variants CYP2C9 * 2 and CYP2C9 * 3 . Although, CYP2C9 * 2 and CYP2C9 * 3 are separately contained within two of the haplotypes, 10 subjects with the sensitive phenotype were carriers of only the CYP2C9 rs2860905 variant. Other polymorphisms in CES2 and ABCB1 were found to be associated with warfarin resistance. Incorporation of rs 2860905 in a regression model ( R 2 = 0.63, MSE = 0.37) that also includes additional genetics (i.e., VKORC1 -1639 G>A; CYP2C9 rs1856908; ABCB1 c.IVS9-44A>G/ rs10276036; CES2 c.269-965A>G/ rs4783745) and non-genetic factors (i.e., hypertension, diabetes and age) showed better prediction of warfarin dose requirements than CYP2C9 * 2 and CYP2C9 * 3 combined (partial R 2 = 0.132 vs. 0.023 and 0.007, respectively, p < 0.001). The genetic background of Puerto Ricans in the study cohort showed a tri-hybrid admixture pattern, with a slightly higher than expected contribution of Native American ancestry (25%). The genomic diversity of Puerto Ricans is highlighted by the presence of four different major haplotype blocks in the CYP2C9 locus. Although, our findings need further replication, this study contributes to the field by identifying novel genetic variants that increase predictability of stable warfarin dosing among Caribbean Hispanics.
Genotyping microarray (gene chip) for the ABCR (ABCA4) gene.
Jaakson, K; Zernant, J; Külm, M; Hutchinson, A; Tonisson, N; Glavac, D; Ravnik-Glavac, M; Hawlina, M; Meltzer, M R; Caruso, R C; Testa, F; Maugeri, A; Hoyng, C B; Gouras, P; Simonelli, F; Lewis, R A; Lupski, J R; Cremers, F P M; Allikmets, R
2003-11-01
Genetic variation in the ABCR (ABCA4) gene has been associated with five distinct retinal phenotypes, including Stargardt disease/fundus flavimaculatus (STGD/FFM), cone-rod dystrophy (CRD), and age-related macular degeneration (AMD). Comparative genetic analyses of ABCR variation and diagnostics have been complicated by substantial allelic heterogeneity and by differences in screening methods. To overcome these limitations, we designed a genotyping microarray (gene chip) for ABCR that includes all approximately 400 disease-associated and other variants currently described, enabling simultaneous detection of all known ABCR variants. The ABCR genotyping microarray (the ABCR400 chip) was constructed by the arrayed primer extension (APEX) technology. Each sequence change in ABCR was included on the chip by synthesis and application of sequence-specific oligonucleotides. We validated the chip by screening 136 confirmed STGD patients and 96 healthy controls, each of whom we had analyzed previously by single strand conformation polymorphism (SSCP) technology and/or heteroduplex analysis. The microarray was >98% effective in determining the existing genetic variation and was comparable to direct sequencing in that it yielded many sequence changes undetected by SSCP. In STGD patient cohorts, the efficiency of the array to detect disease-associated alleles was between 54% and 78%, depending on the ethnic composition and degree of clinical and molecular characterization of a cohort. In addition, chip analysis suggested a high carrier frequency (up to 1:10) of ABCR variants in the general population. The ABCR genotyping microarray is a robust, cost-effective, and comprehensive screening tool for variation in one gene in which mutations are responsible for a substantial fraction of retinal disease. The ABCR chip is a prototype for the next generation of screening and diagnostic tools in ophthalmic genetics, bridging clinical and scientific research. Copyright 2003 Wiley-Liss, Inc.
Cristina Kenney, M.; Chwa, Marilyn; Atilano, Shari R.; Falatoonzadeh, Payam; Ramirez, Claudio; Malik, Deepika; Tarek, Mohamed; Cáceres-del-Carpio, Javier; Nesburn, Anthony B.; Boyer, David S.; Kuppermann, Baruch D.; Vawter, Marquis; Michal Jazwinski, S.; Miceli, Michael; Wallace, Douglas C.; Udar, Nitin
2014-01-01
Age-related macular degeneration (AMD) is the leading cause of vision loss in developed countries. While linked to genetic polymorphisms in the complement pathway, there are many individuals with high risk alleles that do not develop AMD, suggesting that other ‘modifiers’ may be involved. Mitochondrial (mt) haplogroups, defined by accumulations of specific mtDNA single nucleotide polymorphisms (SNPs) which represent population origins, may be one such modifier. J haplogroup has been associated with high risk for AMD while the H haplogroup is protective. It has been difficult to assign biological consequences for haplogroups so we created human ARPE-19 cybrids (cytoplasmic hybrids), which have identical nuclei but mitochondria of either J or H haplogroups, to investigate their effects upon bioenergetics and molecular pathways. J cybrids have altered bioenergetic profiles compared with H cybrids. Q-PCR analyses show significantly lower expression levels for seven respiratory complex genes encoded by mtDNA. J and H cybrids have significantly altered expression of eight nuclear genes of the alternative complement, inflammation and apoptosis pathways. Sequencing of the entire mtDNA was carried out for all the cybrids to identify haplogroup and non-haplogroup defining SNPs. mtDNA can mediate cellular bioenergetics and expression levels of nuclear genes related to complement, inflammation and apoptosis. Sequencing data suggest that observed effects are not due to rare mtDNA variants but rather the combination of SNPs representing the J versus H haplogroups. These findings represent a paradigm shift in our concepts of mt–nuclear interactions. PMID:24584571
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mumbrekar, Kamalesh Dattaram; Bola Sadashiva, Satish Rao; Kabekkodu, Shama Prasada
Purpose: Heterogeneity in radiation therapy (RT)-induced normal tissue toxicity is observed in 10% of cancer patients, limiting the therapeutic outcomes. In addition to treatment-related factors, normal tissue adverse reactions also manifest from genetic alterations in distinct pathways majorly involving DNA damage–repair genes, inflammatory cytokine genes, cell cycle regulation, and antioxidant response. Therefore, the common sequence variants in these radioresponsive genes might modify the severity of normal tissue toxicity, and the identification of the same could have clinical relevance as a predictive biomarker. Methods and Materials: The present study was conducted in a cohort of patients with breast cancer to evaluatemore » the possible associations between genetic variants in radioresponsive genes described previously and the risk of developing RT-induced acute skin adverse reactions. We tested 22 genetic variants reported in 18 genes (ie, NFE2L2, OGG1, NEIL3, RAD17, PTTG1, REV3L, ALAD, CD44, RAD9A, TGFβR3, MAD2L2, MAP3K7, MAT1A, RPS6KB2, ZNF830, SH3GL1, BAX, and XRCC1) using TaqMan assay-based real-time polymerase chain reaction. At the end of RT, the severity of skin damage was scored, and the subjects were dichotomized as nonoverresponders (Radiation Therapy Oncology Group grade <2) and overresponders (Radiation Therapy Oncology Group grade ≥2) for analysis. Results: Of the 22 single nucleotide polymorphisms studied, the rs8193 polymorphism lying in the micro-RNA binding site of 3′-UTR of CD44 was significantly (P=.0270) associated with RT-induced adverse skin reactions. Generalized multifactor dimensionality reduction analysis showed significant (P=.0107) gene–gene interactions between MAT1A and CD44. Furthermore, an increase in the total number of risk alleles was associated with increasing occurrence of overresponses (P=.0302). Conclusions: The genetic polymorphisms in radioresponsive genes act as genetic modifiers of acute normal tissue toxicity outcomes after RT by acting individually (rs8193), by gene–gene interactions (MAT1A and CD44), and/or by the additive effects of risk alleles.« less
Identification of a new variant of Chlamydia trachomatis in Mexico.
Escobedo-Guerra, Marcos R; Katoku-Herrera, Mitzuko; Lopez-Hurtado, Marcela; Villagrana-Zesati, Jesus Roberto; de Haro-Cruz, María de J; Guerra-Infante, Fernando M
2018-04-07
Chlamydia trachomatis is one of the main etiological agents of sexually transmitted infections worldwide. In 2006, a Swedish variant of C. trachomatis (Swedish-nvCT), which has a deletion of 377bp in the plasmid, was reported. In Latin America, Swedish-nvCT infections have not been reported. We investigated the presence of Swedish-nvCT in women with infertility in Mexico. Swedish-nvCT was searched in 69C. trachomatis positive samples from 2339 endocervical specimens. We designed PCR primers to identify the deletion in the plasmid in the ORF1, and the presence of a repeated 44bp in the ORF3. The sample with the deletion was genotyped with the genes of the major outer membrane protein A (ompA) and the polymorphic membrane protein (pmpH). The deletion was detected in one of the 69 samples positive C. trachomatis of 2339 endocervical exudates. The nucleotide sequence analysis of the ompA shows a high degree of similarity with the Swedish nvCT (98%), however the variant found belongs to serovar D. The nucleotide sequence of the pmpH gene associates to the variant found in the genitourinary pathotype of the Swedish-nvCT but in different clusters. Our results revealed the presence of a new variant of C. trachomatis in Mexican patients. This variant found in Mexico belongs to serovar D based on the in silico analysis of the ompA and pmpH genes and differs to the Swedish-nvCT (serovars E). For these variants of C. trachomatis that have been found it is necessary to carry out a more detailed analysis, although the role of this mutation has not been demonstrated in the pathogenesis. Copyright © 2018 Elsevier España, S.L.U. and Sociedad Española de Enfermedades Infecciosas y Microbiología Clínica. All rights reserved.
Kikuchi, Naoki; Nakazato, Koichi
2015-01-01
Training variants (type, intensity, and duration of exercise) can be selected according to individual aims and fitness assessment. Recently, various methods of resistance and endurance training have been used for muscle hypertrophy and VO2max improvement. Although several genetic variants are associated with elite athletic performance and muscle phenotypes, genetic background has not been used as variant for physical training. ACTN3 R577X is a well-studied genetic polymorphism. It is the only genotype associated with elite athletic performance in multiple cohorts. This association is strongly supported by mechanistic data from an Actn3-knockout mouse model. In this review, possible guidelines are discussed for effective utilization of ACTN3 R577X polymorphism for physical training. PMID:26526670
Krawczyk, Marcin; Rau, Monika; Schattenberg, Jörn M.; Bantel, Heike; Pathil, Anita; Demir, Münevver; Kluwe, Johannes; Boettler, Tobias; Lammert, Frank; Geier, Andreas
2017-01-01
The PNPLA3 p.I148M, TM6SF2 p.E167K, and MBOAT7 rs641738 variants represent genetic risk factors for nonalcoholic fatty liver disease (NAFLD). Here we investigate if these polymorphisms modulate both steatosis and fibrosis in patients with NAFLD. We recruited 515 patients with NAFLD (age 16–88 years, 280 female patients). Liver biopsies were performed in 320 patients. PCR-based assays were used to genotype the PNPLA3, TM6SF2, and MBOAT7 variants. Carriers of the PNPLA3 and TM6SF2 risk alleles showed increased serum aspartate aminotransferase and alanine transaminase activities (P < 0.05). The PNPLA3 genotype was associated with steatosis grades S2–S3 (P < 0.001) and fibrosis stages F2–F4 (P < 0.001). The TM6SF2 genotype was associated with steatosis (P = 0.003) but not with fibrosis (P > 0.05). The MBOAT7 variant was solely associated with increased fibrosis (P = 0.046). In the multivariate model, variants PNPLA3 (P = 0.004) and TM6SF2 (P = 0.038) were associated with steatosis. Fibrosis stages were affected by the PNPLA3 (P = 0.042) and MBOAT7 (P = 0.021) but not by the TM6SF2 polymorphism (P > 0.05). The PNPLA3, TM6SF2, and MBOAT7 variants are associated with increased liver injury. The TM6SF2 variant seems to modulate predominantly hepatic fat accumulation, whereas the MBOAT7 polymorphism is linked to fibrosis. The PNPLA3 polymorphism confers risk of both increased steatosis and fibrosis. PMID:27836992
Krawczyk, Marcin; Rau, Monika; Schattenberg, Jörn M; Bantel, Heike; Pathil, Anita; Demir, Münevver; Kluwe, Johannes; Boettler, Tobias; Lammert, Frank; Geier, Andreas
2017-01-01
The PNPLA3 p.I148M, TM6SF2 p.E167K, and MBOAT7 rs641738 variants represent genetic risk factors for nonalcoholic fatty liver disease (NAFLD). Here we investigate if these polymorphisms modulate both steatosis and fibrosis in patients with NAFLD. We recruited 515 patients with NAFLD (age 16-88 years, 280 female patients). Liver biopsies were performed in 320 patients. PCR-based assays were used to genotype the PNPLA3, TM6SF2, and MBOAT7 variants. Carriers of the PNPLA3 and TM6SF2 risk alleles showed increased serum aspartate aminotransferase and alanine transaminase activities (P < 0.05). The PNPLA3 genotype was associated with steatosis grades S2-S3 (P < 0.001) and fibrosis stages F2-F4 (P < 0.001). The TM6SF2 genotype was associated with steatosis (P = 0.003) but not with fibrosis (P > 0.05). The MBOAT7 variant was solely associated with increased fibrosis (P = 0.046). In the multivariate model, variants PNPLA3 (P = 0.004) and TM6SF2 (P = 0.038) were associated with steatosis. Fibrosis stages were affected by the PNPLA3 (P = 0.042) and MBOAT7 (P = 0.021) but not by the TM6SF2 polymorphism (P > 0.05). The PNPLA3, TM6SF2, and MBOAT7 variants are associated with increased liver injury. The TM6SF2 variant seems to modulate predominantly hepatic fat accumulation, whereas the MBOAT7 polymorphism is linked to fibrosis. The PNPLA3 polymorphism confers risk of both increased steatosis and fibrosis. Copyright © 2017 by the American Society for Biochemistry and Molecular Biology, Inc.
Association of ghrelin polymorphisms with metabolic syndrome in Han Nationality Chinese.
Xu, Ling-Ling; Xiang, Hong-Ding; Qiu, Chang-Chun; Xu, Qun
2008-06-01
To investigate the association of ghrelin gene polymorphisms with metabolic syndrome in Han Nationality Chinese. A total of 240 patients with metabolic syndrome and 427 adults aged above forty years were recruited. Genotypes were determined by polymerase chain reaction and restriction fragment length polymorphism analysis. The allelic frequency of the Leu72Met polymorphism was 17.3% in the patient group and 11.9% in the control group (chi2 = 7.36, P = 0.007). Metabolic syndrome was more prevalent among carriers of the Met72 variant (43.8 vs 33.1%, age- and sex-adjusted odds ratio = 1.57, P = 0.01). No Arg51Gln variants were found in our study subjects. Rather than being associated with its individual components, Leu72Met polymorphism is associated with metabolic syndrome in the Han Nationality Chinese. Arg51Gln polymorphism is rare in the Han Nationality Chinese.
Population genetic implications from sequence variation in four Y chromosome genes.
Shen, P; Wang, F; Underhill, P A; Franco, C; Yang, W H; Roxas, A; Sung, R; Lin, A A; Hyman, R W; Vollrath, D; Davis, R W; Cavalli-Sforza, L L; Oefner, P J
2000-06-20
Some insight into human evolution has been gained from the sequencing of four Y chromosome genes. Primary genomic sequencing determined gene SMCY to be composed of 27 exons that comprise 4,620 bp of coding sequence. The unfinished sequencing of the 5' portion of gene UTY1 was completed by primer walking, and a total of 20 exons were found. By using denaturing HPLC, these two genes, as well as DBY and DFFRY, were screened for polymorphic sites in 53-72 representatives of the five continents. A total of 98 variants were found, yielding nucleotide diversity estimates of 2.45 x 10(-5), 5. 07 x 10(-5), and 8.54 x 10(-5) for the coding regions of SMCY, DFFRY, and UTY1, respectively, with no variant having been observed in DBY. In agreement with most autosomal genes, diversity estimates for the noncoding regions were about 2- to 3-fold higher and ranged from 9. 16 x 10(-5) to 14.2 x 10(-5) for the four genes. Analysis of the frequencies of derived alleles for all four genes showed that they more closely fit the expectation of a Luria-Delbrück distribution than a distribution expected under a constant population size model, providing evidence for exponential population growth. Pairwise nucleotide mismatch distributions date the occurrence of population expansion to approximately 28,000 years ago. This estimate is in accord with the spread of Aurignacian technology and the disappearance of the Neanderthals.
Derkach, Andriy; Chiang, Theodore; Gong, Jiafen; Addis, Laura; Dobbins, Sara; Tomlinson, Ian; Houlston, Richard; Pal, Deb K; Strug, Lisa J
2014-08-01
Sufficiently powered case-control studies with next-generation sequence (NGS) data remain prohibitively expensive for many investigators. If feasible, a more efficient strategy would be to include publicly available sequenced controls. However, these studies can be confounded by differences in sequencing platform; alignment, single nucleotide polymorphism and variant calling algorithms; read depth; and selection thresholds. Assuming one can match cases and controls on the basis of ethnicity and other potential confounding factors, and one has access to the aligned reads in both groups, we investigate the effect of systematic differences in read depth and selection threshold when comparing allele frequencies between cases and controls. We propose a novel likelihood-based method, the robust variance score (RVS), that substitutes genotype calls by their expected values given observed sequence data. We show theoretically that the RVS eliminates read depth bias in the estimation of minor allele frequency. We also demonstrate that, using simulated and real NGS data, the RVS method controls Type I error and has comparable power to the 'gold standard' analysis with the true underlying genotypes for both common and rare variants. An RVS R script and instructions can be found at strug.research.sickkids.ca, and at https://github.com/strug-lab/RVS. lisa.strug@utoronto.ca Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Choi, Hyung Jin; Cho, Young Min; Moon, Min Kyong; Choi, Hye Hun; Shin, Hyoung Doo; Jang, Hak Chul; Kim, Seong Yeon; Lee, Hong Kyu; Park, Kyong Soo
2006-11-01
Ghrelin is known to play a role in glucose metabolism and in beta-cell function. There are controversies regarding the role of ghrelin polymorphisms in diabetes and diabetes-related phenotypes. The objective of this study was to examine polymorphisms of the ghrelin gene in a Korean cohort and investigate associations between them and susceptibility to type 2 diabetes and its related phenotypes. The ghrelin gene was sequenced to identify polymorphisms in 24 DNA samples. Common variants were then genotyped in 760 type 2 diabetic patients and 641 nondiabetic subjects. Genetic associations with diabetes-related phenotypes were also analyzed. Nine polymorphisms were identified, and four common polymorphisms [g.-1500C>G, g.-1062G > C, g.-994C > T, g.+408C > A (Leu72Met)] were genotyped in a larger study. The genotype distributions of these four common polymorphisms in type 2 diabetes patients were similar to those of normal nondiabetic controls. However, these four common polymorphisms were variably associated with several diabetes-related phenotypes, such as high-density lipoprotein (HDL) cholesterol, fasting plasma glucose, and homeostasis model assessment of insulin resistance. In particular, subjects harboring g.-1062C were associated with a lower serum HDL cholesterol level after adjusting for other variables (P = 0.0004 or 0.01 after Bonferroni correction for 24 tests). The aforementioned four common polymorphisms in the ghrelin gene were not found to be significantly associated with susceptibility to type 2 diabetes mellitus in the Korean population. However, the common polymorphism g.-1062G > C in the promoter region of the ghrelin gene was found to be significantly associated with serum HDL cholesterol levels.
Epidemiological evolution of canine parvovirus in the Portuguese domestic dog population.
Miranda, Carla; Parrish, Colin R; Thompson, Gertrude
2016-02-01
Since its emergence, canine parvovirus type 2 (CPV-2) has caused disease pandemics with severe gastroenteritis signs, infecting especially puppies. As a consequence of CPV rapid evolution a variety of genetic and antigenic variants have been reported circulating worldwide. The detection of additional variants of CPV circulating in the dog population in Portugal suggests monitoring of the disease is useful. The objectives of this study were to further detect and characterize circulating field variants from suspected CPV diseased dogs that were admitted to veterinary clinics distributed throughout the country, during 2012-2014. Of the 260 fecal samples collected, 198 were CPV positive by PCR, and CPV antigen was detected in 61/109 samples by Immunochromatographic (IC) test. The restriction fragment length polymorphism (RFLP) analysis of 167 samples revealed that 86 were the CPV-2c. Sequence analysis of the 198 strains confirmed that CPV-2c were the dominant variant (51.5%), followed by CPV-2b (47.5%) and CPV-2a (1%). The variants were irregularly distributed throughout the country and some were detected with additional non-synonymous mutations in the VP2 gene. Phylogenetic analysis demonstrated that the isolates were similar to other European strains, and that this virus continues to evolve. Copyright © 2015 Elsevier B.V. All rights reserved.
Zhang, Ya-Jie; Li, Lei; Wang, Zhen-Jing; Zhang, Xiao-Jing; Zhao, Han; Zhao, Yan; Wang, Xie-Tong; Li, Chang-Zhong; Wan, Ji-Peng
2018-05-17
To evaluate the association between preeclampsia and three single nucleotide polymorphisms (rs13405728 in LHCGR gene; rs13429458 in THADA gene, and rs2479106 in DENND1A gene) which were identified to be genetic variants of polycystic ovary syndrome (PCOS) by genome-wide association study in Han Chinese populations. A total of 784 northern Han Chinese women (378 controls and 406 cases) were genotyped for the three genetic variants by polymerase chain reaction and direct sequencing. Unconditional logistic regression analysis was used to adjust the impact of prepregnancy body mass index, primiparas, and maternal age. No significant difference was found in the allele frequencies of the three genetic variants between cases and controls (p > .05), but genotype frequency of the SNP rs2479106 was significantly differ between cases and controls when analyzed under recessive models (p = .02). There was also a substantial difference in the genotype frequencies of the SNP rs13429458 between cases and controls under additive models (p = .01). Genetic variants of PCOS (rs13405728 in LHCGR gene; rs13429458 in THADA gene and rs2479106 in DENND1A gene) may not be involved in the development of preeclampsia in Han Chinese women.
Wilson, Kitchener D; Shen, Peidong; Fung, Eula; Karakikes, Ioannis; Zhang, Angela; InanlooRahatloo, Kolsoum; Odegaard, Justin; Sallam, Karim; Davis, Ronald W; Lui, George K; Ashley, Euan A; Scharfe, Curt; Wu, Joseph C
2015-09-11
Thousands of mutations across >50 genes have been implicated in inherited cardiomyopathies. However, options for sequencing this rapidly evolving gene set are limited because many sequencing services and off-the-shelf kits suffer from slow turnaround, inefficient capture of genomic DNA, and high cost. Furthermore, customization of these assays to cover emerging targets that suit individual needs is often expensive and time consuming. We sought to develop a custom high throughput, clinical-grade next-generation sequencing assay for detecting cardiac disease gene mutations with improved accuracy, flexibility, turnaround, and cost. We used double-stranded probes (complementary long padlock probes), an inexpensive and customizable capture technology, to efficiently capture and amplify the entire coding region and flanking intronic and regulatory sequences of 88 genes and 40 microRNAs associated with inherited cardiomyopathies, congenital heart disease, and cardiac development. Multiplexing 11 samples per sequencing run resulted in a mean base pair coverage of 420, of which 97% had >20× coverage and >99% were concordant with known heterozygous single nucleotide polymorphisms. The assay correctly detected germline variants in 24 individuals and revealed several polymorphic regions in miR-499. Total run time was 3 days at an approximate cost of $100 per sample. Accurate, high-throughput detection of mutations across numerous cardiac genes is achievable with complementary long padlock probe technology. Moreover, this format allows facile insertion of additional probes as more cardiomyopathy and congenital heart disease genes are discovered, giving researchers a powerful new tool for DNA mutation detection and discovery. © 2015 American Heart Association, Inc.
OryzaGenome: Genome Diversity Database of Wild Oryza Species.
Ohyanagi, Hajime; Ebata, Toshinobu; Huang, Xuehui; Gong, Hao; Fujita, Masahiro; Mochizuki, Takako; Toyoda, Atsushi; Fujiyama, Asao; Kaminuma, Eli; Nakamura, Yasukazu; Feng, Qi; Wang, Zi-Xuan; Han, Bin; Kurata, Nori
2016-01-01
The species in the genus Oryza, encompassing nine genome types and 23 species, are a rich genetic resource and may have applications in deeper genomic analyses aiming to understand the evolution of plant genomes. With the advancement of next-generation sequencing (NGS) technology, a flood of Oryza species reference genomes and genomic variation information has become available in recent years. This genomic information, combined with the comprehensive phenotypic information that we are accumulating in our Oryzabase, can serve as an excellent genotype-phenotype association resource for analyzing rice functional and structural evolution, and the associated diversity of the Oryza genus. Here we integrate our previous and future phenotypic/habitat information and newly determined genotype information into a united repository, named OryzaGenome, providing the variant information with hyperlinks to Oryzabase. The current version of OryzaGenome includes genotype information of 446 O. rufipogon accessions derived by imputation and of 17 accessions derived by imputation-free deep sequencing. Two variant viewers are implemented: SNP Viewer as a conventional genome browser interface and Variant Table as a text-based browser for precise inspection of each variant one by one. Portable VCF (variant call format) file or tab-delimited file download is also available. Following these SNP (single nucleotide polymorphism) data, reference pseudomolecules/scaffolds/contigs and genome-wide variation information for almost all of the closely and distantly related wild Oryza species from the NIG Wild Rice Collection will be available in future releases. All of the resources can be accessed through http://viewer.shigen.info/oryzagenome/. © The Author 2015. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists.
SNCA 3'UTR genetic variants in patients with Parkinson's disease and REM sleep behavior disorder.
Toffoli, M; Dreussi, E; Cecchin, E; Valente, M; Sanvilli, N; Montico, M; Gagno, S; Garziera, M; Polano, M; Savarese, M; Calandra-Buonaura, G; Placidi, F; Terzaghi, M; Toffoli, G; Gigli, G L
2017-07-01
REM sleep behavior disorder (RBD) is an early marker of Parkinson's disease (PD); however, it is still unclear which patients with RBD will eventually develop PD. Single nucleotide polymorphisms (SNPs) in the 3'untranslated region (3'UTR) of alpha-synuclein (SNCA) have been associated with PD, but at present, no data is available about RBD. The 3'UTR hosts regulatory regions involved in gene expression control, such as microRNA binding sites. The aim of this study was to determine RBD specific genetic features associated to an increased risk of progression to PD, by sequencing of the SNCA-3'UTR in patients with "idiopathic" RBD (iRBD) and in patients with PD. We recruited 113 consecutive patients with a diagnosis of iRBD (56 patients) or PD (with or without RBD, 57 patients). Sequencing of SNCA-3'UTR was performed on genomic DNA extracted from peripheral blood samples. Bioinformatic analyses were carried out to predict the potential effect of the identified genetic variants on microRNA binding. We found three SNCA-3'UTR SNPs (rs356165, rs3857053, rs1045722) to be more frequent in PD patients than in iRBD patients (p = 0.014, 0.008, and 0.008, respectively). Four new or previously reported but not annotated specific genetic variants (KP876057, KP876056, NM_000345.3:c*860T>A, NM_000345.3:c*2320A>T) have been observed in the RBD population. The in silico approach highlighted that these variants could affect microRNA-mediated gene expression control. Our data show specific SNPs in the SNCA-3'UTR that may bear a risk for RBD to be associated with PD. Moreover, new genetic variants were identified in patients with iRBD.
Thomson, P A; Parla, J S; McRae, A F; Kramer, M; Ramakrishnan, K; Yao, J; Soares, D C; McCarthy, S; Morris, S W; Cardone, L; Cass, S; Ghiban, E; Hennah, W; Evans, K L; Rebolini, D; Millar, J K; Harris, S E; Starr, J M; MacIntyre, D J; McIntosh, A M; Watson, J D; Deary, I J; Visscher, P M; Blackwood, D H; McCombie, W R; Porteous, D J
2014-06-01
A balanced t(1;11) translocation that transects the Disrupted in schizophrenia 1 (DISC1) gene shows genome-wide significant linkage for schizophrenia and recurrent major depressive disorder (rMDD) in a single large Scottish family, but genome-wide and exome sequencing-based association studies have not supported a role for DISC1 in psychiatric illness. To explore DISC1 in more detail, we sequenced 528 kb of the DISC1 locus in 653 cases and 889 controls. We report 2718 validated single-nucleotide polymorphisms (SNPs) of which 2010 have a minor allele frequency of <1%. Only 38% of these variants are reported in the 1000 Genomes Project European subset. This suggests that many DISC1 SNPs remain undiscovered and are essentially private. Rare coding variants identified exclusively in patients were found in likely functional protein domains. Significant region-wide association was observed between rs16856199 and rMDD (P=0.026, unadjusted P=6.3 × 10(-5), OR=3.48). This was not replicated in additional recurrent major depression samples (replication P=0.11). Combined analysis of both the original and replication set supported the original association (P=0.0058, OR=1.46). Evidence for segregation of this variant with disease in families was limited to those of rMDD individuals referred from primary care. Burden analysis for coding and non-coding variants gave nominal associations with diagnosis and measures of mood and cognition. Together, these observations are likely to generalise to other candidate genes for major mental illness and may thus provide guidelines for the design of future studies.
Polymorphisms in the prostaglandin receptor EP2 gene confers susceptibility to tuberculosis.
Liang, Li; Zhang, Qing; Luo, Liu-Lin; Yue, Jun; Zhao, Yan-Lin; Han, Min; Liu, Li-Rong; Xiao, He-Ping
2016-12-01
Prostaglandin E2 (PGE2) is an important lipid mediator of the inflammatory immune response during acute and chronic infections. PGE2 modulates a variety of immune functions via four receptors (EP1-EP4), which mediate distinct PGE2 effects. Mice lacking EP2 are more susceptible to infection by Mycobacterium tuberculosis (M.tb), have a higher bacterial load, and increase size and number of granulomatous lesions. Our aim was to assess whether single nucleotide polymorphisms (SNPs) in EP2 increase the risk of tuberculosis. DNA re-sequencing revealed five common EP2 variants in the Chinese Han population. We sequenced the EP2 gene from 600 patients and 572 healthy controls to measure SNP frequencies in association with tuberculosis infections (TB) within the population. The rs937337 polymorphism is associated with increased risk to tuberculosis (p=0.0044, odds ratio [OR], 1.67; 95% confidential interval,1.22-2.27). The rs937337 AA genotype and the rs1042618 CC genotype were significantly associated with TB. An estimation of the frequencies of haplotypes revealed a single protective haplotype GACGC for tuberculosis (p=0.00096, odds ratio [OR], 0.56; 95% confidential interval, 0.41-0.77). Furthermore, we determined that the remaining SNPs of EP2 were nominally associated with clinical patterns of disease. We identified genetic polymorphisms in EP2 associated with susceptibility to tuberculosis within a Chinese population. Our data support that EP2 SNPs are genetic predispositions of increased susceptibility to TB and to different clinical patterns of disease. Copyright © 2016 Elsevier B.V. All rights reserved.
Genotyping of Canine parvovirus in western Mexico.
Pedroza-Roldán, César; Páez-Magallan, Varinia; Charles-Niño, Claudia; Elizondo-Quiroga, Darwin; De Cervantes-Mireles, Raúl Leonel; López-Amezcua, Mario Alberto
2015-01-01
Canine parvovirus (CPV) is one of the most common infectious agents related to high morbidity rates in dogs. In addition, the virus is associated with severe gastroenteritis, diarrhea, and vomiting, resulting in high death rates, especially in puppies and nonvaccinated dogs. To date, there are 3 variants of the virus (CPV-2a, CPV-2b, and CPV-2c) circulating worldwide. In Mexico, reports describing the viral variants circulating in dog populations are lacking. In response to this deficiency, a total of 41 fecal samples of suspected dogs were collected from October 2013 through April 2014 in the Veterinary Hospital of the University of Guadalajara in western Mexico. From these, 24 samples resulted positive by polymerase chain reaction, and the viral variant was determined by restriction fragment length polymorphism. Five positive diagnosed samples were selected for partial sequencing of the vp2 gene and codon analysis. The results demonstrated that the current dominant viral variant in Mexico is CPV-2c. The current study describes the genotyping of CPV strains, providing valuable evidence of the dominant frequency of this virus in a dog population from western Mexico. © 2014 The Author(s).
Volaki, Konstantina; Pampanos, Andreas; Kitsiou-Tzeli, Sophia; Vrettou, Christina; Oikonomakis, Vasilis; Sofocleous, Christalena; Kanavakis, Emmanuel
2013-10-01
Molecular and neurobiological evidence for the involvement of neuroligins (particularly NLGN3 and NLGN4X genes) in autistic disorder is accumulating. However, previous mutation screening studies on these two genes have yielded controversial results. The present study explores, for the first time, the contribution of NLGN3 and NLGN4X genetic variants in Greek patients with autistic disorder. We analyzed the full exonic sequence of NLGN3 and NLGN4X genes in 40 patients strictly fulfilling the Diagnostic and Statistical Manual of Mental Disorders, 4th ed. criteria for autistic disorder. We identified nine nucleotide changes in NLGN4X--one probable causative mutation (p.K378R) previously reported by our research group, one novel variant (c.-206G>C), one nonvalidated single nucleotide polymorphism (SNP, rs111953947), and six known human SNPs reported in the SNP database--and one known human SNP in NLGN3 also reported in the SNP database. The variants identified are expected to be benign. However, they should be investigated in the context of variants in interacting cellular pathways to assess their contribution to the etiology of autism.
Haplotype Analysis in Multiple Crosses to Identify a QTL Gene
Wang, Xiaosong; Korstanje, Ron; Higgins, David; Paigen, Beverly
2004-01-01
Identifying quantitative trait locus (QTL) genes is a challenging task. Herein, we report using a two-step process to identify Apoa2 as the gene underlying Hdlq5, a QTL for plasma high-density lipoprotein cholesterol (HDL) levels on mouse chromosome 1. First, we performed a sequence analysis of the Apoa2 coding region in 46 genetically diverse mouse strains and found five different APOA2 protein variants, which we named APOA2a to APOA2e. Second, we conducted a haplotype analysis of the strains in 21 crosses that have so far detected HDL QTLs; we found that Hdlq5 was detected only in the nine crosses where one parent had the APOA2b protein variant characterized by an Ala61-to-Val61 substitution. We then found that strains with the APOA2b variant had significantly higher (P ≤ 0.002) plasma HDL levels than those with either the APOA2a or the APOA2c variant. These findings support Apoa2 as the underlying Hdlq5 gene and suggest the Apoa2 polymorphisms responsible for the Hdlq5 phenotype. Therefore, haplotype analysis in multiple crosses can be used to support a candidate QTL gene. PMID:15310659
Haplotype analysis in multiple crosses to identify a QTL gene.
Wang, Xiaosong; Korstanje, Ron; Higgins, David; Paigen, Beverly
2004-09-01
Identifying quantitative trait locus (QTL) genes is a challenging task. Herein, we report using a two-step process to identify Apoa2 as the gene underlying Hdlq5, a QTL for plasma high-density lipoprotein cholesterol (HDL) levels on mouse chromosome 1. First, we performed a sequence analysis of the Apoa2 coding region in 46 genetically diverse mouse strains and found five different APOA2 protein variants, which we named APOA2a to APOA2e. Second, we conducted a haplotype analysis of the strains in 21 crosses that have so far detected HDL QTLs; we found that Hdlq5 was detected only in the nine crosses where one parent had the APOA2b protein variant characterized by an Ala61-to-Val61 substitution. We then found that strains with the APOA2b variant had significantly higher (P < or = 0.002) plasma HDL levels than those with either the APOA2a or the APOA2c variant. These findings support Apoa2 as the underlying Hdlq5 gene and suggest the Apoa2 polymorphisms responsible for the Hdlq5 phenotype. Therefore, haplotype analysis in multiple crosses can be used to support a candidate QTL gene.
Mohana, Vamsi U; Swapna, N; Surender, Reddy S; Vishnupriya, S; Padma, Tirunilai
2012-01-01
The human angiotensinogen (AGT) is a promising candidate gene for evaluating susceptibility to essential hypertension (EH). We aimed to assess the association of the variants of AGT gene and the extent of risk involved in developing EH. A case-control study was designed to compare 279 hypertensive patients with 200 normotensive subjects. The frequency distribution of M235T and T174M polymorphisms of AGT gene was assessed by polymerase chain reaction (PCR)-restriction fragment length polymorphism (RFLP) method. A haplotype analysis was done to determine the risk conferred by the combination of alleles of the two polymorphisms for EH. The genotype distribution of the T174M variant differed significantly between hypertensives and normotensives, whereas genotypes of M235T variant did not show such difference. For M235T, MM genotype conferred an increase in risk for hypertension in women (odds ratios (OR) = 2.82; 95% confidence interval (CI) = 1.22-6.49). For the variant T174M, the TM genotype frequency was elevated in hypertensive females (36.5%) as compared to controls (18.8 %; P = .034). The 174M allele was more prevalent among female hypertensives than among female controls (0.20 vs. 0.12; P = .059). The haplotype analysis showed a significant association for the haplotypes of paired markers (M235 and 174M) with a χ(2) value of 8.037 (P = .045). Our findings suggest that the polymorphic variants of AGT gene-M235T and T174M-show association with hypertension.
Yazıcıoğlu, Burcu; Kaya, Zühre; Güntekin Ergun, Sezen; Perçin, Ferda; Koçak, Ülker; Yenicesu, İdil; Gürsel, Türkiz
2017-06-05
High-dose methotrexate (HD-MTX) is widely used in the consolidation phase of childhood acute lymphoblastic leukemia (ALL), but the roles that polymorphisms in folate-related genes (FRGs) play in HD-MTX toxicity and prognosis in children with ALL are not understood. The aims of this study were to investigate the frequencies of polymorphisms in the genes for thymidylate synthase (TS), methionine synthase reductase (MTRR), and methylene tetrahydrofolate reductase (MTHFR) in Turkish children with ALL and to assess associations between these polymorphisms and HD-MTX-related toxicity and leukemia prognosis in this patient group. FRG polymorphisms were assessed by real-time polymerase chain reaction. Survival status, MTX levels, and toxicity data were retrieved from 106 patients' charts. The allele frequencies for the FRG polymorphisms were as follows: TS 2R 41.0%, 3R 57.0%, and 4R 2.0%; MTRR 66A 42.4% and 66G 57.6%; MTHFR 677C 59.3% and 677T 40.7%; and MTHFR 1298A 58.1% and 1298C 41.9%. At the 48th hour of HD-MTX infusion, serum MTX was significantly higher in patients who had TS 2R/3R/4R variants as compared to those with wild-type TS (p<0.05). No significant differences were detected with respect to event-free survival or toxicity between wild-type and other FRG variants. The frequencies of FRG polymorphisms in Turkish children with ALL are similar to those reported in other Caucasian populations. This is the first published finding of the TS 3R/4R variant in the Turkish population. The results indicate that HD-MTX can be tolerated by leukemic children with some polymorphic variants of FRG; thus, it may prevent future risk of leukemic relapse.
Buchanan, Carrie C; Torstenson, Eric S; Bush, William S; Ritchie, Marylyn D
2012-01-01
Since publication of the human genome in 2003, geneticists have been interested in risk variant associations to resolve the etiology of traits and complex diseases. The International HapMap Consortium undertook an effort to catalog all common variation across the genome (variants with a minor allele frequency (MAF) of at least 5% in one or more ethnic groups). HapMap along with advances in genotyping technology led to genome-wide association studies which have identified common variants associated with many traits and diseases. In 2008 the 1000 Genomes Project aimed to sequence 2500 individuals and identify rare variants and 99% of variants with a MAF of <1%. To determine whether the 1000 Genomes Project includes all the variants in HapMap, we examined the overlap between single nucleotide polymorphisms (SNPs) genotyped in the two resources using merged phase II/III HapMap data and low coverage pilot data from 1000 Genomes. Comparison of the two data sets showed that approximately 72% of HapMap SNPs were also found in 1000 Genomes Project pilot data. After filtering out HapMap variants with a MAF of <5% (separately for each population), 99% of HapMap SNPs were found in 1000 Genomes data. Not all variants cataloged in HapMap are also cataloged in 1000 Genomes. This could affect decisions about which resource to use for SNP queries, rare variant validation, or imputation. Both the HapMap and 1000 Genomes Project databases are useful resources for human genetics, but it is important to understand the assumptions made and filtering strategies employed by these projects.
2014-01-01
Background The epidermal growth factor receptor (EGFR) is differently expressed in breast cancer, and its presence may favor cancer progression. We hypothesized that two EGFR functional polymorphisms, a (CA)n repeat in intron 1, and a single nucleotide polymorphism, R497K, may affect EGFR expression and breast cancer clinical profile. Methods The study population consisted of 508 Brazilian women with unilateral breast cancer, and no distant metastases. Patients were genotyped for the (CA)n and R497K polymorphisms, and the associations between (CA)n polymorphism and EGFR transcript levels (n = 129), or between either polymorphism and histopathological features (n = 505) were evaluated. The REMARK criteria of tumor marker evaluation were followed. Results (CA)n lengths ranged from 14 to 24 repeats, comprehending 11 alleles and 37 genotypes. The most frequent allele was (CA)16 (0.43; 95% CI = 0.40–0.46), which was set as the cut-off length to define the Short allele. Variant (CA)n genotypes had no significant effect in tumoral EGFR mRNA levels, but patients with two (CA)n Long alleles showed lower chances of being negative for progesterone receptor (ORadjusted = 0.42; 95% CI = 0.19–0.91). The evaluation of R497K polymorphism indicated a frequency of 0.21 (95% CI = 0.19 – 0.24) for the variant (Lys) allele. Patients with variant R497K genotypes presented lower proportion of worse lymph node status (pN2 or pN3) when compared to the reference genotype Arg/Arg (ORadjusted = 0.32; 95% CI = 0.17–0.59), which resulted in lower tumor staging (ORadjusted = 0.34; 95% CI = 0.19-0.63), and lower estimated recurrence risk (OR = 0.50; 95% CI = 0.30-0.81). The combined presence of both EGFR polymorphisms (Lys allele of R497K and Long/Long (CA)n) resulted in lower TNM status (ORadjusted = 0.22; 95% CI = 0.07-0.75) and lower ERR (OR = 0.25; 95% CI = 0.09-0.71). When tumors were stratified according to biological classification, the favorable effects of variant EGFR polymorphisms were preserved for luminal A tumors, but not for other subtypes. Conclusions The data suggest that the presence of the variant forms of EGFR polymorphisms may lead to better prognosis in breast cancer, especially in patients with luminal A tumors. PMID:24629097
Xu, Zhen-Hua; Thomae, Bianca A; Eckloff, Bruce W; Wieben, Eric D; Weinshilboum, Richard M
2003-06-01
3'-Phosphoadenosine 5'-phosphosulfate (PAPS) is the high-energy "sulfate donor" for reactions catalyzed by sulfotransferase (SULT) enzymes. The strict requirement of SULTs for PAPS suggests that PAPS synthesis might influence the rate of sulfate conjugation. In humans, PAPS is synthesized from ATP and SO(4)(2-) by two isoforms of PAPS synthetase (PAPSS): PAPSS1 and PAPSS2. As a step toward pharmacogenetic studies, we have resequenced the entire coding sequence of the human PAPSS1 gene, including exon-intron splice junctions, using DNA samples from 60 Caucasian-American and 58 African-American subjects. Twenty-one genetic polymorphisms were observed-1 insertion-deletion event and 20 single nucleotide polymorphisms (SNPs)-including two non-synonymous coding SNPs (cSNPs) that altered the following amino acids: Arg333Cys and Glu531Gln. Twelve pairs of these polymorphisms were tightly linked, and a total of twelve unequivocal haplotypes could be identified-two that were common to both ethnic groups and ten that were ethnic-specific. The Arg333Cys polymorphism, with an allele frequency of 2.5%, was observed only in DNA samples from Caucasian subjects. The Glu531Gln polymorphism was rare, with only a single copy of that allele in a DNA sample from an African-American subject. Transient expression in mammalian cells showed that neither of the non-synonymous cSNPs resulted in a change in the basal level of enzyme activity measured under optimal assay conditions. However, the Glu531Gln polymorphism altered the substrate kinetic properties of the enzyme. The Gln531 variant allozyme had a 5-fold higher K(m) value for SO(4)(2-) than did the wild-type allozyme and displayed monophasic kinetics for Na(2)SO(4). The wild-type allozyme (Glu531) showed biphasic kinetics for that substrate. These observations represent a step toward testing the hypothesis that genetic variation in PAPS synthesis catalyzed by PAPSS1 might alter in vivo sulfate conjugation.
Bosch, T M; Doodeman, V D; Smits, P H M; Meijerman, I; Schellens, J H M; Beijnen, J H
2006-01-01
A possible explanation for the wide interindividual variability in toxicity and efficacy of drug therapy is variation in genes encoding drug-metabolizing enzymes and drug transporters. The allelic frequency of these genetic variants, linkage disequilibrium (LD), and haplotype of these polymorphisms are important parameters in determining the genetic differences between patients. The aim of this study was to explore the frequencies of polymorphisms in drug-metabolizing enzymes (CYP1A1, CYP2C9, CYP2C19, CYP3A4, CYP2D6, CYP3A5, DPYD, UGT1A1, GSTM1, GSTP1, GSTT1) and drug transporters (ABCB1[MDR1] and ABCC2[MRP2]), and to investigate the LD and perform haplotype analysis of these polymorphisms in a Dutch population. Blood samples were obtained from 100 healthy volunteers and genomic DNA was isolated and amplified by PCR. The amplification products were sequenced and analyzed for the presence of polymorphisms by sequence alignment. In the study population, we identified 13 new single nucleotide polymorphisms (SNPs) in Caucasians and three new SNPs in non-Caucasians, in addition to previously recognized SNPs. Three of the new SNPs were found within exons, of which two resulted in amino acid changes (A428T in CYP2C9 resulting in the amino acid substitution D143V; and C4461T in ABCC2 in a non-Caucasian producing the amino acid change T1476M). Several LDs and haplotypes were found in the Caucasian individuals. In this Dutch population, the frequencies of 16 new SNPs and those of previously recognized SNPs were determined in genes coding for drug-metabolizing enzymes and drug transporters. Several LDs and haplotypes were also inferred. These data are important for further research to help explain the interindividual pharmacokinetic and pharmacodynamic variability in response to drug therapy.
Pelle, Roger; Mwacharo, Joram M.; Njahira, Moses N.; Marcellino, Wani L.; Kiara, Henry; Malak, Agol K.; EL Hussein, Abdel Rahim M.; Bishop, Richard; Skilton, Robert A.
2017-01-01
East Coast fever (ECF), caused by Theileria parva infection, is a frequently fatal disease of cattle in eastern, central and southern Africa, and an emerging disease in South Sudan. Immunization using the infection and treatment method (ITM) is increasingly being used for control in countries affected by ECF, but not yet in South Sudan. It has been reported that CD8+ T-cell lymphocytes specific for parasitized cells play a central role in the immunity induced by ITM and a number of T. parva antigens recognized by parasite-specific CD8+ T-cells have been identified. In this study we determined the sequence diversity among two of these antigens, Tp1 and Tp2, which are under evaluation as candidates for inclusion in a sub-unit vaccine. T. parva samples (n = 81) obtained from cattle in four geographical regions of South Sudan were studied for sequence polymorphism in partial sequences of the Tp1 and Tp2 genes. Eight positions (1.97%) in Tp1 and 78 positions (15.48%) in Tp2 were shown to be polymorphic, giving rise to four and 14 antigen variants in Tp1 and Tp2, respectively. The overall nucleotide diversity in the Tp1 and Tp2 genes was π = 1.65% and π = 4.76%, respectively. The parasites were sampled from regions approximately 300 km apart, but there was limited evidence for genetic differentiation between populations. Analyses of the sequences revealed limited numbers of amino acid polymorphisms both overall and in residues within the mapped CD8+ T-cell epitopes. Although novel epitopes were identified in the samples from South Sudan, a large number of the samples harboured several epitopes in both antigens that were similar to those in the T. parva Muguga reference stock, which is a key component in the widely used live vaccine cocktail. PMID:28231338
Pan, Wei; Song, Im-Sook; Shin, Ho-Jung; Kim, Min-Hye; Choi, Yeong-Lim; Lim, Su-Jeong; Kim, Woo-Young; Lee, Sang-Seop; Shin, Jae-Gook
2011-06-01
Genetic variants of Na(+)-taurocholate co-transporting polypeptide (NTCP; SLC10A1) and ileal apical sodium-dependent bile acid transporter (ASBT; SLC10A2), which greatly contribute to bile acid homeostasis, were extensively explored in the Korean population and functional variants of NTCP were compared among Asian populations. From direct DNA sequencing, six SNPs were identified in the SLC10A1 gene and 14 SNPs in the SLC10A2 gene. Three of seven coding variants were non-synonymous SNPs: two variants from SLC10A1 (A64T, S267F) and one from SLC10A2 (A171S). No linkage was analysed in the SLC10A1 gene because of low frequencies of genetic variants, and the SLC10A2 gene was composed of two separated linkage disequilibrium blocks contrary to the white population. The stably transfected NTCP-A64T variant showed significantly decreased uptakes of taurocholate and rosuvastatin compared with wild-type NTCP. The decreased taurocholate uptake and increased rosuvastatin uptake were shown in the NTCP-S267F variant. The allele frequencies of these functional variants were 1.0% and 3.1%, respectively, in a Korean population. However, NTCP-A64T was not found in Chinese and Vietnamese subjects. The frequency distribution of NTCP-S267F in Koreans was significantly lower than those in Chinese and Vietnamese populations. Our data suggest that NTCP-A64T and -S267F variants cause substrate-dependent functional change in vitro, and show ethnic difference in their allelic frequencies among Asian populations although the clinical relevance of these variants is remained to be evaluated.
Blue, Elizabeth Marchani; Sun, Lei; Tintle, Nathan L.; Wijsman, Ellen M.
2014-01-01
When analyzing family data, we dream of perfectly informative data, even whole genome sequences (WGS) for all family members. Reality intervenes, and we find next-generation sequence (NGS) data have error, and are often too expensive or impossible to collect on everyone. Genetic Analysis Workshop 18 groups “Quality Control” and “Dropping WGS through families using GWAS framework” focused on finding, correcting, and using errors within the available sequence and family data, developing methods to infer and analyze missing sequence data among relatives, and testing for linkage and association with simulated blood pressure. We found that single nucleotide polymorphisms, NGS, and imputed data are generally concordant, but that errors are particularly likely at rare variants, homozygous genotypes, within regions with repeated sequences or structural variants, and within sequence data imputed from unrelateds. Admixture complicated identification of cryptic relatedness, but information from Mendelian transmission improved error detection and provided an estimate of the de novo mutation rate. Both genotype and pedigree errors had an adverse effect on subsequent analyses. Computationally fast rules-based imputation was accurate, but could not cover as many loci or subjects as more computationally demanding probability-based methods. Incorporating population-level data into pedigree-based imputation methods improved results. Observed data outperformed imputed data in association testing, but imputed data were also useful. We discuss the strengths and weaknesses of existing methods, and suggest possible future directions. Topics include improving communication between those performing data collection and analysis, establishing thresholds for and improving imputation quality, and incorporating error into imputation and analytical models. PMID:25112184
Shen, Wei; Paxton, Christian N; Szankasi, Philippe; Longhurst, Maria; Schumacher, Jonathan A; Frizzell, Kimberly A; Sorrells, Shelly M; Clayton, Adam L; Jattani, Rakhi P; Patel, Jay L; Toydemir, Reha; Kelley, Todd W; Xu, Xinjie
2018-04-01
Genetic abnormalities, including copy number variants (CNV), copy number neutral loss of heterozygosity (CN-LOH) and gene mutations, underlie the pathogenesis of myeloid malignancies and serve as important diagnostic, prognostic and/or therapeutic markers. Currently, multiple testing strategies are required for comprehensive genetic testing in myeloid malignancies. The aim of this proof-of-principle study was to investigate the feasibility of combining detection of genome-wide large CNVs, CN-LOH and targeted gene mutations into a single assay using next-generation sequencing (NGS). For genome-wide CNV detection, we designed a single nucleotide polymorphism (SNP) sequencing backbone with 22 762 SNP regions evenly distributed across the entire genome. For targeted mutation detection, 62 frequently mutated genes in myeloid malignancies were targeted. We combined this SNP sequencing backbone with a targeted mutation panel, and sequenced 9 healthy individuals and 16 patients with myeloid malignancies using NGS. We detected 52 somatic CNVs, 11 instances of CN-LOH and 39 oncogenic mutations in the 16 patients with myeloid malignancies, and none in the 9 healthy individuals. All CNVs and CN-LOH were confirmed by SNP microarray analysis. We describe a genome-wide SNP sequencing backbone which allows for sensitive detection of genome-wide CNVs and CN-LOH using NGS. This proof-of-principle study has demonstrated that this strategy can provide more comprehensive genetic profiling for patients with myeloid malignancies using a single assay. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Hilton, Hugo G; Parham, Peter
2013-01-01
Monoclonal antibodies with specificity for HLA class I determinants of HLA were originally characterized using serological assays in which the targets were cells expressing 3-6 HLA class I variants. Because of this complexity, the specificities of the antibodies were defined indirectly by correlation. Here we use a direct binding assay, in which the targets are synthetic beads coated with one of 111 HLA class I variants, representing the full range of HLA-A, -B and -C variation. We studied one monoclonal antibody with monomorphic specificity (W6/32) and four with polymorphic specificity (MA2.1, PA2.1, BB7.2 and BB7.1) and compared the results with those obtained previously. W6/32 reacted with all HLA class I variants. MA2.1 exhibits high specificity for HLA-A*02, -B*57 and -B*58, but also exhibited cross-reactivity with HLA-A*11 and -B*15:16. At low concentration (1μg/ml) PA2.1 and BB7.2 were both specific for HLA-A*02 and -A*69, and at high concentration (50μg/ml) exhibited significant cross-reactions with HLA-A*68, -A*23, and -A*24. BB7.1 exhibits specificity for HLA-B*07 and -B*42, as previously described, but reacts equally well with HLA-B*81, a rare allotype defined some 16 years after the description of BB7.1. The results obtained with cell-based and bead-based assays are consistent and, in combination with amino acid sequence comparison, increase understanding of the polymorphic epitopes recognized by the MA2.1, PA2.1, BB7.2 and BB7.1 antibodies. Comparison of two overlapping but distinctive bead sets from two sources gave similar results, but the overall levels of binding were significantly different. Several weaker reactions were observed with only one of the bead sets. PMID:23510417
Mikstacki, Adam; Skrzypczak-Zielinska, Marzena; Zakerska-Banaszak, Oliwia; Tamowicz, Barbara; Skibinska, Maria; Molinska-Glura, Marta; Szalata, Marlena; Slomski, Ryszard
2016-05-14
The serum glutathione S-transferase alpha (α-GST) concentration has been used as a marker of hepatic condition. After sevoflurane anaesthesia a mild impairment of hepatocellular integrity was observed. Genetic polymorphisms in CYP2E1, GSTA1 and GSTP1 genes, affecting enzymes activity, may possibly influence the hepatotoxic effect of sevoflurane. The aim of this study was to assess the influence of genetic polymorphism of CYP2E1, GSTA1 and GSTP1 genes on serum α-GST level in 86 unrelated patients representing ASA physical status I-II, undergoing laryngological surgery under general anaesthesia with sevoflurane. The serum samples from three perioperative time points were analyzed using ELISA. Genetic variants were detected by pyrosequencing and sequencing. Finally, the statistical associations between serum α-GST concentration and analyzed alleles of CYP2E1, GSTP1 and GSTA1 genes were estimated. The allele GSTA1*B (-567G, -69T, -52A) frequency was 0.43, whereas the alleles c.313G and c.341T of GSTP1 were identified with frequencies of 0.28 and 0.1 respectively. The -1053T allele of the CYP2E1 gene was observed with 0.01 frequency. We found serum α-GST concentrations in homozygous changes c.313A>G and c.341C>T of the GSTP1 gene significantly higher at the end of anaesthesia as compared with the levels at pre-anaesthetic and 24 h post-anaesthetic time points. Moreover, GSTA1 wild type genotype was associated with increased α-GST concentration at 24 h after the end of anaesthesia. GSTP1 gene polymorphism has an impact on the perioperative serum α-GST concentration in patients undergoing sevoflurane anaesthesia. A similar association, although not statistically significant exists between GSTA1 gene variants and perioperative serum α-GST level.
Tavares, Rita C F; Feldner, Ana C C A; Pinho, João R R; Uehara, Silvia N O; Emori, Christini T; Carvalho-Filho, Roberto J; Silva, Ivonete S S; Santana, Rúbia A F; de Castro, Vanessa F D; Castoli, Gregório T F; Cristovão, Charliana U; Ferraz, Maria L C G
2017-07-01
Background NS3 protease inhibitors (PIs) were the first direct antiviral agents used for the treatment of hepatitis C virus. The combination of second-wave PIs with other direct antiviral agents enabled the use of interferon-free regimens for chronic kidney disease patients on dialysis and renal transplant (RTx) recipients, populations in which the use of interferon and ribavirin is limited. However, the occurrence of PI resistance-associated variants (RAVs), both baseline and induced by therapy, has resulted in the failure of many treatment strategies. Methods The aim of this study was to estimate the prevalence of PI RAVs and of the Q80K polymorphism in chronic kidney disease patients on hemodialysis and RTx recipients. Direct sequencing of the NS3 protease was performed in 67 patients (32 hemodialysis and 35 RTx).Results RAVs to PIs were detected in 18% of the patients: V55A (9%), V36L (1.5%), T54S (1.5%), S122N (1.5%), I170L (1.5%), and M175L (1.5%). Only 1.5% of the patients carried the Q80K polymorphism. The frequency of these mutations was more than two times higher in patients infected with GT1a (25%) than GT1b (9.7%) (P=0.1). The mutations were detected in 20% of treatment-naive patients and in 15.6% of peginterferon/ribavirin-experienced patients (P=0.64). Furthermore, no mutation that would confer high resistance to PIs was detected.Conclusion The Q80K polymorphism was rare in the population studied. The occurrence of RAVs was common, with predominance in GT1a. However, the variants observed were those associated with a low level of resistance to PIs, facilitating the use of these drugs in this special group of patients.
Zavarella, S; Petrone, A; Zampetti, S; Gueorguiev, M; Spoletini, M; Mein, C A; Leto, G; Korbonits, M; Buzzetti, R
2008-04-01
Previous studies suggested that polymorphisms in the coding region of the preproghrelin were involved in the etiology of obesity and might modulate glucose-induced insulin secretion. We evaluated the association of a new variation, -604C>T, in the promoter region of the ghrelin gene, of Leu72Met (247C>A) and of Gln90Leu (265A>T), all haplotype-tagging single nucleotide polymorphisms (SNPs), with measures of insulin sensitivity in 1420 adult individuals. The three SNPs were genotyped using ABI PRISM 7900 HT Sequence Detection System. We used multiple linear regression analysis for quantitative traits and THESIAS software for haplotype analysis. We observed a protective effect exerted by Met72 variant of Leu72Met SNP on insulin resistance parameters; a significant decreasing trend from Leu/Leu to Leu/Met and to Met/Met homozygous subjects in triglycerides, fasting insulin levels and HOMA-IR index (P=0.02, 0.01 and 0.003, respectively), and, consistently, an increase in ghrelin levels (P=0.003) was found. A significant decrease from CC to TC and to TT genotypes in insulin levels and HOMA-IR index was also detected (P=0.00l for both), but only in subjects homozygous for Leu72, where the protective effect of Met72 was not present. The haplotype analysis results supported the data obtained by the evaluation of each single SNP, showing the highest value of insulin levels and HOMA-IR index in the -604(c)247(c) haplotype intermediate value in -604(T)247(C) and lowest value in -604(C)247(A). Our observations suggest a protective role of the Met72 variant and of -604 T allele in modulating insulin resistance. These SNPs or an unknown functional variant in linkage disequilibrium could increase ghrelin levels and probably insulin sensitivity.
Kawasaki, Eiji; Awata, Takuya; Ikegami, Hiroshi; Kobayashi, Tetsuro; Maruyama, Taro; Nakanishi, Koji; Shimada, Akira; Uga, Miho; Uga, Mho; Kurihara, Susumu; Kawabata, Yumiko; Tanaka, Shoichiro; Kanazawa, Yasuhiko; Lee, Inkyu; Eguchi, Katsumi
2006-03-15
The protein tyrosine phosphatase, nonreceptor 22 gene (PTPN22) maps to human chromosome 1p13.3-p13.1 and encodes an important negative regulator of T-cell activation, lymphoid-specific phosphatase (Lyp). Recently, the minor allele of a single-nucleotide polymorphism (SNP) at nucleotide position 1858 (rs2476601, +1858C > T) was found to be associated with type 1 diabetes. However, the degree of the association is variable among ethnic populations, suggesting the presence of other disease-associated variants in PTPN22. To examine this possibility, we carried out a systemic search for PTPN22 using direct sequencing of PCR-amplified products in the Japanese population. Association and linkage studies were also conducted in 1,690 Japanese samples, 180 Korean samples, and 472 Caucasian samples from 95 nuclear families. We identified five novel SNPs, but not the +1858C > T SNP. Of these two frequent SNPs, -1123G > C, and +2740C > T were in strong linkage disequilibrium (LD), and the -1123G > C promoter SNP was associated with acute-onset but not slow-onset type 1 diabetes in the Japanese population (odds ratio [OR] = 1.42, 95% CI = 1.07-1.89, P = 0.015). This association was observed also in Korean patients with type 1 diabetes (Mantel-Haenszel chi2= 6.543, P = 0.0105, combined OR = 1.41 95% CI = 1.09-1.82). Furthermore, the affected family-based control (AFBAC) association test and the transmission disequilibrium analysis of multiplex families of European descent from the British Diabetes Association (BDA) Warren Repository indicated that the association was stronger in -1123G > C compared to +1858C > T. In conclusion, the type 1 diabetes association with PTPN22 is confirmed, but it cannot be attributed solely to the +1858C > T variant. The promoter -1123G > C SNP is a more likely causative variant in PTPN22. 2006 Wiley-Liss, Inc.
Human Chromosome Y and Haplogroups; introducing YDHS Database.
Tiirikka, Timo; Moilanen, Jukka S
2015-12-01
As the high throughput sequencing efforts generate more biological information, scientists from different disciplines are interpreting the polymorphisms that make us unique. In addition, there is an increasing trend in general public to research their own genealogy, find distant relatives and to know more about their biological background. Commercial vendors are providing analyses of mitochondrial and Y-chromosomal markers for such purposes. Clearly, an easy-to-use free interface to the existing data on the identified variants would be in the interest of general public and professionals less familiar with the field. Here we introduce a novel metadatabase YDHS that aims to provide such an interface for Y-chromosomal DNA (Y-DNA) haplogroups and sequence variants. The database uses ISOGG Y-DNA tree as the source of mutations and haplogroups and by using genomic positions of the mutations the database links them to genes and other biological entities. YDHS contains analysis tools for deeper Y-SNP analysis. YDHS addresses the shortage of Y-DNA related databases. We have tested our database using a set of different cases from literature ranging from infertility to autism. The database is at http://www.semanticgen.net/ydhs Y-chromosomal DNA (Y-DNA) haplogroups and sequence variants have not been in the scientific limelight, excluding certain specialized fields like forensics, mainly because there is not much freely available information or it is scattered in different sources. However, as we have demonstrated Y-SNPs do play a role in various cases on the haplogroup level and it is possible to create a free Y-DNA dedicated bioinformatics resource.
van Riet, Job; Krol, Niels M G; Atmodimedjo, Peggy N; Brosens, Erwin; van IJcken, Wilfred F J; Jansen, Maurice P H M; Martens, John W M; Looijenga, Leendert H; Jenster, Guido; Dubbink, Hendrikus J; Dinjens, Winand N M; van de Werken, Harmen J G
2018-03-01
Exploration and visualization of next-generation sequencing data are crucial for clinical diagnostics. Software allowing simultaneous visualization of multiple regions of interest coupled with dynamic heuristic filtering of genetic aberrations is, however, lacking. Therefore, the authors developed the web application SNPitty that allows interactive visualization and interrogation of variant call format files by using B-allele frequencies of single-nucleotide polymorphisms and single-nucleotide variants, coverage metrics, and copy numbers analysis results. SNPitty displays variant alleles and allelic imbalances with a focus on loss of heterozygosity and copy number variation using genome-wide heterozygous markers and somatic mutations. In addition, SNPitty is capable of generating predefined reports that summarize and highlight disease-specific targets of interest. SNPitty was validated for diagnostic interpretation of somatic events by showcasing a serial dilution series of glioma tissue. Additionally, SNPitty is demonstrated in four cancer-related scenarios encountered in daily clinical practice and on whole-exome sequencing data of peripheral blood from a Down syndrome patient. SNPitty allows detection of loss of heterozygosity, chromosomal and gene amplifications, homozygous or heterozygous deletions, somatic mutations, or any combination thereof in regions or genes of interest. Furthermore, SNPitty can be used to distinguish molecular relationships between multiple tumors from a single patient. On the basis of these data, the authors demonstrate that SNPitty is robust and user friendly in a wide range of diagnostic scenarios. Copyright © 2018 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Best Practices and Joint Calling of the HumanExome BeadChip: The CHARGE Consortium
Grove, Megan L.; Yu, Bing; Cochran, Barbara J.; Haritunians, Talin; Bis, Joshua C.; Taylor, Kent D.; Hansen, Mark; Borecki, Ingrid B.; Cupples, L. Adrienne; Fornage, Myriam; Gudnason, Vilmundur; Harris, Tamara B.; Kathiresan, Sekar; Kraaij, Robert; Launer, Lenore J.; Levy, Daniel; Liu, Yongmei; Mosley, Thomas; Peloso, Gina M.; Psaty, Bruce M.; Rich, Stephen S.; Rivadeneira, Fernando; Siscovick, David S.; Smith, Albert V.; Uitterlinden, Andre; van Duijn, Cornelia M.; Wilson, James G.; O’Donnell, Christopher J.; Rotter, Jerome I.; Boerwinkle, Eric
2013-01-01
Genotyping arrays are a cost effective approach when typing previously-identified genetic polymorphisms in large numbers of samples. One limitation of genotyping arrays with rare variants (e.g., minor allele frequency [MAF] <0.01) is the difficulty that automated clustering algorithms have to accurately detect and assign genotype calls. Combining intensity data from large numbers of samples may increase the ability to accurately call the genotypes of rare variants. Approximately 62,000 ethnically diverse samples from eleven Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium cohorts were genotyped with the Illumina HumanExome BeadChip across seven genotyping centers. The raw data files for the samples were assembled into a single project for joint calling. To assess the quality of the joint calling, concordance of genotypes in a subset of individuals having both exome chip and exome sequence data was analyzed. After exclusion of low performing SNPs on the exome chip and non-overlap of SNPs derived from sequence data, genotypes of 185,119 variants (11,356 were monomorphic) were compared in 530 individuals that had whole exome sequence data. A total of 98,113,070 pairs of genotypes were tested and 99.77% were concordant, 0.14% had missing data, and 0.09% were discordant. We report that joint calling allows the ability to accurately genotype rare variation using array technology when large sample sizes are available and best practices are followed. The cluster file from this experiment is available at www.chargeconsortium.com/main/exomechip. PMID:23874508
D'Avolio, Antonio; De Nicolò, Amedeo; Cusato, Jessica; Ciancio, Alessia; Boglione, Lucio; Strona, Silvia; Cariti, Giuseppe; Troshina, Giulia; Caviglia, Gian Paolo; Smedile, Antonina; Rizzetto, Mario; Di Perri, Giovanni
2013-10-01
Functional variants rs7270101 and rs1127354 of inosine triphosphatase (ITPA) were recently found to protect against ribavirin (RBV)-induced hemolytic anemia. However, no definitive data are yet available on the role of no functional rs6051702 polymorphism. Since a simultaneous evaluation of the three ITPA SNPs for hemolytic anemia has not yet been investigated, we aimed to understand the contribution of each SNPs and its potential clinical use to predict anemia in HCV treated patients. A retrospective analysis included 379 HCV treated patients. The ITPA variants rs6051702, rs7270101 and rs1127354 were genotyped and tested for association with achieving anemia at week 4. We also investigated, using multivariate logistic regression, the impact of each single and paired associated polymorphism on anemia onset. All SNPs were associated with Hb decrease. The carrier of at least one variant allele in the functional ITPA SNPs was associated with a lower decrement of Hb, as compared to patients without a variant allele. In multivariate logistic regression analyses the carrier of a variant allele in the rs6051702/rs1127354 association (OR=0.11, p=1.75×10(-5)) and Hb at baseline (OR=1.51, p=1.21×10(-4)) were independently associated with protection against clinically significant anemia at week 4. All ITPA polymorphisms considered were shown to be significantly associated with anemia onset. A multivariate regression model based on ITPA genetic polymorphisms was developed for predicting the risk of anemia. Considering the characterization of pre-therapy anemia predictors, rs6051702 SNP in association to rs1127354 is more informative in order to avoid this relevant adverse event. Copyright © 2013 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fahrenkrog, Annette M.; Neves, Leandro G.; Resende, Jr., Marcio F. R.
Genome-wide association studies (GWAS) have been used extensively to dissect the genetic regulation of complex traits in plants. These studies have focused largely on the analysis of common genetic variants despite the abundance of rare polymorphisms in several species, and their potential role in trait variation. Here, we conducted the first GWAS in Populus deltoides, a genetically diverse keystone forest species in North America and an important short rotation woody crop for the bioenergy industry. We searched for associations between eight growth and wood composition traits, and common and low-frequency single-nucleotide polymorphisms detected by targeted resequencing of 18 153 genesmore » in a population of 391 unrelated individuals. To increase power to detect associations with low-frequency variants, multiple-marker association tests were used in combination with single-marker association tests. Significant associations were discovered for all phenotypes and are indicative that low-frequency polymorphisms contribute to phenotypic variance of several bioenergy traits. Our results suggest that both common and low-frequency variants need to be considered for a comprehensive understanding of the genetic regulation of complex traits, particularly in species that carry large numbers of rare polymorphisms. Lastly, these polymorphisms may be critical for the development of specialized plant feedstocks for bioenergy.« less
Zheng, Miao-Miao; Yue, Li-Jie; Chen, Xiao-Wen; Wen, Fei-Qiu; Li, Chang-Gang; Yang, Chun-Lan; Xie, Cai; Ding, Hui
2013-03-01
To study the association between methylenetetrahydrofolate reductase (MTHFR) gene polymorphisms and toxicities after high-dose methotrexate (HD-MTX) infusion in children with acute lymphocytic leukemia (ALL). MTHFR variants in 52 children with ALL were determined by reverse transcriptase-polymerase chain reaction-denaturing gradient gel electrophoresis and sequencing. Toxicities of children who received HD-MTX chemotherapy were evaluated according to the National Cancer Institute-Common Toxicity Criteria (NCI-CTC). The children carrying MTHFR 1298AC had a higher risk of developing thrombocytopenia compared with the carriers of the 1298 AA genotype (OR=13.7, 95%CI=1.18-159.36, P=0.036). There was no significant difference in HD-MTX chemotherapy-related adverse effects between the patients with different MTHFR C677T or G1793A genotypes. MTHFR A1298C polymorohism may associate with the toxicity of HD-MTX chemotherapy in children with ALL.
Olteanu, Horatiu; Munson, Troy; Banerjee, Ruma
2002-11-12
Methionine synthase reductase (MSR) catalyzes the conversion of the inactive form of human methionine synthase to the active state of the enzyme. This reaction is of paramount physiological importance since methionine synthase is an essential enzyme that plays a key role in the methionine and folate cycles. A common polymorphism in human MSR has been identified (66A --> G) that leads to replacement of isoleucine with methionine at residue 22 and has an allele frequency of 0.5. Another polymorphism is 524C --> T, which leads to the substitution of serine 175 with leucine, but its allele frequency is not known. The I22M polymorphism is a genetic determinant for mild hyperhomocysteinemia, a risk factor for cardiovascular disease. In this study, we have examined the kinetic properties of the M22/S175 and I22/S175 and the I22/L175 and I22/S175 pairs of variants. EPR spectra of the semiquinone forms of variants I22/S175 and M22/S175 are indistinguishable and exhibit an isotropic signal at g = 2.00. In addition, the electronic absorption and reduction stoichiometries with NADPH are identical in these variants. Significantly, the variants activate methionine synthase with the same V(max); however, a 3-4-fold higher ratio of MSR to methionine synthase is required to elicit maximal activity with the M22/S175 and I22/L175 variant versus the I22/S175 enzyme. Differences are also observed between the variants in the efficacies of reduction of the artificial electron acceptors: ferricyanide, 2,6-dichloroindophenol, 3-acetylpyridine adenine dinucleotide phosphate, menadione, and the anticancer drug doxorubicin. These results reveal differences in the interactions between the natural and artificial electron acceptors and MSR variants in vitro, which are predicted to result in less efficient reductive repair of methionine synthase in vivo.
Mining sequence variations in representative polyploid sugarcane germplasm accessions
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Xiping; Song, Jian; You, Qian
Sugarcane (Saccharum spp.) is one of the most important economic crops because of its high sugar production and biofuel potential. Due to the high polyploid level and complex genome of sugarcane, it has been a huge challenge to investigate genomic sequence variations, which are critical for identifying alleles contributing to important agronomic traits. In order to mine the genetic variations in sugarcane, genotyping by sequencing (GBS), was used to genotype 14 representative Saccharum complex accessions. GBS is a method to generate a large number of markers, enabled by next generation sequencing (NGS) and the genome complexity reduction using restriction enzymes.more » To use GBS for high throughput genotyping highly polyploid sugarcane, the GBS analysis pipelines in 14 Saccharum complex accessions were established by evaluating different alignment methods, sequence variants callers, and sequence depth for single nucleotide polymorphism (SNP) filtering. By using the established pipeline, a total of 76,251 non-redundant SNPs, 5642 InDels, 6380 presence/absence variants (PAVs), and 826 copy number variations (CNVs) were detected among the 14 accessions. In addition, non-reference based universal network enabled analysis kit and Stacks de novo called 34,353 and 109,043 SNPs, respectively. In the 14 accessions, the percentages of single dose SNPs ranged from 38.3% to 62.3% with an average of 49.6%, much more than the portions of multiple dosage SNPs. Concordantly called SNPs were used to evaluate the phylogenetic relationship among the 14 accessions. The results showed that the divergence time between the Erianthus genus and the Saccharum genus was more than 10 million years ago (MYA). The Saccharum species separated from their common ancestors ranging from 0.19 to 1.65 MYA. The GBS pipelines including the reference sequences, alignment methods, sequence variant callers, and sequence depth were recommended and discussed for the Saccharum complex and other related species. A large number of sequence variations were discovered in the Saccharum complex, including SNPs, InDels, PAVs, and CNVs. Genome-wide SNPs were further used to illustrate sequence features of polyploid species and demonstrated the divergence of different species in the Saccharum complex. The results of this study showed that GBS was an effective NGS-based method to discover genomic sequence variations in highly polyploid and heterozygous species.« less
Mining sequence variations in representative polyploid sugarcane germplasm accessions
Yang, Xiping; Song, Jian; You, Qian; ...
2017-08-09
Sugarcane (Saccharum spp.) is one of the most important economic crops because of its high sugar production and biofuel potential. Due to the high polyploid level and complex genome of sugarcane, it has been a huge challenge to investigate genomic sequence variations, which are critical for identifying alleles contributing to important agronomic traits. In order to mine the genetic variations in sugarcane, genotyping by sequencing (GBS), was used to genotype 14 representative Saccharum complex accessions. GBS is a method to generate a large number of markers, enabled by next generation sequencing (NGS) and the genome complexity reduction using restriction enzymes.more » To use GBS for high throughput genotyping highly polyploid sugarcane, the GBS analysis pipelines in 14 Saccharum complex accessions were established by evaluating different alignment methods, sequence variants callers, and sequence depth for single nucleotide polymorphism (SNP) filtering. By using the established pipeline, a total of 76,251 non-redundant SNPs, 5642 InDels, 6380 presence/absence variants (PAVs), and 826 copy number variations (CNVs) were detected among the 14 accessions. In addition, non-reference based universal network enabled analysis kit and Stacks de novo called 34,353 and 109,043 SNPs, respectively. In the 14 accessions, the percentages of single dose SNPs ranged from 38.3% to 62.3% with an average of 49.6%, much more than the portions of multiple dosage SNPs. Concordantly called SNPs were used to evaluate the phylogenetic relationship among the 14 accessions. The results showed that the divergence time between the Erianthus genus and the Saccharum genus was more than 10 million years ago (MYA). The Saccharum species separated from their common ancestors ranging from 0.19 to 1.65 MYA. The GBS pipelines including the reference sequences, alignment methods, sequence variant callers, and sequence depth were recommended and discussed for the Saccharum complex and other related species. A large number of sequence variations were discovered in the Saccharum complex, including SNPs, InDels, PAVs, and CNVs. Genome-wide SNPs were further used to illustrate sequence features of polyploid species and demonstrated the divergence of different species in the Saccharum complex. The results of this study showed that GBS was an effective NGS-based method to discover genomic sequence variations in highly polyploid and heterozygous species.« less
Genetic diversity of Babesia bovis in virulent and attenuated strains.
Mazuz, M L; Molad, T; Fish, L; Leibovitz, B; Wolkomirsky, R; Fleiderovitz, L; Shkap, V
2012-03-01
The aim of this study was to compare the genetic diversity of the single copy Bv80 gene sequences of Babesia bovis in populations of attenuated and virulent parasites. PCR/ RT-PCR followed by cloning and sequence analyses of 4 attenuated and 4 virulent strains were performed. Multiple fragments in the range of 420 to 744 bp were amplified by PCR or RT-PCR. Cloning of the PCR fragments and sequence analyses revealed the presence of mixed subpopulations in either virulent or attenuated parasites with a total of 19 variants with 12 different sequences that differed in number and type of tandem repeats. High levels of intra- and inter-strain diversity of the Bv80 gene, with the presence of mixed populations of parasites were found in both the virulent field isolates and the attenuated vaccine strains. In addition, during the attenuation process, sequence analyses showed changes in the pattern of the parasite subpopulations. Despite high polymorphism found by sequence analyses, the patterns observed and the number of repeats, order, or motifs found could not discriminate between virulent field isolates and attenuated vaccine strains of the parasite.
Association of Amine-Receptor DNA Sequence Variants with Associative Learning in the Honeybee.
Lagisz, Malgorzata; Mercer, Alison R; de Mouzon, Charlotte; Santos, Luana L S; Nakagawa, Shinichi
2016-03-01
Octopamine- and dopamine-based neuromodulatory systems play a critical role in learning and learning-related behaviour in insects. To further our understanding of these systems and resulting phenotypes, we quantified DNA sequence variations at six loci coding octopamine-and dopamine-receptors and their association with aversive and appetitive learning traits in a population of honeybees. We identified 79 polymorphic sequence markers (mostly SNPs and a few insertions/deletions) located within or close to six candidate genes. Intriguingly, we found that levels of sequence variation in the protein-coding regions studied were low, indicating that sequence variation in the coding regions of receptor genes critical to learning and memory is strongly selected against. Non-coding and upstream regions of the same genes, however, were less conserved and sequence variations in these regions were weakly associated with between-individual differences in learning-related traits. While these associations do not directly imply a specific molecular mechanism, they suggest that the cross-talk between dopamine and octopamine signalling pathways may influence olfactory learning and memory in the honeybee.