predicting single nucleotide: Topics by Science.gov

Sample records for predicting single nucleotide

Improved prediction of biochemical recurrence after radical prostatectomy by genetic polymorphisms.

PubMed

Morote, Juan; Del Amo, Jokin; Borque, Angel; Ars, Elisabet; Hernández, Carlos; Herranz, Felipe; Arruza, Antonio; Llarena, Roberto; Planas, Jacques; Viso, María J; Palou, Joan; Raventós, Carles X; Tejedor, Diego; Artieda, Marta; Simón, Laureano; Martínez, Antonio; Rioja, Luis A

2010-08-01

Single nucleotide polymorphisms are inherited genetic variations that can predispose or protect individuals against clinical events. We hypothesized that single nucleotide polymorphism profiling may improve the prediction of biochemical recurrence after radical prostatectomy. We performed a retrospective, multi-institutional study of 703 patients treated with radical prostatectomy for clinically localized prostate cancer who had at least 5 years of followup after surgery. All patients were genotyped for 83 prostate cancer related single nucleotide polymorphisms using a low density oligonucleotide microarray. Baseline clinicopathological variables and single nucleotide polymorphisms were analyzed to predict biochemical recurrence within 5 years using stepwise logistic regression. Discrimination was measured by ROC curve AUC, specificity, sensitivity, predictive values, net reclassification improvement and integrated discrimination index. The overall biochemical recurrence rate was 35%. The model with the best fit combined 8 covariates, including the 5 clinicopathological variables prostate specific antigen, Gleason score, pathological stage, lymph node involvement and margin status, and 3 single nucleotide polymorphisms at the KLK2, SULT1A1 and TLR4 genes. Model predictive power was defined by 80% positive predictive value, 74% negative predictive value and an AUC of 0.78. The model based on clinicopathological variables plus single nucleotide polymorphisms showed significant improvement over the model without single nucleotide polymorphisms, as indicated by 23.3% net reclassification improvement (p = 0.003), integrated discrimination index (p <0.001) and likelihood ratio test (p <0.001). Internal validation proved model robustness (bootstrap corrected AUC 0.78, range 0.74 to 0.82). The calibration plot showed close agreement between biochemical recurrence observed and predicted probabilities. Predicting biochemical recurrence after radical prostatectomy based on clinicopathological data can be significantly improved by including patient genetic information. Copyright (c) 2010 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
Single Nucleotide Polymorphisms Predict Symptom Severity of Autism Spectrum Disorder

ERIC Educational Resources Information Center

Jiao, Yun; Chen, Rong; Ke, Xiaoyan; Cheng, Lu; Chu, Kangkang; Lu, Zuhong; Herskovits, Edward H.

2012-01-01

Autism is widely believed to be a heterogeneous disorder; diagnosis is currently based solely on clinical criteria, although genetic, as well as environmental, influences are thought to be prominent factors in the etiology of most forms of autism. Our goal is to determine whether a predictive model based on single-nucleotide polymorphisms (SNPs)…
Predicted stem-loop structures and variation in nucleotide sequence of 3' noncoding regions among animal calicivirus genomes.

PubMed

Seal, B S; Neill, J D; Ridpath, J F

1994-07-01

Caliciviruses are nonenveloped with a polyadenylated genome of approximately 7.6 kb and a single capsid protein. The "RNA Fold" computer program was used to analyze 3'-terminal noncoding sequences of five feline calicivirus (FCV), rabbit hemorrhagic disease virus (RHDV), and two San Miguel sea lion virus (SMSV) isolates. The FCV 3'-terminal sequences are 40-46 nucleotides in length and 72-91% similar. The FCV sequences were predicted to contain two possible duplex structures and one stem-loop structure with free energies of -2.1 to -18.2 kcal/mole. The RHDV genomic 3'-terminal RNA sequences are 54 nucleotides in length and share 49% sequence similarity to homologous regions of the FCV genome. The RHDV sequence was predicted to form two duplex structures in the 3'-terminal noncoding region with a single stem-loop structure, resembling that of FCV. In contrast, the SMSV 1 and 4 genomic 3'-terminal noncoding sequences were 185 and 182 nucleotides in length, respectively. Ten possible duplex structures were predicted with an average structural free energy of -35 kcal/mole. Sequence similarity between the two SMSV isolates was 75%. Furthermore, extensive cloverleaflike structures are predicted in the 3' noncoding region of the SMSV genome, in contrast to the predicted single stem-loop structures of FCV or RHDV.
Identification and Characterization of Novel Variations in Platelet G-Protein Coupled Receptor (GPCR) Genes in Patients Historically Diagnosed with Type 1 von Willebrand Disease.

PubMed

Stockley, Jacqueline; Nisar, Shaista P; Leo, Vincenzo C; Sabi, Essa; Cunningham, Margaret R; Eikenboom, Jeroen C; Lethagen, Stefan; Schneppenheim, Reinhard; Goodeve, Anne C; Watson, Steve P; Mundell, Stuart J; Daly, Martina E

2015-01-01

The clinical expression of type 1 von Willebrand disease may be modified by co-inheritance of other mild bleeding diatheses. We previously showed that mutations in the platelet P2Y12 ADP receptor gene (P2RY12) could contribute to the bleeding phenotype in patients with type 1 von Willebrand disease. Here we investigated whether variations in platelet G protein-coupled receptor genes other than P2RY12 also contributed to the bleeding phenotype. Platelet G protein-coupled receptor genes P2RY1, F2R, F2RL3, TBXA2R and PTGIR were sequenced in 146 index cases with type 1 von Willebrand disease and the potential effects of identified single nucleotide variations were assessed using in silico methods and heterologous expression analysis. Seven heterozygous single nucleotide variations were identified in 8 index cases. Two single nucleotide variations were detected in F2R; a novel c.-67G>C transversion which reduced F2R transcriptional activity and a rare c.1063C>T transition predicting a p.L355F substitution which did not interfere with PAR1 expression or signalling. Two synonymous single nucleotide variations were identified in F2RL3 (c.402C>G, p.A134 =; c.1029 G>C p.V343 =), both of which introduced less commonly used codons and were predicted to be deleterious, though neither of them affected PAR4 receptor expression. A third single nucleotide variation in F2RL3 (c.65 C>A; p.T22N) was co-inherited with a synonymous single nucleotide variation in TBXA2R (c.6680 C>T, p.S218 =). Expression and signalling of the p.T22N PAR4 variant was similar to wild-type, while the TBXA2R variation introduced a cryptic splice site that was predicted to cause premature termination of protein translation. The enrichment of single nucleotide variations in G protein-coupled receptor genes among type 1 von Willebrand disease patients supports the view of type 1 von Willebrand disease as a polygenic disorder.
Bitterness of the Non-nutritive Sweetener Acesulfame Potassium Varies With Polymorphisms in TAS2R9 and TAS2R31

PubMed Central

2013-01-01

Demand for nonnutritive sweeteners continues to increase due to their ability to provide desirable sweetness with minimal calories. Acesulfame potassium and saccharin are well-studied nonnutritive sweeteners commonly found in food products. Some individuals report aversive sensations from these sweeteners, such as bitter and metallic side tastes. Recent advances in molecular genetics have provided insight into the cause of perceptual differences across people. For example, common alleles for the genes TAS2R9 and TAS2R38 explain variable response to the bitter drugs ofloxacin in vitro and propylthiouracil in vivo. Here, we wanted to determine whether differences in the bitterness of acesulfame potassium could be predicted by common polymorphisms (genetic variants) in bitter taste receptor genes (TAS2Rs). We genotyped participants (n = 108) for putatively functional single nucleotide polymorphisms in 5 TAS2Rs and asked them to rate the bitterness of 25 mM acesulfame potassium on a general labeled magnitude scale. Consistent with prior reports, we found 2 single nucleotide polymorphisms in TAS2R31 were associated with acesulfame potassium bitterness. However, TAS2R9 alleles also predicted additional variation in acesulfame potassium bitterness. Conversely, single nucleotide polymorphisms in TAS2R4, TAS2R38, and near TAS2R16 were not significant predictors. Using 1 single nucleotide polymorphism each from TAS2R9 and TAS2R31, we modeled the simultaneous influence of these single nucleotide polymorphisms on acesulfame potassium bitterness; together, these 2 single nucleotide polymorphisms explained 13.4% of the variance in perceived bitterness. These data suggest multiple polymorphisms within TAS2Rs contribute to the ability to perceive the bitterness from acesulfame potassium. PMID:23599216
Theory of single-molecule controlled rotation experiments, predictions, tests, and comparison with stalling experiments in F1-ATPase.

PubMed

Volkán-Kacsó, Sándor; Marcus, Rudolph A

2016-10-25

A recently proposed chemomechanical group transfer theory of rotary biomolecular motors is applied to treat single-molecule controlled rotation experiments. In these experiments, single-molecule fluorescence is used to measure the binding and release rate constants of nucleotides by monitoring the occupancy of binding sites. It is shown how missed events of nucleotide binding and release in these experiments can be corrected using theory, with F 1 -ATP synthase as an example. The missed events are significant when the reverse rate is very fast. Using the theory the actual rate constants in the controlled rotation experiments and the corrections are predicted from independent data, including other single-molecule rotation and ensemble biochemical experiments. The effective torsional elastic constant is found to depend on the binding/releasing nucleotide, and it is smaller for ADP than for ATP. There is a good agreement, with no adjustable parameters, between the theoretical and experimental results of controlled rotation experiments and stalling experiments, for the range of angles where the data overlap. This agreement is perhaps all the more surprising because it occurs even though the binding and release of fluorescent nucleotides is monitored at single-site occupancy concentrations, whereas the stalling and free rotation experiments have multiple-site occupancy.
dbWGFP: a database and web server of human whole-genome single nucleotide variants and their functional predictions.

PubMed

Wu, Jiaxin; Wu, Mengmeng; Li, Lianshuo; Liu, Zhuo; Zeng, Wanwen; Jiang, Rui

2016-01-01

The recent advancement of the next generation sequencing technology has enabled the fast and low-cost detection of all genetic variants spreading across the entire human genome, making the application of whole-genome sequencing a tendency in the study of disease-causing genetic variants. Nevertheless, there still lacks a repository that collects predictions of functionally damaging effects of human genetic variants, though it has been well recognized that such predictions play a central role in the analysis of whole-genome sequencing data. To fill this gap, we developed a database named dbWGFP (a database and web server of human whole-genome single nucleotide variants and their functional predictions) that contains functional predictions and annotations of nearly 8.58 billion possible human whole-genome single nucleotide variants. Specifically, this database integrates 48 functional predictions calculated by 17 popular computational methods and 44 valuable annotations obtained from various data sources. Standalone software, user-friendly query services and free downloads of this database are available at http://bioinfo.au.tsinghua.edu.cn/dbwgfp. dbWGFP provides a valuable resource for the analysis of whole-genome sequencing, exome sequencing and SNP array data, thereby complementing existing data sources and computational resources in deciphering genetic bases of human inherited diseases. © The Author(s) 2016. Published by Oxford University Press.
Comprehensive thermodynamic analysis of 3′ double-nucleotide overhangs neighboring Watson–Crick terminal base pairs

PubMed Central

O'Toole, Amanda S.; Miller, Stacy; Haines, Nathan; Zink, M. Coleen; Serra, Martin J.

2006-01-01

Thermodynamic parameters are reported for duplex formation of 48 self-complementary RNA duplexes containing Watson–Crick terminal base pairs (GC, AU and UA) with all 16 possible 3′ double-nucleotide overhangs; mimicking the structures of short interfering RNAs (siRNA) and microRNAs (miRNA). Based on nearest-neighbor analysis, the addition of a second dangling nucleotide to a single 3′ dangling nucleotide increases stability of duplex formation up to 0.8 kcal/mol in a sequence dependent manner. Results from this study in conjunction with data from a previous study [A. S. O'Toole, S. Miller and M. J. Serra (2005) RNA, 11, 512.] allows for the development of a refined nearest-neighbor model to predict the influence of 3′ double-nucleotide overhangs on the stability of duplex formation. The model improves the prediction of free energy and melting temperature when tested against five oligomers with various core duplex sequences. Phylogenetic analysis of naturally occurring miRNAs was performed to support our results. Selection of the effector miR strand of the mature miRNA duplex appears to be dependent upon the identity of the 3′ double-nucleotide overhang. Thermodynamic parameters for 3′ single terminal overhangs adjacent to a UA pair are also presented. PMID:16820533
Electron attachment to DNA single strands: gas phase and aqueous solution.

PubMed

Gu, Jiande; Xie, Yaoming; Schaefer, Henry F

2007-01-01

The 2'-deoxyguanosine-3',5'-diphosphate, 2'-deoxyadenosine-3',5'-diphosphate, 2'-deoxycytidine-3',5'-diphosphate and 2'-deoxythymidine-3',5'-diphosphate systems are the smallest units of a DNA single strand. Exploring these comprehensive subunits with reliable density functional methods enables one to approach reasonable predictions of the properties of DNA single strands. With these models, DNA single strands are found to have a strong tendency to capture low-energy electrons. The vertical attachment energies (VEAs) predicted for 3',5'-dTDP (0.17 eV) and 3',5'-dGDP (0.14 eV) indicate that both the thymine-rich and the guanine-rich DNA single strands have the ability to capture electrons. The adiabatic electron affinities (AEAs) of the nucleotides considered here range from 0.22 to 0.52 eV and follow the order 3',5'-dTDP > 3',5'-dCDP > 3',5'-dGDP > 3',5'-dADP. A substantial increase in the AEA is observed compared to that of the corresponding nucleic acid bases and the corresponding nucleosides. Furthermore, aqueous solution simulations dramatically increase the electron attracting properties of the DNA single strands. The present investigation illustrates that in the gas phase, the excess electron is situated both on the nucleobase and on the phosphate moiety for DNA single strands. However, the distribution of the extra negative charge is uneven. The attached electron favors the base moiety for the pyrimidine, while it prefers the 3'-phosphate subunit for the purine DNA single strands. In contrast, the attached electron is tightly bound to the base fragment for the cytidine, thymidine and adenosine nucleotides, while it almost exclusively resides in the vicinity of the 3'-phosphate group for the guanosine nucleotides due to the solvent effects. The comparatively low vertical detachment energies (VDEs) predicted for 3',5'-dADP(-) (0.26 eV) and 3',5'-dGDP(-) (0.32 eV) indicate that electron detachment might compete with reactions having high activation barriers such as glycosidic bond breakage. However, the radical anions of the pyrimidine nucleotides with high VDE are expected to be electronically stable. Thus the base-centered radical anions of the pyrimidine nucleotides might be the possible intermediates for DNA single-strand breakage.
Genetic risk profiling and gene signature modeling to predict risk of complications after IPAA.

PubMed

Sehgal, Rishabh; Berg, Arthur; Polinski, Joseph I; Hegarty, John P; Lin, Zhenwu; McKenna, Kevin J; Stewart, David B; Poritz, Lisa S; Koltun, Walter A

2012-03-01

Severe pouchitis and Crohn's disease-like complications are 2 adverse postoperative complications that confound the success of the IPAA in patients with ulcerative colitis. To date, approximately 83 single nucleotide polymorphisms within 55 genes have been associated with IBD. The aim of this study was to identify single-nucleotide polymorphisms that correlate with complications after IPAA that could be utilized in a gene signature fashion to predict postoperative complications and aid in preoperative surgical decision making. One hundred forty-two IPAA patients were retrospectively classified as "asymptomatic" (n = 104, defined as no Crohn's disease-like complications or severe pouchitis for at least 2 years after IPAA) and compared with a "severe pouchitis" group (n = 12, ≥ 4 episodes pouchitis per year for 2 years including the need for long-term therapy to maintain remission) and a "Crohn's disease-like" group (n = 26, presence of fistulae, pouch inlet stricture, proximal small-bowel disease, or pouch granulomata, occurring at least 6 months after surgery). Genotyping for 83 single-nucleotide polymorphisms previously associated with Crohn's disease and/or ulcerative colitis was performed on a customized Illumina genotyping platform. The top 2 single-nucleotide polymorphisms statistically identified as being independently associated with each of Crohn's disease-like and severe pouchitis were used in a multivariate logistic regression model. These single-nucleotide polymorphisms were then used to create probability equations to predict overall chance of a positive or negative outcome for that complication. The top 2 single-nucleotide polymorphisms for Crohn's disease-like complications were in the 10q21 locus and the gene for PTGER4 (p = 0.006 and 0.007), whereas for severe pouchitis it was NOD2 and TNFSF15 (p = 0.003 and 0.011). Probability equations suggested that the risk of these 2 complications greatly increased with increasing number of risk alleles, going as high as 92% for severe pouchitis and 65% for Crohn's disease-like complications. In this IPAA patient cohort, mutations in the 10q21 locus and the PTGER4 gene were associated with Crohn's disease-like complications, whereas mutations in NOD2 and TNFSF15 correlated with severe pouchitis. Preoperative genetic analysis and use of such gene signatures hold promise for improved preoperative surgical patient selection to minimize these IPAA complications.
Detection of Strand Cleavage And Oxidation Damage Using Model DNA Molecules Captured in a Nanoscale Pore

NASA Technical Reports Server (NTRS)

Vercoutere, W.; Solbrig, A.; DeGuzman, V.; Deamer, D.; Akeson, M.

2003-01-01

We use a biological nano-scale pore to distinguish among individual DNA hairpins that differ by a single site of oxidation or a nick in the sugar-phosphate backbone. In earlier work we showed that the protein ion channel alpha-hemolysin can be used as a detector to distinguish single-stranded from double-stranded DNA, single base pair and single nucleotide differences. This resolution is in part a result of sensitivity to structural changes that influence the molecular dynamics of nucleotides within DNA. The strand cleavage products we examined here included a 5-base-pair (5-bp) hairpin with a 5-prime five-nucleotide overhang, and a complementary five-nucleotide oligomer. These produced predictable shoulder-spike and rapid near-full blockade signatures, respectively. When combined, strand annealing was monitored in real time. The residual current level dropped to a lower discrete level in the shoulder-spike blockade signatures, and the duration lengthened. However, these blockade signatures had a shorter duration than the unmodified l0bp hairpin. To test the pore sensitivity to nucleotide oxidation, we examined a 9-bp hairpin with a terminal 8-oxo-deoxyguanosine (8-oxo-dG), or a penultimate 8-oxo-dG. Each produced blockade signatures that differed from the otherwise identical control 9bp hairpins. This study showed that DNA structure is modified sufficiently by strand cleavage or oxidation damage at a single site to alter in a predictable manner the ionic current blockade signatures produced. This technique improves the ability to assess damage to DNA, and can provide a simple means to help characterize the risks of radiation exposure. It may also provide a method to test radiation protection.
Prediction of peripheral neuropathy in multiple myeloma patients receiving bortezomib and thalidomide: a genetic study based on a single nucleotide polymorphism array.

PubMed

García-Sanz, Ramón; Corchete, Luis Antonio; Alcoceba, Miguel; Chillon, María Carmen; Jiménez, Cristina; Prieto, Isabel; García-Álvarez, María; Puig, Noemi; Rapado, Immaculada; Barrio, Santiago; Oriol, Albert; Blanchard, María Jesús; de la Rubia, Javier; Martínez, Rafael; Lahuerta, Juan José; González Díaz, Marcos; Mateos, María Victoria; San Miguel, Jesús Fernando; Martínez-López, Joaquín; Sarasquete, María Eugenia

2017-12-01

Bortezomib- and thalidomide-based therapies have significantly contributed to improved survival of multiple myeloma (MM) patients. However, treatment-induced peripheral neuropathy (TiPN) is a common adverse event associated with them. Risk factors for TiPN in MM patients include advanced age, prior neuropathy, and other drugs, but there are conflicting results about the role of genetics in predicting the risk of TiPN. Thus, we carried out a genome-wide association study based on more than 300 000 exome single nucleotide polymorphisms in 172 MM patients receiving therapy involving bortezomib and thalidomide. We compared patients developing and not developing TiPN under similar treatment conditions (GEM05MAS65, NCT00443235). The highest-ranking single nucleotide polymorphism was rs45443101, located in the PLCG2 gene, but no significant differences were found after multiple comparison correction (adjusted P = .1708). Prediction analyses, cytoband enrichment, and pathway analyses were also performed, but none yielded any significant findings. A copy number approach was also explored, but this gave no significant results either. In summary, our study did not find a consistent genetic component associated with TiPN under bortezomib and thalidomide therapies that could be used for prediction, which makes clinical judgment essential in the practical management of MM treatment. Copyright © 2016 John Wiley & Sons, Ltd.
Lack of Association Between Polymorphisms in Dopa Decarboxylase and Dopamine Receptor-1 Genes With Childhood Autism in Chinese Han Population.

PubMed

Yu, Hong; Liu, Jun; Yang, Aiping; Yang, Guohui; Yang, Wenjun; Lei, Heyue; Quan, Jianjun; Zhang, Zengyu

2016-04-01

Genetic factors play an important role in childhood autism. This study is to determine the association of single-nucleotide polymorphisms in dopa decarboxylase (DDC) and dopamine receptor-1 (DRD1) genes with childhood autism, in a Chinese Han population. A total of 211 autistic children and 250 age- and gender-matched healthy controls were recruited. The severity of disease was determined by Children Autism Rating Scale scores. TaqMan Probe by real-time polymerase chain reaction was used to determine genotypes and allele frequencies of single-nucleotide polymorphism rs6592961 in DDC and rs251937 in DRD1. Case-control and case-only studies were respectively performed, to determine the contribution of both single-nucleotide polymorphisms to the predisposition of disease and its severity. Our results showed that there was no significant association of the genotypes and allele frequencies of both single-nucleotide polymorphisms concerning childhood autism and its severity. More studies with larger samples are needed to corroborate their predicting roles. © The Author(s) 2015.
Prediction of Adult Dyslipidemia Using Genetic and Childhood Clinical Risk Factors: The Cardiovascular Risk in Young Finns Study.

PubMed

Nuotio, Joel; Pitkänen, Niina; Magnussen, Costan G; Buscot, Marie-Jeanne; Venäläinen, Mikko S; Elo, Laura L; Jokinen, Eero; Laitinen, Tomi; Taittonen, Leena; Hutri-Kähönen, Nina; Lyytikäinen, Leo-Pekka; Lehtimäki, Terho; Viikari, Jorma S; Juonala, Markus; Raitakari, Olli T

2017-06-01

Dyslipidemia is a major modifiable risk factor for cardiovascular disease. We examined whether the addition of novel single-nucleotide polymorphisms for blood lipid levels enhances the prediction of adult dyslipidemia in comparison to childhood lipid measures. Two thousand four hundred and twenty-two participants of the Cardiovascular Risk in Young Finns Study who had participated in 2 surveys held during childhood (in 1980 when aged 3-18 years and in 1986) and at least once in a follow-up study in adulthood (2001, 2007, and 2011) were included. We examined whether inclusion of a lipid-specific weighted genetic risk score based on 58 single-nucleotide polymorphisms for low-density lipoprotein cholesterol, 71 single-nucleotide polymorphisms for high-density lipoprotein cholesterol, and 40 single-nucleotide polymorphisms for triglycerides improved the prediction of adult dyslipidemia compared with clinical childhood risk factors. Adjusting for age, sex, body mass index, physical activity, and smoking in childhood, childhood lipid levels, and weighted genetic risk scores were associated with an increased risk of adult dyslipidemia for all lipids. Risk assessment based on 2 childhood lipid measures and the lipid-specific weighted genetic risk scores improved the accuracy of predicting adult dyslipidemia compared with the approach using only childhood lipid measures for low-density lipoprotein cholesterol (area under the receiver-operating characteristic curve 0.806 versus 0.811; P =0.01) and triglycerides (area under the receiver-operating characteristic curve 0.740 versus area under the receiver-operating characteristic curve 0.758; P <0.01). The overall net reclassification improvement and integrated discrimination improvement were significant for all outcomes. The inclusion of weighted genetic risk scores to lipid-screening programs in childhood could modestly improve the identification of those at highest risk of dyslipidemia in adulthood. © 2017 American Heart Association, Inc.
SEAN: SNP prediction and display program utilizing EST sequence clusters.

PubMed

Huntley, Derek; Baldo, Angela; Johri, Saurabh; Sergot, Marek

2006-02-15

SEAN is an application that predicts single nucleotide polymorphisms (SNPs) using multiple sequence alignments produced from expressed sequence tag (EST) clusters. The algorithm uses rules of sequence identity and SNP abundance to determine the quality of the prediction. A Java viewer is provided to display the EST alignments and predicted SNPs.
Electron attachment to DNA single strands: gas phase and aqueous solution

PubMed Central

Gu, Jiande; Xie, Yaoming; Schaefer, Henry F.

2007-01-01

The 2′-deoxyguanosine-3′,5′-diphosphate, 2′-deoxyadenosine-3′,5′-diphosphate, 2′-deoxycytidine-3′,5′-diphosphate and 2′-deoxythymidine-3′,5′-diphosphate systems are the smallest units of a DNA single strand. Exploring these comprehensive subunits with reliable density functional methods enables one to approach reasonable predictions of the properties of DNA single strands. With these models, DNA single strands are found to have a strong tendency to capture low-energy electrons. The vertical attachment energies (VEAs) predicted for 3′,5′-dTDP (0.17 eV) and 3′,5′-dGDP (0.14 eV) indicate that both the thymine-rich and the guanine-rich DNA single strands have the ability to capture electrons. The adiabatic electron affinities (AEAs) of the nucleotides considered here range from 0.22 to 0.52 eV and follow the order 3′,5′-dTDP > 3′,5′-dCDP > 3′,5′-dGDP > 3′,5′-dADP. A substantial increase in the AEA is observed compared to that of the corresponding nucleic acid bases and the corresponding nucleosides. Furthermore, aqueous solution simulations dramatically increase the electron attracting properties of the DNA single strands. The present investigation illustrates that in the gas phase, the excess electron is situated both on the nucleobase and on the phosphate moiety for DNA single strands. However, the distribution of the extra negative charge is uneven. The attached electron favors the base moiety for the pyrimidine, while it prefers the 3′-phosphate subunit for the purine DNA single strands. In contrast, the attached electron is tightly bound to the base fragment for the cytidine, thymidine and adenosine nucleotides, while it almost exclusively resides in the vicinity of the 3′-phosphate group for the guanosine nucleotides due to the solvent effects. The comparatively low vertical detachment energies (VDEs) predicted for 3′,5′-dADP− (0.26 eV) and 3′,5′-dGDP− (0.32 eV) indicate that electron detachment might compete with reactions having high activation barriers such as glycosidic bond breakage. However, the radical anions of the pyrimidine nucleotides with high VDE are expected to be electronically stable. Thus the base-centered radical anions of the pyrimidine nucleotides might be the possible intermediates for DNA single-strand breakage. PMID:17660189
High-throughput discovery of rare human nucleotide polymorphisms by Ecotilling

PubMed Central

Till, Bradley J.; Zerr, Troy; Bowers, Elisabeth; Greene, Elizabeth A.; Comai, Luca; Henikoff, Steven

2006-01-01

Human individuals differ from one another at only ∼0.1% of nucleotide positions, but these single nucleotide differences account for most heritable phenotypic variation. Large-scale efforts to discover and genotype human variation have been limited to common polymorphisms. However, these efforts overlook rare nucleotide changes that may contribute to phenotypic diversity and genetic disorders, including cancer. Thus, there is an increasing need for high-throughput methods to robustly detect rare nucleotide differences. Toward this end, we have adapted the mismatch discovery method known as Ecotilling for the discovery of human single nucleotide polymorphisms. To increase throughput and reduce costs, we developed a universal primer strategy and implemented algorithms for automated band detection. Ecotilling was validated by screening 90 human DNA samples for nucleotide changes in 5 gene targets and by comparing results to public resequencing data. To increase throughput for discovery of rare alleles, we pooled samples 8-fold and found Ecotilling to be efficient relative to resequencing, with a false negative rate of 5% and a false discovery rate of 4%. We identified 28 new rare alleles, including some that are predicted to damage protein function. The detection of rare damaging mutations has implications for models of human disease. PMID:16893952
Evaluation of single nucleotide polymorphisms in chromosomal regions impacting pregnancy status in cattle

USDA-ARS?s Scientific Manuscript database

Reproductive success is an important component of commercial beef cattle production, and identification of DNA markers with predictive merit for reproductive success would facilitate accurate prediction of mean daughter pregnancy rate, enabling effective selection of bulls to improve female fertilit...
Computational Analysis of Single Nucleotide Polymorphisms Associated with Altered Drug Responsiveness in Type 2 Diabetes

PubMed Central

Costa, Valerio; Federico, Antonio; Pollastro, Carla; Ziviello, Carmela; Cataldi, Simona; Formisano, Pietro; Ciccodicola, Alfredo

2016-01-01

Type 2 diabetes (T2D) is one of the most frequent mortality causes in western countries, with rapidly increasing prevalence. Anti-diabetic drugs are the first therapeutic approach, although many patients develop drug resistance. Most drug responsiveness variability can be explained by genetic causes. Inter-individual variability is principally due to single nucleotide polymorphisms, and differential drug responsiveness has been correlated to alteration in genes involved in drug metabolism (CYP2C9) or insulin signaling (IRS1, ABCC8, KCNJ11 and PPARG). However, most genome-wide association studies did not provide clues about the contribution of DNA variations to impaired drug responsiveness. Thus, characterizing T2D drug responsiveness variants is needed to guide clinicians toward tailored therapeutic approaches. Here, we extensively investigated polymorphisms associated with altered drug response in T2D, predicting their effects in silico. Combining different computational approaches, we focused on the expression pattern of genes correlated to drug resistance and inferred evolutionary conservation of polymorphic residues, computationally predicting the biochemical properties of polymorphic proteins. Using RNA-Sequencing followed by targeted validation, we identified and experimentally confirmed that two nucleotide variations in the CAPN10 gene—currently annotated as intronic—fall within two new transcripts in this locus. Additionally, we found that a Single Nucleotide Polymorphism (SNP), currently reported as intergenic, maps to the intron of a new transcript, harboring CAPN10 and GPR35 genes, which undergoes non-sense mediated decay. Finally, we analyzed variants that fall into non-coding regulatory regions of yet underestimated functional significance, predicting that some of them can potentially affect gene expression and/or post-transcriptional regulation of mRNAs affecting the splicing. PMID:27347941
Optimal design of low-density SNP arrays for genomic prediction: algorithm and applications

USDA-ARS?s Scientific Manuscript database

Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for their optimal design. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optim...

Use of causative variants and SNP weighting in a single-step GBLUP context

USDA-ARS?s Scientific Manuscript database

Much effort has been recently put into identifying causative quantitative trait nucleotides (QTN) in animal breeding, aiming genomic prediction. Among the genomic methods available, single-step GBLUP (ssGBLUP) became the choice because of its simplicity and potentially higher accuracy. When QTN are ...
Predicting stroke through genetic risk functions: the CHARGE Risk Score Project.

PubMed

Ibrahim-Verbaas, Carla A; Fornage, Myriam; Bis, Joshua C; Choi, Seung Hoan; Psaty, Bruce M; Meigs, James B; Rao, Madhu; Nalls, Mike; Fontes, Joao D; O'Donnell, Christopher J; Kathiresan, Sekar; Ehret, Georg B; Fox, Caroline S; Malik, Rainer; Dichgans, Martin; Schmidt, Helena; Lahti, Jari; Heckbert, Susan R; Lumley, Thomas; Rice, Kenneth; Rotter, Jerome I; Taylor, Kent D; Folsom, Aaron R; Boerwinkle, Eric; Rosamond, Wayne D; Shahar, Eyal; Gottesman, Rebecca F; Koudstaal, Peter J; Amin, Najaf; Wieberdink, Renske G; Dehghan, Abbas; Hofman, Albert; Uitterlinden, André G; Destefano, Anita L; Debette, Stephanie; Xue, Luting; Beiser, Alexa; Wolf, Philip A; Decarli, Charles; Ikram, M Arfan; Seshadri, Sudha; Mosley, Thomas H; Longstreth, W T; van Duijn, Cornelia M; Launer, Lenore J

2014-02-01

Beyond the Framingham Stroke Risk Score, prediction of future stroke may improve with a genetic risk score (GRS) based on single-nucleotide polymorphisms associated with stroke and its risk factors. The study includes 4 population-based cohorts with 2047 first incident strokes from 22,720 initially stroke-free European origin participants aged ≥55 years, who were followed for up to 20 years. GRSs were constructed with 324 single-nucleotide polymorphisms implicated in stroke and 9 risk factors. The association of the GRS to first incident stroke was tested using Cox regression; the GRS predictive properties were assessed with area under the curve statistics comparing the GRS with age and sex, Framingham Stroke Risk Score models, and reclassification statistics. These analyses were performed per cohort and in a meta-analysis of pooled data. Replication was sought in a case-control study of ischemic stroke. In the meta-analysis, adding the GRS to the Framingham Stroke Risk Score, age and sex model resulted in a significant improvement in discrimination (all stroke: Δjoint area under the curve=0.016, P=2.3×10(-6); ischemic stroke: Δjoint area under the curve=0.021, P=3.7×10(-7)), although the overall area under the curve remained low. In all the studies, there was a highly significantly improved net reclassification index (P<10(-4)). The single-nucleotide polymorphisms associated with stroke and its risk factors result only in a small improvement in prediction of future stroke compared with the classical epidemiological risk factors for stroke.
OligArch: A software tool to allow artificially expanded genetic information systems (AEGIS) to guide the autonomous self-assembly of long DNA constructs from multiple DNA single strands.

PubMed

Bradley, Kevin M; Benner, Steven A

2014-01-01

Synthetic biologists wishing to self-assemble large DNA (L-DNA) constructs from small DNA fragments made by automated synthesis need fragments that hybridize predictably. Such predictability is difficult to obtain with nucleotides built from just the four standard nucleotides. Natural DNA's peculiar combination of strong and weak G:C and A:T pairs, the context-dependence of the strengths of those pairs, unimolecular strand folding that competes with desired interstrand hybridization, and non-Watson-Crick interactions available to standard DNA, all contribute to this unpredictability. In principle, adding extra nucleotides to the genetic alphabet can improve the predictability and reliability of autonomous DNA self-assembly, simply by increasing the information density of oligonucleotide sequences. These extra nucleotides are now available as parts of artificially expanded genetic information systems (AEGIS), and tools are now available to generate entirely standard DNA from AEGIS DNA during PCR amplification. Here, we describe the OligArch (for "oligonucleotide architecting") software, an application that permits synthetic biologists to engineer optimally self-assembling DNA constructs from both six- and eight-letter AEGIS alphabets. This software has been used to design oligonucleotides that self-assemble to form complete genes from 20 or more single-stranded synthetic oligonucleotides. OligArch is therefore a key element of a scalable and integrated infrastructure for the rapid and designed engineering of biology.
Single nucleotide variations: Biological impact and theoretical interpretation

PubMed Central

Katsonis, Panagiotis; Koire, Amanda; Wilson, Stephen Joseph; Hsu, Teng-Kuei; Lua, Rhonald C; Wilkins, Angela Dawn; Lichtarge, Olivier

2014-01-01

Genome-wide association studies (GWAS) and whole-exome sequencing (WES) generate massive amounts of genomic variant information, and a major challenge is to identify which variations drive disease or contribute to phenotypic traits. Because the majority of known disease-causing mutations are exonic non-synonymous single nucleotide variations (nsSNVs), most studies focus on whether these nsSNVs affect protein function. Computational studies show that the impact of nsSNVs on protein function reflects sequence homology and structural information and predict the impact through statistical methods, machine learning techniques, or models of protein evolution. Here, we review impact prediction methods and discuss their underlying principles, their advantages and limitations, and how they compare to and complement one another. Finally, we present current applications and future directions for these methods in biological research and medical genetics. PMID:25234433
A high-throughput approach to profile RNA structure.

PubMed

Delli Ponti, Riccardo; Marti, Stefanie; Armaos, Alexandros; Tartaglia, Gian Gaetano

2017-03-17

Here we introduce the Computational Recognition of Secondary Structure (CROSS) method to calculate the structural profile of an RNA sequence (single- or double-stranded state) at single-nucleotide resolution and without sequence length restrictions. We trained CROSS using data from high-throughput experiments such as Selective 2΄-Hydroxyl Acylation analyzed by Primer Extension (SHAPE; Mouse and HIV transcriptomes) and Parallel Analysis of RNA Structure (PARS; Human and Yeast transcriptomes) as well as high-quality NMR/X-ray structures (PDB database). The algorithm uses primary structure information alone to predict experimental structural profiles with >80% accuracy, showing high performances on large RNAs such as Xist (17 900 nucleotides; Area Under the ROC Curve AUC of 0.75 on dimethyl sulfate (DMS) experiments). We integrated CROSS in thermodynamics-based methods to predict secondary structure and observed an increase in their predictive power by up to 30%. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
How single nucleotide polymorphism chips will advance our knowledge of factors controlling puberty and aid in selecting replacement beef females

USDA-ARS?s Scientific Manuscript database

The promise of genomic selection is accurate prediction of animals' genetic potential from their genotypes. Simple DNA tests might replace low accuracy predictions for expensive or lowly heritable measures of puberty and fertility based on performance and pedigree. Knowing which DNA variants affec...
[The joint applications of DNA chips and single nucleotide polymorphisms in forensic science].

PubMed

Bai, Peng; Tian, Li; Zhou, Xue-ping

2005-05-01

DNA chip technology, being a new high-technology, shows its vigorous life and rapid growth. Single Nucleotide Polymorphisms (SNPs) is the most common diversity in the human genome. It provides suitable genetic markers which play a key role in disease linkage study, pharmacogenomics, forensic medicine, population evolution and immigration study. Their advantage such as being analyzed with DNA chips technology, is predicted to play an important role in the field of forensic medicine, especially in paternity test and individual identification. This report mainly reviews the characteristics of DNA chip and SNPs, and their joint applications in the practice of forensic medicine.
Prediction of Adulthood Obesity Using Genetic and Childhood Clinical Risk Factors in the Cardiovascular Risk in Young Finns Study.

PubMed

Seyednasrollah, Fatemeh; Mäkelä, Johanna; Pitkänen, Niina; Juonala, Markus; Hutri-Kähönen, Nina; Lehtimäki, Terho; Viikari, Jorma; Kelly, Tanika; Li, Changwei; Bazzano, Lydia; Elo, Laura L; Raitakari, Olli T

2017-06-01

Obesity is a known risk factor for cardiovascular disease. Early prediction of obesity is essential for prevention. The aim of this study is to assess the use of childhood clinical factors and the genetic risk factors in predicting adulthood obesity using machine learning methods. A total of 2262 participants from the Cardiovascular Risk in YFS (Young Finns Study) were followed up from childhood (age 3-18 years) to adulthood for 31 years. The data were divided into training (n=1625) and validation (n=637) set. The effect of known genetic risk factors (97 single-nucleotide polymorphisms) was investigated as a weighted genetic risk score of all 97 single-nucleotide polymorphisms (WGRS97) or a subset of 19 most significant single-nucleotide polymorphisms (WGRS19) using boosting machine learning technique. WGRS97 and WGRS19 were validated using external data (n=369) from BHS (Bogalusa Heart Study). WGRS19 improved the accuracy of predicting adulthood obesity in training (area under the curve [AUC=0.787 versus AUC=0.744, P <0.0001) and validation data (AUC=0.769 versus AUC=0.747, P =0.026). WGRS97 improved the accuracy in training (AUC=0.782 versus AUC=0.744, P <0.0001) but not in validation data (AUC=0.749 versus AUC=0.747, P =0.785). Higher WGRS19 associated with higher body mass index at 9 years and WGRS97 at 6 years. Replication in BHS confirmed our findings that WGRS19 and WGRS97 are associated with body mass index. WGRS19 improves prediction of adulthood obesity. Predictive accuracy is highest among young children (3-6 years), whereas among older children (9-18 years) the risk can be identified using childhood clinical factors. The model is helpful in screening children with high risk of developing obesity. © 2017 American Heart Association, Inc.
CHASM and SNVBox: toolkit for detecting biologically important single nucleotide mutations in cancer.

PubMed

Wong, Wing Chung; Kim, Dewey; Carter, Hannah; Diekhans, Mark; Ryan, Michael C; Karchin, Rachel

2011-08-01

Thousands of cancer exomes are currently being sequenced, yielding millions of non-synonymous single nucleotide variants (SNVs) of possible relevance to disease etiology. Here, we provide a software toolkit to prioritize SNVs based on their predicted contribution to tumorigenesis. It includes a database of precomputed, predictive features covering all positions in the annotated human exome and can be used either stand-alone or as part of a larger variant discovery pipeline. MySQL database, source code and binaries freely available for academic/government use at http://wiki.chasmsoftware.org, Source in Python and C++. Requires 32 or 64-bit Linux system (tested on Fedora Core 8,10,11 and Ubuntu 10), 2.5*≤ Python <3.0*, MySQL server >5.0, 60 GB available hard disk space (50 MB for software and data files, 40 GB for MySQL database dump when uncompressed), 2 GB of RAM.
Predicting the disease of Alzheimer with SNP biomarkers and clinical data using data mining classification approach: decision tree.

PubMed

Erdoğan, Onur; Aydin Son, Yeşim

2014-01-01

Single Nucleotide Polymorphisms (SNPs) are the most common genomic variations where only a single nucleotide differs between individuals. Individual SNPs and SNP profiles associated with diseases can be utilized as biological markers. But there is a need to determine the SNP subsets and patients' clinical data which is informative for the diagnosis. Data mining approaches have the highest potential for extracting the knowledge from genomic datasets and selecting the representative SNPs as well as most effective and informative clinical features for the clinical diagnosis of the diseases. In this study, we have applied one of the widely used data mining classification methodology: "decision tree" for associating the SNP biomarkers and significant clinical data with the Alzheimer's disease (AD), which is the most common form of "dementia". Different tree construction parameters have been compared for the optimization, and the most accurate tree for predicting the AD is presented.
Normalization of Complete Genome Characteristics: Application to Evolution from Primitive Organisms to Homo sapiens.

PubMed

Sorimachi, Kenji; Okayasu, Teiji; Ohhira, Shuji

2015-04-01

Normalized nucleotide and amino acid contents of complete genome sequences can be visualized as radar charts. The shapes of these charts depict the characteristics of an organism's genome. The normalized values calculated from the genome sequence theoretically exclude experimental errors. Further, because normalization is independent of both target size and kind, this procedure is applicable not only to single genes but also to whole genomes, which consist of a huge number of different genes. In this review, we discuss the applications of the normalization of the nucleotide and predicted amino acid contents of complete genomes to the investigation of genome structure and to evolutionary research from primitive organisms to Homo sapiens. Some of the results could never have been obtained from the analysis of individual nucleotide or amino acid sequences but were revealed only after the normalization of nucleotide and amino acid contents was applied to genome research. The discovery that genome structure was homogeneous was obtained only after normalization methods were applied to the nucleotide or predicted amino acid contents of genome sequences. Normalization procedures are also applicable to evolutionary research. Thus, normalization of the contents of whole genomes is a useful procedure that can help to characterize organisms.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Chen, Pei-Chun; Chen, Yen-Ching; Research Center for Gene, Environment, and Human Health, College of Public Health, National Taiwan University, Taiwan

Purpose: To identify germline polymorphisms to predict concurrent chemoradiation therapy (CCRT) response in esophageal cancer patients. Materials and Methods: A total of 139 esophageal cancer patients treated with CCRT (cisplatin-based chemotherapy combined with 40 Gy of irradiation) and subsequent esophagectomy were recruited at the National Taiwan University Hospital between 1997 and 2008. After excluding confounding factors (i.e., females and patients aged {>=}70 years), 116 patients were enrolled to identify single nucleotide polymorphisms (SNPs) associated with specific CCRT responses. Genotyping arrays and mass spectrometry were used sequentially to determine germline polymorphisms from blood samples. These polymorphisms remain stable throughout disease progression,more » unlike somatic mutations from tumor tissues. Two-stage design and additive genetic models were adopted in this study. Results: From the 26 SNPs identified in the first stage, 2 SNPs were found to be significantly associated with CCRT response in the second stage. Single nucleotide polymorphism rs16863886, located between SGPP2 and FARSB on chromosome 2q36.1, was significantly associated with a 3.93-fold increase in pathologic complete response to CCRT (95% confidence interval 1.62-10.30) under additive models. Single nucleotide polymorphism rs4954256, located in ZRANB3 on chromosome 2q21.3, was associated with a 3.93-fold increase in pathologic complete response to CCRT (95% confidence interval 1.57-10.87). The predictive accuracy for CCRT response was 71.59% with these two SNPs combined. Conclusions: This is the first study to identify germline polymorphisms with a high accuracy for predicting CCRT response in the treatment of esophageal cancer.« less
In silico prediction of splice-altering single nucleotide variants in the human genome.

PubMed

Jian, Xueqiu; Boerwinkle, Eric; Liu, Xiaoming

2014-12-16

In silico tools have been developed to predict variants that may have an impact on pre-mRNA splicing. The major limitation of the application of these tools to basic research and clinical practice is the difficulty in interpreting the output. Most tools only predict potential splice sites given a DNA sequence without measuring splicing signal changes caused by a variant. Another limitation is the lack of large-scale evaluation studies of these tools. We compared eight in silico tools on 2959 single nucleotide variants within splicing consensus regions (scSNVs) using receiver operating characteristic analysis. The Position Weight Matrix model and MaxEntScan outperformed other methods. Two ensemble learning methods, adaptive boosting and random forests, were used to construct models that take advantage of individual methods. Both models further improved prediction, with outputs of directly interpretable prediction scores. We applied our ensemble scores to scSNVs from the Catalogue of Somatic Mutations in Cancer database. Analysis showed that predicted splice-altering scSNVs are enriched in recurrent scSNVs and known cancer genes. We pre-computed our ensemble scores for all potential scSNVs across the human genome, providing a whole genome level resource for identifying splice-altering scSNVs discovered from large-scale sequencing studies.
Single nucleotide polymorphisms in multiple sclerosis: disease susceptibility and treatment response biomarkers.

PubMed

Pravica, Vera; Popadic, Dusan; Savic, Emina; Markovic, Milos; Drulovic, Jelena; Mostarica-Stojkovic, Marija

2012-04-01

Multiple sclerosis (MS) is a chronic inflammatory demyelinating and neurodegenerative disease of the central nervous system characterized by unpredictable and variable clinical course. Etiology of MS involves both genetic and environmental factors. New technologies identified genetic polymorphisms associated with MS susceptibility among which immunologically relevant genes are significantly overrepresented. Although individual genes contribute only a small part to MS susceptibility, they might be used as biomarkers, thus helping to identify accurate diagnosis, predict clinical disease course and response to therapy. This review focuses on recent progress in research on MS genetics with special emphasis on the possibility to use single nucleotide polymorphism of candidate genes as biomarkers of susceptibility to disease and response to therapy.
Nucleotide Sequences and Comparison of Two Large Conjugative Plasmids from Different Campylobacter species

DTIC Science & Technology

2004-01-01

alleles have different predicted lengths, e.g. in pCC31, cpp46 starts with ATGATG whereas in pTet this gene starts with only one ATG; in ssb1 , cmgB7 and...homologues in plasmid pVT745 from Actinobacillus actinomycetemcomitans, and a single-stranded DNA-binding protein ssb1 that may coat the single-stranded
SNPs in DNA repair or oxidative stress genes and late subcutaneous fibrosis in patients following single shot partial breast irradiation

PubMed Central

2012-01-01

Background The aim of this study was to evaluate the potential association between single nucleotide polymorphisms related response to radiotherapy injury, such as genes related to DNA repair or enzymes involved in anti-oxidative activities. The paper aims to identify marker genes able to predict an increased risk of late toxicity studying our group of patients who underwent a Single Shot 3D-CRT PBI (SSPBI) after BCS (breast conserving surgery). Methods A total of 57 breast cancer patients who underwent SSPBI were genotyped for SNPs (single nucleotide polymorphisms) in XRCC1, XRCC3, GST and RAD51 by Pyrosequencing technology. Univariate analysis (ORs and 95% CI) was performed to correlate SNPs with the risk of developing ≥ G2 fibrosis or fat necrosis. Results A higher significant risk of developing ≥ G2 fibrosis or fat necrosis in patients with: polymorphic variant GSTP1 (Ile105Val) (OR = 2.9; 95%CI, 0.88-10.14, p = 0.047). Conclusions The presence of some SNPs involved in DNA repair or response to oxidative stress seem to be able to predict late toxicity. Trial Registration ClinicalTrials.gov: NCT01316328 PMID:22272830
Genome-environment associations in sorghum landraces predict adaptive traits

USDA-ARS?s Scientific Manuscript database

Improving environmental adaptation in crops is essential for food security under global change, but phenotyping adaptive traits remains a major bottleneck. If associations between single-nucleotide polymorphism (SNP) alleles and environment of origin in crop landraces reflect adaptation, then these ...
Genome-wide association and genomic prediction identifies associated loci and predicts the sensitivity of Tobacco ringspot virus in soybean plant introduction

USDA-ARS?s Scientific Manuscript database

The genome-wide association study (GWAS) is a useful tool for detecting and characterizing traits of interest including those associated with disease resistance in soybean. The availability of 50,000 single nucleotide polymorphism (SNP) markers (SoySNP50K iSelect BeadChip; www.soybase.org) on 19,652...
Use of single nucleotide polymorphisms in candidate genes associated with daughter pregnancy rate for prediction of genetic merit for reproduction in Holstein cows

USDA-ARS?s Scientific Manuscript database

We evaluated 69 SNPs in genes previously related to fertility and production traits for relationship to daughter pregnancy rate (DPR), cow conception rate (CCR) and heifer conception rate (HCR) in a separate population of Holstein cows grouped according to their predicted transmitting ability for DP...
PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations

PubMed Central

Bendl, Jaroslav; Stourac, Jan; Salanda, Ondrej; Pavelka, Antonin; Wieben, Eric D.; Zendulka, Jaroslav; Brezovsky, Jan; Damborsky, Jiri

2014-01-01

Single nucleotide variants represent a prevalent form of genetic variation. Mutations in the coding regions are frequently associated with the development of various genetic diseases. Computational tools for the prediction of the effects of mutations on protein function are very important for analysis of single nucleotide variants and their prioritization for experimental characterization. Many computational tools are already widely employed for this purpose. Unfortunately, their comparison and further improvement is hindered by large overlaps between the training datasets and benchmark datasets, which lead to biased and overly optimistic reported performances. In this study, we have constructed three independent datasets by removing all duplicities, inconsistencies and mutations previously used in the training of evaluated tools. The benchmark dataset containing over 43,000 mutations was employed for the unbiased evaluation of eight established prediction tools: MAPP, nsSNPAnalyzer, PANTHER, PhD-SNP, PolyPhen-1, PolyPhen-2, SIFT and SNAP. The six best performing tools were combined into a consensus classifier PredictSNP, resulting into significantly improved prediction performance, and at the same time returned results for all mutations, confirming that consensus prediction represents an accurate and robust alternative to the predictions delivered by individual tools. A user-friendly web interface enables easy access to all eight prediction tools, the consensus classifier PredictSNP and annotations from the Protein Mutant Database and the UniProt database. The web server and the datasets are freely available to the academic community at http://loschmidt.chemi.muni.cz/predictsnp. PMID:24453961

The complete genome sequence and genetic analysis of ΦCA82 a novel uncultured microphage from the turkey gastrointestinal system

PubMed Central

2011-01-01

The genomic DNA sequence of a novel enteric uncultured microphage, ΦCA82 from a turkey gastrointestinal system was determined utilizing metagenomics techniques. The entire circular, single-stranded nucleotide sequence of the genome was 5,514 nucleotides. The ΦCA82 genome is quite different from other microviruses as indicated by comparisons of nucleotide similarity, predicted protein similarity, and functional classifications. Only three genes showed significant similarity to microviral proteins as determined by local alignments using BLAST analysis. ORF1 encoded a predicted phage F capsid protein that was phylogenetically most similar to the Microviridae ΦMH2K member's major coat protein. The ΦCA82 genome also encoded a predicted minor capsid protein (ORF2) and putative replication initiation protein (ORF3) most similar to the microviral bacteriophage SpV4. The distant evolutionary relationship of ΦCA82 suggests that the divergence of this novel turkey microvirus from other microviruses may reflect unique evolutionary pressures encountered within the turkey gastrointestinal system. PMID:21714899
Single Nucleotide Polymorphisms of Stemness Genes Predicted to Regulate RNA Splicing, microRNA and Oncogenic Signaling are Associated with Prostate Cancer Survival.

PubMed

Freedman, Jennifer A; Wang, Yanru; Li, Xuechan; Liu, Hongliang; Moorman, Patricia G; George, Daniel J; Lee, Norman H; Hyslop, Terry; Wei, Qingyi; Patierno, Steven R

2018-05-03

Prostate cancer is a clinically and molecularly heterogeneous disease, with variation in outcomes only partially predicted by grade and stage. Additional tools to distinguish indolent from aggressive disease are needed. Phenotypic characteristics of stemness correlate with poor cancer prognosis. Given this correlation, we identified single nucleotide polymorphisms (SNPs) of stemness-related genes and examined their associations with prostate cancer survival. SNPs within stemness-related genes were analyzed for association with overall survival of prostate cancer in the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial. Significant SNPs predicted to be functional were selected for linkage disequilibrium analysis and combined and stratified analyses. Identified SNPs were evaluated for association with gene expression. SNPs of CD44 (rs9666607), ABCC1 (rs35605 and rs212091) and GDF15 (rs1058587) were associated with prostate cancer survival and predicted to be functional. A role for rs9666607 of CD44 and rs35605 of ABCC1 in RNA splicing regulation, rs212091 of ABCC1 in miRNA binding site activity and rs1058587 of GDF15 in causing an amino acid change was predicted. These SNPs represent potential novel prognostic markers for overall survival of prostate cancer and support a contribution of the stemness pathway to prostate cancer patient outcome.
DNA Sequence-Dependent Ionic Currents in Ultra-Small Solid-State Nanopores†

PubMed Central

Comer, Jeffrey

2016-01-01

Measurements of ionic currents through nanopores partially blocked by DNA have emerged as a powerful method for characterization of the DNA nucleotide sequence. Although the effect of the nucleotide sequence on the nanopore blockade current has been experimentally demonstrated, prediction and interpretation of such measurements remain a formidable challenge. Using atomic resolution computational approaches, here we show how the sequence, molecular conformation, and pore geometry affect the blockade ionic current in model solid-state nanopores. We demonstrate that the blockade current from a DNA molecule is determined by the chemical identities and conformations of at least three consecutive nucleotides. We find the blockade currents produced by the nucleotide triplets to vary considerably with their nucleotide sequence despite having nearly identical molecular conformations. Encouragingly, we find blockade current differences as large as 25% for single-base substitutions in ultra small (1.6 nm × 1.1 nm cross section; 2 nm length) solid-state nanopores. Despite the complex dependence of the blockade current on the sequence and conformation of the DNA triplets, we find that, under many conditions, the number of thymine bases is positively correlated with the current, whereas the number of purine bases and the presence of both purine and pyrimidines in the triplet are negatively correlated with the current. Based on these observations, we construct a simple theoretical model that relates the ion current to the base content of a solid-state nanopore. Furthermore, we show that compact conformations of DNA in narrow pores provide the greatest signal-to-noise ratio for single base detection, whereas reduction of the nanopore length increases the ionic current noise. Thus, the sequence dependence of nanopore blockade current can be theoretically rationalized, although the predictions will likely need to be customized for each nanopore type. PMID:27103233
Integrated Cox's model for predicting survival time of glioblastoma multiforme.

PubMed

Ai, Zhibing; Li, Longti; Fu, Rui; Lu, Jing-Min; He, Jing-Dong; Li, Sen

2017-04-01

Glioblastoma multiforme is the most common primary brain tumor and is highly lethal. This study aims to figure out signatures for predicting the survival time of patients with glioblastoma multiforme. Clinical information, messenger RNA expression, microRNA expression, and single-nucleotide polymorphism array data of patients with glioblastoma multiforme were retrieved from The Cancer Genome Atlas. Patients were separated into two groups by using 1 year as a cutoff, and a logistic regression model was used to figure out any variables that can predict whether the patient was able to live longer than 1 year. Furthermore, Cox's model was used to find out features that were correlated with the survival time. Finally, a Cox model integrated the significant clinical variables, messenger RNA expression, microRNA expression, and single-nucleotide polymorphism was built. Although the classification method failed, signatures of clinical features, messenger RNA expression levels, and microRNA expression levels were figured out by using Cox's model. However, no single-nucleotide polymorphisms related to prognosis were found. The selected clinical features were age at initial diagnosis, Karnofsky score, and race, all of which had been suggested to correlate with survival time. Both of the two significant microRNAs, microRNA-221 and microRNA-222, were targeted to p27 Kip1 protein, which implied the important role of p27 Kip1 on the prognosis of glioblastoma multiforme patients. Our results suggested that survival modeling was more suitable than classification to figure out prognostic biomarkers for patients with glioblastoma multiforme. An integrated model containing clinical features, messenger RNA levels, and microRNA expression levels was built, which has the potential to be used in clinics and thus to improve the survival status of glioblastoma multiforme patients.
CHASM and SNVBox: toolkit for detecting biologically important single nucleotide mutations in cancer

PubMed Central

Carter, Hannah; Diekhans, Mark; Ryan, Michael C.; Karchin, Rachel

2011-01-01

Summary: Thousands of cancer exomes are currently being sequenced, yielding millions of non-synonymous single nucleotide variants (SNVs) of possible relevance to disease etiology. Here, we provide a software toolkit to prioritize SNVs based on their predicted contribution to tumorigenesis. It includes a database of precomputed, predictive features covering all positions in the annotated human exome and can be used either stand-alone or as part of a larger variant discovery pipeline. Availability and Implementation: MySQL database, source code and binaries freely available for academic/government use at http://wiki.chasmsoftware.org, Source in Python and C++. Requires 32 or 64-bit Linux system (tested on Fedora Core 8,10,11 and Ubuntu 10), 2.5*≤ Python <3.0*, MySQL server >5.0, 60 GB available hard disk space (50 MB for software and data files, 40 GB for MySQL database dump when uncompressed), 2 GB of RAM. Contact: karchin@jhu.edu Supplementary Information: Supplementary data are available at Bioinformatics online. PMID:21685053
Single nucleotide editing without DNA cleavage using CRISPR/Cas9-deaminase in the sea urchin embryo.

PubMed

Shevidi, Saba; Uchida, Alicia; Schudrowitz, Natalie; Wessel, Gary M; Yajima, Mamiko

2017-12-01

A single base pair mutation in the genome can result in many congenital disorders in humans. The recent gene editing approach using CRISPR/Cas9 has rapidly become a powerful tool to replicate or repair such mutations in the genome. These approaches rely on cleaving DNA, while presenting unexpected risks. In this study, we demonstrate a modified CRISPR/Cas9 system fused to cytosine deaminase (Cas9-DA), which induces a single nucleotide conversion in the genome. Cas9-DA was introduced into sea urchin eggs with sgRNAs targeted for SpAlx1, SpDsh, or SpPks, each of which is critical for skeletogenesis, embryonic axis formation, or pigment formation, respectively. We found that both Cas9 and Cas9-DA edit the genome, and cause predicted phenotypic changes at a similar efficiency. Cas9, however, resulted in significant deletions in the genome centered on the gRNA target sequence, whereas Cas9-DA resulted in single or double nucleotide editing of C to T conversions within the gRNA target sequence. These results suggest that the Cas9-DA approach may be useful for manipulating gene activity with decreased risks of genomic aberrations. Developmental Dynamics 246:1036-1046, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Quantum Point Contact Single-Nucleotide Conductance for DNA and RNA Sequence Identification.

PubMed

Afsari, Sepideh; Korshoj, Lee E; Abel, Gary R; Khan, Sajida; Chatterjee, Anushree; Nagpal, Prashant

2017-11-28

Several nanoscale electronic methods have been proposed for high-throughput single-molecule nucleic acid sequence identification. While many studies display a large ensemble of measurements as "electronic fingerprints" with some promise for distinguishing the DNA and RNA nucleobases (adenine, guanine, cytosine, thymine, and uracil), important metrics such as accuracy and confidence of base calling fall well below the current genomic methods. Issues such as unreliable metal-molecule junction formation, variation of nucleotide conformations, insufficient differences between the molecular orbitals responsible for single-nucleotide conduction, and lack of rigorous base calling algorithms lead to overlapping nanoelectronic measurements and poor nucleotide discrimination, especially at low coverage on single molecules. Here, we demonstrate a technique for reproducible conductance measurements on conformation-constrained single nucleotides and an advanced algorithmic approach for distinguishing the nucleobases. Our quantum point contact single-nucleotide conductance sequencing (QPICS) method uses combed and electrostatically bound single DNA and RNA nucleotides on a self-assembled monolayer of cysteamine molecules. We demonstrate that by varying the applied bias and pH conditions, molecular conductance can be switched ON and OFF, leading to reversible nucleotide perturbation for electronic recognition (NPER). We utilize NPER as a method to achieve >99.7% accuracy for DNA and RNA base calling at low molecular coverage (∼12×) using unbiased single measurements on DNA/RNA nucleotides, which represents a significant advance compared to existing sequencing methods. These results demonstrate the potential for utilizing simple surface modifications and existing biochemical moieties in individual nucleobases for a reliable, direct, single-molecule, nanoelectronic DNA and RNA nucleotide identification method for sequencing.
Single Color Multiplexed ddPCR Copy Number Measurements and Single Nucleotide Variant Genotyping.

PubMed

Wood-Bouwens, Christina M; Ji, Hanlee P

2018-01-01

Droplet digital PCR (ddPCR) allows for accurate quantification of genetic events such as copy number variation and single nucleotide variants. Probe-based assays represent the current "gold-standard" for detection and quantification of these genetic events. Here, we introduce a cost-effective single color ddPCR assay that allows for single genome resolution quantification of copy number and single nucleotide variation.
Genome-wide patterns of recombination, linkage disequilibrium and nucleotide diversity from pooled resequencing and single nucleotide polymorphism genotyping unlock the evolutionary history of Eucalyptus grandis.

PubMed

Silva-Junior, Orzenil B; Grattapaglia, Dario

2015-11-01

We used high-density single nucleotide polymorphism (SNP) data and whole-genome pooled resequencing to examine the landscape of population recombination (ρ) and nucleotide diversity (ϴw ), assess the extent of linkage disequilibrium (r(2) ) and build the highest density linkage maps for Eucalyptus. At the genome-wide level, linkage disequilibrium (LD) decayed within c. 4-6 kb, slower than previously reported from candidate gene studies, but showing considerable variation from absence to complete LD up to 50 kb. A sharp decrease in the estimate of ρ was seen when going from short to genome-wide inter-SNP distances, highlighting the dependence of this parameter on the scale of observation adopted. Recombination was correlated with nucleotide diversity, gene density and distance from the centromere, with hotspots of recombination enriched for genes involved in chemical reactions and pathways of the normal metabolic processes. The high nucleotide diversity (ϴw = 0.022) of E. grandis revealed that mutation is more important than recombination in shaping its genomic diversity (ρ/ϴw = 0.645). Chromosome-wide ancestral recombination graphs allowed us to date the split of E. grandis (1.7-4.8 million yr ago) and identify a scenario for the recent demographic history of the species. Our results have considerable practical importance to Genome Wide Association Studies (GWAS), while indicating bright prospects for genomic prediction of complex phenotypes in eucalypt breeding. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Single nucleotide polymorphisms in candidate genes related to daughter pregnancy rate in Holstein cows

USDA-ARS?s Scientific Manuscript database

ABSTRACT: Previously, a candidate gene approach identified 40 SNPs associated with daughter pregnancy rate (DPR) in dairy bulls. We evaluated 39 of these SNPs for relationship to DPR in a separate population of Holstein cows grouped on their predicted transmitting ability for DPR: <= -1 (n=1266) a...
Prediction of maize phenotype based on whole-genome single nucleotide polymorphisms using deep belief networks

NASA Astrophysics Data System (ADS)

Rachmatia, H.; Kusuma, W. A.; Hasibuan, L. S.

2017-05-01

Selection in plant breeding could be more effective and more efficient if it is based on genomic data. Genomic selection (GS) is a new approach for plant-breeding selection that exploits genomic data through a mechanism called genomic prediction (GP). Most of GP models used linear methods that ignore effects of interaction among genes and effects of higher order nonlinearities. Deep belief network (DBN), one of the architectural in deep learning methods, is able to model data in high level of abstraction that involves nonlinearities effects of the data. This study implemented DBN for developing a GP model utilizing whole-genome Single Nucleotide Polymorphisms (SNPs) as data for training and testing. The case study was a set of traits in maize. The maize dataset was acquisitioned from CIMMYT’s (International Maize and Wheat Improvement Center) Global Maize program. Based on Pearson correlation, DBN is outperformed than other methods, kernel Hilbert space (RKHS) regression, Bayesian LASSO (BL), best linear unbiased predictor (BLUP), in case allegedly non-additive traits. DBN achieves correlation of 0.579 within -1 to 1 range.
Cumulative Risk on the Oxytocin Receptor Gene (OXTR) Predicts Empathic Communication by Physician Assistant Students.

PubMed

Floyd, Kory; Generous, Mark Alan; Clark, Lou; McLeod, Ian; Simon, Albert

2017-10-01

In the relationship between patients and health care providers, few communicative features are as significant as the providers' ability to express empathy. A robust empirical literature describes the importance of physician communication skills-particularly those that convey empathy-yet few studies have examined empathic communication by physician assistants, who provide primary care for an increasing number of Americans. The present study examines the empathic communication of physician assistant students in interactions with standardized patients. Over a 6-month period, each student conducted three clinical interviews, each of which was evaluated for empathic communication by the patients, the students' clinical instructors, and third-party observers. Students also provided saliva samples for genotyping six single-nucleotide polymorphisms on the oxytocin receptor gene (OXTR) that are linked empirically to empathic behavior. Consistent with recent research, this study adopted a cumulative risk approach wherein students were scored for their number of risky alleles on the single-nucleotide polymorphisms. Results indicated that cumulative risk on OXTR receptor gene predicted lower patient empathy scores as rated by instructors and observers, but not by standardized patients.
Germline contamination and leakage in whole genome somatic single nucleotide variant detection.

PubMed

Sendorek, Dorota H; Caloian, Cristian; Ellrott, Kyle; Bare, J Christopher; Yamaguchi, Takafumi N; Ewing, Adam D; Houlahan, Kathleen E; Norman, Thea C; Margolin, Adam A; Stuart, Joshua M; Boutros, Paul C

2018-01-31

The clinical sequencing of cancer genomes to personalize therapy is becoming routine across the world. However, concerns over patient re-identification from these data lead to questions about how tightly access should be controlled. It is not thought to be possible to re-identify patients from somatic variant data. However, somatic variant detection pipelines can mistakenly identify germline variants as somatic ones, a process called "germline leakage". The rate of germline leakage across different somatic variant detection pipelines is not well-understood, and it is uncertain whether or not somatic variant calls should be considered re-identifiable. To fill this gap, we quantified germline leakage across 259 sets of whole-genome somatic single nucleotide variant (SNVs) predictions made by 21 teams as part of the ICGC-TCGA DREAM Somatic Mutation Calling Challenge. The median somatic SNV prediction set contained 4325 somatic SNVs and leaked one germline polymorphism. The level of germline leakage was inversely correlated with somatic SNV prediction accuracy and positively correlated with the amount of infiltrating normal cells. The specific germline variants leaked differed by tumour and algorithm. To aid in quantitation and correction of leakage, we created a tool, called GermlineFilter, for use in public-facing somatic SNV databases. The potential for patient re-identification from leaked germline variants in somatic SNV predictions has led to divergent open data access policies, based on different assessments of the risks. Indeed, a single, well-publicized re-identification event could reshape public perceptions of the values of genomic data sharing. We find that modern somatic SNV prediction pipelines have low germline-leakage rates, which can be further reduced, especially for cloud-sharing, using pre-filtering software.
Increased genomic prediction accuracy in wheat breeding through spatial adjustment of field trial data.

PubMed

Lado, Bettina; Matus, Ivan; Rodríguez, Alejandra; Inostroza, Luis; Poland, Jesse; Belzile, François; del Pozo, Alejandro; Quincke, Martín; Castro, Marina; von Zitzewitz, Jarislav

2013-12-09

In crop breeding, the interest of predicting the performance of candidate cultivars in the field has increased due to recent advances in molecular breeding technologies. However, the complexity of the wheat genome presents some challenges for applying new technologies in molecular marker identification with next-generation sequencing. We applied genotyping-by-sequencing, a recently developed method to identify single-nucleotide polymorphisms, in the genomes of 384 wheat (Triticum aestivum) genotypes that were field tested under three different water regimes in Mediterranean climatic conditions: rain-fed only, mild water stress, and fully irrigated. We identified 102,324 single-nucleotide polymorphisms in these genotypes, and the phenotypic data were used to train and test genomic selection models intended to predict yield, thousand-kernel weight, number of kernels per spike, and heading date. Phenotypic data showed marked spatial variation. Therefore, different models were tested to correct the trends observed in the field. A mixed-model using moving-means as a covariate was found to best fit the data. When we applied the genomic selection models, the accuracy of predicted traits increased with spatial adjustment. Multiple genomic selection models were tested, and a Gaussian kernel model was determined to give the highest accuracy. The best predictions between environments were obtained when data from different years were used to train the model. Our results confirm that genotyping-by-sequencing is an effective tool to obtain genome-wide information for crops with complex genomes, that these data are efficient for predicting traits, and that correction of spatial variation is a crucial ingredient to increase prediction accuracy in genomic selection models.
Prediction for Intravenous Immunoglobulin Resistance by Using Weighted Genetic Risk Score Identified From Genome-Wide Association Study in Kawasaki Disease.

PubMed

Kuo, Ho-Chang; Wong, Henry Sung-Ching; Chang, Wei-Pin; Chen, Ben-Kuen; Wu, Mei-Shin; Yang, Kuender D; Hsieh, Kai-Sheng; Hsu, Yu-Wen; Liu, Shih-Feng; Liu, Xiao; Chang, Wei-Chiao

2017-10-01

Intravenous immunoglobulin (IVIG) is the treatment of choice in Kawasaki disease (KD). IVIG is used to prevent cardiovascular complications related to KD. However, a proportion of KD patients have persistent fever after IVIG treatment and are defined as IVIG resistant. To develop a risk scoring system based on genetic markers to predict IVIG responsiveness in KD patients, a total of 150 KD patients (126 IVIG responders and 24 IVIG nonresponders) were recruited for this study. A genome-wide association analysis was performed to compare the 2 groups and identified risk alleles for IVIG resistance. A weighted genetic risk score was calculated by the natural log of the odds ratio multiplied by the number of risk alleles. Eleven single-nucleotide polymorphisms were identified by genome-wide association study. The KD patients were categorized into 3 groups based on their calculated weighted genetic risk score. Results indicated a significant association between weighted genetic risk score (groups 3 and 4 versus group 1) and the response to IVIG (Fisher's exact P value 4.518×10 - 03 and 8.224×10 - 10 , respectively). This is the first weighted genetic risk score study based on a genome-wide association study in KD. The predictive model integrated the additive effects of all 11 single-nucleotide polymorphisms to provide a prediction of the responsiveness to IVIG. © 2017 The Authors.
Genomewide predictions from maize single-cross data.

PubMed

Massman, Jon M; Gordillo, Andres; Lorenzana, Robenzon E; Bernardo, Rex

2013-01-01

Maize (Zea mays L.) breeders evaluate many single-cross hybrids each year in multiple environments. Our objective was to determine the usefulness of genomewide predictions, based on marker effects from maize single-cross data, for identifying the best untested single crosses and the best inbreds within a biparental cross. We considered 479 experimental maize single crosses between 59 Iowa Stiff Stalk Synthetic (BSSS) inbreds and 44 non-BSSS inbreds. The single crosses were evaluated in multilocation experiments from 2001 to 2009 and the BSSS and non-BSSS inbreds had genotypic data for 669 single nucleotide polymorphism (SNP) markers. Single-cross performance was predicted by a previous best linear unbiased prediction (BLUP) approach that utilized marker-based relatedness and information on relatives, and from genomewide marker effects calculated by ridge-regression BLUP (RR-BLUP). With BLUP, the mean prediction accuracy (r(MG)) of single-cross performance was 0.87 for grain yield, 0.90 for grain moisture, 0.69 for stalk lodging, and 0.84 for root lodging. The BLUP and RR-BLUP models did not lead to r(MG) values that differed significantly. We then used the RR-BLUP model, developed from single-cross data, to predict the performance of testcrosses within 14 biparental populations. The r(MG) values within each testcross population were generally low and were often negative. These results were obtained despite the above-average level of linkage disequilibrium, i.e., r(2) between adjacent markers of 0.35 in the BSSS inbreds and 0.26 in the non-BSSS inbreds. Overall, our results suggested that genomewide marker effects estimated from maize single crosses are not advantageous (cofmpared with BLUP) for predicting single-cross performance and have erratic usefulness for predicting testcross performance within a biparental cross.
Oxytocin Receptor (OXTR) Single Nucleotide Polymorphisms Indirectly Predict Prosocial Behavior Through Perspective Taking and Empathic Concern.

PubMed

Christ, Christa C; Carlo, Gustavo; Stoltenberg, Scott F

2016-04-01

Engaging in prosocial behavior can provide positive outcomes for self and others. Prosocial tendencies contribute to the propensity to engage in prosocial behavior. The oxytocin receptor gene (OXTR) has also been associated with prosocial tendencies and behaviors. There has been little research, however, investigating whether the relationship between OXTR and prosocial behaviors is mediated by prosocial tendencies. This relationship may also vary among different types of prosocial behavior. The current study examines the relationship between OXTR, gender, prosocial tendencies, and both altruistic and public prosocial behavior endorsement. Students at a midwestern university (N = 398; 89.2% Caucasian; Mage = 20.76; 26.6% male) provided self-report measures of prosocial tendencies and behaviors and buccal cells for genotyping OXTR polymorphisms. Results indicated that OXTR single nucleotide polymorphism (SNP) rs2268498 genotype significantly predicted empathic concern, whereas gender moderated the association between several other OXTR SNPs and prosocial tendencies. Increased prosocial tendencies predicted increased altruistic prosocial behavior endorsement and decreased public prosocial behavior endorsement. Our findings suggest an association between genetic variation in OXTR and endorsement of prosocial behavior indirectly through prosocial tendencies, and that the pathway is dependent on the type of prosocial behavior and gender. © 2014 Wiley Periodicals, Inc.
Initial evidence that polymorphisms in neurotransmitter-regulating genes contribute to being born small for gestational age

PubMed Central

Morgan, Angharad R.; Thompson, John M.D.; Waldie, Karen E.; Cornforth, Christine M.; Turic, Darko; Sonuga-Barke, Edmund J.S.; Lam, Wen-Jiun; Ferguson, Lynnette R.; Mitchell, Edwin A.

2012-01-01

Being born small for gestational age (SGA) is a putative risk factor for the development of later cognitive and psychiatric health problems. While the inter-uterine environment has been shown to play an important role in predicting birth weight, little is known about the genetic factors that might be important. Here we test the hypothesis that neurotransmitter-regulating genes implicated in psychiatric disorders previously shown to be associated with SGA (such as attention-deficit hyperactivity disorder) are themselves predictive of SGA. DNA was collected from 227 SGA and 319 appropriate for gestational age children taking part in the Auckland Birthweight Collaborative Study. Candidate single nucleotide polymorphisms in genes regulating activity within dopamine, serotonin, glutamate and gamma-aminobutyric acid pathways were genotyped. Multiple regression analysis, controlling for potentially confounding factors, supported nominally significant associations between SGA and single nucleotide polymorphisms in COMT, HTR2A, SLC1A1 and SLC6A1. This is the first evidence that genes implicated in psychiatric disorders previously linked to SGA status themselves predict SGA. This highlights the possibility that the link between SGA and psychiatric disorders such as attention-deficit hyperactivity disorder may in part be genetically determined – that SGA marks pre-existing genetic risk for later problems. PMID:27625810
TP53 and MDM2 single nucleotide polymorphisms influence survival in non-del(5q) myelodysplastic syndromes

PubMed Central

Sallman, David A.; Basiorka, Ashley A.; Irvine, Brittany A.; Zhang, Ling; Epling-Burnette, P.K.; Rollison, Dana E.; Mallo, Mar; Sokol, Lubomir; Solé, Francesc; Maciejewski, Jaroslaw; List, Alan F.

2015-01-01

P53 is a key regulator of many cellular processes and is negatively regulated by the human homolog of murine double minute-2 (MDM2) E3 ubiquitin ligase. Single nucleotide polymorphisms (SNPs) of either gene alone, and in combination, are linked to cancer susceptibility, disease progression, and therapy response. We analyzed the interaction of TP53 R72P and MDM2 SNP309 SNPs in relationship to outcome in patients with myelodysplastic syndromes (MDS). Sanger sequencing was performed on DNA isolated from 208 MDS cases. Utilizing a novel functional SNP scoring system ranging from +2 to −2 based on predicted p53 activity, we found statistically significant differences in overall survival (OS) (p = 0.02) and progression-free survival (PFS) (p = 0.02) in non-del(5q) MDS patients with low functional scores. In univariate analysis, only IPSS and the functional SNP score predicted OS and PFS in non-del(5q) patients. In multivariate analysis, the functional SNP score was independent of IPSS for OS and PFS. These data underscore the importance of TP53 R72P and MDM2 SNP309 SNPs in MDS, and provide a novel scoring system independent of IPSS that is predictive for disease outcome. PMID:26416416
Mismatch repair factor MSH2-MSH3 binds and alters the conformation of branched DNA structures predicted to form during genetic recombination.

PubMed

Surtees, Jennifer A; Alani, Eric

2006-07-14

Genetic studies in Saccharomyces cerevisiae predict that the mismatch repair (MMR) factor MSH2-MSH3 binds and stabilizes branched recombination intermediates that form during single strand annealing and gene conversion. To test this model, we constructed a series of DNA substrates that are predicted to form during these recombination events. We show in an electrophoretic mobility shift assay that S. cerevisiae MSH2-MSH3 specifically binds branched DNA substrates containing 3' single-stranded DNA and that ATP stimulates its release from these substrates. Chemical footprinting analyses indicate that MSH2-MSH3 specifically binds at the double-strand/single-strand junction of branched substrates, alters its conformation and opens up the junction. Therefore, MSH2-MSH3 binding to its substrates creates a unique nucleoprotein structure that may signal downstream steps in repair that include interactions with MMR and nucleotide excision repair factors.

Quantitative Understanding of SHAPE Mechanism from RNA Structure and Dynamics Analysis.

PubMed

Hurst, Travis; Xu, Xiaojun; Zhao, Peinan; Chen, Shi-Jie

2018-05-10

The selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) method probes RNA local structural and dynamic information at single nucleotide resolution. To gain quantitative insights into the relationship between nucleotide flexibility, RNA 3D structure, and SHAPE reactivity, we develop a 3D Structure-SHAPE Relationship model (3DSSR) to rebuild SHAPE profiles from 3D structures. The model starts from RNA structures and combines nucleotide interaction strength and conformational propensity, ligand (SHAPE reagent) accessibility, and base-pairing pattern through a composite function to quantify the correlation between SHAPE reactivity and nucleotide conformational stability. The 3DSSR model shows the relationship between SHAPE reactivity and RNA structure and energetics. Comparisons between the 3DSSR-predicted SHAPE profile and the experimental SHAPE data show correlation, suggesting that the extracted analytical function may have captured the key factors that determine the SHAPE reactivity profile. Furthermore, the theory offers an effective method to sieve RNA 3D models and exclude models that are incompatible with experimental SHAPE data.
Hop stunt viroid: molecular cloning and nucleotide sequence of the complete cDNA copy.

PubMed Central

Ohno, T; Takamatsu, N; Meshi, T; Okada, Y

1983-01-01

The complete cDNA of hop stunt viroid (HSV) has been cloned by the method of Okayama and Berg (Mol.Cell.Biol.2,161-170. (1982] and the complete nucleotide sequence has been established. The covalently closed circular single-stranded HSV RNA consists of 297 nucleotides. The secondary structure predicted for HSV contains 67% of its residues base-paired. The native HSV can possess an extended rod-like structure characteristic of viroids previously established. The central region of the native HSV has a similar structure to the conserved region found in all viroids sequenced so far except for avocado sunblotch viroid. The sequence homologous to the 5'-end of U1a RNA is also found in the sequence of HSV but not in the central conserved region. Images PMID:6312412
Oxytocin and Parent-Child Interaction in the Development of Empathy among Children at Risk for Autism

ERIC Educational Resources Information Center

McDonald, Nicole M.; Baker, Jason K.; Messinger, Daniel S.

2016-01-01

This longitudinal study investigated whether variation in the oxytocin receptor gene (OXTR) and early parent-child interactions predicted later empathic behavior in 84 toddlers at high or low familial risk for autism spectrum disorder. Two well-studied OXTR single-nucleotide polymorphisms, rs53576 and rs2254298, were examined. Parent-child…
GESPA: classifying nsSNPs to predict disease association.

PubMed

Khurana, Jay K; Reeder, Jay E; Shrimpton, Antony E; Thakar, Juilee

2015-07-25

Non-synonymous single nucleotide polymorphisms (nsSNPs) are the most common DNA sequence variation associated with disease in humans. Thus determining the clinical significance of each nsSNP is of great importance. Potential detrimental nsSNPs may be identified by genetic association studies or by functional analysis in the laboratory, both of which are expensive and time consuming. Existing computational methods lack accuracy and features to facilitate nsSNP classification for clinical use. We developed the GESPA (GEnomic Single nucleotide Polymorphism Analyzer) program to predict the pathogenicity and disease phenotype of nsSNPs. GESPA is a user-friendly software package for classifying disease association of nsSNPs. It allows flexibility in acceptable input formats and predicts the pathogenicity of a given nsSNP by assessing the conservation of amino acids in orthologs and paralogs and supplementing this information with data from medical literature. The development and testing of GESPA was performed using the humsavar, ClinVar and humvar datasets. Additionally, GESPA also predicts the disease phenotype associated with a nsSNP with high accuracy, a feature unavailable in existing software. GESPA's overall accuracy exceeds existing computational methods for predicting nsSNP pathogenicity. The usability of GESPA is enhanced by fast SQL-based cloud storage and retrieval of data. GESPA is a novel bioinformatics tool to determine the pathogenicity and phenotypes of nsSNPs. We anticipate that GESPA will become a useful clinical framework for predicting the disease association of nsSNPs. The program, executable jar file, source code, GPL 3.0 license, user guide, and test data with instructions are available at http://sourceforge.net/projects/gespa.
Validation and Interrogation of Differentially Expressed and Alternatively Spliced Genes in African American Prostate Cancer

DTIC Science & Technology

2016-10-01

These analyses have led to two submitted manuscripts. The first manuscript, “Variants of stemness -related genes predicted to regulate RNA splicing...and Table 1-3 at the end of this progress report. The second manuscript, “Single nucleotide polymorphisms of stemness pathway genes predicted to...cancer and support a contribution of the stemness pathway to prostate cancer patient outcome. Please see Figure 5-7 and Table 4-6 at the end of this
Single nucleotide polymorphisms associated with coronary heart disease predict incident ischemic stroke in the atherosclerosis risk in communities study.

PubMed

Morrison, Alanna C; Bare, Lance A; Luke, May M; Pankow, James S; Mosley, Thomas H; Devlin, James J; Willerson, James T; Boerwinkle, Eric

2008-01-01

Ischemic stroke and coronary heart disease (CHD) may share genetic factors contributing to a common etiology. This study investigates whether 51 single nucleotide polymorphisms (SNPs) associated with CHD in multiple antecedent studies are associated with incident ischemic stroke in the Atherosclerosis Risk in Communities (ARIC) study. From the multiethnic ARIC cohort of 14,215 individuals, 495 validated ischemic strokes were identified. Cox proportional hazards models, adjusted for age and gender, identified three SNPs in Whites and two SNPs in Blacks associated with incident stroke (p
Association between Single Nucleotide Polymorphisms of the Major Histocompatibility Complex Class II Gene and Newcastle Disease Virus Titre and Body Weight in Leung Hang Khao Chickens

PubMed Central

Molee, A.; Kongroi, K.; Kuadsantia, P.; Poompramun, C.; Likitdecharote, B.

2016-01-01

The aim of the present study was to investigate the effect of single nucleotide polymorphisms in the major histocompatibility complex (MHC) class II gene on resistance to Newcastle disease virus and body weight of the Thai indigenous chicken, Leung Hang Khao (Gallus gallus domesticus). Blood samples were collected for single nucleotide polymorphism analysis from 485 chickens. Polymerase chain reaction sequencing was used to classify single nucleotide polymorphisms of class II MHC. Body weights were measured at the ages of 3, 4, 5, and 7 months. Titres of Newcastle disease virus at 2 weeks to 7 months were determined and the correlation between body weight and titre was analysed. The association between single nucleotide polymorphisms and body weight and titre were analysed by a generalized linear model. Seven single nucleotide polymorphisms were identified: C125T, A126T, C209G, C242T, A243T, C244T, and A254T. Significant correlations between log titre and body weight were found at 2 and 4 weeks. Associations between single nucleotide polymorphisms and titre were found for C209G and A254T, and between all single nucleotide polymorphisms (except A243T) and body weight. The results showed that class II MHC is associated with both titre of Newcastle disease virus and body weight in Leung Hang Khao chickens. This is of concern because improved growth traits are the main goal of breeding selection. Moreover, the results suggested that MHC has a pleiotropic effect on the titre and growth performance. This mechanism should be investigated in a future study. PMID:26732325
Increased Genomic Prediction Accuracy in Wheat Breeding Through Spatial Adjustment of Field Trial Data

PubMed Central

Lado, Bettina; Matus, Ivan; Rodríguez, Alejandra; Inostroza, Luis; Poland, Jesse; Belzile, François; del Pozo, Alejandro; Quincke, Martín; Castro, Marina; von Zitzewitz, Jarislav

2013-01-01

In crop breeding, the interest of predicting the performance of candidate cultivars in the field has increased due to recent advances in molecular breeding technologies. However, the complexity of the wheat genome presents some challenges for applying new technologies in molecular marker identification with next-generation sequencing. We applied genotyping-by-sequencing, a recently developed method to identify single-nucleotide polymorphisms, in the genomes of 384 wheat (Triticum aestivum) genotypes that were field tested under three different water regimes in Mediterranean climatic conditions: rain-fed only, mild water stress, and fully irrigated. We identified 102,324 single-nucleotide polymorphisms in these genotypes, and the phenotypic data were used to train and test genomic selection models intended to predict yield, thousand-kernel weight, number of kernels per spike, and heading date. Phenotypic data showed marked spatial variation. Therefore, different models were tested to correct the trends observed in the field. A mixed-model using moving-means as a covariate was found to best fit the data. When we applied the genomic selection models, the accuracy of predicted traits increased with spatial adjustment. Multiple genomic selection models were tested, and a Gaussian kernel model was determined to give the highest accuracy. The best predictions between environments were obtained when data from different years were used to train the model. Our results confirm that genotyping-by-sequencing is an effective tool to obtain genome-wide information for crops with complex genomes, that these data are efficient for predicting traits, and that correction of spatial variation is a crucial ingredient to increase prediction accuracy in genomic selection models. PMID:24082033
Bootstrap study of genome-enabled prediction reliabilities using haplotype blocks across Nordic Red cattle breeds.

PubMed

Cuyabano, B C D; Su, G; Rosa, G J M; Lund, M S; Gianola, D

2015-10-01

This study compared the accuracy of genome-enabled prediction models using individual single nucleotide polymorphisms (SNP) or haplotype blocks as covariates when using either a single breed or a combined population of Nordic Red cattle. The main objective was to compare predictions of breeding values of complex traits using a combined training population with haplotype blocks, with predictions using a single breed as training population and individual SNP as predictors. To compare the prediction reliabilities, bootstrap samples were taken from the test data set. With the bootstrapped samples of prediction reliabilities, we built and graphed confidence ellipses to allow comparisons. Finally, measures of statistical distances were used to calculate the gain in predictive ability. Our analyses are innovative in the context of assessment of predictive models, allowing a better understanding of prediction reliabilities and providing a statistical basis to effectively calibrate whether one prediction scenario is indeed more accurate than another. An ANOVA indicated that use of haplotype blocks produced significant gains mainly when Bayesian mixture models were used but not when Bayesian BLUP was fitted to the data. Furthermore, when haplotype blocks were used to train prediction models in a combined Nordic Red cattle population, we obtained up to a statistically significant 5.5% average gain in prediction accuracy, over predictions using individual SNP and training the model with a single breed. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
The pharmacogenetics of body odor: as easy as ABCC?

PubMed

Brown, Sara

2013-07-01

ABCC11 genotype affects apocrine secretory cell function and determines individual body odor phenotype. Rodriguez et al. have applied genetic epidemiology using predetermined phenotype data to demonstrate an association between a single-nucleotide polymorphism (rs17822931) and the human behavior of deodorant application. Individuals with the ABCC11 genotype predicting a nonodorous phenotype report a significantly lower frequency of deodorant use.
A genome-wide association study of calf birth weight in Holstein cattle using single nucleotide polymorphisms and phenotypes predicted from auxiliary traits

USDA-ARS?s Scientific Manuscript database

Previous research has found that there is a QTL affecting calving and conformation traits on Bos taurus (BTA) autosome 18 that may be related to increased calf birth weights, which are not routinely recorded in the US. Birth weight (BW) data from large, intensively managed dairies in eastern German...
Using variable importance measures to identify a small set of single nucleotide polymorphisms capable of predicting heading date in perennial ryegrass

USDA-ARS?s Scientific Manuscript database

Prior knowledge on heading date enables the selection of parents for synthetic cultivars that are well-matched with respect to heading date, which is necessary to ensure plants put together will successfully cross with each other. Heading date of individual plants can be determined directly, which h...
regSNPs-splicing: a tool for prioritizing synonymous single-nucleotide substitution.

PubMed

Zhang, Xinjun; Li, Meng; Lin, Hai; Rao, Xi; Feng, Weixing; Yang, Yuedong; Mort, Matthew; Cooper, David N; Wang, Yue; Wang, Yadong; Wells, Clark; Zhou, Yaoqi; Liu, Yunlong

2017-09-01

While synonymous single-nucleotide variants (sSNVs) have largely been unstudied, since they do not alter protein sequence, mounting evidence suggests that they may affect RNA conformation, splicing, and the stability of nascent-mRNAs to promote various diseases. Accurately prioritizing deleterious sSNVs from a pool of neutral ones can significantly improve our ability of selecting functional genetic variants identified from various genome-sequencing projects, and, therefore, advance our understanding of disease etiology. In this study, we develop a computational algorithm to prioritize sSNVs based on their impact on mRNA splicing and protein function. In addition to genomic features that potentially affect splicing regulation, our proposed algorithm also includes dozens structural features that characterize the functions of alternatively spliced exons on protein function. Our systematical evaluation on thousands of sSNVs suggests that several structural features, including intrinsic disorder protein scores, solvent accessible surface areas, protein secondary structures, and known and predicted protein family domains, show significant differences between disease-causing and neutral sSNVs. Our result suggests that the protein structure features offer an added dimension of information while distinguishing disease-causing and neutral synonymous variants. The inclusion of structural features increases the predictive accuracy for functional sSNV prioritization.
GAPIT: genome association and prediction integrated tool.

PubMed

Lipka, Alexander E; Tian, Feng; Wang, Qishan; Peiffer, Jason; Li, Meng; Bradbury, Peter J; Gore, Michael A; Buckler, Edward S; Zhang, Zhiwu

2012-09-15

Software programs that conduct genome-wide association studies and genomic prediction and selection need to use methodologies that maximize statistical power, provide high prediction accuracy and run in a computationally efficient manner. We developed an R package called Genome Association and Prediction Integrated Tool (GAPIT) that implements advanced statistical methods including the compressed mixed linear model (CMLM) and CMLM-based genomic prediction and selection. The GAPIT package can handle large datasets in excess of 10 000 individuals and 1 million single-nucleotide polymorphisms with minimal computational time, while providing user-friendly access and concise tables and graphs to interpret results. http://www.maizegenetics.net/GAPIT. zhiwu.zhang@cornell.edu Supplementary data are available at Bioinformatics online.
Identification of largemouth bass virus in the introduced Northern Snakehead inhabiting the Chesapeake Bay watershed.

PubMed

Iwanowicz, L; Densmore, C; Hahn, C; McAllister, P; Odenkirk, J

2013-09-01

The Northern Snakehead Channa argus is an introduced species that now inhabits the Chesapeake Bay. During a preliminary survey for introduced pathogens possibly harbored by these fish in Virginia waters, a filterable agent was isolated from five specimens that produced cytopathic effects in BF-2 cells. Based on PCR amplification and partial sequencing of the major capsid protein (MCP), DNA polymerase (DNApol), and DNA methyltransferase (Mtase) genes, the isolates were identified as Largemouth Bass virus (LMBV). Nucleotide sequences of the MCP (492 bp) and DNApol (419 pb) genes were 100% identical to those of LMBV. The nucleotide sequence of the Mtase (206 bp) gene was 99.5% identical to that of LMBV, and the single nucleotide substitution did not lead to a predicted amino acid coding change. This is the first report of LMBV from the Northern Snakehead, and provides evidence that noncentrarchid fishes may be susceptible to this virus.
Identification of largemouth bass virus in the introduced Northern snakehead inhabiting the Cheasapeake Bay watershed

USGS Publications Warehouse

Iwanowicz, Luke R.; Densmore, Christine L.; Hahn, Cassidy M.; McAllister, Phillip; Odenkirk, John

2013-01-01

The Northern Snakehead Channa argus is an introduced species that now inhabits the Chesapeake Bay. During a preliminary survey for introduced pathogens possibly harbored by these fish in Virginia waters, a filterable agent was isolated from five specimens that produced cytopathic effects in BF-2 cells. Based on PCR amplification and partial sequencing of the major capsid protein (MCP), DNA polymerase (DNApol), and DNA methyltransferase (Mtase) genes, the isolates were identified as Largemouth Bass virus (LMBV). Nucleotide sequences of the MCP (492 bp) and DNApol (419 pb) genes were 100% identical to those of LMBV. The nucleotide sequence of the Mtase (206 bp) gene was 99.5% identical to that of LMBV, and the single nucleotide substitution did not lead to a predicted amino acid coding change. This is the first report of LMBV from the Northern Snakehead, and provides evidence that noncentrarchid fishes may be susceptible to this virus.
Functional analysis of regulatory single-nucleotide polymorphisms.

PubMed

Pampín, Sandra; Rodríguez-Rey, José C

2007-04-01

The identification of regulatory polymorphisms has become a key problem in human genetics. In the past few years there has been a conceptual change in the way in which regulatory single-nucleotide polymorphisms are studied. We revise the new approaches and discuss how gene expression studies can contribute to a better knowledge of the genetics of common diseases. New techniques for the association of single-nucleotide polymorphisms with changes in gene expression have been recently developed. This, together with a more comprehensive use of the old in-vitro methods, has produced a great amount of genetic information. When added to current databases, it will help to design better tools for the detection of regulatory single-nucleotide polymorphisms. The identification of functional regulatory single-nucleotide polymorphisms cannot be done by the simple inspection of DNA sequence. In-vivo techniques, based on primer-extension, and the more recently developed 'haploChIP' allow the association of gene variants to changes in gene expression. Gene expression analysis by conventional in-vitro techniques is the only way to identify the functional consequences of regulatory single-nucleotide polymorphisms. The amount of information produced in the last few years will help to refine the tools for the future analysis of regulatory gene variants.
Associations between single nucleotide polymorphisms in multiple candidate genes and body weight in rabbits

PubMed Central

El-Sabrout, Karim; Aggag, Sarah A.

2017-01-01

Aim: In this study, we examined parts of six growth genes (growth hormone [GH], melanocortin 4 receptor [MC4R], growth hormone receptor [GHR], phosphorglycerate mutase [PGAM], myostatin [MSTN], and fibroblast growth factor [FGF]) as specific primers for two rabbit lines (V-line, Alexandria) using nucleotide sequence analysis, to investigate association between detecting single nucleotide polymorphism (SNP) of these genes and body weight (BW) at market. Materials and Methods: Each line kits were grouped into high and low weight rabbits to identify DNA markers useful for association studies with high BW. DNA from blood samples of each group was extracted to amplify the six growth genes. SNP technique was used to study the associate polymorphism in the six growth genes and marketing BW (at 63 days) in the two rabbit lines. The purified polymerase chain reaction products were sequenced in those had the highest and lowest BW in each line. Results: Alignment of sequence data from each group revealed the following SNPs: At nucleotide 23 (A-C) and nucleotide 35 (T-G) in MC4R gene (sense mutation) of Alexandria and V-line high BW. Furthermore, we detected the following SNPs variation between the two lines: A SNP (T-C) at nucleotide 27 was identified by MC4R gene (sense mutation) and another one (A-C) at nucleotide 14 was identified by GHR gene (nonsense mutation) of Alexandria line. The results of individual BW at market (63 days) indicated that Alexandria rabbits had significantly higher BW compared with V-line rabbits. MC4R polymorphism showed significant association with high BW in rabbits. Conclusion: The results of polymorphism demonstrate the possibility to detect an association between BW in rabbits and the efficiency of the used primers to predict through the genetic specificity using the SNP of MC4R. PMID:28246458
Plasmid-encoded hygromycin B resistance: the sequence of hygromycin B phosphotransferase gene and its expression in Escherichia coli and Saccharomyces cerevisiae.

PubMed

Gritz, L; Davies, J

1983-11-01

The plasmid-borne gene hph coding for hygromycin B phosphotransferase (HPH) in Escherichia coli has been identified and its nucleotide sequence determined. The hph gene is 1026 nucleotides long, coding for a protein with a predicted Mr of 39 000. The hph gene was placed in a shuttle plasmid vector, downstream from the promoter region of the cyc 1 gene of Saccharomyces cerevisiae, and an hph construction containing a single AUG in the 5' noncoding region allowed direct selection following transformation in yeast and in E. coli. Thus the hph gene can be used in cloning vectors for both pro- and eukaryotes.
Detecting associated single-nucleotide polymorphisms on the X chromosome in case control genome-wide association studies.

PubMed

Chen, Zhongxue; Ng, Hon Keung Tony; Li, Jing; Liu, Qingzhong; Huang, Hanwen

2017-04-01

In the past decade, hundreds of genome-wide association studies have been conducted to detect the significant single-nucleotide polymorphisms that are associated with certain diseases. However, most of the data from the X chromosome were not analyzed and only a few significant associated single-nucleotide polymorphisms from the X chromosome have been identified from genome-wide association studies. This is mainly due to the lack of powerful statistical tests. In this paper, we propose a novel statistical approach that combines the information of single-nucleotide polymorphisms on the X chromosome from both males and females in an efficient way. The proposed approach avoids the need of making strong assumptions about the underlying genetic models. Our proposed statistical test is a robust method that only makes the assumption that the risk allele is the same for both females and males if the single-nucleotide polymorphism is associated with the disease for both genders. Through simulation study and a real data application, we show that the proposed procedure is robust and have excellent performance compared to existing methods. We expect that many more associated single-nucleotide polymorphisms on the X chromosome will be identified if the proposed approach is applied to current available genome-wide association studies data.

Genome-wide divergence and linkage disequilibrium analyses for Capsicum baccatum revealed by genome-anchored single nucleotide polymorphisms

USDA-ARS?s Scientific Manuscript database

Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to show the distribution of these 2 important incompatible cultivated pepper species. Estimated mean nucleotide...
Genetic polymorphisms in ESR1 and ESR2 genes, and risk of hypospadias in a multiethnic study population.

PubMed

Choudhry, Shweta; Baskin, Laurence S; Lammer, Edward J; Witte, John S; Dasgupta, Sudeshna; Ma, Chen; Surampalli, Abhilasha; Shen, Joel; Shaw, Gary M; Carmichael, Suzan L

2015-05-01

Estrogenic endocrine disruptors acting via estrogen receptors α (ESR1) and β (ESR2) have been implicated in the etiology of hypospadias, a common congenital malformation of the male external genitalia. We determined the association of single nucleotide polymorphisms in ESR1 and ESR2 genes with hypospadias in a racially/ethnically diverse study population of California births. We investigated the relationship between hypospadias and 108 ESR1 and 36 ESR2 single nucleotide polymorphisms in 647 cases and 877 population based nonmalformed controls among infants born in selected California counties from 1990 to 2003. Subgroup analyses were performed by race/ethnicity (nonHispanic white and Hispanic subjects) and by hypospadias severity (mild to moderate and severe). Odds ratios for 33 of the 108 ESR1 single nucleotide polymorphisms had p values less than 0.05 (p = 0.05 to 0.007) for risk of hypospadias. However, none of the 36 ESR2 single nucleotide polymorphisms was significantly associated. In stratified analyses the association results were consistent by disease severity but different sets of single nucleotide polymorphisms were significantly associated with hypospadias in nonHispanic white and Hispanic subjects. Due to high linkage disequilibrium across the single nucleotide polymorphisms, haplotype analyses were conducted and identified 6 haplotype blocks in ESR1 gene that had haplotypes significantly associated with an increased risk of hypospadias (OR 1.3 to 1.8, p = 0.04 to 0.00001). Similar to single nucleotide polymorphism analysis, different ESR1 haplotypes were associated with risk of hypospadias in nonHispanic white and Hispanic subjects. No significant haplotype association was observed for ESR2. The data provide evidence that ESR1 single nucleotide polymorphisms and haplotypes influence the risk of hypospadias in white and Hispanic subjects, and warrant further examination in other study populations. Copyright © 2015 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
Genetic Factors Influencing Coagulation Factor XIII B-Subunit Contribute to Risk of Ischemic Stroke.

PubMed

Hanscombe, Ken B; Traylor, Matthew; Hysi, Pirro G; Bevan, Stephen; Dichgans, Martin; Rothwell, Peter M; Worrall, Bradford B; Seshadri, Sudha; Sudlow, Cathie; Williams, Frances M K; Markus, Hugh S; Lewis, Cathryn M

2015-08-01

Abnormal coagulation has been implicated in the pathogenesis of ischemic stroke, but how this association is mediated and whether it differs between ischemic stroke subtypes is unknown. We determined the shared genetic risk between 14 coagulation factors and ischemic stroke and its subtypes. Using genome-wide association study results for 14 coagulation factors from the population-based TwinsUK sample (N≈2000 for each factor), meta-analysis results from the METASTROKE consortium ischemic stroke genome-wide association study (12 389 cases, 62 004 controls), and genotype data for 9520 individuals from the WTCCC2 ischemic stroke study (3548 cases, 5972 controls-the largest METASTROKE subsample), we explored shared genetic risk for coagulation and stroke. We performed three analyses: (1) a test for excess concordance (or discordance) in single nucleotide polymorphism effect direction across coagulation and stroke, (2) an estimation of the joint effect of multiple coagulation-associated single nucleotide polymorphisms in stroke, and (3) an evaluation of common genetic risk between coagulation and stroke. One coagulation factor, factor XIII subunit B (FXIIIB), showed consistent effects in the concordance analysis, the estimation of polygenic risk, and the validation with genotype data, with associations specific to the cardioembolic stroke subtype. Effect directions for FXIIIB-associated single nucleotide polymorphisms were significantly discordant with cardioembolic disease (smallest P=5.7×10(-04)); the joint effect of FXIIIB-associated single nucleotide polymorphisms was significantly predictive of ischemic stroke (smallest P=1.8×10(-04)) and the cardioembolic subtype (smallest P=1.7×10(-04)). We found substantial negative genetic covariation between FXIIIB and ischemic stroke (rG=-0.71, P=0.01) and the cardioembolic subtype (rG=-0.80, P=0.03). Genetic markers associated with low FXIIIB levels increase risk of ischemic stroke cardioembolic subtype. © 2015 The Authors.
Compositions and methods for detecting single nucleotide polymorphisms

DOEpatents

Yeh, Hsin-Chih; Werner, James; Martinez, Jennifer S.

2016-11-22

Described herein are nucleic acid based probes and methods for discriminating and detecting single nucleotide variants in nucleic acid molecules (e.g., DNA). The methods include use of a pair of probes can be used to detect and identify polymorphisms, for example single nucleotide polymorphism in DNA. The pair of probes emit a different fluorescent wavelength of light depending on the association and alignment of the probes when hybridized to a target nucleic acid molecule. Each pair of probes is capable of discriminating at least two different nucleic acid molecules that differ by at least a single nucleotide difference. The methods can probes can be used, for example, for detection of DNA polymorphisms that are indicative of a particular disease or condition.
Lack of Association Between Toll-like Receptor 2 Polymorphisms (R753Q and A-16934T) and Atopic Dermatitis in Children from Thrace Region of Turkey

PubMed Central

Can, Ceren; Yazıcıoğlu, Mehtap; Gürkan, Hakan; Tozkır, Hilmi; Görgülü, Adnan; Süt, Necdet Hilmi

2017-01-01

Background: Atopic dermatitis is the most common chronic inflammatory skin disease. A complex interaction of both genetic and environmental factors is thought to contribute to the disease. Aims: To evaluate whether single nucleotide polymorphisms in the TLR2 gene c.2258C>T (R753Q) (rs5743708) and TLR2 c.-148+1614T>A (A-16934T) (rs4696480) (NM_0032643) are associated with atopic dermatitis in Turkish children. Study Design: Case-control study. Methods: The study was conducted on 70 Turkish children with atopic dermatitis aged 0.5-18 years. The clinical severity of atopic dermatitis was evaluated by the severity scoring of atopic dermatitis index. Serum total IgE levels, specific IgE antibodies to inhalant and food allergens were measured in both atopic dermatitis patients and controls, skin prick tests were done on 70 children with atopic dermatitis. Genotyping for TLR2 (R753Q and A-16934T) single nucleotide polymorphisms was performed in both atopic dermatitis patients and controls. Results: Cytosine-cytosine and cytosin-thymine genotype frequencies of the TLR2 R753Q single nucleotide polymorphism in the atopic dermatitis group were determined as being 98.6% and 1.4%, cytosine allele frequency for TLR2 R753Q single nucleotide polymorphism was determined as 99.29% and the thymine allele frequency was 0.71%, thymine-thymine, thymine-adenine, and adenine-adenine genotype frequencies of the TLR2 A-16934T single nucleotide polymorphism were 24.3%, 44.3%, and 31.4%. The thymine allele frequency for the TLR2 A-16934T single nucleotide polymorphism in the atopic dermatitis group was 46.43%, and the adenine allele frequency was 53.57%, respectively. There was not statistically significant difference between the groups for all investigated polymorphisms (p>0.05). For all single nucleotide polymorphisms studied, allelic distribution was analogous among atopic dermatitis patients and controls, and no significant statistical difference was observed. No homozygous carriers of the TLR2 R753Q single nucleotide polymorphism were found in the atopic dermatitis and control groups. Conclusion: The TLR2 (R753Q and A-16934T) single nucleotide polymorphisms are not associated with atopic dermatitis in a group of Turkish patients. PMID:28443596
Lack of Association Between Toll-like Receptor 2 Polymorphisms (R753Q and A-16934T) and Atopic Dermatitis in Children from Thrace Region of Turkey.

PubMed

Can, Ceren; Yazıcıoğlu, Mehtap; Gürkan, Hakan; Tozkır, Hilmi; Görgülü, Adnan; Süt, Necdet Hilmi

2017-05-05

Atopic dermatitis is the most common chronic inflammatory skin disease. A complex interaction of both genetic and environmental factors is thought to contribute to the disease. To evaluate whether single nucleotide polymorphisms in the TLR2 gene c.2258C>T (R753Q) (rs5743708) and TLR2 c.-148+1614T>A (A-16934T) (rs4696480) (NM_0032643) are associated with atopic dermatitis in Turkish children. Case-control study. The study was conducted on 70 Turkish children with atopic dermatitis aged 0.5-18 years. The clinical severity of atopic dermatitis was evaluated by the severity scoring of atopic dermatitis index. Serum total IgE levels, specific IgE antibodies to inhalant and food allergens were measured in both atopic dermatitis patients and controls, skin prick tests were done on 70 children with atopic dermatitis. Genotyping for TLR2 (R753Q and A-16934T) single nucleotide polymorphisms was performed in both atopic dermatitis patients and controls. Cytosine-cytosine and cytosin-thymine genotype frequencies of the TLR2 R753Q single nucleotide polymorphism in the atopic dermatitis group were determined as being 98.6% and 1.4%, cytosine allele frequency for TLR2 R753Q single nucleotide polymorphism was determined as 99.29% and the thymine allele frequency was 0.71%, thymine-thymine, thymine-adenine, and adenine-adenine genotype frequencies of the TLR2 A-16934T single nucleotide polymorphism were 24.3%, 44.3%, and 31.4%. The thymine allele frequency for the TLR2 A-16934T single nucleotide polymorphism in the atopic dermatitis group was 46.43%, and the adenine allele frequency was 53.57%, respectively. There was not statistically significant difference between the groups for all investigated polymorphisms (p>0.05). For all single nucleotide polymorphisms studied, allelic distribution was analogous among atopic dermatitis patients and controls, and no significant statistical difference was observed. No homozygous carriers of the TLR2 R753Q single nucleotide polymorphism were found in the atopic dermatitis and control groups. The TLR2 (R753Q and A-16934T) single nucleotide polymorphisms are not associated with atopic dermatitis in a group of Turkish patients.
Evidence of pervasive biologically functional secondary structures within the genomes of eukaryotic single-stranded DNA viruses.

PubMed

Muhire, Brejnev Muhizi; Golden, Michael; Murrell, Ben; Lefeuvre, Pierre; Lett, Jean-Michel; Gray, Alistair; Poon, Art Y F; Ngandu, Nobubelo Kwanele; Semegni, Yves; Tanov, Emil Pavlov; Monjane, Adérito Luis; Harkins, Gordon William; Varsani, Arvind; Shepherd, Dionne Natalie; Martin, Darren Patrick

2014-02-01

Single-stranded DNA (ssDNA) viruses have genomes that are potentially capable of forming complex secondary structures through Watson-Crick base pairing between their constituent nucleotides. A few of the structural elements formed by such base pairings are, in fact, known to have important functions during the replication of many ssDNA viruses. Unknown, however, are (i) whether numerous additional ssDNA virus genomic structural elements predicted to exist by computational DNA folding methods actually exist and (ii) whether those structures that do exist have any biological relevance. We therefore computationally inferred lists of the most evolutionarily conserved structures within a diverse selection of animal- and plant-infecting ssDNA viruses drawn from the families Circoviridae, Anelloviridae, Parvoviridae, Nanoviridae, and Geminiviridae and analyzed these for evidence of natural selection favoring the maintenance of these structures. While we find evidence that is consistent with purifying selection being stronger at nucleotide sites that are predicted to be base paired than at sites predicted to be unpaired, we also find strong associations between sites that are predicted to pair with one another and site pairs that are apparently coevolving in a complementary fashion. Collectively, these results indicate that natural selection actively preserves much of the pervasive secondary structure that is evident within eukaryote-infecting ssDNA virus genomes and, therefore, that much of this structure is biologically functional. Lastly, we provide examples of various highly conserved but completely uncharacterized structural elements that likely have important functions within some of the ssDNA virus genomes analyzed here.
Evidence of Pervasive Biologically Functional Secondary Structures within the Genomes of Eukaryotic Single-Stranded DNA Viruses

PubMed Central

Muhire, Brejnev Muhizi; Golden, Michael; Murrell, Ben; Lefeuvre, Pierre; Lett, Jean-Michel; Gray, Alistair; Poon, Art Y. F.; Ngandu, Nobubelo Kwanele; Semegni, Yves; Tanov, Emil Pavlov; Monjane, Adérito Luis; Harkins, Gordon William; Varsani, Arvind; Shepherd, Dionne Natalie

2014-01-01

Single-stranded DNA (ssDNA) viruses have genomes that are potentially capable of forming complex secondary structures through Watson-Crick base pairing between their constituent nucleotides. A few of the structural elements formed by such base pairings are, in fact, known to have important functions during the replication of many ssDNA viruses. Unknown, however, are (i) whether numerous additional ssDNA virus genomic structural elements predicted to exist by computational DNA folding methods actually exist and (ii) whether those structures that do exist have any biological relevance. We therefore computationally inferred lists of the most evolutionarily conserved structures within a diverse selection of animal- and plant-infecting ssDNA viruses drawn from the families Circoviridae, Anelloviridae, Parvoviridae, Nanoviridae, and Geminiviridae and analyzed these for evidence of natural selection favoring the maintenance of these structures. While we find evidence that is consistent with purifying selection being stronger at nucleotide sites that are predicted to be base paired than at sites predicted to be unpaired, we also find strong associations between sites that are predicted to pair with one another and site pairs that are apparently coevolving in a complementary fashion. Collectively, these results indicate that natural selection actively preserves much of the pervasive secondary structure that is evident within eukaryote-infecting ssDNA virus genomes and, therefore, that much of this structure is biologically functional. Lastly, we provide examples of various highly conserved but completely uncharacterized structural elements that likely have important functions within some of the ssDNA virus genomes analyzed here. PMID:24284329
Single-Nucleotide Polymorphisms Within the Thrombomodulin Gene (THBD) Predict Mortality in Patients With Graft-Versus-Host Disease.

PubMed

Rachakonda, Sivaramakrishna P; Penack, Olaf; Dietrich, Sascha; Blau, Olga; Blau, Igor Wolfgang; Radujkovic, Aleksandar; Isermann, Berend; Ho, Anthony D; Uharek, Lutz; Dreger, Peter; Kumar, Rajiv; Luft, Thomas

2014-10-20

Steroid-refractory graft-versus-host disease (GVHD) is a major and often fatal complication after allogeneic stem-cell transplantation (alloSCT). Although the pathophysiology of steroid refractoriness is not fully understood, evidence is accumulating that endothelial cell stress is involved, and endothelial thrombomodulin (THBD) plays a role in this process. Here we assess whether single-nucleotide polymorphisms (SNPs) within the THBD gene predict outcome after alloSCT. Seven SNPs within the THBD gene were studied (rs1962, rs1042579, rs1042580, rs3176123, rs3176124, rs3176126, and rs3176134) in a training cohort of 306 patients. The relevant genotypes were then validated in an independent cohort (n = 321). In the training cohort, an increased risk of nonrelapse mortality (NRM) was associated with three of seven SNPs tested: rs1962, rs1042579 (in linkage disequilibrium with rs3176123), and rs1042580. When patients were divided into risk groups (one v no high-risk SNP), a strong correlation with NRM was observed (hazard ratio [HR], 2.31; 95% CI, 1.36 to 3.95; P = .002). More specifically, NRM was predicted by THBD SNPs in patients who later developed GVHD (HR, 3.03; 95% CI, 1.61 to 5.68; P < .001) but not in patients without GVHD. In contrast, THBD SNPs did not predict incidence of acute GVHD. Multivariable analyses adjusting for clinical variables confirmed the independent effect of THBD SNPs on NRM. All findings could be reproduced in the validation cohort. THBD SNPs predict mortality of manifest GVHD but not the risk of acquiring GVHD, supporting the hypothesis that endothelial vulnerability contributes to GVHD refractoriness. © 2014 by American Society of Clinical Oncology.
Evaluation and identification of damaged single nucleotide polymorphisms in COL1A1 gene involved in osteoporosis

PubMed Central

Alsaif, Mohammed A.; Al Shammari, Sulaiman A.; Alhamdan, Adel A.

2012-01-01

Introduction Single-nucleotide polymorphisms (SNPs) are biomarkers for exploring the genetic basis of many complex human diseases. The prediction of SNPs is promising in modern genetic analysis but it is still a great challenge to identify the functional SNPs in a disease-related gene. The computational approach has overcome this challenge and an increase in the successful rate of genetic association studies and reduced cost of genotyping have been achieved. The objective of this study is to identify deleterious non-synonymous SNPs (nsSNPs) associated with the COL1A1 gene. Material and methods The SNPs were retrieved from the Single Nucleotide Polymorphism Database (dbSNP). Using I-Mutant, protein stability change was calculated. The potentially functional nsSNPs and their effect on proteins were predicted by PolyPhen and SIFT respectively. FASTSNP was used for estimation of risk score. Results Our analysis revealed 247 SNPs as non-synonymous, out of which 5 nsSNPs were found to be least stable by I-Mutant 2.0 with a DDG value of > –1.0. Four nsSNPs, namely rs17853657, rs17857117, rs57377812 and rs1059454, showed a highly deleterious tolerance index score of 0.00 with a change in their physicochemical properties by the SIFT server. Seven nsSNPs, namely rs1059454, rs8179178, rs17853657, rs17857117, rs72656340, rs72656344 and rs72656351, were found to be probably damaging with a PSIC score difference between 2.0 and 3.5 by the PolyPhen server. Three nsSNPs, namely rs1059454, rs17853657 and rs17857117, were found to be highly polymorphic with a risk score of 3-4 with a possible effect of non-conservative change and splicing regulation by FASTSNP. Conclusions Three nsSNPs, namely rs1059454, rs17853657 and rs17857117, are potential functional polymorphisms that are likely to have a functional impact on the COL1A1 gene. PMID:24273577
Cassava Brown Streak Virus (Potyviridae) Encodes a Putative Maf/HAM1 Pyrophosphatase Implicated in Reduction of Mutations and a P1 Proteinase That Suppresses RNA Silencing but Contains No HC-Pro ▿

PubMed Central

Mbanzibwa, Deusdedith R.; Tian, Yanping; Mukasa, Settumba B.; Valkonen, Jari P. T.

2009-01-01

The complete positive-sense single-stranded RNA genome of Cassava brown streak virus (CBSV; genus Ipomovirus; Potyviridae) was found to consist of 9,069 nucleotides and predicted to produce a polyprotein of 2,902 amino acids. It was lacking helper-component proteinase but contained a single P1 serine proteinase that strongly suppressed RNA silencing. Besides the exceptional structure of the 5′-proximal part of the genome, CBSV also contained a Maf/HAM1-like sequence (678 nucleotides, 226 amino acids) recombined between the replicase and coat protein domains in the 3′-proximal part of the genome, which is highly conserved in Potyviridae. HAM1 was flanked by consensus proteolytic cleavage sites for ipomovirus NIaPro cysteine proteinase. Homology of CBSV HAM1 with cellular Maf/HAM1 pyrophosphatases suggests that it may intercept noncanonical nucleoside triphosphates to reduce mutagenesis of viral RNA. PMID:19386713
[Meta-analysis on relationship between single nucleotide polymorphism of rs2231142 in ABCG2 gene and gout in East Asian population].

PubMed

Wu, Lei; He, Yao; Zhang, Di

2015-11-01

To systematically evaluate the association between single nucleotide polymorphism of rs2231142 genetic susceptibility and gout in East Asian population. The literature retrieval was conducted by using English databases (Medline, EMbase), Chinese databases (CNKI, Vip, Wanfang, SinaMed) and others to collect the published papers on the association between single nucleotide polymorphism of rs2231142 genetic susceptibility and gout by the end of December 2014. Meta-analysis was performed with software Stata 12.0. Nine studies were included. There were significant associations between increased risk of gout and single nucleotide polymorphism of rs2231142, the combined OR was 2.04 (95%CI: 1.82-2.28) for A allele and C allele, 1.97 (95%CI: 1.57-2.48) for CA and CC, 3.71 (95%CI: 3.07-4.47) for AA and CC. Sex and region specific subgroup analysis showed less heterogeneity. There is significant association between gout and single nucleotide polymorphism of rs2231142 in East Asian population, and A allele is a high risk gene for gout.
CNTNAP2 Is Significantly Associated With Speech Sound Disorder in the Chinese Han Population.

PubMed

Zhao, Yun-Jing; Wang, Yue-Ping; Yang, Wen-Zhu; Sun, Hong-Wei; Ma, Hong-Wei; Zhao, Ya-Ru

2015-11-01

Speech sound disorder is the most common communication disorder. Some investigations support the possibility that the CNTNAP2 gene might be involved in the pathogenesis of speech-related diseases. To investigate single-nucleotide polymorphisms in the CNTNAP2 gene, 300 unrelated speech sound disorder patients and 200 normal controls were included in the study. Five single-nucleotide polymorphisms were amplified and directly sequenced. Significant differences were found in the genotype (P = .0003) and allele (P = .0056) frequencies of rs2538976 between patients and controls. The excess frequency of the A allele in the patient group remained significant after Bonferroni correction (P = .0280). A significant haplotype association with rs2710102T/+rs17236239A/+2538976A/+2710117A (P = 4.10e-006) was identified. A neighboring single-nucleotide polymorphism, rs10608123, was found in complete linkage disequilibrium with rs2538976, and the genotypes exactly corresponded to each other. The authors propose that these CNTNAP2 variants increase the susceptibility to speech sound disorder. The single-nucleotide polymorphisms rs10608123 and rs2538976 may merge into one single-nucleotide polymorphism. © The Author(s) 2015.
Cacao single-nucleotide polymorphism (SNP) markers: A discovery strategy to identify SNPs for genotyping, genetic mapping and genome wide association studies (GWAS)

USDA-ARS?s Scientific Manuscript database

Single-nucleotide polymorphisms (SNPs) are the most common genetic markers in Theobroma cacao, occurring approximately once in every 200 nucleotides. SNPs, like microsatellites, are co-dominant and PCR-based, but they have several advantages over microsatellites. They are unambiguous, so that a SN...
Molecular cloning of an inducible serine esterase gene from human cytotoxic lymphocytes.

PubMed Central

Trapani, J A; Klein, J L; White, P C; Dupont, B

1988-01-01

A cDNA clone encoding a human serine esterase gene was isolated from a library constructed from poly(A)+ RNA of allogeneically stimulated, interleukin 2-expanded peripheral blood mononuclear cells. The clone, designated HSE26.1, represents a full-length copy of a 0.9-kilobase mRNA present in human cytotoxic cells but absent from a wide variety of noncytotoxic cell lines. Clone HSE26.1 contains an 892-base-pair sequence, including a single 741-base-pair open reading frame encoding a putative 247-residue polypeptide. The first 20 amino acids of the polypeptide form a leader sequence. The mature protein is predicted to have an unglycosylated Mr of approximately equal to 26,000 and contains a single potential site for N-linked glycosylation. The nucleotide and predicted amino acid sequences of clone HSE26.1 are homologous with all murine and human serine esterases cloned thus far but are most similar to mouse granzyme B (70% nucleotide and 68% amino acid identity). HSE26.1 protein is expressed weakly in unstimulated peripheral blood mononuclear cells but is strongly induced within 6-hr incubation in medium containing phytohemagglutinin. The data suggest that the protein encoded by HSE26.1 plays a role in cell-mediated cytotoxicity. Images PMID:3261871
Detection and quantitation of single nucleotide polymorphisms, DNA sequence variations, DNA mutations, DNA damage and DNA mismatches

DOEpatents

McCutchen-Maloney, Sandra L.

2002-01-01

DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.
ADEPT, a dynamic next generation sequencing data error-detection program with trimming

DOE Office of Scientific and Technical Information (OSTI.GOV)

Feng, Shihai; Lo, Chien-Chi; Li, Po-E

Illumina is the most widely used next generation sequencing technology and produces millions of short reads that contain errors. These sequencing errors constitute a major problem in applications such as de novo genome assembly, metagenomics analysis and single nucleotide polymorphism discovery. In this study, we present ADEPT, a dynamic error detection method, based on the quality scores of each nucleotide and its neighboring nucleotides, together with their positions within the read and compares this to the position-specific quality score distribution of all bases within the sequencing run. This method greatly improves upon other available methods in terms of the truemore » positive rate of error discovery without affecting the false positive rate, particularly within the middle of reads. We conclude that ADEPT is the only tool to date that dynamically assesses errors within reads by comparing position-specific and neighboring base quality scores with the distribution of quality scores for the dataset being analyzed. The result is a method that is less prone to position-dependent under-prediction, which is one of the most prominent issues in error prediction. The outcome is that ADEPT improves upon prior efforts in identifying true errors, primarily within the middle of reads, while reducing the false positive rate.« less
ADEPT, a dynamic next generation sequencing data error-detection program with trimming

DOE PAGES

Feng, Shihai; Lo, Chien-Chi; Li, Po-E; ...

2016-02-29

Illumina is the most widely used next generation sequencing technology and produces millions of short reads that contain errors. These sequencing errors constitute a major problem in applications such as de novo genome assembly, metagenomics analysis and single nucleotide polymorphism discovery. In this study, we present ADEPT, a dynamic error detection method, based on the quality scores of each nucleotide and its neighboring nucleotides, together with their positions within the read and compares this to the position-specific quality score distribution of all bases within the sequencing run. This method greatly improves upon other available methods in terms of the truemore » positive rate of error discovery without affecting the false positive rate, particularly within the middle of reads. We conclude that ADEPT is the only tool to date that dynamically assesses errors within reads by comparing position-specific and neighboring base quality scores with the distribution of quality scores for the dataset being analyzed. The result is a method that is less prone to position-dependent under-prediction, which is one of the most prominent issues in error prediction. The outcome is that ADEPT improves upon prior efforts in identifying true errors, primarily within the middle of reads, while reducing the false positive rate.« less
The association of single-nucleotide polymorphisms in the oxytocin receptor and G protein-coupled receptor kinase 6 (GRK6) genes with oxytocin dosing requirements and labor outcomes.

PubMed

Grotegut, Chad A; Ngan, Emily; Garrett, Melanie E; Miranda, Marie Lynn; Ashley-Koch, Allison E; Swamy, Geeta K

2017-09-01

Oxytocin is a potent uterotonic agent that is widely used for induction and augmentation of labor. Oxytocin has a narrow therapeutic index and the optimal dosing for any individual woman varies widely. The objective of this study was to determine whether genetic variation in the oxytocin receptor (OXTR) or in the gene encoding G protein-coupled receptor kinase 6 (GRK6), which regulates desensitization of the oxytocin receptor, could explain variation in oxytocin dosing and labor outcomes among women being induced near term. Pregnant women with a singleton gestation residing in Durham County, NC, were prospectively enrolled as part of the Healthy Pregnancy, Healthy Baby cohort study. Those women undergoing an induction of labor at 36 weeks or greater were genotyped for 18 haplotype-tagging single-nucleotide polymorphisms in OXTR and 7 haplotype-tagging single-nucleotide polymorphisms in GRK6 using TaqMan assays. Linear regression was used to examine the relationship between maternal genotype and maximal oxytocin infusion rate, total oxytocin dose received, and duration of labor. Logistic regression was used to test for the association of maternal genotype with mode of delivery. For each outcome, backward selection techniques were utilized to control for important confounding variables and additive genetic models were used. Race/ethnicity was included in all models because of differences in allele frequencies across populations, and Bonferroni correction for multiple testing was used. DNA was available from 482 women undergoing induction of labor at 36 weeks or greater. Eighteen haplotype-tagging single-nucleotide polymorphisms within OXTR and 7 haplotype-tagging single-nucleotide polymorphisms within GRK6 were examined. Five single-nucleotide polymorphisms in OXTR showed nominal significance with maximal infusion rate of oxytocin, and two single-nucleotide polymorphisms in OXTR were associated with total oxytocin dose received. One single-nucleotide polymorphism in OXTR and two single-nucleotide polymorphisms in GRK6 were associated with duration of labor, one of which met the multiple testing threshold (P = .0014, rs2731664 [GRK6], mean duration of labor, 17.7 hours vs 20.2 hours vs 23.5 hours for AA, AC, and CC genotypes, respectively). Three single-nucleotide polymorphisms, two in OXTR and one in GRK6, showed nominal significance with mode of delivery. Genetic variation in OXTR and GRK6 is associated with the amount of oxytocin required as well as the duration of labor and risk for cesarean delivery among women undergoing induction of labor near term. With further research, pharmacogenomic approaches may potentially be utilized to develop personalized treatment to improve safety and efficacy outcomes among women undergoing induction of labor. Copyright © 2017 Elsevier Inc. All rights reserved.
Polymerization of non-complementary RNA: systematic symmetric nucleotide exchanges mainly involving uracil produce mitochondrial RNA transcripts coding for cryptic overlapping genes.

PubMed

Seligmann, Hervé

2013-03-01

Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA coding potential. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

Significant SNPs have limited prediction ability for thyroid cancer

PubMed Central

Guo, Shicheng; Wang, Yu-Long; Li, Yi; Jin, Li; Xiong, Momiao; Ji, Qing-Hai; Wang, Jiucun

2014-01-01

Recently, five thyroid cancer significantly associated genetic variants (rs965513, rs944289, rs116909374, rs966423, and rs2439302) have been discovered and validated in two independent GWAS and numerous case–control studies, which were conducted in different populations. We genotyped the above five single nucleotide polymorphisms (SNPs) in Han Chinese populations and performed thyroid cancer-risk predictions with nine machine learning methods. We found that four SNPs were significantly associated with thyroid cancer in Han Chinese population, while no polymorphism was observed for rs116909374. Small familial relative risks (1.02–1.05) and limited power to predict thyroid cancer (AUCs: 0.54–0.60) indicate limited clinical potential. Four significant SNPs have limited prediction ability for thyroid cancer. PMID:24591304
Single nucleotide polymorphisms in the mitochondrial displacement loop and outcome of esophageal squamous cell carcinoma.

PubMed

Zhang, Ruixing; Wang, Rui; Zhang, Fengbin; Wu, Chensi; Fan, Haiyan; Li, Yan; Wang, Cuiju; Guo, Zhanjun

2010-11-26

Accumulation of single nucleotide polymorphisms (SNPs) in the displacement loop (D-loop) of mitochondrial DNA (mtDNA) has been described for different types of cancers and might be associated with cancer risk and disease outcome. We used a population-based series of esophageal squamous cell carcinoma (ESCC) patients for investigating the prediction power of SNPs in mitochondrial D-loop. The D-loop region of mtDNA was sequenced for 60 ESCC patients recorded in the Fourth Hospital of Hebei Medical University between 2003 and 2004. The 5 year survival curve were calculated with the Kaplan-Meier method and compared by the log-rank test at each SNP site, a multivariate survival analysis was also performed with the Cox proportional hazards method. The SNP sites of nucleotides 16274G/A, 16278C/T and 16399A/G were identified for prediction of post-operational survival by the log-rank test. In an overall multivariate analysis, the 16278 and 16399 alleles were identified as independent predictors of ESCC outcome. The length of survival of patients with the minor allele 16278T genotype was significantly shorter than that of patients with 16278C at the 16278 site (relative risk, 3.001; 95% CI, 1.029 - 8.756; p = 0.044). The length of survival of patients with the minor allele 16399G genotype was significantly shorter than that of patients with the more frequent allele 16399A at the 16399 site in ESCC patients (relative risk, 3.483; 95% CI, 1.068 - 11.359; p = 0.039). Genetic polymorphisms in the D-loop are independent prognostic markers for patients with ESCC. Accordingly, the analysis of genetic polymorphisms in the mitochondrial D-loop can help identify patient subgroups at high risk of a poor disease outcome.
KBG syndrome involving a single-nucleotide duplication in ANKRD11

PubMed Central

Kleyner, Robert; Malcolmson, Janet; Tegay, David; Ward, Kenneth; Maughan, Annette; Maughan, Glenn; Nelson, Lesa; Wang, Kai; Robison, Reid; Lyon, Gholson J.

2016-01-01

KBG syndrome is a rare autosomal dominant genetic condition characterized by neurological involvement and distinct facial, hand, and skeletal features. More than 70 cases have been reported; however, it is likely that KBG syndrome is underdiagnosed because of lack of comprehensive characterization of the heterogeneous phenotypic features. We describe the clinical manifestations in a male currently 13 years of age, who exhibited symptoms including epilepsy, severe developmental delay, distinct facial features, and hand anomalies, without a positive genetic diagnosis. Subsequent exome sequencing identified a novel de novo heterozygous single base pair duplication (c.6015dupA) in ANKRD11, which was validated by Sanger sequencing. This single-nucleotide duplication is predicted to lead to a premature stop codon and loss of function in ANKRD11, thereby implicating it as contributing to the proband's symptoms and yielding a molecular diagnosis of KBG syndrome. Before molecular diagnosis, this syndrome was not recognized in the proband, as several key features of the disorder were mild and were not recognized by clinicians, further supporting the concept of variable expressivity in many disorders. Although a diagnosis of cerebral folate deficiency has also been given, its significance for the proband's condition remains uncertain. PMID:27900361
Infectious mononucleosis-linked HLA class I single nucleotide polymorphism is associated with multiple sclerosis.

PubMed

Jafari, Naghmeh; Broer, Linda; Hoppenbrouwers, Ilse A; van Duijn, Cornelia M; Hintzen, Rogier Q

2010-11-01

Multiple sclerosis is a presumed autoimmune disease associated with genetic and environmental risk factors such as infectious mononucleosis. Recent research has shown infectious mononucleosis to be associated with a specific HLA class I polymorphism. Our aim was to test if the infectious mononucleosis-linked HLA class I single nucleotide polymorphism (rs6457110) is also associated with multiple sclerosis. Genotyping of the HLA-A single nucleotide polymorphism rs6457110 using TaqMan was performed in 591 multiple sclerosis cases and 600 controls. The association of multiple sclerosis with the HLA-A single nucleotide polymorphism was tested using logistic regression adjusted for age, sex and HLA-DRB1*1501. HLA-A minor allele (A) is associated with multiple sclerosis (OR = 0.68; p = 4.08 × 10( -5)). After stratification for HLA-DRB1*1501 risk allele (T) carrier we showed a significant OR of 0.70 (p = 0.003) for HLA-A. HLA class I single nucleotide polymorphism rs6457110 is associated with infectious mononucleosis and multiple sclerosis, independent of the major class II allele, supporting the hypothesis that shared genetics may contribute to the association between infectious mononucleosis and multiple sclerosis.
Homology modeling and in silico prediction of Ulcerative colitis associated polymorphisms of NOD1.

PubMed

Majumdar, Ishani; Nagpal, Isha; Paul, Jaishree

2017-10-01

Cytosolic pattern recognition receptors play key roles in innate immune response. Nucleotide binding and oligomerisation domain containing protein 1 (NOD1) belonging to the Nod-like receptor C (NLRC) sub-family of Nod-like receptors (NLRs) is important for detection and clearance of intra-cellular Gram negative bacteria. NOD1 is involved in activation of pro-inflammatory pathways. Limited structural data is available for NOD1. Using different templates for each domain of NOD1, we determined the full-length homology model of NOD1. ADP binding amino acids within the nucleotide binding domain (NBD) of NOD1 were also predicted. Key residues in inter-domain interaction were identified by sequence comparison with Oryctolagus cuniculus NOD2, a related protein. Interactions between NBD and winged helix domain (WHD) were found to be conserved in NOD1. Functional and structural effect of single nucleotide polymorphisms within the NOD1 NBD domain associated with susceptibility risk to Ulcerative colitis (UC), an inflammatory disorder of the colon was evaluated by in silico studies. Mutations W219R and L349P were predicted to be damaging and disease associated by prediction programs SIFT, PolyPhen2, PANTHER, SNP&GO, PhD SNP and SNAP2. We further validated the effect of W219R and L349P mutation on NOD1 function in vitro. Elevated mRNA expression of pro-inflammatory cytokines IL8 and IL-1β was seen as compared to the wild type NOD1 in intestinal epithelial cell line HT29 when stimulated with NOD1 ligand. Thus, these mutations may indeed have a bearing on pathogenesis of inflammation during UC. Copyright © 2017 Elsevier Ltd. All rights reserved.
Electrical detection and quantification of single and mixed DNA nucleotides in suspension

NASA Astrophysics Data System (ADS)

Ahmad, Mahmoud Al; Panicker, Neena G.; Rizvi, Tahir A.; Mustafa, Farah

2016-09-01

High speed sequential identification of the building blocks of DNA, (deoxyribonucleotides or nucleotides for short) without labeling or processing in long reads of DNA is the need of the hour. This can be accomplished through exploiting their unique electrical properties. In this study, the four different types of nucleotides that constitute a DNA molecule were suspended in a buffer followed by performing several types of electrical measurements. These electrical parameters were then used to quantify the suspended DNA nucleotides. Thus, we present a purely electrical counting scheme based on the semiconductor theory that allows one to determine the number of nucleotides in a solution by measuring their capacitance-voltage dependency. The nucleotide count was observed to be similar to the multiplication of the corresponding dopant concentration and debye volume after de-embedding the buffer contribution. The presented approach allows for a fast and label-free quantification of single and mixed nucleotides in a solution.
Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.

PubMed

Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook

2014-11-01

As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of our knowledge, this is the first attempt to predict protein-binding nucleotides in a given DNA sequence from the sequence data alone. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Compilation of mRNA Polyadenylation Signals in Arabidopsis Revealed a New Signal Element and Potential Secondary Structures1[w

PubMed Central

Loke, Johnny C.; Stahlberg, Eric A.; Strenski, David G.; Haas, Brian J.; Wood, Paul Chris; Li, Qingshun Quinn

2005-01-01

Using a novel program, SignalSleuth, and a database containing authenticated polyadenylation [poly(A)] sites, we analyzed the composition of mRNA poly(A) signals in Arabidopsis (Arabidopsis thaliana), and reevaluated previously described cis-elements within the 3′-untranslated (UTR) regions, including near upstream elements and far upstream elements. As predicted, there are absences of high-consensus signal patterns. The AAUAAA signal topped the near upstream elements patterns and was found within the predicted location to only approximately 10% of 3′-UTRs. More importantly, we identified a new set, named cleavage elements, of poly(A) signals flanking both sides of the cleavage site. These cis-elements were not previously revealed by conventional mutagenesis and are contemplated as a cluster of signals for cleavage site recognition. Moreover, a single-nucleotide profile scan on the 3′-UTR regions unveiled a distinct arrangement of alternate stretches of U and A nucleotides, which led to a prediction of the formation of secondary structures. Using an RNA secondary structure prediction program, mFold, we identified three main types of secondary structures on the sequences analyzed. Surprisingly, these observed secondary structures were all interrupted in previously constructed mutations in these regions. These results will enable us to revise the current model of plant poly(A) signals and to develop tools to predict 3′-ends for gene annotation. PMID:15965016
Single nucleotide polymorphism coverage and inference of N-acetyltransferase-2 acetylator phenotypes in wordwide population groups.

PubMed

Suarez-Kurtz, Guilherme; Fuchshuber-Moraes, Mateus; Struchiner, Claudio J; Parra, Esteban J

2016-08-01

Several algorithms have been proposed to reduce the genotyping effort and cost, while retaining the accuracy of N-acetyltransferase-2 (NAT2) phenotype prediction. Data from the 1000 Genomes (1KG) project and an admixed cohort of Black Brazilians were used to assess the accuracy of NAT2 phenotype prediction using algorithms based on paired single nucleotide polymorphisms (SNPs) (rs1041983 and rs1801280) or a tag SNP (rs1495741). NAT2 haplotypes comprising SNPs rs1801279, rs1041983, rs1801280, rs1799929, rs1799930, rs1208 and rs1799931 were assigned according to the arylamine N-acetyltransferases database. Contingency tables were used to visualize the agreement between the NAT2 acetylator phenotypes on the basis of these haplotypes versus phenotypes inferred by the prediction algorithms. The paired and tag SNP algorithms provided more than 96% agreement with the 7-SNP derived phenotypes in Europeans, East Asians, South Asians and Admixed Americans, but discordance of phenotype prediction occurred in 30.2 and 24.8% 1KG Africans and in 14.4 and 18.6% Black Brazilians, respectively. Paired SNP panel misclassification occurs in carriers of NATs haplotypes *13A (282T alone), *12B (282T and 803G), *6B (590A alone) and *14A (191A alone), whereas haplotype *14, defined by the 191A allele, is the major culprit of misclassification by the tag allele. Both the paired SNP and the tag SNP algorithms may be used, with economy of scale, to infer NAT2 acetylator phenotypes, including the ultra-slow phenotype, in European, East Asian, South Asian and American populations represented in the 1KG cohort. Both algorithms, however, perform poorly in populations of predominant African descent, including admixed African-Americans, African Caribbeans and Black Brazilians.
Prediction of functionally significant single nucleotide polymorphisms in PTEN tumor suppressor gene: An in silico approach.

PubMed

Khan, Imran; Ansari, Irfan A; Singh, Pratichi; Dass J, Febin Prabhu

2017-09-01

The phosphatase and tensin homolog (PTEN) gene plays a crucial role in signal transduction by negatively regulating the PI3K signaling pathway. It is the most frequent mutated gene in many human-related cancers. Considering its critical role, a functional analysis of missense mutations of PTEN gene was undertaken in this study. Thirty five nonsynonymous single nucleotide polymorphisms (nsSNPs) within the coding region of the PTEN gene were selected for our in silico investigation, and five nsSNPs (G129E, C124R, D252G, H61D, and R130G) were found to be deleterious based on combinatorial predictions of different computational tools. Moreover, molecular dynamics (MD) simulation was performed to investigate the conformational variation between native and all the five mutant PTEN proteins having predicted deleterious nsSNPs. The results of MD simulation of all mutant models illustrated variation in structural attributes such as root-mean-square deviation, root-mean-square fluctuation, radius of gyration, and total energy; which depicts the structural stability of PTEN protein. Furthermore, mutant PTEN protein structures also showed a significant variation in the solvent accessible surface area and hydrogen bond frequencies from the native PTEN structure. In conclusion, results of this study have established the deleterious effect of the all the five predicted nsSNPs on the PTEN protein structure. Thus, results of the current study can pave a new platform to sort out nsSNPs that can be undertaken for the confirmation of their phenotype and their correlation with diseased status in case of control studies. © 2016 International Union of Biochemistry and Molecular Biology, Inc.
Markers predicting response to bacillus Calmette-Guérin immunotherapy in high-risk bladder cancer patients: a systematic review.

PubMed

Zuiverloon, Tahlita C M; Nieuweboer, Annemieke J M; Vékony, Hedvig; Kirkels, Wim J; Bangma, Chris H; Zwarthoff, Ellen C

2012-01-01

Currently, bacillus Calmette-Guérin (BCG) intravesical instillations are standard treatment for patients with high-grade non-muscle-invasive bladder cancer; however, no markers are available to predict BCG response. To review the contemporary literature on markers predicting BCG response, to discuss the key issues concerning the identification of predictive markers, and to provide recommendations for further research studies. We performed a systematic review of the literature using PubMed and Embase databases in the period 1996-2010. The free-text search was extended by adding the following keywords: recurrence, progression, survival, molecular marker, prognosis, TP53, Ki-67, RB, fibronectin, immunotherapy, cytokine, interleukin, natural killer, macrophage, PMN, polymorphism, SNP, single nucleotide polymorphism, and gene signature. If thresholds for the detection of urinary interleukin (IL)-8, IL-18, and tumour necrosis factor apoptosis-inducing ligand levels are standardised, measurement of these cytokines holds promise in the assessment of BCG therapy outcome. Studies on immunohistochemical markers (ie, TP53, Ki-67, and retinoblastoma) display contradictory results, probably because of the small patient groups that were used and seem unsuitable to predict BCG response. Exploring combinations of protein levels might prove to be more helpful to establish the effect of BCG therapy. Single nucleotide polymorphisms, either in cytokines or in genes involved in DNA repair, need to be investigated in different ethnicities before their clinical relevance can be determined. Measurement of urinary IL-2 levels seems to be the most potent marker of all the clinical parameters reviewed. IL-2 levels are currently the most promising predictive markers of BCG response. For future studies focusing on new biomarkers, it is essential to make more use of new biomedical techniques such as microRNA profiling and genomewide sequencing. Copyright © 2011 European Association of Urology. Published by Elsevier B.V. All rights reserved.
Makeup of the genetic correlation between milk production traits using genome-wide single nucleotide polymorphism information.

PubMed

van Binsbergen, R; Veerkamp, R F; Calus, M P L

2012-04-01

The correlated responses between traits may differ depending on the makeup of genetic covariances, and may differ from the predictions of polygenic covariances. Therefore, the objective of the present study was to investigate the makeup of the genetic covariances between the well-studied traits: milk yield, fat yield, protein yield, and their percentages in more detail. Phenotypic records of 1,737 heifers of research farms in 4 different countries were used after homogenizing and adjusting for management effects. All cows had a genotype for 37,590 single nucleotide polymorphisms (SNP). A bayesian stochastic search variable selection model was used to estimate the SNP effects for each trait. About 0.5 to 1.0% of the SNP had a significant effect on 1 or more traits; however, the SNP without a significant effect explained most of the genetic variances and covariances of the traits. Single nucleotide polymorphism correlations differed from the polygenic correlations, but only 10 regions were found with an effect on multiple traits; in 1 of these regions the DGAT1 gene was previously reported with an effect on multiple traits. This region explained up to 41% of the variances of 4 traits and explained a major part of the correlation between fat yield and fat percentage and contributes to asymmetry in correlated response between fat yield and fat percentage. Overall, for the traits in this study, the infinitesimal model is expected to be sufficient for the estimation of the variances and covariances. Copyright © 2012 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Seqping: gene prediction pipeline for plant genomes using self-training gene models and transcriptomic data.

PubMed

Chan, Kuang-Lim; Rosli, Rozana; Tatarinova, Tatiana V; Hogan, Michael; Firdaus-Raih, Mohd; Low, Eng-Ti Leslie

2017-01-27

Gene prediction is one of the most important steps in the genome annotation process. A large number of software tools and pipelines developed by various computing techniques are available for gene prediction. However, these systems have yet to accurately predict all or even most of the protein-coding regions. Furthermore, none of the currently available gene-finders has a universal Hidden Markov Model (HMM) that can perform gene prediction for all organisms equally well in an automatic fashion. We present an automated gene prediction pipeline, Seqping that uses self-training HMM models and transcriptomic data. The pipeline processes the genome and transcriptome sequences of the target species using GlimmerHMM, SNAP, and AUGUSTUS pipelines, followed by MAKER2 program to combine predictions from the three tools in association with the transcriptomic evidence. Seqping generates species-specific HMMs that are able to offer unbiased gene predictions. The pipeline was evaluated using the Oryza sativa and Arabidopsis thaliana genomes. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis showed that the pipeline was able to identify at least 95% of BUSCO's plantae dataset. Our evaluation shows that Seqping was able to generate better gene predictions compared to three HMM-based programs (MAKER2, GlimmerHMM and AUGUSTUS) using their respective available HMMs. Seqping had the highest accuracy in rice (0.5648 for CDS, 0.4468 for exon, and 0.6695 nucleotide structure) and A. thaliana (0.5808 for CDS, 0.5955 for exon, and 0.8839 nucleotide structure). Seqping provides researchers a seamless pipeline to train species-specific HMMs and predict genes in newly sequenced or less-studied genomes. We conclude that the Seqping pipeline predictions are more accurate than gene predictions using the other three approaches with the default or available HMMs.
Functional Analysis of a Novel Genome-Wide Association Study Signal in SMAD3 That Confers Protection From Coronary Artery Disease.

PubMed

Turner, Adam W; Martinuk, Amy; Silva, Anada; Lau, Paulina; Nikpay, Majid; Eriksson, Per; Folkersen, Lasse; Perisic, Ljubica; Hedin, Ulf; Soubeyrand, Sebastien; McPherson, Ruth

2016-05-01

A recent genome-wide association study meta-analysis identified an intronic single nucleotide polymorphism in SMAD3, rs56062135C>T, the minor allele (T) which associates with protection from coronary artery disease. Relevant to atherosclerosis, SMAD3 is a key contributor to transforming growth factor-β pathway signaling. Here, we seek to identify ≥1 causal coronary artery disease-associated single nucleotide polymorphisms at the SMAD3 locus and characterize mechanisms whereby the risk allele(s) contribute to coronary artery disease risk. By genetic and epigenetic fine mapping, we identified a candidate causal single nucleotide polymorphism rs17293632C>T (D', 0.97; r(2), 0.94 with rs56062135) in intron 1 of SMAD3 with predicted functional effects. We show that the sequence encompassing rs17293632 acts as a strong enhancer in human arterial smooth muscle cells. The common allele (C) preserves an activator protein (AP)-1 site and enhancer function, whereas the protective (T) allele disrupts the AP-1 site and significantly reduces enhancer activity (P<0.001). Pharmacological inhibition of AP-1 activity upstream demonstrates that this allele-specific enhancer effect is AP-1 dependent (P<0.001). Chromatin immunoprecipitation experiments reveal binding of several AP-1 component proteins with preferential binding to the (C) allele. We show that rs17293632 is an expression quantitative trait locus for SMAD3 in blood and atherosclerotic plaque with reduced expression of SMAD3 in carriers of the protective allele. Finally, siRNA knockdown of SMAD3 in human arterial smooth muscle cells increases cell viability, consistent with an antiproliferative role. The coronary artery disease-associated rs17293632C>T single nucleotide polymorphism represents a novel functional cis-acting element at the SMAD3 locus. The protective (T) allele of rs17293632 disrupts a consensus AP-1 binding site in a SMAD3 intron 1 enhancer, reduces enhancer activity and SMAD3 expression, altering human arterial smooth muscle cell proliferation. © 2016 American Heart Association, Inc.
E6 and E7 Gene Polymorphisms in Human Papillomavirus Types-58 and 33 Identified in Southwest China

PubMed Central

Wen, Qiang; Wang, Tao; Mu, Xuemei; Chenzhang, Yuwei; Cao, Man

2017-01-01

Cancer of the cervix is associated with infection by certain types of human papillomavirus (HPV). The gene variants differ in immune responses and oncogenic potential. The E6 and E7 proteins encoded by high-risk HPV play a key role in cellular transformation. HPV-33 and HPV-58 types are highly prevalent among Chinese women. To study the gene intratypic variations, polymorphisms and positive selections of HPV-33 and HPV-58 E6/E7 in southwest China, HPV-33 (E6, E7: n = 216) and HPV-58 (E6, E7: n = 405) E6 and E7 genes were sequenced and compared to others submitted to GenBank. Phylogenetic trees were constructed by Maximum-likelihood and the Kimura 2-parameters methods by MEGA 6 (Molecular Evolutionary Genetics Analysis version 6.0). The diversity of secondary structure was analyzed by PSIPred software. The selection pressures acting on the E6/E7 genes were estimated by PAML 4.8 (Phylogenetic Analyses by Maximun Likelihood version4.8) software. The positive sites of HPV-33 and HPV-58 E6/E7 were contrasted by ClustalX 2.1. Among 216 HPV-33 E6 sequences, 8 single nucleotide mutations were observed with 6/8 non-synonymous and 2/8 synonymous mutations. The 216 HPV-33 E7 sequences showed 3 single nucleotide mutations that were non-synonymous. The 405 HPV-58 E6 sequences revealed 8 single nucleotide mutations with 4/8 non-synonymous and 4/8 synonymous mutations. Among 405 HPV-58 E7 sequences, 13 single nucleotide mutations were observed with 10/13 non-synonymous mutations and 3/13 synonymous mutations. The selective pressure analysis showed that all HPV-33 and 4/6 HPV-58 E6/E7 major non-synonymous mutations were sites of positive selection. All variations were observed in sites belonging to major histocompatibility complex and/or B-cell predicted epitopes. K93N and R145 (I/N) were observed in both HPV-33 and HPV-58 E6. PMID:28141822
Single Endemic Genotype of Measles Virus Continuously Circulating in China for at Least 16 Years

PubMed Central

Wang, Huiling; Zhu, Zhen; Ji, Yixin; Liu, Chunyu; Zhang, Xiaojie; Sun, Liwei; Zhou, Jianhui; Lu, Peishan; Hu, Ying; Feng, Daxing; Zhang, Zhenying; Wang, Changyin; Fang, Xueqiang; Zheng, Huanying; Liu, Leng; Sun, Xiaodong; Tang, Wei; Wang, Yan; Liu, Yan; Gao, Hui; Tian, Hong; Ma, Jiangtao; Gu, Suyi; Wang, Shuang; Feng, Yan; Bo, Fang; Liu, Jianfeng; Si, Yuan; Zhou, Shujie; Ma, Yuyan; Wu, Shengwei; Zhou, Shunde; Li, Fangcai; Ding, Zhengrong; Yang, Zhaohui; Rota, Paul A.; Featherstone, David; Jee, Youngmee; Bellini, William J.; Xu, Wenbo

2012-01-01

The incidence of measles in China from 1991 to 2008 was reviewed, and the nucleotide sequences from 1507 measles viruses (MeV) isolated during 1993 to 2008 were phylogenetically analyzed. The results showed that measles epidemics peaked approximately every 3 to 5 years with the range of measles cases detected between 56,850 and 140,048 per year. The Chinese MeV strains represented three genotypes; 1501 H1, 1 H2 and 5 A. Genotype H1 was the predominant genotype throughout China continuously circulating for at least 16 years. Genotype H1 sequences could be divided into two distinct clusters, H1a and H1b. A 4.2% average nucleotide divergence was found between the H1a and H1b clusters, and the nucleotide sequence and predicted amino acid homologies of H1a viruses were 92.3%–100% and 84.7%–100%, H1b were 97.1%–100% and 95.3%–100%, respectively. Viruses from both clusters were distributed throughout China with no apparent geographic restriction and multiple co-circulating lineages were present in many provinces. Cluster H1a and H1b viruses were co-circulating during 1993 to 2005, while no H1b viruses were detected after 2005 and the transmission of that cluster has presumably been interrupted. Analysis of the nucleotide and predicted amino acid changes in the N proteins of H1a and H1b viruses showed no evidence of selective pressure. This study investigated the genotype and cluster distribution of MeV in China over a 16-year period to establish a genetic baseline before MeV elimination in Western Pacific Region (WPR). Continuous and extensive MeV surveillance and the ability to quickly identify imported cases of measles will become more critical as measles elimination goals are achieved in China in the near future. This is the first report that a single endemic genotype of measles virus has been found to be continuously circulating in one country for at least 16 years. PMID:22532829
Single endemic genotype of measles virus continuously circulating in China for at least 16 years.

PubMed

Zhang, Yan; Xu, Songtao; Wang, Huiling; Zhu, Zhen; Ji, Yixin; Liu, Chunyu; Zhang, Xiaojie; Sun, Liwei; Zhou, Jianhui; Lu, Peishan; Hu, Ying; Feng, Daxing; Zhang, Zhenying; Wang, Changyin; Fang, Xueqiang; Zheng, Huanying; Liu, Leng; Sun, Xiaodong; Tang, Wei; Wang, Yan; Liu, Yan; Gao, Hui; Tian, Hong; Ma, Jiangtao; Gu, Suyi; Wang, Shuang; Feng, Yan; Bo, Fang; Liu, Jianfeng; Si, Yuan; Zhou, Shujie; Ma, Yuyan; Wu, Shengwei; Zhou, Shunde; Li, Fangcai; Ding, Zhengrong; Yang, Zhaohui; Rota, Paul A; Featherstone, David; Jee, Youngmee; Bellini, William J; Xu, Wenbo

2012-01-01

The incidence of measles in China from 1991 to 2008 was reviewed, and the nucleotide sequences from 1507 measles viruses (MeV) isolated during 1993 to 2008 were phylogenetically analyzed. The results showed that measles epidemics peaked approximately every 3 to 5 years with the range of measles cases detected between 56,850 and 140,048 per year. The Chinese MeV strains represented three genotypes; 1501 H1, 1 H2 and 5 A. Genotype H1 was the predominant genotype throughout China continuously circulating for at least 16 years. Genotype H1 sequences could be divided into two distinct clusters, H1a and H1b. A 4.2% average nucleotide divergence was found between the H1a and H1b clusters, and the nucleotide sequence and predicted amino acid homologies of H1a viruses were 92.3%-100% and 84.7%-100%, H1b were 97.1%-100% and 95.3%-100%, respectively. Viruses from both clusters were distributed throughout China with no apparent geographic restriction and multiple co-circulating lineages were present in many provinces. Cluster H1a and H1b viruses were co-circulating during 1993 to 2005, while no H1b viruses were detected after 2005 and the transmission of that cluster has presumably been interrupted. Analysis of the nucleotide and predicted amino acid changes in the N proteins of H1a and H1b viruses showed no evidence of selective pressure. This study investigated the genotype and cluster distribution of MeV in China over a 16-year period to establish a genetic baseline before MeV elimination in Western Pacific Region (WPR). Continuous and extensive MeV surveillance and the ability to quickly identify imported cases of measles will become more critical as measles elimination goals are achieved in China in the near future. This is the first report that a single endemic genotype of measles virus has been found to be continuously circulating in one country for at least 16 years.
Resolving incomplete single nucleotide polymorphism tagging of HLA-DQ2.2 for coeliac disease genotyping using digital droplet PCR.

PubMed

Hardy, M Y; Ontiveros, N; Varney, M D; Tye-Din, J A

2018-04-01

A hallmark of coeliac disease (CD) is the exceptionally strong genetic association with HLA-DQ2.5, DQ8, and DQ2.2. HLA typing provides information on CD risk important to both clinicians and researchers. A method that enables simple and fast detection of all CD risk genotypes is particularly desirable for the study of large populations. Single nucleotide polymorphism (SNP)-based HLA typing can detect the CD risk genotypes by detecting a combination of six SNPs but this approach can struggle to resolve HLA-DQ2.2, seen in 4% of European CD patients, because of the low resolution of one negatively predicting SNP. We sought to optimise SNP-based HLA typing by harnessing the additional resolution of digital droplet PCR to resolve HLA-DQ2.2. Here we test this two-step approach in an unselected sample of Mexican DNA and compare its accuracy to DNA typed using traditional exon detection. The addition of digital droplet PCR for samples requiring negative prediction of HLA-DQ2.2 enabled HLA-DQ2.2 to be accurately typed. This technique is a simple addition to a SNP-based typing strategy and enables comprehensive definition of all at-risk HLA genotypes in CD in a timely and cost-effective manner. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
PCR/LDR/capillary electrophoresis for detection of single-nucleotide differences between fetal and maternal DNA in maternal plasma.

PubMed

Yi, Ping; Chen, Zhuqin; Zhao, Yan; Guo, Jianxin; Fu, Huabin; Zhou, Yuanguo; Yu, Lili; Li, Li

2009-03-01

The discovery of fetal DNA in maternal plasma has opened up an approach for noninvasive diagnosis. We have now assessed the possibility of detecting single-nucleotide differences between fetal and maternal DNA in maternal plasma by polymerase chain reaction (PCR)/ligase detection reaction((LDR)/capillary electrophoresis. PCR/LDR/capillary electrophoresis was applied to detect the genotype of c.454-397T>gene (ESR1) from experimental DNA models of maternal plasma at different sensitivity levels and 13 maternal plasma samples.alphaC in estrogen receptor. (1) Our results demonstrated that the technique could discriminate low abundance single-nucleotide mutation with a mutant/normal allele ratio up to 1:10 000. (2) Examination of ESR1 c.454-397T>C genotypes by using the method of restriction fragment length analysis was performed in 25 pregnant women, of whom 13 pregnant women had homozygous genotypes. The c.454-397T>C genotypes of paternally inherited fetal DNA in maternal plasma of these 13 women were detected by PCR/LDR/capillary electrophoresis, which were accordant with the results of umbilical cord blood. PCR/LDR/capillary electrophoresis has very high sensitivity to distinguish low abundance single nucleotide differences and can discriminate point mutations and single-nucleotide polymorphisms(SNPs) of paternally inherited fetal DNA in maternal plasma.
The Role of Constitutional Copy Number Variants in Breast Cancer

PubMed Central

Walker, Logan C.; Wiggins, George A.R.; Pearson, John F.

2015-01-01

Constitutional copy number variants (CNVs) include inherited and de novo deviations from a diploid state at a defined genomic region. These variants contribute significantly to genetic variation and disease in humans, including breast cancer susceptibility. Identification of genetic risk factors for breast cancer in recent years has been dominated by the use of genome-wide technologies, such as single nucleotide polymorphism (SNP)-arrays, with a significant focus on single nucleotide variants. To date, these large datasets have been underutilised for generating genome-wide CNV profiles despite offering a massive resource for assessing the contribution of these structural variants to breast cancer risk. Technical challenges remain in determining the location and distribution of CNVs across the human genome due to the accuracy of computational prediction algorithms and resolution of the array data. Moreover, better methods are required for interpreting the functional effect of newly discovered CNVs. In this review, we explore current and future application of SNP array technology to assess rare and common CNVs in association with breast cancer risk in humans. PMID:27600231

Aspergillus and Penicillium identification using DNA sequences: Barcode or MLST?

USDA-ARS?s Scientific Manuscript database

Current methods in DNA technology can detect single nucleotide polymorphisms with measurable accuracy using several different approaches appropriate for different uses. If there are even single nucleotide differences that are invariant markers of the species, we can accomplish identification through...
Nucleotide cleaving agents and method

DOEpatents

Que, Jr., Lawrence; Hanson, Richard S.; Schnaith, Leah M. T.

2000-01-01

The present invention provides a unique series of nucleotide cleaving agents and a method for cleaving a nucleotide sequence, whether single-stranded or double-stranded DNA or RNA, using and a cationic metal complex having at least one polydentate ligand to cleave the nucleotide sequence phosphate backbone to yield a hydroxyl end and a phosphate end.
Terminal structures of West Nile virus genomic RNA and their interactions with viral NS5 protein

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dong Hongping; Zhang Bo; Shi Peiyong

2008-11-10

Genome cyclization is essential for flavivirus replication. We used RNases to probe the structures formed by the 5'-terminal 190 nucleotides and the 3'-terminal 111 nucleotides of the West Nile virus (WNV) genomic RNA. When analyzed individually, the two RNAs adopt stem-loop structures as predicted by the thermodynamic-folding program. However, when mixed together, the two RNAs form a duplex that is mediated through base-pairings of two sets of RNA elements (5'CS/3'CSI and 5'UAR/3'UAR). Formation of the RNA duplex facilitates a conformational change that leaves the 3'-terminal nucleotides of the genome (position - 8 to - 16) to be single-stranded. Viral NS5more » binds specifically to the 5'-terminal stem-loop (SL1) of the genomic RNA. The 5'SL1 RNA structure is essential for WNV replication. The study has provided further evidence to suggest that flavivirus genome cyclization and NS5/5'SL1 RNA interaction facilitate NS5 binding to the 3' end of the genome for the initiation of viral minus-strand RNA synthesis.« less
Geographically Distinct and Domain-Specific Sequence Variations in the Alleles of Rice Blast Resistance Gene Pib

PubMed Central

Vasudevan, Kumar; Vera Cruz, Casiana M.; Gruissem, Wilhelm; Bhullar, Navreet K.

2016-01-01

Rice blast is caused by Magnaporthe oryzae, which is the most destructive fungal pathogen affecting rice growing regions worldwide. The rice blast resistance gene Pib confers broad-spectrum resistance against Southeast Asian M. oryzae races. We investigated the allelic diversity of Pib in rice germplasm originating from 12 major rice growing countries. Twenty-five new Pib alleles were identified that have unique single nucleotide polymorphisms (SNPs), insertions and/or deletions, in addition to the polymorphic nucleotides that are shared between the different alleles. These partially or completely shared polymorphic nucleotides indicate frequent sequence exchange events between the Pib alleles. In some of the new Pib alleles, nucleotide diversity is high in the LRR domain, whereas, in others it is distributed among the NB-ARC and LRR domains. Most of the polymorphic amino acids in LRR and NB-ARC2 domains are predicted as solvent-exposed. Several of the alleles and the unique SNPs are country specific, suggesting a diversifying selection of alleles in various geographical locations in response to the locally prevalent M. oryzae population. Together, the new Pib alleles are an important genetic resource for rice blast resistance breeding programs and provide new information on rice-M. oryzae interactions at the molecular level. PMID:27446145
Genetic characterization of Measles Viruses in China, 2004

PubMed Central

Zhang, Yan; Ji, Yixin; Jiang, Xiaohong; Xu, Songtao; Zhu, Zhen; Zheng, Lei; He, Jilan; Ling, Hua; Wang, Yan; Liu, Yang; Du, Wen; Yang, Xuelei; Mao, Naiying; Xu, Wenbo

2008-01-01

Genetic characterization of wild-type measles virus was studied using nucleotide sequencing of the C-terminal region of the N protein gene and phylogenetic analysis on 59 isolates from 16 provinces of China in 2004. The results showed that all of the isolates belonged to genotype H1. 51 isolates were belonged to cluster 1 and 8 isolates were cluster 2 and Viruses from both clusters were distributed throughout China without distinct geographic pattern. The nucleotide sequence and predicted amino acid homologies of the 59 H1 strains were 96.5%–100% and 95.7%–100%, respectively. The report showed that the transmission pattern of genotype H1 viruses in China in 2004 was consistent with ongoing endemic transmission of multiple lineages of a single, endemic genotype. Multiple transmission pathways leaded to multiple lineages within endemic genotype. PMID:18928575
Molecular characterization and expression of the M6 gene of grass carp hemorrhage virus (GCHV), an aquareovirus.

PubMed

Qiu, T; Lu, R H; Zhang, J; Zhu, Z Y

2001-07-01

The complete nucleotide sequence of M6 gene of grass carp hemorrhage virus (GCHV) was determined. It is 2039 nucleotides in length and contains a single large open reading frame that could encode a protein of 648 amino acids with predicted molecular mass of 68.7 kDa. Amino acid sequence comparison revealed that the protein encoded by GCHV M6 is closely related to the protein mu1 of mammalian reovirus. The M6 gene, encoding the major outer-capsid protein, was expressed using the pET fusion protein vector in Escherichia coli and detected by Western blotting using chicken anti-GCHV immunoglobulin (IgY). The result indicates that the protein encoded by M6 may share a putative Asn-42-Pro-43 proteolytic cleavage site with mu1.
An Engineered Kinetic Amplification Mechanism for Single Nucleotide Variant Discrimination by DNA Hybridization Probes.

PubMed

Chen, Sherry Xi; Seelig, Georg

2016-04-20

Even a single-nucleotide difference between the sequences of two otherwise identical biological nucleic acids can have dramatic functional consequences. Here, we use model-guided reaction pathway engineering to quantitatively improve the performance of selective hybridization probes in recognizing single nucleotide variants (SNVs). Specifically, we build a detection system that combines discrimination by competition with DNA strand displacement-based catalytic amplification. We show, both mathematically and experimentally, that the single nucleotide selectivity of such a system in binding to single-stranded DNA and RNA is quadratically better than discrimination due to competitive hybridization alone. As an additional benefit the integrated circuit inherits the property of amplification and provides at least 10-fold better sensitivity than standard hybridization probes. Moreover, we demonstrate how the detection mechanism can be tuned such that the detection reaction is agnostic to the position of the SNV within the target sequence. in contrast, prior strand displacement-based probes designed for kinetic discrimination are highly sensitive to position effects. We apply our system to reliably discriminate between different members of the let-7 microRNA family that differ in only a single base position. Our results demonstrate the power of systematic reaction network design to quantitatively improve biotechnology.
Association of single nucleotide polymorphism in CD28(C/T-I3 + 17) and CD40 (C/T-1) genes with the Graves' disease.

PubMed

Mustafa, Saima; Fatima, Hira; Fatima, Sadia; Khosa, Tafheem; Akbar, Atif; Shaikh, Rehan Sadiq; Iqbal, Furhan

2018-01-01

To find out a correlation between the single nucleotide polymorphisms in cluster of differentiation 28 and cluster of differentiation 40 genes with Graves' disease, if any. This case-control study was conducted at the Multan Institute of Nuclear Medicine and Radiotherapy, Multan, Pakistan, and comprised blood samples of Graves' disease patients and controls. Various risk factors were also correlated either with the genotype at each single-nucleotide polymorphism or with various combinations of genotypes studied during present investigation. Of the 160 samples, there were 80(50%) each from patients and controls. Risk factor analysis revealed that gender (p=0.008), marital status (p<0.001), education (p<0.001), smoking (p<0.001), tri-iodothyronine (P <0.001), thyroxin (p<0.001) and thyroid-stimulating hormone (p<0.000) levels in blood were associated with Graves' disease. Both single-nucleotide polymorphisms in both genes were not associated with Graves' disease, either individually or in any combined form.
MicroRNA-196a2 Biomarker and Targetome Network Analysis in Solid Tumors.

PubMed

Toraih, Eman A; Fawzy, Manal S; Mohammed, Eman A; Hussein, Mohammad H; El-Labban, Mohamad M

2016-12-01

MicroRNAs (miRNAs) have been linked to cancer development and progression. The molecular mechanisms underlying the genetic associations of the miRNA single nucleotide polymorphism with cancer vary by cancer site. As there are no previous studies on the miR-196a2 variant or expression in any type of cancer among our population, we aimed to determine the expression profile of mature miR-196a2 in various types of solid tumors and to analyze the impact of its polymorphism (rs11614913; C/T) on the expression levels. The study included 230 cancer patients (including 17 types of cancer), 26 patients with pre-cancer lesions, and 100 unrelated controls. Archived formalin-fixed, paraffin-embedded specimens (n = 197) were available for both miRNA expression analysis and single nucleotide polymorphism identification. Venous blood was collected from 59 histologically confirmed sporadic cancer patients and the study controls for single nucleotide polymorphism identification. Real-time polymerase chain reaction analysis was performed for allelic discrimination and relative quantification of miR-196a2 in the study samples. In silico target gene prediction and network analysis was performed. We found that individuals with the T variant were associated with cancer risk under all genetic association models, especially in colorectal, esophageal, skin, lung, thyroid, and renal cancer. Overall and stratified analysis showed miR-196a2 over-expression in most of the current malignant tumor samples relative to their corresponding cancer-free tissues. Carriers of the C allele had significantly higher expression levels of miR-196a2. Correlation with the clinicopathological features of cancer showed organ-specific effects. Gene enrichment analysis of predicted and validated targets speculated the putative role of miR-196a2 in cancer-associated biology. We highlighted cancer-type specific expression profiles of miR-196a2, which was correlated with the clinicopathological features in various types of cancer. Taken together, our results suggest that the miRNA signature could have promising diagnostic and prognostic significance.
Understanding pathogenic single-nucleotide polymorphisms in multidomain proteins – studies of isolated domains are not enough

PubMed Central

Randles, Lucy G; Dawes, Gwen J S; Wensley, Beth G; Steward, Annette; Nickson, Adrian A; Clarke, Jane

2013-01-01

Studying the effects of pathogenic mutations is more complex in multidomain proteins when compared with single domains: mutations occurring at domain boundaries may have a large effect on a neighbouring domain that will not be detected in a single-domain system. To demonstrate this, we present a study that utilizes well-characterized model protein domains from human spectrin to investigate the effect of disease-and non-disease-causing single point mutations occurring at the boundaries of human spectrin repeats. Our results show that mutations in the single domains have no clear correlation with stability and disease; however, when studied in a tandem model system, the disease-causing mutations are shown to disrupt stabilizing interactions that exist between domains. This results in a much larger decrease in stability than would otherwise have been predicted, and demonstrates the importance of studying such mutations in the correct protein context. PMID:23241237
Real-time single-molecule electronic DNA sequencing by synthesis using polymer-tagged nucleotides on a nanopore array

PubMed Central

Fuller, Carl W.; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Bibillo, Arek; Stranges, P. Benjamin; Dorwart, Michael; Tao, Chuanjuan; Li, Zengmin; Guo, Wenjing; Shi, Shundi; Korenblum, Daniel; Trans, Andrew; Aguirre, Anne; Liu, Edward; Harada, Eric T.; Pollard, James; Bhat, Ashwini; Cech, Cynthia; Yang, Alexander; Arnold, Cleoma; Palla, Mirkó; Hovis, Jennifer; Chen, Roger; Morozova, Irina; Kalachikov, Sergey; Russo, James J.; Kasianowicz, John J.; Davis, Randy; Roever, Stefan; Church, George M.; Ju, Jingyue

2016-01-01

DNA sequencing by synthesis (SBS) offers a robust platform to decipher nucleic acid sequences. Recently, we reported a single-molecule nanopore-based SBS strategy that accurately distinguishes four bases by electronically detecting and differentiating four different polymer tags attached to the 5′-phosphate of the nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. Further developing this approach, we report here the use of nucleotides tagged at the terminal phosphate with oligonucleotide-based polymers to perform nanopore SBS on an α-hemolysin nanopore array platform. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primer/template and polymerase, the tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. The use of these polymer-tagged nucleotides, combined with polymerase tethering to nanopores and multiplexed nanopore sensors, should lead to new high-throughput sequencing methods. PMID:27091962
Incorporation of causative quantitative trait nucleotides in single-step GBLUP.

PubMed

Fragomeni, Breno O; Lourenco, Daniela A L; Masuda, Yutaka; Legarra, Andres; Misztal, Ignacy

2017-07-26

Much effort is put into identifying causative quantitative trait nucleotides (QTN) in animal breeding, empowered by the availability of dense single nucleotide polymorphism (SNP) information. Genomic selection using traditional SNP information is easily implemented for any number of genotyped individuals using single-step genomic best linear unbiased predictor (ssGBLUP) with the algorithm for proven and young (APY). Our aim was to investigate whether ssGBLUP is useful for genomic prediction when some or all QTN are known. Simulations included 180,000 animals across 11 generations. Phenotypes were available for all animals in generations 6 to 10. Genotypes for 60,000 SNPs across 10 chromosomes were available for 29,000 individuals. The genetic variance was fully accounted for by 100 or 1000 biallelic QTN. Raw genomic relationship matrices (GRM) were computed from (a) unweighted SNPs, (b) unweighted SNPs and causative QTN, (c) SNPs and causative QTN weighted with results obtained with genome-wide association studies, (d) unweighted SNPs and causative QTN with simulated weights, (e) only unweighted causative QTN, (f-h) as in (b-d) but using only the top 10% causative QTN, and (i) using only causative QTN with simulated weight. Predictions were computed by pedigree-based BLUP (PBLUP) and ssGBLUP. Raw GRM were blended with 1 or 5% of the numerator relationship matrix, or 1% of the identity matrix. Inverses of GRM were obtained directly or with APY. Accuracy of breeding values for 5000 genotyped animals in the last generation with PBLUP was 0.32, and for ssGBLUP it increased to 0.49 with an unweighted GRM, 0.53 after adding unweighted QTN, 0.63 when QTN weights were estimated, and 0.89 when QTN weights were based on true effects known from the simulation. When the GRM was constructed from causative QTN only, accuracy was 0.95 and 0.99 with blending at 5 and 1%, respectively. Accuracies simulating 1000 QTN were generally lower, with a similar trend. Accuracies using the APY inverse were equal or higher than those with a regular inverse. Single-step GBLUP can account for causative QTN via a weighted GRM. Accuracy gains are maximum when variances of causative QTN are known and blending is at 1%.
Estimating Additive and Non-Additive Genetic Variances and Predicting Genetic Merits Using Genome-Wide Dense Single Nucleotide Polymorphism Markers

PubMed Central

Su, Guosheng; Christensen, Ole F.; Ostersen, Tage; Henryon, Mark; Lund, Mogens S.

2012-01-01

Non-additive genetic variation is usually ignored when genome-wide markers are used to study the genetic architecture and genomic prediction of complex traits in human, wild life, model organisms or farm animals. However, non-additive genetic effects may have an important contribution to total genetic variation of complex traits. This study presented a genomic BLUP model including additive and non-additive genetic effects, in which additive and non-additive genetic relation matrices were constructed from information of genome-wide dense single nucleotide polymorphism (SNP) markers. In addition, this study for the first time proposed a method to construct dominance relationship matrix using SNP markers and demonstrated it in detail. The proposed model was implemented to investigate the amounts of additive genetic, dominance and epistatic variations, and assessed the accuracy and unbiasedness of genomic predictions for daily gain in pigs. In the analysis of daily gain, four linear models were used: 1) a simple additive genetic model (MA), 2) a model including both additive and additive by additive epistatic genetic effects (MAE), 3) a model including both additive and dominance genetic effects (MAD), and 4) a full model including all three genetic components (MAED). Estimates of narrow-sense heritability were 0.397, 0.373, 0.379 and 0.357 for models MA, MAE, MAD and MAED, respectively. Estimated dominance variance and additive by additive epistatic variance accounted for 5.6% and 9.5% of the total phenotypic variance, respectively. Based on model MAED, the estimate of broad-sense heritability was 0.506. Reliabilities of genomic predicted breeding values for the animals without performance records were 28.5%, 28.8%, 29.2% and 29.5% for models MA, MAE, MAD and MAED, respectively. In addition, models including non-additive genetic effects improved unbiasedness of genomic predictions. PMID:23028912
Effective detection of human leukocyte antigen risk alleles in celiac disease using tag single nucleotide polymorphisms.

PubMed

Monsuur, Alienke J; de Bakker, Paul I W; Zhernakova, Alexandra; Pinto, Dalila; Verduijn, Willem; Romanos, Jihane; Auricchio, Renata; Lopez, Ana; van Heel, David A; Crusius, J Bart A; Wijmenga, Cisca

2008-05-28

The HLA genes, located in the MHC region on chromosome 6p21.3, play an important role in many autoimmune disorders, such as celiac disease (CD), type 1 diabetes (T1D), rheumatoid arthritis, multiple sclerosis, psoriasis and others. Known HLA variants that confer risk to CD, for example, include DQA1*05/DQB1*02 (DQ2.5) and DQA1*03/DQB1*0302 (DQ8). To diagnose the majority of CD patients and to study disease susceptibility and progression, typing these strongly associated HLA risk factors is of utmost importance. However, current genotyping methods for HLA risk factors involve many reactions, and are complicated and expensive. We sought a simple experimental approach using tagging SNPs that predict the CD-associated HLA risk factors. Our tagging approach exploits linkage disequilibrium between single nucleotide polymorphism (SNPs) and the CD-associated HLA risk factors DQ2.5 and DQ8 that indicate direct risk, and DQA1*0201/DQB1*0202 (DQ2.2) and DQA1*0505/DQB1*0301 (DQ7) that attribute to the risk of DQ2.5 to CD. To evaluate the predictive power of this approach, we performed an empirical comparison of the predicted DQ types, based on these six tag SNPs, with those executed with current validated laboratory typing methods of the HLA-DQA1 and -DQB1 genes in three large cohorts. The results were validated in three European celiac populations. Using this method, only six SNPs were needed to predict the risk types carried by >95% of CD patients. We determined that for this tagging approach the sensitivity was >0.991, specificity >0.996 and the predictive value >0.948. Our results show that this tag SNP method is very accurate and provides an excellent basis for population screening for CD. This method is broadly applicable in European populations.
Detecting Single-Nucleotides by Tunneling Current Measurements at Sub-MHz Temporal Resolution.

PubMed

Morikawa, Takanori; Yokota, Kazumichi; Tanimoto, Sachie; Tsutsui, Makusu; Taniguchi, Masateru

2017-04-18

Label-free detection of single-nucleotides was performed by fast tunneling current measurements in a polar solvent at 1 MHz sampling rate using SiO₂-protected Au nanoprobes. Short current spikes were observed, suggestive of trapping/detrapping of individual nucleotides between the nanoelectrodes. The fall and rise features of the electrical signatures indicated signal retardation by capacitance effects with a time constant of about 10 microseconds. The high temporal resolution revealed current fluctuations, reflecting the molecular conformation degrees of freedom in the electrode gap. The method presented in this work may enable direct characterizations of dynamic changes in single-molecule conformations in an electrode gap in liquid.
[Single nucleotide polymorphism and its application in allogeneic hematopoietic stem cell transplantation--review].

PubMed

Li, Su-Xia

2004-12-01

Single nucleotide polymorphism (SNP) is the third genetic marker after restriction fragment length polymorphism (RFLP) and short tandem repeat. It represents the most density genetic variability in the human genome and has been widely used in gene location, cloning, and research of heredity variation, as well as parenthood identification in forensic medicine. As steady heredity polymorphism, single nucleotide polymorphism is becoming the focus of attention in monitoring chimerism and minimal residual disease in the patients after allogeneic hematopoietic stem cell transplantation. The article reviews SNP heredity characterization, analysis techniques and its applications in allogeneic stem cell transplantation and other fields.
Detection and interrogation of biomolecules via nanoscale probes: From fundamental physics to DNA sequencing

NASA Astrophysics Data System (ADS)

Zwolak, Michael

2013-03-01

A rapid and low-cost method to sequence DNA would revolutionize personalized medicine, where genetic information is used to diagnose, treat, and prevent diseases. There is a longstanding interest in nanopores as a platform for rapid interrogation of single DNA molecules. I will discuss a sequencing protocol based on the measurement of transverse electronic currents during the translocation of single-stranded DNA through nanopores. Using molecular dynamics simulations coupled to quantum mechanical calculations of the tunneling current, I will show that the DNA nucleotides are predicted to have distinguishable electronic signatures in experimentally realizable systems. Several recent experiments support our theoretical predictions. In addition to their possible impact in medicine and biology, the above methods offer ideal test beds to study open scientific issues in the relatively unexplored area at the interface between solids, liquids, and biomolecules at the nanometer length scale. http://mike.zwolak.org
A new single-nucleotide polymorphism database for rainbow trout generated through whole genome re-sequencing

USDA-ARS?s Scientific Manuscript database

Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout, SNP discovery has been done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL), RNA sequencing, and whole...
Characterization of polyploid wheat genomic diversity using a high-density 90 000 single nucleotide polymorphism array

USDA-ARS?s Scientific Manuscript database

High-density single nucleotide polymorphism (SNP) genotyping chips are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships among individuals in populations and studying marker-trait associations in mapping experiments. We developed a genotyping array includ...
Novel high-speed droplet-allele specific-polymerase chain reaction: application in the rapid genotyping of single nucleotide polymorphisms.

PubMed

Taira, Chiaki; Matsuda, Kazuyuki; Yamaguchi, Akemi; Sueki, Akane; Koeda, Hiroshi; Takagi, Fumio; Kobayashi, Yukihiro; Sugano, Mitsutoshi; Honda, Takayuki

2013-09-23

Single nucleotide alterations such as single nucleotide polymorphisms (SNP) and single nucleotide mutations are associated with responses to drugs and predisposition to several diseases, and they contribute to the pathogenesis of malignancies. We developed a rapid genotyping assay based on the allele-specific polymerase chain reaction (AS-PCR) with our droplet-PCR machine (droplet-AS-PCR). Using 8 SNP loci, we evaluated the specificity and sensitivity of droplet-AS-PCR. Buccal cells were pretreated with proteinase K and subjected directly to the droplet-AS-PCR without DNA extraction. The genotypes determined using the droplet-AS-PCR were then compared with those obtained by direct sequencing. Specific PCR amplifications for the 8 SNP loci were detected, and the detection limit of the droplet-AS-PCR was found to be 0.1-5.0% by dilution experiments. Droplet-AS-PCR provided specific amplification when using buccal cells, and all the genotypes determined within 9 min were consistent with those obtained by direct sequencing. Our novel droplet-AS-PCR assay enabled high-speed amplification retaining specificity and sensitivity and provided ultra-rapid genotyping. Crude samples such as buccal cells were available for the droplet-AS-PCR assay, resulting in the reduction of the total analysis time. Droplet-AS-PCR may therefore be useful for genotyping or the detection of single nucleotide alterations. Copyright © 2013 Elsevier B.V. All rights reserved.

BDNF Polymorphism Predicts General Intelligence after Penetrating Traumatic Brain Injury

PubMed Central

Rostami, Elham; Krueger, Frank; Zoubak, Serguei; Dal Monte, Olga; Raymont, Vanessa; Pardini, Matteo; Hodgkinson, Colin A.; Goldman, David; Risling, Mårten; Grafman, Jordan

2011-01-01

Neuronal plasticity is a fundamental factor in cognitive outcome following traumatic brain injury. Brain-derived neurotrophic factor (BDNF), a member of the neurotrophin family, plays an important role in this process. While there are many ways to measure cognitive outcome, general cognitive intelligence is a strong predictor of everyday decision-making, occupational attainment, social mobility and job performance. Thus it is an excellent measure of cognitive outcome following traumatic brain injury (TBI). Although the importance of the single-nucleotide polymorphisms polymorphism on cognitive function has been previously addressed, its role in recovery of general intelligence following TBI is unknown. We genotyped male Caucasian Vietnam combat veterans with focal penetrating TBI (pTBI) (n = 109) and non-head injured controls (n = 38) for 7 BDNF single-nucleotide polymorphisms. Subjects were administrated the Armed Forces Qualification Test (AFQT) at three different time periods: pre-injury on induction into the military, Phase II (10–15 years post-injury, and Phase III (30–35 years post-injury). Two single-nucleotide polymorphisms, rs7124442 and rs1519480, were significantly associated with post-injury recovery of general cognitive intelligence with the most pronounced effect at the Phase II time point, indicating lesion-induced plasticity. The genotypes accounted for 5% of the variance of the AFQT scores, independently of other significant predictors such as pre-injury intelligence and percentage of brain volume loss. These data indicate that genetic variations in BDNF play a significant role in lesion-induced recovery following pTBI. Identifying the underlying mechanism of this brain-derived neurotrophic factor effect could provide insight into an important aspect of post-traumatic cognitive recovery. PMID:22087305
Effects of the BDNF Val66Met Polymorphism on Anxiety-Like Behavior Following Nicotine Withdrawal in Mice

PubMed Central

Lee, Bridgin G.; Anastasia, Agustin; Hempstead, Barbara L.; Lee, Francis S.

2015-01-01

Introduction: Nicotine withdrawal is characterized by both affective and cognitive symptoms. Identifying genetic polymorphisms that could affect the symptoms associated with nicotine withdrawal are important in predicting withdrawal sensitivity and identifying personalized cessation therapies. In the current study we used a mouse model of a non-synonymous single nucleotide polymorphism in the translated region of the brain-derived neurotrophic factor (BDNF) gene that substitutes a valine (Val) for a methionine (Met) amino acid (Val66Met) to examine the relationship between the Val66Met single nucleotide polymorphism and nicotine dependence. Methods: This study measured proBDNF and the BDNF prodomain levels following nicotine and nicotine withdrawal and examined a mouse model of a common polymorphism in this protein (BDNFMet/Met) in three behavioral paradigms: novelty-induced hypophagia, marble burying, and the open-field test. Results: Using the BDNF knock-in mouse containing the BDNF Val66Met polymorphism we found: (1) blunted anxiety-like behavior in BDNFMet/Met mice following withdrawal in three behavioral paradigms: novelty-induced hypophagia, marble burying, and the open-field test; (2) the anxiolytic effects of chronic nicotine are absent in BDNFMet/Met mice; and (3) an increase in BDNF prodomain in BDNFMet/Met mice following nicotine withdrawal. Conclusions: Our study is the first to examine the effect of the BDNF Val66Met polymorphism on the affective symptoms of withdrawal from nicotine in mice. In these mice, a single-nucleotide polymorphism in the translated region of the BDNF gene can result in a blunted withdrawal, as measured by decreased anxiety-like behavior. The significant increase in the BDNF prodomain in BDNFMet/Met mice following nicotine cessation suggests a possible role of this ligand in the circuitry remodeling after withdrawal. PMID:25744957
Identification of Critical Residues for the Tight Binding of Both Correct and Incorrect Nucleotides to Human DNA Polymerase λ

PubMed Central

Brown, Jessica A.; Pack, Lindsey R.; Sherrer, Shanen M.; Kshetry, Ajay K.; Newmister, Sean A.; Fowler, Jason D.; Taylor, John-Stephen; Suo, Zucai

2010-01-01

DNA polymerase λ (Pol λ) is a novel X-family DNA polymerase that shares 34% sequence identity with DNA polymerase β (Pol β). Pre-steady state kinetic studies have shown that the Pol λ•DNA complex binds both correct and incorrect nucleotides 130-fold tighter on average than the Pol β•DNA complex, although, the base substitution fidelity of both polymerases is 10−4 to 10−5. To better understand Pol λ’s tight nucleotide binding affinity, we created single- and double-substitution mutants of Pol λ to disrupt interactions between active site residues and an incoming nucleotide or a template base. Single-turnover kinetic assays showed that Pol λ binds to an incoming nucleotide via cooperative interactions with active site residues (R386, R420, K422, Y505, F506, A510, and R514). Disrupting protein interactions with an incoming correct or incorrect nucleotide impacted binding with each of the common structural moieties in the following order: triphosphate ≫ base > ribose. In addition, the loss of Watson-Crick hydrogen bonding between the nucleotide and template base led to a moderate increase in the Kd. The fidelity of Pol λ was maintained predominantly by a single residue, R517, which has minor groove interactions with the DNA template. PMID:20851705
Genome-wide association study of fertility traits in dairy cattle using high-density single nucleotide polymorphism marker panels

USDA-ARS?s Scientific Manuscript database

Unfavorable genetic correlations between production and fertility traits are well documented. Genetic selection for fertility traits is slow, however, due to low heritabilities. Identification of single nucleotide polymorphisms (SNP) involved in reproduction could improve reliability of genomic esti...
Discovery, Validation and Characterization of 1039 Cattle Single Nucleotide Polymorphisms

USDA-ARS?s Scientific Manuscript database

We identified approximately 13000 putative single nucleotide polymorphisms (SNPs) by comparison of repeat-masked BAC-end sequences from the cattle RPCI-42 BAC library with whole-genome shotgun contigs of cattle genome assembly Btau 1.0. Genotyping of a subset of these SNPs was performed on a panel ...
High-throughput single nucleotide polymorphism genotyping for breeding applications in rice using the BeadXpress platform

USDA-ARS?s Scientific Manuscript database

Multiplexed single nucleotide polymorphism (SNP) markers have the potential to increase the speed and cost-effectiveness of genotyping, provided that an optimal SNP density is used for each application. To test the efficiency of multiplexed SNP genotyping for diversity, mapping and breeding applicat...
Developing Single Nucleotide Polymorphism (SNP) markers from transcriptome sequences for the identification of longan (Dimocarpus longan) germplasm

USDA-ARS?s Scientific Manuscript database

Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in...
Informativeness of single nucleotide polymorphisms and relationships among onion populations from important world production regions

USDA-ARS?s Scientific Manuscript database

Single nucleotide polymorphisms (SNPs) were genotyped using a high-density array and DNAs from individual plants from important onion populations from major production regions world-wide and the likely progenitor of onion, Allium vavilovii. Genotypes at 1226 SNPs were used to estimate genetic relati...
Relationships among calpastatin single nucleotide polymorphisms, calpastatin expression and tenderness in pork longissimus

USDA-ARS?s Scientific Manuscript database

Genome scans in the pig have identified a region on chromosome 2 (SSC2) associated with tenderness. Calpastatin is a likely positional candidate gene in this region because of its inhibitory role in the calpain system that is involved in postmortem tenderization. Novel single nucleotide polymorphism...
Lineage and genogroup-defining single nucleotide polymorphisms of Escherichia coli 0157:H7

USDA-ARS?s Scientific Manuscript database

Escherichia coli O157:H7 is a zoonotic human pathogen for which cattle are an important reservoir host. Using both previously published and new sequencing data, a 48-locus single nucleotide polymorphism (SNP) based typing panel was developed that redundantly identified eleven genogroups that span ...
A new single-nucleotide polymorphisms database for rainbow trout generated through whole genome resequencing of selected samples

USDA-ARS?s Scientific Manuscript database

Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout, SNP discovery has been done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL), RNA sequencing, and whole...
The value of genetic information for diabetes risk prediction - differences according to sex, age, family history and obesity.

PubMed

Mühlenbruch, Kristin; Jeppesen, Charlotte; Joost, Hans-Georg; Boeing, Heiner; Schulze, Matthias B

2013-01-01

Genome-wide association studies have identified numerous single nucleotide polymorphisms associated with type 2 diabetes through the past years. In previous studies, the usefulness of these genetic markers for prediction of diabetes was found to be limited. However, differences may exist between substrata of the population according to the presence of major diabetes risk factors. This study aimed to investigate the added predictive value of genetic information (42 single nucleotide polymorphisms) in subgroups of sex, age, family history of diabetes, and obesity. A case-cohort study (random subcohort N = 1,968; incident cases: N = 578) within the European Prospective Investigation into Cancer and Nutrition Potsdam study was used. Prediction models without and with genetic information were evaluated in terms of the area under the receiver operating characteristic curve and the integrated discrimination improvement. Stratified analyses included subgroups of sex, age (<50 or ≥50 years), family history (positive if either father or mother or a sibling has/had diabetes), and obesity (BMI< or ≥30 kg/m(2)). A genetic risk score did not improve prediction above classic and metabolic markers, but - compared to a non-invasive prediction model - genetic information slightly improved the area under the receiver operating characteristic curve (difference [95%-CI]: 0.007 [0.002-0.011]). Stratified analyses showed stronger improvement in the older age group (0.010 [0.002-0.018]), the group with a positive family history (0.012 [0.000-0.023]) and among obese participants (0.015 [-0.005-0.034]) compared to the younger participants (0.005 [-0.004-0.014]), participants with a negative family history (0.003 [-0.001-0.008]) and non-obese (0.007 [0.000-0.014]), respectively. No difference was found between men and women. There was no incremental value of genetic information compared to standard non-invasive and metabolic markers. Our study suggests that inclusion of genetic variants in diabetes risk prediction might be useful for subgroups with already manifest risk factors such as older age, a positive family history and obesity.
A novel MALDI–TOF based methodology for genotyping single nucleotide polymorphisms

PubMed Central

Blondal, Thorarinn; Waage, Benedikt G.; Smarason, Sigurdur V.; Jonsson, Frosti; Fjalldal, Sigridur B.; Stefansson, Kari; Gulcher, Jeffery; Smith, Albert V.

2003-01-01

A new MALDI–TOF based detection assay was developed for analysis of single nucleotide polymorphisms (SNPs). It is a significant modification on the classic three-step minisequencing method, which includes a polymerase chain reaction (PCR), removal of excess nucleotides and primers, followed by primer extension in the presence of dideoxynucleotides using modified thermostable DNA polymerase. The key feature of this novel assay is reliance upon deoxynucleotide mixes, lacking one of the nucleotides at the polymorphic position. During primer extension in the presence of depleted nucleotide mixes, standard thermostable DNA polymerases dissociate from the template at positions requiring a depleted nucleotide; this principal was harnessed to create a genotyping assay. The assay design requires a primer- extension primer having its 3′-end one nucleotide upstream from the interrogated site. The assay further utilizes the same DNA polymerase in both PCR and the primer extension step. This not only simplifies the assay but also greatly reduces the cost per genotype compared to minisequencing methodology. We demonstrate accurate genotyping using this methodology for two SNPs run in both singleplex and duplex reactions. We term this assay nucleotide depletion genotyping (NUDGE). Nucleotide depletion genotyping could be extended to other genotyping assays based on primer extension such as detection by gel or capillary electrophoresis. PMID:14654708
Biological nanopore MspA for DNA sequencing

NASA Astrophysics Data System (ADS)

Manrao, Elizabeth A.

Unlocking the information hidden in the human genome provides insight into the inner workings of complex biological systems and can be used to greatly improve health-care. In order to allow for widespread sequencing, new technologies are required that provide fast and inexpensive readings of DNA. Nanopore sequencing is a third generation DNA sequencing technology that is currently being developed to fulfill this need. In nanopore sequencing, a voltage is applied across a small pore in an electrolyte solution and the resulting ionic current is recorded. When DNA passes through the channel, the ionic current is partially blocked. If the DNA bases uniquely modulate the ionic current flowing through the channel, the time trace of the current can be related to the sequence of DNA passing through the pore. There are two main challenges to realizing nanopore sequencing: identifying a pore with sensitivity to single nucleotides and controlling the translocation of DNA through the pore so that the small single nucleotide current signatures are distinguishable from background noise. In this dissertation, I explore the use of Mycobacterium smegmatis porin A (MspA) for nanopore sequencing. In order to determine MspA's sensitivity to single nucleotides, DNA strands of various compositions are held in the pore as the resulting ionic current is measured. DNA is immobilized in MspA by attaching it to a large molecule which acts as an anchor. This technique confirms the single nucleotide resolution of the pore and additionally shows that MspA is sensitive to epigenetic modifications and single nucleotide polymorphisms. The forces from the electric field within MspA, the effective charge of nucleotides, and elasticity of DNA are estimated using a Freely Jointed Chain model of single stranded DNA. These results offer insight into the interactions of DNA within the pore. With the nucleotide sensitivity of MspA confirmed, a method is introduced to controllably pass DNA through the pore. Using a DNA polymerase, DNA strands are stepped through MspA one nucleotide at a time. The steps are observable as distinct levels on the ionic-current time-trace and are related to the DNA sequence. These experiments overcome the two fundamental challenges to realizing MspA nanopore sequencing and pave the way to the development of a commercial technology.
Advantages and limitations of multiple-trait genomic prediction for Fusarium head blight severity in hybrid wheat (Triticum aestivum L.).

PubMed

Schulthess, Albert W; Zhao, Yusheng; Longin, C Friedrich H; Reif, Jochen C

2018-03-01

Predictabilities for wheat hybrids less related to the estimation set were improved by shifting from single- to multiple-trait genomic prediction of Fusarium head blight severity. Breeding for improved Fusarium head blight resistance (FHBr) of wheat is a very laborious and expensive task. FHBr complexity is mainly due to its highly polygenic nature and because FHB severity (FHBs) is greatly influenced by the environment. Associated traits plant height and heading date may provide additional information related to FHBr, but this is ignored in single-trait genomic prediction (STGP). The aim of our study was to explore the benefits in predictabilities of multiple-trait genomic prediction (MTGP) over STGP of target trait FHBs in a population of 1604 wheat hybrids using information on 17,372 single nucleotide polymorphism markers along with indicator traits plant height and heading date. The additive inheritance of FHBs allowed accurate hybrid performance predictions using information on general combining abilities or average performance of both parents without the need of markers. Information on molecular markers and indicator trait(s) improved FHBs predictabilities for hybrids less related to the estimation set. Indicator traits must be observed on the predicted individuals to benefit from MTGP. Magnitudes of genetic and phenotypic correlations along with improvements in predictabilities made plant height a better indicator trait for FHBs than heading date. Thus, MTGP having only plant height as indicator trait already maximized FHBs predictabilities. Provided a good indicator trait was available, MTGP could reduce the impacts of genotype environment [Formula: see text] interaction on STGP for hybrids less related to the estimation set.
Enabling multiplexed testing of pooled donor cells through whole-genome sequencing.

PubMed

Chan, Yingleong; Chan, Ying Kai; Goodman, Daniel B; Guo, Xiaoge; Chavez, Alejandro; Lim, Elaine T; Church, George M

2018-04-19

We describe a method that enables the multiplex screening of a pool of many different donor cell lines. Our method accurately predicts each donor proportion from the pool without requiring the use of unique DNA barcodes as markers of donor identity. Instead, we take advantage of common single nucleotide polymorphisms, whole-genome sequencing, and an algorithm to calculate the proportions from the sequencing data. By testing using simulated and real data, we showed that our method robustly predicts the individual proportions from a mixed-pool of numerous donors, thus enabling the multiplexed testing of diverse donor cells en masse.More information is available at https://pgpresearch.med.harvard.edu/poolseq/.
Base-By-Base: single nucleotide-level analysis of whole viral genome alignments.

PubMed

Brodie, Ryan; Smith, Alex J; Roper, Rachel L; Tcherepanov, Vasily; Upton, Chris

2004-07-14

With ever increasing numbers of closely related virus genomes being sequenced, it has become desirable to be able to compare two genomes at a level more detailed than gene content because two strains of an organism may share the same set of predicted genes but still differ in their pathogenicity profiles. For example, detailed comparison of multiple isolates of the smallpox virus genome (each approximately 200 kb, with 200 genes) is not feasible without new bioinformatics tools. A software package, Base-By-Base, has been developed that provides visualization tools to enable researchers to 1) rapidly identify and correct alignment errors in large, multiple genome alignments; and 2) generate tabular and graphical output of differences between the genomes at the nucleotide level. Base-By-Base uses detailed annotation information about the aligned genomes and can list each predicted gene with nucleotide differences, display whether variations occur within promoter regions or coding regions and whether these changes result in amino acid substitutions. Base-By-Base can connect to our mySQL database (Virus Orthologous Clusters; VOCs) to retrieve detailed annotation information about the aligned genomes or use information from text files. Base-By-Base enables users to quickly and easily compare large viral genomes; it highlights small differences that may be responsible for important phenotypic differences such as virulence. It is available via the Internet using Java Web Start and runs on Macintosh, PC and Linux operating systems with the Java 1.4 virtual machine.
Variation of Cats under Domestication: Genetic Assignment of Domestic Cats to Breeds and Worldwide Random Bred Populations

PubMed Central

Kurushima, J. D.; Lipinski, M. J.; Gandolfi, B.; Froenicke, L.; Grahn, J. C.; Grahn, R. A.; Lyons, L. A.

2012-01-01

Summary Both cat breeders and the lay public have interests in the origins of their pets, not only in the genetic identity of the purebred individuals, but also the historical origins of common household cats. The cat fancy is a relatively new institution with over 85% of its 40–50 breeds arising only in the past 75 years, primarily through selection on single-gene aesthetic traits. The short, yet intense cat breed history poses a significant challenge to the development of a genetic marker-based breed identification strategy. Using different breed assignment strategies and methods, 477 cats representing 29 fancy breeds were analysed with 38 short tandem repeats, 148 intergenic and five phenotypic single nucleotide polymorphisms. Results suggest the frequentist method of Paetkau (accuracy single nucleotide polymorphisms = 0.78, short tandem repeats = 0.88) surpasses the Bayesian method of Rannala and Mountain (single nucleotide polymorphisms = 0.56, short tandem repeats = 0.83) for accurate assignment of individuals to the correct breed. Additionally, a post-assignment verification step with the five phenotypic single nucleotide polymorphisms accurately identified between 0.31 and 0.58 of the mis-assigned individuals raising the sensitivity of assignment with the frequentist method to 0.89 and 0.92 single nucleotide polymorphisms and short tandem repeats respectively. This study provides a novel multi-step assignment strategy and suggests that, despite their short breed history and breed family groupings, a majority of cats can be assigned to their proper breed or population of origin, i.e. race. PMID:23171373
Imputation of single nucleotide polymorhpism genotypes of Hereford cattle: reference panel size, family relationship and population structure

USDA-ARS?s Scientific Manuscript database

The objective of this study is to investigate single nucleotide polymorphism (SNP) genotypes imputation of Hereford cattle. Purebred Herefords were from two sources, Line 1 Hereford (N=240) and representatives of Industry Herefords (N=311). Using different reference panels of 62 and 494 males with 1...
A resource of single-nucleotide polymorphisms for rainbow trout generated by restriction-site associated DNA sequencing of doubled haploids

USDA-ARS?s Scientific Manuscript database

Salmonid genomes are considered to be in a pseudo-tetraploid state as a result of an evolutionarily recent genome duplication event. This situation complicates single nucleotide polymorphism (SNP) discovery in rainbow trout as many putative SNPs are actually paralogous sequence variants (PSVs) and ...

Single nucleotide polymorphisms in candidate genes associated with fertilizing ability of sperm and subsequent embryonic development in cattle

USDA-ARS?s Scientific Manuscript database

Fertilization and development of the preimplantation embryo is under genetic control. The goal of the current study was to test 434 single nucleotide polymorphisms (SNPs) for association with genetic variation in fertilization and early embryonic development. The approach was to produce embryos from...
Prospects for inferring pairwise relationships with single nucleotide polymorphisms

Treesearch

Jeffery C. Glaubitz; O. Eugene, Jr. Rhodes; J. Andrew DeWoody

2003-01-01

An extraordinarily large number of single nucleotide polymorphisms (SNPs) are now available in humans as well as in other model organisms. Technological advancements may soon make it feasible to assay hundreds of SNPs in virtually any organism of interest. One potential application of SNPs is the determination of pairwise genetic relationships in populations without...
Short communication: Relationship of call rate and accuracy of single nucleotide polymorphism genotypes in dairy cattle

USDA-ARS?s Scientific Manuscript database

Call rate has been used as a measure of quality on both a single nucleotide polymorphism (SNP) and animal basis since SNP genotypes were first used in genomic evaluation of dairy cattle. The genotyping laboratories perform initial quality control screening and genotypes that fail are usually exclude...
Single nucleotide polymorphisms generated by genotyping by sequencing to characterize genome-wide diversity, linkage disequilibrium, and selective sweeps in cultivated watermelon

USDA-ARS?s Scientific Manuscript database

Large datasets containing single nucleotide polymorphisms (SNPs) are used to analyze genome-wide diversity in a robust collection of cultivars from representative accessions, across the world. The extent of linkage disequilibrium (LD) within a population determines the number of markers required fo...
Single-molecule comparison of DNA Pol I activity with native and analog nucleotides

NASA Astrophysics Data System (ADS)

Gul, Osman; Olsen, Tivoli; Choi, Yongki; Corso, Brad; Weiss, Gregory; Collins, Philip

2014-03-01

DNA polymerases are critical enzymes for DNA replication, and because of their complex catalytic cycle they are excellent targets for investigation by single-molecule experimental techniques. Recently, we studied the Klenow fragment (KF) of DNA polymerase I using a label-free, electronic technique involving single KF molecules attached to carbon nanotube transistors. The electronic technique allowed long-duration monitoring of a single KF molecule while processing thousands of template strands. Processivity of up to 42 nucleotide bases was directly observed, and statistical analysis of the recordings determined key kinetic parameters for the enzyme's open and closed conformations. Subsequently, we have used the same technique to compare the incorporation of canonical nucleotides like dATP to analogs like 1-thio-2'-dATP. The analog had almost no affect on duration of the closed conformation, during which the nucleotide is incorporated. On the other hand, the analog increased the rate-limiting duration of the open conformation by almost 40%. We propose that the thiolated analog interferes with KF's recognition and binding, two key steps that determine its ensemble turnover rate.
Transcript-specific, single-nucleotide polymorphism discovery and linkage analysis in hexaploid bread wheat (Triticum aestivum L.).

PubMed

Allen, Alexandra M; Barker, Gary L A; Berry, Simon T; Coghill, Jane A; Gwilliam, Rhian; Kirby, Susan; Robinson, Phil; Brenchley, Rachel C; D'Amore, Rosalinda; McKenzie, Neil; Waite, Darren; Hall, Anthony; Bevan, Michael; Hall, Neil; Edwards, Keith J

2011-12-01

Food security is a global concern and substantial yield increases in cereal crops are required to feed the growing world population. Wheat is one of the three most important crops for human and livestock feed. However, the complexity of the genome coupled with a decline in genetic diversity within modern elite cultivars has hindered the application of marker-assisted selection (MAS) in breeding programmes. A crucial step in the successful application of MAS in breeding programmes is the development of cheap and easy to use molecular markers, such as single-nucleotide polymorphisms. To mine selected elite wheat germplasm for intervarietal single-nucleotide polymorphisms, we have used expressed sequence tags derived from public sequencing programmes and next-generation sequencing of normalized wheat complementary DNA libraries, in combination with a novel sequence alignment and assembly approach. Here, we describe the development and validation of a panel of 1114 single-nucleotide polymorphisms in hexaploid bread wheat using competitive allele-specific polymerase chain reaction genotyping technology. We report the genotyping results of these markers on 23 wheat varieties, selected to represent a broad cross-section of wheat germplasm including a number of elite UK varieties. Finally, we show that, using relatively simple technology, it is possible to rapidly generate a linkage map containing several hundred single-nucleotide polymorphism markers in the doubled haploid mapping population of Avalon × Cadenza. © 2011 The Authors. Plant Biotechnology Journal © 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd.
TPH-2 Polymorphisms Interact with Early Life Stress to Influence Response to Treatment with Antidepressant Drugs.

PubMed

Xu, Zhi; Reynolds, Gavin P; Yuan, Yonggui; Shi, Yanyan; Pu, Mengjia; Zhang, Zhijun

2016-11-01

Variation in genes implicated in monoamine neurotransmission may interact with environmental factors to influence antidepressant response. We aimed to determine how a range of single nucleotide polymorphisms in monoaminergic genes influence this response to treatment and how they interact with childhood trauma and recent life stress in a Chinese sample. An initial study of monoaminergic coding region single nucleotide polymorphisms identified significant associations of TPH2 and HTR1B single nucleotide polymorphisms with treatment response that showed interactions with childhood and recent life stress, respectively (Xu et al., 2012). A total of 47 further single nucleotide polymorphisms in 17 candidate monoaminergic genes were genotyped in 281 Chinese Han patients with major depressive disorder. Response to 6 weeks' antidepressant treatment was determined by change in the 17-item Hamilton Depression Rating Scale score, and previous stressful events were evaluated by the Life Events Scale and Childhood Trauma Questionnaire-Short Form. Three TPH2 single nucleotide polymorphisms (rs11178998, rs7963717, and rs2171363) were significantly associated with antidepressant response in this Chinese sample, as was a haplotype in TPH2 (rs2171363 and rs1487278). One of these, rs2171363, showed a significant interaction with childhood adversity in its association with antidepressant response. These findings provide further evidence that variation in TPH2 is associated with antidepressant response and may also interact with childhood trauma to influence outcome of antidepressant treatment. © The Author 2016. Published by Oxford University Press on behalf of CINP.
TPH-2 Polymorphisms Interact with Early Life Stress to Influence Response to Treatment with Antidepressant Drugs

PubMed Central

Reynolds, Gavin P.; Yuan, Yonggui; Shi, Yanyan; Pu, Mengjia; Zhang, Zhijun

2016-01-01

Background: Variation in genes implicated in monoamine neurotransmission may interact with environmental factors to influence antidepressant response. We aimed to determine how a range of single nucleotide polymorphisms in monoaminergic genes influence this response to treatment and how they interact with childhood trauma and recent life stress in a Chinese sample. An initial study of monoaminergic coding region single nucleotide polymorphisms identified significant associations of TPH2 and HTR1B single nucleotide polymorphisms with treatment response that showed interactions with childhood and recent life stress, respectively (Xu et al., 2012). Methods: A total of 47 further single nucleotide polymorphisms in 17 candidate monoaminergic genes were genotyped in 281 Chinese Han patients with major depressive disorder. Response to 6 weeks’ antidepressant treatment was determined by change in the 17-item Hamilton Depression Rating Scale score, and previous stressful events were evaluated by the Life Events Scale and Childhood Trauma Questionnaire-Short Form. Results: Three TPH2 single nucleotide polymorphisms (rs11178998, rs7963717, and rs2171363) were significantly associated with antidepressant response in this Chinese sample, as was a haplotype in TPH2 (rs2171363 and rs1487278). One of these, rs2171363, showed a significant interaction with childhood adversity in its association with antidepressant response. Conclusions: These findings provide further evidence that variation in TPH2 is associated with antidepressant response and may also interact with childhood trauma to influence outcome of antidepressant treatment. PMID:27521242
Combined use of a new SNP-based assay and multilocus SSR markers to assess genetic diversity of Xylella fastidiosa subsp. pauca infecting citrus and coffee plants.

PubMed

Montes-Borrego, Miguel; Lopes, Joao R S; Jiménez-Díaz, Rafael M; Landa, Blanca B

2015-03-01

Two haplotypes of Xylella fastidiosa subsp. pauca (Xfp) that correlated with their host of origin were identified in a collection of 90 isolates infecting citrus and coffee plants in Brazil, based on a single-nucleotide polymorphism in the gyrB sequence. A new single-nucleotide primer extension (SNuPE) protocol was designed for rapid identification of Xfp according to the host source. The protocol proved to be robust for the prediction of the Xfp host source in blind tests using DNA from cultures of the bacterium, infected plants, and insect vectors allowed to feed on Xfp-infected citrus plants. AMOVA and STRUCTURE analyses of microsatellite data separated most Xfp populations on the basis of their host source, indicating that they were genetically distinct. The combined use of the SNaPshot protocol and three previously developed multilocus SSR markers showed that two haplotypes and distinct isolates of Xfp infect citrus and coffee in Brazil and that multiple, genetically different isolates can be present in a single orchard or infect a single tree. This combined approach will be very useful in studies of the epidemiology of Xfp-induced diseases, host specificity of bacterial genotypes, the occurrence of Xfp host jumping, vector feeding habits, etc., in economically important cultivated plants or weed host reservoirs of Xfp in Brazil and elsewhere. Copyright© by the Spanish Society for Microbiology and Institute for Catalan Studies.
Inherited variations in the SOD and GPX gene families and cancer risk.

PubMed

Yuzhalin, Arseniy E; Kutikhin, Anton G

2012-05-01

Antioxidant defence enzymes are essential protectors of living organisms against oxidative stress. These enzymes are involved in the detoxification and decomposition of harmful chemical compounds called reactive oxygen species (ROS), which are, first and foremost, a source of intracellular oxidative stress. ROS directly promote the oxidative damage of genes resulting in aberrant regulation of many vital cell processes. As a consequence, the presence of ROS can lead to genomic instability, deregulation of transcription, induction of mitogenic signal transduction pathways and replication errors, all of which may increase the risk of cancer development. Single nucleotide polymorphisms of antioxidant defence genes may significantly modify the functional activity of the encoded proteins; therefore, certain alleles can be established as risk factors for particular cancer types. In the future, these risk alleles may be utilized as genomic markers of cancer predisposition to allow for early prevention measures among carriers of these alleles. The review is devoted to common single nucleotide polymorphisms of the superoxide dismutase (SOD) and glutathione peroxidase (GPX) gene families and their impact on carcinogenesis. The predictive significance of several polymorphisms was determined, and these polymorphisms were recommended for further in-depth research.
Identification of a psoriasis susceptibility candidate gene by linkage disequilibrium mapping with a localized single nucleotide polymorphism map.

PubMed

Hewett, Duncan; Samuelsson, Lena; Polding, Joanne; Enlund, Fredrik; Smart, Devi; Cantone, Kathryn; See, Chee Gee; Chadha, Sapna; Inerot, Annica; Enerback, Charlotta; Montgomery, Doug; Christodolou, Chris; Robinson, Phil; Matthews, Paul; Plumpton, Mary; Wahlstrom, Jan; Swanbeck, Gunnar; Martinsson, Tommy; Roses, Allen; Riley, John; Purvis, Ian

2002-03-01

Psoriasis is a chronic inflammatory disease of the skin with both genetic and environmental risk factors. Here we describe the creation of a single-nucleotide polymorphism (SNP) map spanning 900-1200 kb of chromosome 3q21, which had been previously recognized as containing a psoriasis susceptibility locus, PSORS5. We genotyped 644 individuals, from 195 Swedish psoriatic families, for 19 polymorphisms. Linkage disequilibrium (LD) between marker and disease was assessed using the transmission/disequilibrium test (TDT). In the TDT analysis, alleles of three of these SNPs showed significant association with disease (P<0.05). A 160-kb interval encompassing these three SNPs was sequenced, and a coding sequence consisting of 13 exons was identified. The predicted protein shares 30-40% homology with the family of cation/chloride cotransporters. A five-marker haplotype spanning the 3' half of this gene is associated with psoriasis to a P value of 3.8<10(-5). We have called this gene SLC12A8, coding for a member of the solute carrier family 12 proteins. It belongs to a class of genes that were previously unrecognized as playing a role in psoriasis pathogenesis.
Association analysis identifies 65 new breast cancer risk loci.

PubMed

Michailidou, Kyriaki; Lindström, Sara; Dennis, Joe; Beesley, Jonathan; Hui, Shirley; Kar, Siddhartha; Lemaçon, Audrey; Soucy, Penny; Glubb, Dylan; Rostamianfar, Asha; Bolla, Manjeet K; Wang, Qin; Tyrer, Jonathan; Dicks, Ed; Lee, Andrew; Wang, Zhaoming; Allen, Jamie; Keeman, Renske; Eilber, Ursula; French, Juliet D; Qing Chen, Xiao; Fachal, Laura; McCue, Karen; McCart Reed, Amy E; Ghoussaini, Maya; Carroll, Jason S; Jiang, Xia; Finucane, Hilary; Adams, Marcia; Adank, Muriel A; Ahsan, Habibul; Aittomäki, Kristiina; Anton-Culver, Hoda; Antonenkova, Natalia N; Arndt, Volker; Aronson, Kristan J; Arun, Banu; Auer, Paul L; Bacot, François; Barrdahl, Myrto; Baynes, Caroline; Beckmann, Matthias W; Behrens, Sabine; Benitez, Javier; Bermisheva, Marina; Bernstein, Leslie; Blomqvist, Carl; Bogdanova, Natalia V; Bojesen, Stig E; Bonanni, Bernardo; Børresen-Dale, Anne-Lise; Brand, Judith S; Brauch, Hiltrud; Brennan, Paul; Brenner, Hermann; Brinton, Louise; Broberg, Per; Brock, Ian W; Broeks, Annegien; Brooks-Wilson, Angela; Brucker, Sara Y; Brüning, Thomas; Burwinkel, Barbara; Butterbach, Katja; Cai, Qiuyin; Cai, Hui; Caldés, Trinidad; Canzian, Federico; Carracedo, Angel; Carter, Brian D; Castelao, Jose E; Chan, Tsun L; David Cheng, Ting-Yuan; Seng Chia, Kee; Choi, Ji-Yeob; Christiansen, Hans; Clarke, Christine L; Collée, Margriet; Conroy, Don M; Cordina-Duverger, Emilie; Cornelissen, Sten; Cox, David G; Cox, Angela; Cross, Simon S; Cunningham, Julie M; Czene, Kamila; Daly, Mary B; Devilee, Peter; Doheny, Kimberly F; Dörk, Thilo; Dos-Santos-Silva, Isabel; Dumont, Martine; Durcan, Lorraine; Dwek, Miriam; Eccles, Diana M; Ekici, Arif B; Eliassen, A Heather; Ellberg, Carolina; Elvira, Mingajeva; Engel, Christoph; Eriksson, Mikael; Fasching, Peter A; Figueroa, Jonine; Flesch-Janys, Dieter; Fletcher, Olivia; Flyger, Henrik; Fritschi, Lin; Gaborieau, Valerie; Gabrielson, Marike; Gago-Dominguez, Manuela; Gao, Yu-Tang; Gapstur, Susan M; García-Sáenz, José A; Gaudet, Mia M; Georgoulias, Vassilios; Giles, Graham G; Glendon, Gord; Goldberg, Mark S; Goldgar, David E; González-Neira, Anna; Grenaker Alnæs, Grethe I; Grip, Mervi; Gronwald, Jacek; Grundy, Anne; Guénel, Pascal; Haeberle, Lothar; Hahnen, Eric; Haiman, Christopher A; Håkansson, Niclas; Hamann, Ute; Hamel, Nathalie; Hankinson, Susan; Harrington, Patricia; Hart, Steven N; Hartikainen, Jaana M; Hartman, Mikael; Hein, Alexander; Heyworth, Jane; Hicks, Belynda; Hillemanns, Peter; Ho, Dona N; Hollestelle, Antoinette; Hooning, Maartje J; Hoover, Robert N; Hopper, John L; Hou, Ming-Feng; Hsiung, Chia-Ni; Huang, Guanmengqian; Humphreys, Keith; Ishiguro, Junko; Ito, Hidemi; Iwasaki, Motoki; Iwata, Hiroji; Jakubowska, Anna; Janni, Wolfgang; John, Esther M; Johnson, Nichola; Jones, Kristine; Jones, Michael; Jukkola-Vuorinen, Arja; Kaaks, Rudolf; Kabisch, Maria; Kaczmarek, Katarzyna; Kang, Daehee; Kasuga, Yoshio; Kerin, Michael J; Khan, Sofia; Khusnutdinova, Elza; Kiiski, Johanna I; Kim, Sung-Won; Knight, Julia A; Kosma, Veli-Matti; Kristensen, Vessela N; Krüger, Ute; Kwong, Ava; Lambrechts, Diether; Le Marchand, Loic; Lee, Eunjung; Lee, Min Hyuk; Lee, Jong Won; Neng Lee, Chuen; Lejbkowicz, Flavio; Li, Jingmei; Lilyquist, Jenna; Lindblom, Annika; Lissowska, Jolanta; Lo, Wing-Yee; Loibl, Sibylle; Long, Jirong; Lophatananon, Artitaya; Lubinski, Jan; Luccarini, Craig; Lux, Michael P; Ma, Edmond S K; MacInnis, Robert J; Maishman, Tom; Makalic, Enes; Malone, Kathleen E; Kostovska, Ivana Maleva; Mannermaa, Arto; Manoukian, Siranoush; Manson, JoAnn E; Margolin, Sara; Mariapun, Shivaani; Martinez, Maria Elena; Matsuo, Keitaro; Mavroudis, Dimitrios; McKay, James; McLean, Catriona; Meijers-Heijboer, Hanne; Meindl, Alfons; Menéndez, Primitiva; Menon, Usha; Meyer, Jeffery; Miao, Hui; Miller, Nicola; Taib, Nur Aishah Mohd; Muir, Kenneth; Mulligan, Anna Marie; Mulot, Claire; Neuhausen, Susan L; Nevanlinna, Heli; Neven, Patrick; Nielsen, Sune F; Noh, Dong-Young; Nordestgaard, Børge G; Norman, Aaron; Olopade, Olufunmilayo I; Olson, Janet E; Olsson, Håkan; Olswold, Curtis; Orr, Nick; Pankratz, V Shane; Park, Sue K; Park-Simon, Tjoung-Won; Lloyd, Rachel; Perez, Jose I A; Peterlongo, Paolo; Peto, Julian; Phillips, Kelly-Anne; Pinchev, Mila; Plaseska-Karanfilska, Dijana; Prentice, Ross; Presneau, Nadege; Prokofyeva, Darya; Pugh, Elizabeth; Pylkäs, Katri; Rack, Brigitte; Radice, Paolo; Rahman, Nazneen; Rennert, Gadi; Rennert, Hedy S; Rhenius, Valerie; Romero, Atocha; Romm, Jane; Ruddy, Kathryn J; Rüdiger, Thomas; Rudolph, Anja; Ruebner, Matthias; Rutgers, Emiel J T; Saloustros, Emmanouil; Sandler, Dale P; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Daniel F; Schmutzler, Rita K; Schneeweiss, Andreas; Schoemaker, Minouk J; Schumacher, Fredrick; Schürmann, Peter; Scott, Rodney J; Scott, Christopher; Seal, Sheila; Seynaeve, Caroline; Shah, Mitul; Sharma, Priyanka; Shen, Chen-Yang; Sheng, Grace; Sherman, Mark E; Shrubsole, Martha J; Shu, Xiao-Ou; Smeets, Ann; Sohn, Christof; Southey, Melissa C; Spinelli, John J; Stegmaier, Christa; Stewart-Brown, Sarah; Stone, Jennifer; Stram, Daniel O; Surowy, Harald; Swerdlow, Anthony; Tamimi, Rulla; Taylor, Jack A; Tengström, Maria; Teo, Soo H; Beth Terry, Mary; Tessier, Daniel C; Thanasitthichai, Somchai; Thöne, Kathrin; Tollenaar, Rob A E M; Tomlinson, Ian; Tong, Ling; Torres, Diana; Truong, Thérèse; Tseng, Chiu-Chen; Tsugane, Shoichiro; Ulmer, Hans-Ulrich; Ursin, Giske; Untch, Michael; Vachon, Celine; van Asperen, Christi J; Van Den Berg, David; van den Ouweland, Ans M W; van der Kolk, Lizet; van der Luijt, Rob B; Vincent, Daniel; Vollenweider, Jason; Waisfisz, Quinten; Wang-Gohrke, Shan; Weinberg, Clarice R; Wendt, Camilla; Whittemore, Alice S; Wildiers, Hans; Willett, Walter; Winqvist, Robert; Wolk, Alicja; Wu, Anna H; Xia, Lucy; Yamaji, Taiki; Yang, Xiaohong R; Har Yip, Cheng; Yoo, Keun-Young; Yu, Jyh-Cherng; Zheng, Wei; Zheng, Ying; Zhu, Bin; Ziogas, Argyrios; Ziv, Elad; Lakhani, Sunil R; Antoniou, Antonis C; Droit, Arnaud; Andrulis, Irene L; Amos, Christopher I; Couch, Fergus J; Pharoah, Paul D P; Chang-Claude, Jenny; Hall, Per; Hunter, David J; Milne, Roger L; García-Closas, Montserrat; Schmidt, Marjanka K; Chanock, Stephen J; Dunning, Alison M; Edwards, Stacey L; Bader, Gary D; Chenevix-Trench, Georgia; Simard, Jacques; Kraft, Peter; Easton, Douglas F

2017-11-02

Breast cancer risk is influenced by rare coding variants in susceptibility genes, such as BRCA1, and many common, mostly non-coding variants. However, much of the genetic contribution to breast cancer risk remains unknown. Here we report the results of a genome-wide association study of breast cancer in 122,977 cases and 105,974 controls of European ancestry and 14,068 cases and 13,104 controls of East Asian ancestry. We identified 65 new loci that are associated with overall breast cancer risk at P < 5 × 10 -8 . The majority of credible risk single-nucleotide polymorphisms in these loci fall in distal regulatory elements, and by integrating in silico data to predict target genes in breast cells at each locus, we demonstrate a strong overlap between candidate target genes and somatic driver genes in breast tumours. We also find that heritability of breast cancer due to all single-nucleotide polymorphisms in regulatory features was 2-5-fold enriched relative to the genome-wide average, with strong enrichment for particular transcription factor binding sites. These results provide further insight into genetic susceptibility to breast cancer and will improve the use of genetic risk scores for individualized screening and prevention.
Role of genetic variation in docetaxel-induced neutropenia and pharmacokinetics.

PubMed

Nieuweboer, A J M; Smid, M; de Graan, A-J M; Elbouazzaoui, S; de Bruijn, P; Eskens, F A L M; Hamberg, P; Martens, J W M; Sparreboom, A; de Wit, R; van Schaik, R H N; Mathijssen, R H J

2016-11-01

Docetaxel is used for treatment of several solid malignancies. In this study, we aimed for predicting docetaxel clearance and docetaxel-induced neutropenia by developing several genetic models. Therefore, pharmacokinetic data and absolute neutrophil counts (ANCs) of 213 docetaxel-treated cancer patients were collected. Next, patients were genotyped for 1936 single nucleotide polymorphisms (SNPs) in 225 genes using the drug-metabolizing enzymes and transporters platform and thereafter split into two cohorts. The combination of SNPs that best predicted severe neutropenia or low clearance was selected in one cohort and validated in the other. Patients with severe neutropenia had lower docetaxel clearance than patients with ANCs in the normal range (P=0.01). Severe neutropenia was predicted with 70% sensitivity. True low clearance (1 s.d.
MACARON: A python framework to identify and re-annotate multi-base affected codons in whole genome/exome sequence data.

PubMed

Khan, Waqasuddin; Saripella, Ganapathi Varma-; Ludwig, Thomas; Cuppens, Tania; Thibord, Florian; Génin, Emmanuelle; Deleuze, Jean-Francois; Trégouët, David-Alexandre

2018-05-03

Predicted deleteriousness of coding variants is a frequently used criterion to filter out variants detected in next-generation sequencing projects and to select candidates impacting on the risk of human diseases. Most available dedicated tools implement a base-to-base annotation approach that could be biased in presence of several variants in the same genetic codon. We here proposed the MACARON program that, from a standard VCF file, identifies, re-annotates and predicts the amino acid change resulting from multiple single nucleotide variants (SNVs) within the same genetic codon. Applied to the whole exome dataset of 573 individuals, MACARON identifies 114 situations where multiple SNVs within a genetic codon induce an amino acid change that is different from those predicted by standard single SNV annotation tool. Such events are not uncommon and deserve to be studied in sequencing projects with inconclusive findings. MACARON is written in python with codes available on the GENMED website (www.genmed.fr). david-alexandre.tregouet@inserm.fr. Supplementary data are available at Bioinformatics online.
Single nucleotide polymorphisms in uracil-processing genes, intake of one-carbon nutrients and breast cancer risk

USDA-ARS?s Scientific Manuscript database

Background/Objectives: The misincorporation of uracil into DNA leads to genomic instability. In a previous study, some of us identified four common single nucleotide polymorphisms (SNPs) in uracil-processing genes (rs2029166 and rs7296239 in SMUG1, rs34259 in UNG and rs4775748 in DUT) that were asso...
Single nucleotide polymorphisms in common bean: their discovery and genotyping using a multiplex detection system

USDA-ARS?s Scientific Manuscript database

Single-nucleotide Polymorphism (SNP) markers are by far the most common form of DNA polymorphism in a genome. The objectives of this study were to discover SNPs in common bean comparing sequences from coding and non-coding regions obtained from Genbank and genomic DNA and to compare sequencing resu...
Single nucleotide polymorphisms in specific candidate genes are associated with phenotypic differences in days open for first lactation in Holstein cows

USDA-ARS?s Scientific Manuscript database

Previously, a candidate gene approach identified 51 single nucleotide polymorphisms (SNP) associated with genetic merit for reproductive traits and 26 associated with genetic merit for production in dairy bulls. We evaluated association of the 77 SNPs with days open (DO) for first lactation in a pop...
An integrated genetic linkage map of watermelon and genetic diversity based on single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers

USDA-ARS?s Scientific Manuscript database

Watermelon (Citrullus lanatus var. lanatus) is an important vegetable fruit throughout the world. A high number of single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers should provide large coverage of the watermelon genome and high phylogenetic resolution of germplasm acces...
Identification and characterization of single nucleotide polymorphisms (SNPs) in Culex theileri (Diptera: Culicidae).

PubMed

Demirci, Berna; Lee, Yoosook; Lanzaro, Gregory C; Alten, Bulent

2012-05-01

Culex theileri Theobald (Diptera: Culicidae) is one of the most common mosquito species in northeastern Turkey and serves as a vector for various zoonotic diseases including West Nile virus. Although there have been some studies on the ecology of Cx. theileri, very little genetic data has been made available. We successfully sequenced 11 gene fragments from Cx. theileri specimens collected from the northeastern part of Turkey. On average, we found a Single nucleotide polymorphism every 45 bp. Transitions outnumbered transversions, at a ratio of 2:1. This is the first report of genetic polymorphisms in Cx. theileri and Single nucleotide polymorphism discovered from this study can be used to investigate population structure and gene-environmental interactions.
AUTO-MUTE 2.0: A Portable Framework with Enhanced Capabilities for Predicting Protein Functional Consequences upon Mutation.

PubMed

Masso, Majid; Vaisman, Iosif I

2014-01-01

The AUTO-MUTE 2.0 stand-alone software package includes a collection of programs for predicting functional changes to proteins upon single residue substitutions, developed by combining structure-based features with trained statistical learning models. Three of the predictors evaluate changes to protein stability upon mutation, each complementing a distinct experimental approach. Two additional classifiers are available, one for predicting activity changes due to residue replacements and the other for determining the disease potential of mutations associated with nonsynonymous single nucleotide polymorphisms (nsSNPs) in human proteins. These five command-line driven tools, as well as all the supporting programs, complement those that run our AUTO-MUTE web-based server. Nevertheless, all the codes have been rewritten and substantially altered for the new portable software, and they incorporate several new features based on user feedback. Included among these upgrades is the ability to perform three highly requested tasks: to run "big data" batch jobs; to generate predictions using modified protein data bank (PDB) structures, and unpublished personal models prepared using standard PDB file formatting; and to utilize NMR structure files that contain multiple models.

Homology between DNA polymerases of poxviruses, herpesviruses, and adenoviruses: nucleotide sequence of the vaccinia virus DNA polymerase gene.

PubMed Central

Earl, P L; Jones, E V; Moss, B

1986-01-01

A 5400-base-pair segment of the vaccinia virus genome was sequenced and an open reading frame of 938 codons was found precisely where the DNA polymerase had been mapped by transfer of a phosphonoacetate-resistance marker. A single nucleotide substitution changing glycine at position 347 to aspartic acid accounts for the drug resistance of the mutant vaccinia virus. The 5' end of the DNA polymerase mRNA was located 80 base pairs before the methionine codon initiating the open reading frame. Correspondence between the predicted Mr 108,577 polypeptide and the 110,000 purified enzyme indicates that little or no proteolytic processing occurs. Extensive homology, extending over 435 amino acids, was found upon comparing the DNA polymerase of vaccinia virus and DNA polymerase of Epstein-Barr virus. A highly conserved sequence of 14 amino acids in the carboxyl-terminal regions of the above DNA polymerases is also present at a similar location in adenovirus DNA polymerase. This structure, which is predicted to form a turn flanked by beta-pleated sheets, may form part of an essential binding or catalytic site that accounts for its presence in DNA polymerases of poxviruses, herpesviruses, and adenoviruses. Images PMID:3012524
Genome-Wide SNP Genotyping to Infer the Effects on Gene Functions in Tomato

PubMed Central

Hirakawa, Hideki; Shirasawa, Kenta; Ohyama, Akio; Fukuoka, Hiroyuki; Aoki, Koh; Rothan, Christophe; Sato, Shusei; Isobe, Sachiko; Tabata, Satoshi

2013-01-01

The genotype data of 7054 single nucleotide polymorphism (SNP) loci in 40 tomato lines, including inbred lines, F1 hybrids, and wild relatives, were collected using Illumina's Infinium and GoldenGate assay platforms, the latter of which was utilized in our previous study. The dendrogram based on the genotype data corresponded well to the breeding types of tomato and wild relatives. The SNPs were classified into six categories according to their positions in the genes predicted on the tomato genome sequence. The genes with SNPs were annotated by homology searches against the nucleotide and protein databases, as well as by domain searches, and they were classified into the functional categories defined by the NCBI's eukaryotic orthologous groups (KOG). To infer the SNPs' effects on the gene functions, the three-dimensional structures of the 843 proteins that were encoded by the genes with SNPs causing missense mutations were constructed by homology modelling, and 200 of these proteins were considered to carry non-synonymous amino acid substitutions in the predicted functional sites. The SNP information obtained in this study is available at the Kazusa Tomato Genomics Database (http://plant1.kazusa.or.jp/tomato/). PMID:23482505
Distinctive features of single nucleotide alterations in induced pluripotent stem cells with different types of DNA repair deficiency disorders

PubMed Central

Okamura, Kohji; Sakaguchi, Hironari; Sakamoto-Abutani, Rie; Nakanishi, Mahito; Nishimura, Ken; Yamazaki-Inoue, Mayu; Ohtaka, Manami; Periasamy, Vaiyapuri Subbarayan; Alshatwi, Ali Abdullah; Higuchi, Akon; Hanaoka, Kazunori; Nakabayashi, Kazuhiko; Takada, Shuji; Hata, Kenichiro; Toyoda, Masashi; Umezawa, Akihiro

2016-01-01

Disease-specific induced pluripotent stem cells (iPSCs) have been used as a model to analyze pathogenesis of disease. In this study, we generated iPSCs derived from a fibroblastic cell line of xeroderma pigmentosum (XP) group A (XPA-iPSCs), a rare autosomal recessive hereditary disease in which patients develop skin cancer in the areas of skin exposed to sunlight. XPA-iPSCs exhibited hypersensitivity to ultraviolet exposure and accumulation of single-nucleotide substitutions when compared with ataxia telangiectasia-derived iPSCs that were established in a previous study. However, XPA-iPSCs did not show any chromosomal instability in vitro, i.e. intact chromosomes were maintained. The results were mutually compensating for examining two major sources of mutations, nucleotide excision repair deficiency and double-strand break repair deficiency. Like XP patients, XPA-iPSCs accumulated single-nucleotide substitutions that are associated with malignant melanoma, a manifestation of XP. These results indicate that XPA-iPSCs may serve a monitoring tool (analogous to the Ames test but using mammalian cells) to measure single-nucleotide alterations, and may be a good model to clarify pathogenesis of XP. In addition, XPA-iPSCs may allow us to facilitate development of drugs that delay genetic alteration and decrease hypersensitivity to ultraviolet for therapeutic applications. PMID:27197874
Neutral changes during divergent evolution of hemoglobins

NASA Technical Reports Server (NTRS)

Jukes, T. H.

1978-01-01

A comparison of the mRNAs for rabbit and human beta-hemoglobins shows that synonymous changes in codons have accumulated three times as rapidly as nucleotide replacements that produced changes in amino acids. This agrees with predictions based on the so-called neutral theory. In addition, seven codon changes that appear to be single-base changes (according to maximum parsimony) are actually two-base changes. This indicates that the construction of primordial sequences is of limited significance when based on inferences that assume minimum base changes for amino acid replacements.
Systematic assessment of the performance of whole-genome amplification for SNP/CNV detection and β-thalassemia genotyping.

PubMed

He, Fei; Zhou, Wanjun; Cai, Ren; Yan, Tizhen; Xu, Xiangmin

2018-04-01

In this study, we aimed to assess the performance of two whole-genome amplification methods, multiple displacement amplification (MDA), and multiple annealing and looping-based amplification cycle (MALBAC), for β-thalassemia genotyping and single-nucleotide polymorphism (SNP)/copy-number variant (CNV) detection using two DNA sequencing assays. We collected peripheral blood, cell lines, and discarded embryos, and carried out MALBAC and MDA on single-cell and five-cell samples. We detected and statistically analyzed differences in the amplification efficiency, positive predictive value, sensitivity, allele dropout (ADO) rate, SNPs, and CV values between the two methods. Through Sanger sequencing at the single-cell and five-cell levels, we showed that both the amplification rate and ADO rate of MDA were better than those using MALBAC, and the sensitivity and positive predictive value obtained from MDA were higher than those from MALBAC for β-thalassemia genotyping. Using next-generation sequencing (NGS) at the single-cell level, we confirmed that MDA has better properties than MALBAC for SNP detection. However, MALBAC was more stable and homogeneous than MDA using low-depth NGS at the single-cell level for CNV detection. We conclude that MALBAC is the better option for CNV detection, while MDA is better suited for SNV detection.
The Single Nucleotide Polymorphism Consortium

NASA Technical Reports Server (NTRS)

Morgan, Michael

2003-01-01

I want to discuss both the Single Nucleotide Polymorphism (SNP) Consortium and the Human Genome Project. I am afraid most of my presentation will be thin on law and possibly too high on rhetoric. Having been engaged in a personal and direct way with these issues as a trained scientist, I find it quite difficult to be always as objective as I ought to be.
Analysis of single nucleotide polymorphisms in case-control studies.

PubMed

Li, Yonghong; Shiffman, Dov; Oberbauer, Rainer

2011-01-01

Single nucleotide polymorphisms (SNPs) are the most common type of genetic variants in the human genome. SNPs are known to modify susceptibility to complex diseases. We describe and discuss methods used to identify SNPs associated with disease in case-control studies. An outline on study population selection, sample collection and genotyping platforms is presented, complemented by SNP selection, data preprocessing and analysis.
A lateral flow biosensor for detection of single nucleotide polymorphism by circular strand displacement reaction.

PubMed

Xiao, Zhuo; Lie, Puchang; Fang, Zhiyuan; Yu, Luxin; Chen, Junhua; Liu, Jie; Ge, Chenchen; Zhou, Xuemeng; Zeng, Lingwen

2012-09-04

A lateral flow biosensor for detection of single nucleotide polymorphism based on circular strand displacement reaction (CSDPR) has been developed. Taking advantage of high fidelity of T4 DNA ligase, signal amplification by CSDPR, and the optical properties of gold nanoparticles, this assay has reached a detection limit of 0.01 fM.
A Laboratory Exercise for Genotyping Two Human Single Nucleotide Polymorphisms

ERIC Educational Resources Information Center

Fernando, James; Carlson, Bradley; LeBard, Timothy; McCarthy, Michael; Umali, Finianne; Ashton, Bryce; Rose, Ferrill F., Jr.

2016-01-01

The dramatic decrease in the cost of sequencing a human genome is leading to an era in which a wide range of students will benefit from having an understanding of human genetic variation. Since over 90% of sequence variation between humans is in the form of single nucleotide polymorphisms (SNPs), a laboratory exercise has been devised in order to…
The effects of single nucleotide polymorphisms (SNPs) of calpastatin (CAST) gene on meat tenderness of yak.

USDA-ARS?s Scientific Manuscript database

The association of single nucleotide polymorphisms (SNPs) of calpastatin (CAST) gene with shear force of 2.54 cm steaks from M. longissimus dorsi from Gannan yaks (Bos grunniens, n=181) was studied. Yaks were harvested at 2, 3, and 4 yr of age (n=51, 59, and 71, respectively), and samples of each ya...
Single nucleotide polymorphism analysis reveals heterogeneity within a seedling tree population of a polyembryonic mango cultivar.

PubMed

Winterhagen, Patrick; Wünsche, Jens-Norbert

2016-05-01

Within a polyembryonic mango seedling tree population, the genetic background of individuals should be identical because vigorous plants for cultivation are expected to develop from nucellar embryos representing maternal clones. Due to the fact that the mango cultivar 'Hôi' is assigned to the polyembryonic ecotype, an intra-cultivar variability of ethylene receptor genes was unexpected. Ethylene receptors in plants are conserved, but the number of receptors or receptor isoforms is variable regarding different plant species. However, it is shown here that the ethylene receptor MiETR1 is present in various isoforms within the mango cultivar 'Hôi'. The investigation of single nucleotide polymorphisms revealed that different MiETR1 isoforms can not be discriminated simply by individual single nucleotide exchanges but by the specific arrangement of single nucleotide polymorphisms at certain positions in the exons of MiETR1. Furthermore, an MiETR1 isoform devoid of introns in the genomic sequence was identified. The investigation demonstrates some limitations of high resolution melting and ScreenClust analysis and points out the necessity of sequencing to identify individual isoforms and to determine the variability within the tree population.
Protected DNA strand displacement for enhanced single nucleotide discrimination in double-stranded DNA.

PubMed

Khodakov, Dmitriy A; Khodakova, Anastasia S; Huang, David M; Linacre, Adrian; Ellis, Amanda V

2015-03-04

Single nucleotide polymorphisms (SNPs) are a prime source of genetic diversity. Discriminating between different SNPs provides an enormous leap towards the better understanding of the uniqueness of biological systems. Here we report on a new approach for SNP discrimination using toehold-mediated DNA strand displacement. The distinctiveness of the approach is based on the combination of both 3- and 4-way branch migration mechanisms, which allows for reliable discrimination of SNPs within double-stranded DNA generated from real-life human mitochondrial DNA samples. Aside from the potential diagnostic value, the current study represents an additional way to control the strand displacement reaction rate without altering other reaction parameters and provides new insights into the influence of single nucleotide substitutions on 3- and 4-way branch migration efficiency and kinetics.
Single nucleotide polymorphism analysis using different colored dye dimer probes

NASA Astrophysics Data System (ADS)

Marmé, Nicole; Friedrich, Achim; Denapaite, Dalia; Hakenbeck, Regine; Knemeyer, Jens-Peter

2006-09-01

Fluorescence quenching by dye dimer formation has been utilized to develop hairpin-structured DNA probes for the detection of a single nucleotide polymorphism (SNP) in the penicillin target gene pbp2x, which is implicated in the penicillin resistance of Streptococcus pneumoniae. We designed two specific DNA probes for the identification of the pbp2x genes from a penicillin susceptible strain R6 and a resistant strain Streptococcus mitis 661 using green-fluorescent tetramethylrhodamine (TMR) and red-fluorescent DY-636, respectively. Hybridization of each of the probes to its respective target DNA sequence opened the DNA hairpin probes, consequently breaking the nonfluorescent dye dimers into fluorescent species. This hybridization of the target with the hairpin probe achieved single nucleotide specific detection at nanomolar concentrations via increased fluorescence.
Single nucleotide polymorphisms and haplotypes associated with feed efficiency in beef cattle

PubMed Central

2013-01-01

Background General, breed- and diet-dependent associations between feed efficiency in beef cattle and single nucleotide polymorphisms (SNPs) or haplotypes were identified on a population of 1321 steers using a 50 K SNP panel. Genomic associations with traditional two-step indicators of feed efficiency – residual feed intake (RFI), residual average daily gain (RADG), and residual intake gain (RIG) – were compared to associations with two complementary one-step indicators of feed efficiency: efficiency of intake (EI) and efficiency of gain (EG). Associations uncovered in a training data set were evaluated on independent validation data set. A multi-SNP model was developed to predict feed efficiency. Functional analysis of genes harboring SNPs significantly associated with feed efficiency and network visualization aided in the interpretation of the results. Results For the five feed efficiency indicators, the numbers of general, breed-dependent, and diet-dependent associations with SNPs (P-value < 0.0001) were 31, 40, and 25, and with haplotypes were six, ten, and nine, respectively. Of these, 20 SNP and six haplotype associations overlapped between RFI and EI, and five SNP and one haplotype associations overlapped between RADG and EG. This result confirms the complementary value of the one and two-step indicators. The multi-SNP models included 89 SNPs and offered a precise prediction of the five feed efficiency indicators. The associations of 17 SNPs and 7 haplotypes with feed efficiency were confirmed on the validation data set. Nine clusters of Gene Ontology and KEGG pathway categories (mean P-value < 0.001) including, 9nucleotide binding; ion transport, phosphorous metabolic process, and the MAPK signaling pathway were overrepresented among the genes harboring the SNPs associated with feed efficiency. Conclusions The general SNP associations suggest that a single panel of genomic variants can be used regardless of breed and diet. The breed- and diet-dependent associations between SNPs and feed efficiency suggest that further refinement of variant panels require the consideration of the breed and management practices. The unique genomic variants associated with the one- and two-step indicators suggest that both types of indicators offer complementary description of feed efficiency that can be exploited for genome-enabled selection purposes. PMID:24066663
Expression of the Pasteurella haemolytica leukotoxin is inhibited by a locus that encodes an ATP-binding cassette homolog.

PubMed Central

Highlander, S K; Wickersham, E A; Garza, O; Weinstock, G M

1993-01-01

Multicopy and single-copy chromosomal fusions between the Pasteurella haemolytica leukotoxin regulatory region and the Escherichia coli beta-galactosidase gene have been constructed. These fusions were used as reporters to identify and isolate regulators of leukotoxin expression from a P. haemolytica cosmid library. A cosmid clone, which inhibited leukotoxin expression from multicopy and single-copy protein fusions, was isolated and found to contain the complete leukotoxin gene cluster plus additional upstream sequences. The locus responsible for inhibition of expression from leukotoxin-beta-galactosidase fusions was mapped within these upstream sequences, by transposon mutagenesis with Tn5, and its DNA sequence was determined. The inhibitory activity was found to be associated with a predicted 440-amino-acid reading frame (lapA) that lies within a four-gene arginine transport locus. LapA is predicted to be the nucleotide-binding component of this transport system and shares homology with the Clp family of proteases. Images PMID:8359916
Intracellular nucleotide and nucleotide sugar contents of cultured CHO cells determined by a fast, sensitive, and high-resolution ion-pair RP-HPLC.

PubMed

Kochanowski, N; Blanchard, F; Cacan, R; Chirat, F; Guedon, E; Marc, A; Goergen, J-L

2006-01-15

Analysis of intracellular nucleotide and nucleotide sugar contents is essential in studying protein glycosylation of mammalian cells. Nucleotides and nucleotide sugars are the donor substrates of glycosyltransferases, and nucleotides are involved in cellular energy metabolism and its regulation. A sensitive and reproducible ion-pair reverse-phase high-performance liquid chromatography (RP-HPLC) method has been developed, allowing the direct and simultaneous detection and quantification of some essential nucleotides and nucleotide sugars. After a perchloric acid extraction, 13 molecules (8 nucleotides and 5 nucleotide sugars) were separated, including activated sugars such as UDP-glucose, UDP-galactose, GDP-mannose, UDP-N-acetylglucosamine, and UDP-N-acetylgalactosamine. To validate the analytical parameters, the reproducibility, linearity of calibration curves, detection limits, and recovery were evaluated for standard mixtures and cell extracts. The developed method is capable of resolving picomolar quantities of nucleotides and nucleotide sugars in a single chromatographic run. The HPLC method was then applied to quantify intracellular levels of nucleotides and nucleotide sugars of Chinese hamster ovary (CHO) cells cultivated in a bioreactor batch process. Evolutions of the titers of nucleotides and nucleotide sugars during the batch process are discussed.
Correlation approach to identify coding regions in DNA sequences

NASA Technical Reports Server (NTRS)

Ossadnik, S. M.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Peng, C. K.; Simons, M.; Stanley, H. E.

1994-01-01

Recently, it was observed that noncoding regions of DNA sequences possess long-range power-law correlations, whereas coding regions typically display only short-range correlations. We develop an algorithm based on this finding that enables investigators to perform a statistical analysis on long DNA sequences to locate possible coding regions. The algorithm is particularly successful in predicting the location of lengthy coding regions. For example, for the complete genome of yeast chromosome III (315,344 nucleotides), at least 82% of the predictions correspond to putative coding regions; the algorithm correctly identified all coding regions larger than 3000 nucleotides, 92% of coding regions between 2000 and 3000 nucleotides long, and 79% of coding regions between 1000 and 2000 nucleotides. The predictive ability of this new algorithm supports the claim that there is a fundamental difference in the correlation property between coding and noncoding sequences. This algorithm, which is not species-dependent, can be implemented with other techniques for rapidly and accurately locating relatively long coding regions in genomic sequences.
Human leukocyte antigen (HLA) pharmacogenomic tests: potential and pitfalls.

PubMed

Daly, Ann K

2014-02-01

Adverse drug reactions involving a range of prescribed drugs and affecting the skin, liver and other organs show strong associations with particular HLA alleles. For some reactions, HLA typing prior to prescription, so that those positive for the risk allele are not given the drug associated with the reaction, shows high positive and negative predictive values. The best example of clinical implementation relates to the hypersensitivity reaction induced by the anti-HIV drug abacavir. When this reaction is phenotyped accurately, 100% of those who develop it are positive for HLA-B*57:01. Drug regulators worldwide now recommend genotyping for HLA-B*57:01 before abacavir is prescribed. Serious skin rashes including Stevens-Johnson syndrome and toxic epidermal necrosis can be induced by carbamazepine and other anticonvulsant drugs. In certain East Asians, these reactions are significantly associated with HLA-B*15:02, and typing for this allele is now recommended prior to carbamazepine prescription in these populations. Other HLA associations have been described for skin rash induced by carbamazepine, allopurinol and nevirapine and for liver injury induced by flucloxacillin, amoxicillin-clavulanate, lapatanib, lumiracoxib and ticlopidine. However, the predictive values for typing HLA alleles associated with these adverse reactions are lower. Clinical implementation therefore seems unlikely. Performing HLA typing is relatively complex compared with genotyping assays for single nucleotide polymorphisms. With emphasis on HLA-B*57:01, the approaches used commonly, including use of sequence-specific oligonucleotide PCR primers and DNA sequencing are considered, together with their successful implementation. Genotyping single nucleotide polymorphisms tagging HLA alleles is a simpler alternative to HLA typing but appears insufficiently accurate for clinical use.
XPF expression correlates with clinical outcome in squamous cell carcinoma of the head and neck

PubMed Central

Vaezi, Alec; Wang, XiaoZhe; Buch, Shama; Gooding, William; Wang, Lin; Seethala, Raja R.; Weaver, David T.; D’Andrea, Alan D.; Argiris, Athanassios; Romkes, Marjorie; Niedernhofer, Laura J.; Grandis, Jennifer R.

2011-01-01

Purpose Tumor-specific biomarkers that predict resistance to DNA damaging agents may improve therapeutic outcomes by guiding the selection of effective therapies and limiting morbidity related to ineffective approaches. XPF (ERCC4) is an essential component of several DNA repair pathways and XPF-deficient cells are exquisitely sensitive to DNA damaging agents. The purpose of this study was to determine whether XPF expression levels predict clinical response to DNA damaging agents in head and neck squamous cell carcinoma (HNSCC). Experimental Design Quantitative immunohistochemistry was used to measure XPF expression in tumors from a cohort of 80 patients with newly diagnosed HNSCC treated with radiation therapy with or without platinum-based chemotherapy; samples were collected prospectively. Genomic DNA isolated from blood samples was analyzed for nine single nucleotide polymorphisms in the XPF gene using a custom array. The primary endpoint was progression-free survival (PFS). Results XPF expression was higher in tumors from the oral cavity than from the other sites (p<0.01). High XPF expression correlated with early time to progression both by univariate (HR =1.87, p=0.03) and multivariate analysis (HR =1.83, p=0.05). The one year PFS for high expressers was 47% (95% CI = 31% – 62%) compared to 72% (95% CI = 55% – 83%) for low expressers. In addition, we identified four XPF single nucleotide polymorphisms (SNPs) that demonstrated marginal association with treatment failure. Conclusions Expression level of XPF in HNSCC tumors correlates with clinical response to DNA damaging agents. XPF has potential to guide next-generation personalized cancer therapy. PMID:21737503
Implication of common and disease specific variants in CLU, CR1, and PICALM.

PubMed

Ferrari, Raffaele; Moreno, Jorge H; Minhajuddin, Abu T; O'Bryant, Sid E; Reisch, Joan S; Barber, Robert C; Momeni, Parastoo

2012-08-01

Two recent genome-wide association studies (GWAS) for late onset Alzheimer's disease (LOAD) revealed 3 new genes: clusterin (CLU), phosphatidylinositol binding clathrin assembly protein (PICALM), and complement receptor 1 (CR1). In order to evaluate association with these genome-wide association study-identified genes and to isolate the variants contributing to the pathogenesis of LOAD, we genotyped the top single nucleotide polymorphisms (SNPs), rs11136000 (CLU), rs3818361 (CR1), and rs3851179 (PICALM), and sequenced the entire coding regions of these genes in our cohort of 342 LOAD patients and 277 control subjects. We confirmed the association of rs3851179 (PICALM) (p = 7.4 × 10(-3)) with the disease status. Through sequencing we identified 18 variants in CLU, 3 of which were found exclusively in patients; 8 variants (out of 65) in CR1 gene were only found in patients and the 16 variants identified in PICALM gene were present in both patients and controls. In silico analysis of the variants in PICALM did not predict any damaging effect on the protein. The haplotype analysis of the variants in each gene predicted a common haplotype when the 3 single nucleotide polymorphisms rs11136000 (CLU), rs3818361 (CR1), and rs3851179 (PICALM), respectively, were included. For each gene the haplotype structure and size differed between patients and controls. In conclusion, we confirmed association of CLU, CR1, and PICALM genes with the disease status in our cohort through identification of a number of disease-specific variants among patients through the sequencing of the coding region of these genes. Published by Elsevier Inc.

Gender and single nucleotide polymorphisms in MTHFR, BHMT, SPTLC1, CRBP2R, and SCARB1 are significant predictors of plasma homocysteine normalized by RBC folate in healthy adults.

USDA-ARS?s Scientific Manuscript database

Using linear regression models, we studied the main and two-way interaction effects of the predictor variables gender, age, BMI, and 64 folate/vitamin B-12/homocysteine/lipid/cholesterol-related single nucleotide polymorphisms (SNP) on log-transformed plasma homocysteine normalized by red blood cell...
Brief Report: Glutamate Transporter Gene ("SLC1A1") Single Nucleotide Polymorphism (rs301430) and Repetitive Behaviors and Anxiety in Children with Autism Spectrum Disorder

ERIC Educational Resources Information Center

Gadow, Kenneth D.; Roohi, Jasmin; DeVincent, Carla J.; Kirsch, Sarah; Hatchwell, Eli

2010-01-01

Investigated association of single nucleotide polymorphism (SNP) rs301430 in glutamate transporter gene ("SLC1A1") with severity of repetitive behaviors (obsessive-compulsive behaviors, tics) and anxiety in children with autism spectrum disorder (ASD). Mothers and/or teachers completed a validated DSM-IV-referenced rating scale for 67 children…
Effect of increasing the number of single-nucleotide polymorphisms from 60,000 to 85,000 in genomic evaluation of Holsteins

USDA-ARS?s Scientific Manuscript database

The periodic need to restock reagent pools for genotyping chips provides an opportunity to increase the number of single-nucleotide polymorphisms (SNP) on a chip at no increase in cost. A high-density chip with >140,000 SNP has been developed by GeneSeek Inc. (Lincoln, NE) to increase accuracy of ge...
Development of single-nucleotide polymorphism markers for Bromus tectorum (Poaceae) from a partially sequenced transcriptome

Treesearch

Keith R. Merrill; Craig E. Coleman; Susan E. Meyer; Elizabeth A. Leger; Katherine A. Collins

2016-01-01

Premise of the study: Bromus tectorum (Poaceae) is an annual grass species that is invasive in many areas of the world but most especially in the U.S. Intermountain West. Single-nucleotide polymorphism (SNP) markers were developed for use in investigating the geospatial and ecological diversity of B. tectorum in the Intermountain West to better understand the...
A Comprehensive Experiment for Molecular Biology: Determination of Single Nucleotide Polymorphism in Human REV3 Gene Using PCR-RFLP

ERIC Educational Resources Information Center

Zhang, Xu; Shao, Meng; Gao, Lu; Zhao, Yuanyuan; Sun, Zixuan; Zhou, Liping; Yan, Yongmin; Shao, Qixiang; Xu, Wenrong; Qian, Hui

2017-01-01

Laboratory exercise is helpful for medical students to understand the basic principles of molecular biology and to learn about the practical applications of molecular biology. We have designed a lab course on molecular biology about the determination of single nucleotide polymorphism (SNP) in human REV3 gene, the product of which is a subunit of…
Antibiotic Resistance and Single-Nucleotide Polymorphism Cluster Grouping Type in a Multinational Sample of Resistant Mycobacterium tuberculosis Isolates▿

PubMed Central

Brimacombe, M.; Hazbon, M.; Motiwala, A. S.; Alland, D.

2007-01-01

A single-nucleotide polymorphism-based cluster grouping (SCG) classification system for Mycobacterium tuberculosis was used to examine antibiotic resistance type and resistance mutations in relationship to specific evolutionary lineages. Drug resistance and resistance mutations were seen across all SCGs. SCG-2 had higher proportions of katG codon 315 mutations and resistance to four drugs. PMID:17846140
Computed Energetics of Nucleotides in Spatial Ribozyme Structures: An Accurate Identification of Functional Regions from Structure

PubMed Central

Torshin, Ivan Y.

2004-01-01

Ribozymes are functionally diverse RNA molecules with intrinsic catalytic activity. Multiple structural and biochemical studies are required to establish which nucleotide bases are involved in the catalysis. The relative energetic properties of the nucleotide bases have been analyzed in a set of the known ribozyme structures. It was found that many of the known catalytic nucleotides can be identified using only the structure without any additional biochemical data. The results of the calculations compare well with the available biochemical data on RNA stability. Extensive in silico mutagenesis suggests that most of the nucleotides in ribozymes stabilize the RNA. The calculations show that relative contribution of the catalytic bases to RNA stability observably differs from contributions of the noncatalytic bases. Distinction between the concepts of “relative stability” and “mutational stability” is suggested. As results of prediction for several models of ribozymes appear to be in agreement with the published data on the potential active site regions, the method can potentially be used for prediction of functional nucleotides from nucleic sequence. PMID:15105962
Design and characterization of a nanopore-coupled polymerase for single-molecule DNA sequencing by synthesis on an electrode array

PubMed Central

Stranges, P. Benjamin; Palla, Mirkó; Kalachikov, Sergey; Nivala, Jeff; Dorwart, Michael; Trans, Andrew; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Tao, Chuanjuan; Morozova, Irina; Li, Zengmin; Shi, Shundi; Aberra, Aman; Arnold, Cleoma; Yang, Alexander; Aguirre, Anne; Harada, Eric T.; Korenblum, Daniel; Pollard, James; Bhat, Ashwini; Gremyachinskiy, Dmitriy; Bibillo, Arek; Chen, Roger; Davis, Randy; Russo, James J.; Fuller, Carl W.; Roever, Stefan; Ju, Jingyue; Church, George M.

2016-01-01

Scalable, high-throughput DNA sequencing is a prerequisite for precision medicine and biomedical research. Recently, we presented a nanopore-based sequencing-by-synthesis (Nanopore-SBS) approach, which used a set of nucleotides with polymer tags that allow discrimination of the nucleotides in a biological nanopore. Here, we designed and covalently coupled a DNA polymerase to an α-hemolysin (αHL) heptamer using the SpyCatcher/SpyTag conjugation approach. These porin–polymerase conjugates were inserted into lipid bilayers on a complementary metal oxide semiconductor (CMOS)-based electrode array for high-throughput electrical recording of DNA synthesis. The designed nanopore construct successfully detected the capture of tagged nucleotides complementary to a DNA base on a provided template. We measured over 200 tagged-nucleotide signals for each of the four bases and developed a classification method to uniquely distinguish them from each other and background signals. The probability of falsely identifying a background event as a true capture event was less than 1.2%. In the presence of all four tagged nucleotides, we observed sequential additions in real time during polymerase-catalyzed DNA synthesis. Single-polymerase coupling to a nanopore, in combination with the Nanopore-SBS approach, can provide the foundation for a low-cost, single-molecule, electronic DNA-sequencing platform. PMID:27729524
Heated oligonucleotide ligation assay (HOLA): an affordable single nucleotide polymorphism assay.

PubMed

Black, W C; Gorrochotegui-Escalante, N; Duteau, N M

2006-03-01

Most single nucleotide polymorphism (SNP) detection requires expensive equipment and reagents. The oligonucleotide ligation assay (OLA) is an inexpensive SNP assay that detects ligation between a biotinylated "allele-specific detector" and a 3' fluorescein-labeled "reporter" oligonucleotide. No ligation occurs unless the 3' detector nucleotide is complementary to the SNP nucleotide. The original OLA used chemical denaturation and neutralization. Heated OLA (HOLA) instead uses a thermal stable ligase and cycles of denaturing and hybridization for ligation and SNP detection. The cost per genotype is approximately US$1.25 with two-allele SNPs or approximately US$1.75 with three-allele SNPs. We illustrate the development of HOLA for SNP detection in the Early Trypsin and Abundant Trypsin loci in the mosquito Aedes aegypti (L.) and at the a-glycerophosphate dehydrogenase locus in the mosquito Anopheles gambiae s.s.
Identification of rs7350481 at chromosome 11q23.3 as a novel susceptibility locus for metabolic syndrome in Japanese individuals by an exome-wide association study.

PubMed

Yamada, Yoshiji; Sakuma, Jun; Takeuchi, Ichiro; Yasukochi, Yoshiki; Kato, Kimihiko; Oguri, Mitsutoshi; Fujimaki, Tetsuo; Horibe, Hideki; Muramatsu, Masaaki; Sawabe, Motoji; Fujiwara, Yoshinori; Taniguchi, Yu; Obuchi, Shuichi; Kawai, Hisashi; Shinkai, Shoji; Mori, Seijiro; Arai, Tomio; Tanaka, Masashi

2017-06-13

We have performed exome-wide association studies to identify genetic variants that influence body mass index or confer susceptibility to obesity or metabolic syndrome in Japanese. The exome-wide association study for body mass index included 12,890 subjects, and those for obesity and metabolic syndrome included 12,968 subjects (3954 individuals with obesity, 9014 controls) and 6817 subjects (3998 individuals with MetS, 2819 controls), respectively. Exome-wide association studies were performed with Illumina HumanExome-12 DNA Analysis BeadChip or Infinium Exome-24 BeadChip arrays. The relation of genotypes of single nucleotide polymorphisms to body mass index was examined by linear regression analysis, and that of allele frequencies of single nucleotide polymorphisms to obesity or metabolic syndrome was evaluated with Fisher's exact test. The exome-wide association studies identified six, 11, and 40 single nucleotide polymorphisms as being significantly associated with body mass index, obesity (P <1.21 × 10-6), or metabolic syndrome (P <1.20 × 10-6), respectively. Subsequent multivariable logistic regression analysis with adjustment for age and sex revealed that three and five single nucleotide polymorphisms were related (P < 0.05) to obesity or metabolic syndrome, respectively, with one of these latter polymorphisms-rs7350481 (C/T) at chromosome 11q23.3-also being significantly (P < 3.13 × 10-4) associated with metabolic syndrome. The polymorphism rs7350481 may thus be a novel susceptibility locus for metabolic syndrome in Japanese. In addition, single nucleotide polymorphisms in three genes (CROT, TSC1, RIN3) and at four loci (ANKK1, ZNF804B, CSRNP3, 17p11.2) were implicated as candidate determinants of obesity and metabolic syndrome, respectively.
Meta-analysis of the relationship between single nucleotide polymorphism of IL-10-1082G/A and rheumatic heart disease.

PubMed

Dai, Weiran; Ye, Ziliang; Lu, Haili; Su, Qiang; Li, Hui; Li, Lang

2018-02-23

The results showed that there was a certain correlation between the single nucleotide polymorphism of IL-10-1082G/A and rheumatic heart disease, but there was no systematic study to verify this conclusion. Systematic review of the association between single nucleotide polymorphism of IL-10-1082G/A locus and rheumatic heart disease. Computer retrieval PubMed, EMbase, Cochrane Library, CBM, CNKI, VIP and Data WanFang, the retrieval time limit from inception to June 2017. A case control study of single nucleotide polymorphisms and rheumatic heart disease in patients with rheumatic heart disease in the IL-10-1082G/A was collected. Two researchers independently screened the literature, extracted data and evaluated the risk of bias in the study, and using RevMan5.3 software for data analysis. A total of 3 case control studies were included, including 318 patients with rheumatic heart disease and 502 controls. Meta-analysis showed that there was no correlation between IL-10-1082G/A gene polymorphism and rheumatic heart disease [AA+AG VS GG: OR = 0.62, 95% CI (0.28, 1.39), P = 0.25; AA VS AG+GG: OR = 0.73, 95% CI (0.54, 1.00), P = 0.05; AA VS GG: OR = 0.70, 95% CI(0.47, 1.05), P = 0.08; AG VS GG: OR = 0.65, 95% CI (0.22, 1.92), P = 0.43; A VS G: OR = 0.87, 95% CI (0.71, 1.06), P = 0.17]. When AA is a recessive gene, the single nucleotide polymorphism of IL-10-1082G/A is associated with the presence of rheumatic heart disease. Due to the limitations of the quantity and quality of the included literatures, the further research results were still needed.
3D RNA and functional interactions from evolutionary couplings

PubMed Central

Weinreb, Caleb; Riesselman, Adam; Ingraham, John B.; Gross, Torsten; Sander, Chris; Marks, Debora S.

2016-01-01

Summary Non-coding RNAs are ubiquitous, but the discovery of new RNA gene sequences far outpaces research on their structure and functional interactions. We mine the evolutionary sequence record to derive precise information about function and structure of RNAs and RNA-protein complexes. As in protein structure prediction, we use maximum entropy global probability models of sequence co-variation to infer evolutionarily constrained nucleotide-nucleotide interactions within RNA molecules, and nucleotide-amino acid interactions in RNA-protein complexes. The predicted contacts allow all-atom blinded 3D structure prediction at good accuracy for several known RNA structures and RNA-protein complexes. For unknown structures, we predict contacts in 160 non-coding RNA families. Beyond 3D structure prediction, evolutionary couplings help identify important functional interactions, e.g., at switch points in riboswitches and at a complex nucleation site in HIV. Aided by accelerating sequence accumulation, evolutionary coupling analysis can accelerate the discovery of functional interactions and 3D structures involving RNA. PMID:27087444
Bison PRNP genotyping and potential association with Brucella spp. seroprevalence

USGS Publications Warehouse

Seabury, C.M.; Halbert, N.D.; Gogan, P.J.P.; Templeton, J.W.; Derr, J.N.

2005-01-01

The implication that host cellular prion protein (PrPC) may function as a cell surface receptor and/or portal protein for Brucella abortus in mice prompted an evaluation of nucleotide and amino acid variation within exon 3 of the prion protein gene (PRNP) for six US bison populations. A non-synonymous single nucleotide polymorphism (T50C), resulting in the predicted amino acid replacement M17T (Met ??? Thr), was identified in each population. To date, no variation (T50: Met) has been detected at the corresponding exon 3 nucleotide and/or amino acid position for domestic cattle. Notably, 80% (20 of 25) of the Yellowstone National Park bison possessing the C/C genotype were Brucella spp. seropositive, representing a significant (P = 0.021) association between seropositivity and the C/C genotypic class. Moreover, significant differences in the distribution of PRNP exon 3 alleles and genotypes were detected between Yellowstone National Park bison and three bison populations that were either founded from seronegative stock or previously subjected to test-and-slaughter management to eradicate brucellosis. Unlike domestic cattle, no indel polymorphisms were detected within the corresponding regions of the putative bison PRNP promoter, intron 1, octapeptide repeat region or 3???-untranslated region for any population examined. This study provides the first evidence of a potential association between nucleotide variation within PRNP exon 3 and the presence of Brucella spp. antibodies in bison, implicating PrPC in the natural resistance of bison to brucellosis infection. ?? 2005 International Society for Animal Genetics.
OmpF, a nucleotide-sensing nanoprobe, computational evaluation of single channel activities

NASA Astrophysics Data System (ADS)

Abdolvahab, R. H.; Mobasheri, H.; Nikouee, A.; Ejtehadi, M. R.

2016-09-01

The results of highthroughput practical single channel experiments should be formulated and validated by signal analysis approaches to increase the recognition precision of translocating molecules. For this purpose, the activities of the single nano-pore forming protein, OmpF, in the presence of nucleotides were recorded in real time by the voltage clamp technique and used as a means for nucleotide recognition. The results were analyzed based on the permutation entropy of current Time Series (TS), fractality, autocorrelation, structure function, spectral density, and peak fraction to recognize each nucleotide, based on its signature effect on the conductance, gating frequency and voltage sensitivity of channel at different concentrations and membrane potentials. The amplitude and frequency of ion current fluctuation increased in the presence of Adenine more than Cytosine and Thymine in milli-molar (0.5 mM) concentrations. The variance of the current TS at various applied voltages showed a non-monotonic trend whose initial increasing slope in the presence of Thymine changed to a decreasing one in the second phase and was different from that of Adenine and Cytosine; e.g., by increasing the voltage from 40 to 140 mV in the 0.5 mM concentration of Adenine or Cytosine, the variance decreased by one third while for the case of Thymine it was doubled. Moreover, according to the structure function of TS, the fractality of current TS differed as a function of varying membrane potentials (pd) and nucleotide concentrations. Accordingly, the calculated permutation entropy of the TS, validated the biophysical approach defined for the recognition of different nucleotides at various concentrations, pd's and polarities. Thus, the promising outcomes of the combined experimental and theoretical methodologies presented here can be implemented as a complementary means in pore-based nucleotide recognition approaches.
Systematic asymmetric nucleotide exchanges produce human mitochondrial RNAs cryptically encoding for overlapping protein coding genes.

PubMed

Seligmann, Hervé

2013-05-07

GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges. Copyright © 2013 Elsevier Ltd. All rights reserved.
Conserved features of eukaryotic hsp70 genes revealed by comparison with the nucleotide sequence of human hsp70.

PubMed Central

Hunt, C; Morimoto, R I

1985-01-01

We have determined the nucleotide sequence of the human hsp70 gene and 5' flanking region. The hsp70 gene is transcribed as an uninterrupted primary transcript of 2440 nucleotides composed of a 5' noncoding leader sequence of 212 nucleotides, a 3' noncoding region of 242 nucleotides, and a continuous open reading frame of 1986 nucleotides that encodes a protein with predicted molecular mass of 69,800 daltons. Upstream of the 5' terminus are the canonical TATAAA box, the sequence ATTGG that corresponds in the inverted orientation to the CCAAT motif, and the dyad sequence CTGGAAT/ATTCCCG that shares homology in 12 of 14 positions with the consensus transcription regulatory sequence common to Drosophila heat shock genes. Comparison of the predicted amino acid sequences of human hsp70 with the published sequences of Drosophila hsp70 and Escherichia coli dnaK reveals that human hsp70 is 73% identical to Drosophila hsp70 and 47% identical to E. coli dnaK. Surprisingly, the nucleotide sequences of the human and Drosophila genes are 72% identical and human and E. coli genes are 50% identical, which is more highly conserved than necessary given the degeneracy of the genetic code. The lack of accumulated silent nucleotide substitutions leads us to propose that there may be additional information in the nucleotide sequence of the hsp70 gene or the corresponding mRNA that precludes the maximum divergence allowed in the silent codon positions. PMID:3931075
A Single Nucleotide Polymorphism in 3′-Untranslated Region Contributes to the Regulation of Toll-like Receptor 4 Translation*

PubMed Central

Sato, Kayo; Yoshimura, Atsutoshi; Kaneko, Takashi; Ukai, Takashi; Ozaki, Yukio; Nakamura, Hirotaka; Li, Xinyue; Matsumura, Hiroyoshi; Hara, Yoshitaka; Ogata, Yorimasa

2012-01-01

We have previously shown that a single nucleotide polymorphism rs11536889 in the 3′-untranslated region (UTR) of TLR4 was associated with periodontitis. In this study the effects of this single nucleotide polymorphism on Toll-like receptor (TLR) 4 expression were investigated. Monocytes from subjects with the C/C genotype expressed higher levels of TLR4 on their surfaces than those from subjects with the other genotypes. Peripheral blood mononuclear cells (PBMCs) from the C/C and G/C subjects secreted higher levels of IL-8 in response to lipopolysaccharide (LPS), a TLR4 ligand, than the cells from the G/G subjects. However, there was no significant difference in TLR4 mRNA levels in PBMCs from the subjects with each genotype. After stimulation with tripalmitoylated CSK4 (Pam3CSK4), TLR4 mRNA levels increased in PBMCs from both the C/C and G/G subjects, whereas TLR4 protein levels increased in PBMCs from the C/C but not G/G subjects. Transient transfection of a series of chimeric luciferase constructs revealed that a fragment of 3′-UTR containing rs11536889 G allele, but not C allele, suppressed luciferase activity induced by LPS or IL-6. Two microRNAs, hsa-miR-1236 and hsa-miR-642a, were predicted to bind to rs11536889 G allele. Inhibition of these microRNAs reversed the suppressed luciferase activity. These microRNA inhibitors also up-regulated endogenous TLR4 protein on THP-1 cells (the G/G genotype) after LPS stimulation. Furthermore, mutant microRNAs that bind to the C allele inhibited the luciferase activity of the construct containing the C allele. These results indicate that genetic variation of rs11536889 contributes to translational regulation of TLR4, possibly by binding to microRNAs. PMID:22661708
Effects of the BDNF Val66Met Polymorphism on Anxiety-Like Behavior Following Nicotine Withdrawal in Mice.

PubMed

Lee, Bridgin G; Anastasia, Agustin; Hempstead, Barbara L; Lee, Francis S; Blendy, Julie A

2015-12-01

Nicotine withdrawal is characterized by both affective and cognitive symptoms. Identifying genetic polymorphisms that could affect the symptoms associated with nicotine withdrawal are important in predicting withdrawal sensitivity and identifying personalized cessation therapies. In the current study we used a mouse model of a non-synonymous single nucleotide polymorphism in the translated region of the brain-derived neurotrophic factor (BDNF) gene that substitutes a valine (Val) for a methionine (Met) amino acid (Val66Met) to examine the relationship between the Val66Met single nucleotide polymorphism and nicotine dependence. This study measured proBDNF and the BDNF prodomain levels following nicotine and nicotine withdrawal and examined a mouse model of a common polymorphism in this protein (BDNF(Met/Met)) in three behavioral paradigms: novelty-induced hypophagia, marble burying, and the open-field test. Using the BDNF knock-in mouse containing the BDNF Val66Met polymorphism we found: (1) blunted anxiety-like behavior in BDNF(Met/Met) mice following withdrawal in three behavioral paradigms: novelty-induced hypophagia, marble burying, and the open-field test; (2) the anxiolytic effects of chronic nicotine are absent in BDNF(Met/Met) mice; and (3) an increase in BDNF prodomain in BDNF(Met/Met) mice following nicotine withdrawal. Our study is the first to examine the effect of the BDNF Val66Met polymorphism on the affective symptoms of withdrawal from nicotine in mice. In these mice, a single-nucleotide polymorphism in the translated region of the BDNF gene can result in a blunted withdrawal, as measured by decreased anxiety-like behavior. The significant increase in the BDNF prodomain in BDNF(Met/Met) mice following nicotine cessation suggests a possible role of this ligand in the circuitry remodeling after withdrawal. © The Author 2015. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Single-Molecule Counting of Point Mutations by Transient DNA Binding

NASA Astrophysics Data System (ADS)

Su, Xin; Li, Lidan; Wang, Shanshan; Hao, Dandan; Wang, Lei; Yu, Changyuan

2017-03-01

High-confidence detection of point mutations is important for disease diagnosis and clinical practice. Hybridization probes are extensively used, but are hindered by their poor single-nucleotide selectivity. Shortening the length of DNA hybridization probes weakens the stability of the probe-target duplex, leading to transient binding between complementary sequences. The kinetics of probe-target binding events are highly dependent on the number of complementary base pairs. Here, we present a single-molecule assay for point mutation detection based on transient DNA binding and use of total internal reflection fluorescence microscopy. Statistical analysis of single-molecule kinetics enabled us to effectively discriminate between wild type DNA sequences and single-nucleotide variants at the single-molecule level. A higher single-nucleotide discrimination is achieved than in our previous work by optimizing the assay conditions, which is guided by statistical modeling of kinetics with a gamma distribution. The KRAS c.34 A mutation can be clearly differentiated from the wild type sequence (KRAS c.34 G) at a relative abundance as low as 0.01% mutant to WT. To demonstrate the feasibility of this method for analysis of clinically relevant biological samples, we used this technology to detect mutations in single-stranded DNA generated from asymmetric RT-PCR of mRNA from two cancer cell lines.
Genome-environment associations in sorghum landraces predict adaptive traits

PubMed Central

Lasky, Jesse R.; Upadhyaya, Hari D.; Ramu, Punna; Deshpande, Santosh; Hash, C. Tom; Bonnette, Jason; Juenger, Thomas E.; Hyma, Katie; Acharya, Charlotte; Mitchell, Sharon E.; Buckler, Edward S.; Brenton, Zachary; Kresovich, Stephen; Morris, Geoffrey P.

2015-01-01

Improving environmental adaptation in crops is essential for food security under global change, but phenotyping adaptive traits remains a major bottleneck. If associations between single-nucleotide polymorphism (SNP) alleles and environment of origin in crop landraces reflect adaptation, then these could be used to predict phenotypic variation for adaptive traits. We tested this proposition in the global food crop Sorghum bicolor, characterizing 1943 georeferenced landraces at 404,627 SNPs and quantifying allelic associations with bioclimatic and soil gradients. Environment explained a substantial portion of SNP variation, independent of geographical distance, and genic SNPs were enriched for environmental associations. Further, environment-associated SNPs predicted genotype-by-environment interactions under experimental drought stress and aluminum toxicity. Our results suggest that genomic signatures of environmental adaptation may be useful for crop improvement, enhancing germplasm identification and marker-assisted selection. Together, genome-environment associations and phenotypic analyses may reveal the basis of environmental adaptation. PMID:26601206

Methods and kits for nucleic acid analysis using fluorescence resonance energy transfer

DOEpatents

Kwok, Pui-Yan; Chen, Xiangning

1999-01-01

A method for detecting the presence of a target nucleotide or sequence of nucleotides in a nucleic acid is disclosed. The method is comprised of forming an oligonucleotide labeled with two fluorophores on the nucleic acid target site. The doubly labeled oligonucleotide is formed by addition of a singly labeled dideoxynucleoside triphosphate to a singly labeled polynucleotide or by ligation of two singly labeled polynucleotides. Detection of fluorescence resonance energy transfer upon denaturation indicates the presence of the target. Kits are also provided. The method is particularly applicable to genotyping.
Single nucleotide polymorphisms in CETP, SLC46A1, SLC19A1, CD36, BCOM1, APOA5, and ABCA1 are significant predictors of plasma HDL in healthy adults

USDA-ARS?s Scientific Manuscript database

In a marker-trait association study we estimated the statistical significance of 65 single nucleotide polymorphisms (SNP) in 23 candidate genes on HDL levels of two independent Caucasian populations. Each population consisted of men and women and their HDL levels were adjusted for gender and body we...
Rhabdomyolysis After Out-of-Water Exercise in an Elite Adolescent Water Polo Player Carrying the IL-6 174C Allele Single-Nucleotide Polymorphism.

PubMed

Eliakim, Alon; Ben Zaken, Sigal; Meckel, Yoav; Yamin, Chen; Dror, Nitzan; Nemet, Dan

2015-12-01

We present an adolescent elite water polo player who despite a genetic predisposition to develop exercise-induced severe muscle damage due to carrying the IL-6 174C allele single-nucleotide polymorphism, developed acute rhabdomyolysis only after a vigorous out-of-water training, suggesting that water polo training may be more suitable for genetically predisposed athletes.
Decreased necrotizing fasciitis capacity caused by a single nucleotide mutation that alters a multiple gene virulence axis

PubMed Central

Olsen, Randall J.; Sitkiewicz, Izabela; Ayeras, Ara A.; Gonulal, Vedia E.; Cantu, Concepcion; Beres, Stephen B.; Green, Nicole M.; Lei, Benfang; Humbird, Tammy; Greaver, Jamieson; Chang, Ellen; Ragasa, Willie P.; Montgomery, Charles A.; Cartwright, Joiner; McGeer, Allison; Low, Donald E.; Whitney, Adeline R.; Cagle, Philip T.; Blasdel, Terry L.; DeLeo, Frank R.; Musser, James M.

2010-01-01

Single-nucleotide changes are the most common cause of natural genetic variation among members of the same species, but there is remarkably little information bearing on how they alter bacterial virulence. We recently discovered a single-nucleotide mutation in the group A Streptococcus genome that is epidemiologically associated with decreased human necrotizing fasciitis (“flesh-eating disease”). Working from this clinical observation, we find that wild-type mtsR function is required for group A Streptococcus to cause necrotizing fasciitis in mice and nonhuman primates. Expression microarray analysis revealed that mtsR inactivation results in overexpression of PrsA, a chaperonin involved in posttranslational maturation of SpeB, an extracellular cysteine protease. Isogenic mutant strains that overexpress prsA or lack speB had decreased secreted protease activity in vivo and recapitulated the necrotizing fasciitis-negative phenotype of the ΔmtsR mutant strain in mice and monkeys. mtsR inactivation results in increased PrsA expression, which in turn causes decreased SpeB secreted protease activity and reduced necrotizing fasciitis capacity. Thus, a naturally occurring single-nucleotide mutation dramatically alters virulence by dysregulating a multiple gene virulence axis. Our discovery has broad implications for the confluence of population genomics and molecular pathogenesis research. PMID:20080771
Decreased necrotizing fasciitis capacity caused by a single nucleotide mutation that alters a multiple gene virulence axis.

PubMed

Olsen, Randall J; Sitkiewicz, Izabela; Ayeras, Ara A; Gonulal, Vedia E; Cantu, Concepcion; Beres, Stephen B; Green, Nicole M; Lei, Benfang; Humbird, Tammy; Greaver, Jamieson; Chang, Ellen; Ragasa, Willie P; Montgomery, Charles A; Cartwright, Joiner; McGeer, Allison; Low, Donald E; Whitney, Adeline R; Cagle, Philip T; Blasdel, Terry L; DeLeo, Frank R; Musser, James M

2010-01-12

Single-nucleotide changes are the most common cause of natural genetic variation among members of the same species, but there is remarkably little information bearing on how they alter bacterial virulence. We recently discovered a single-nucleotide mutation in the group A Streptococcus genome that is epidemiologically associated with decreased human necrotizing fasciitis ("flesh-eating disease"). Working from this clinical observation, we find that wild-type mtsR function is required for group A Streptococcus to cause necrotizing fasciitis in mice and nonhuman primates. Expression microarray analysis revealed that mtsR inactivation results in overexpression of PrsA, a chaperonin involved in posttranslational maturation of SpeB, an extracellular cysteine protease. Isogenic mutant strains that overexpress prsA or lack speB had decreased secreted protease activity in vivo and recapitulated the necrotizing fasciitis-negative phenotype of the DeltamtsR mutant strain in mice and monkeys. mtsR inactivation results in increased PrsA expression, which in turn causes decreased SpeB secreted protease activity and reduced necrotizing fasciitis capacity. Thus, a naturally occurring single-nucleotide mutation dramatically alters virulence by dysregulating a multiple gene virulence axis. Our discovery has broad implications for the confluence of population genomics and molecular pathogenesis research.
Genetic variants associated with the root system architecture of oilseed rape (Brassica napus L.) under contrasting phosphate supply.

PubMed

Wang, Xiaohua; Chen, Yanling; Thomas, Catherine L; Ding, Guangda; Xu, Ping; Shi, Dexu; Grandke, Fabian; Jin, Kemo; Cai, Hongmei; Xu, Fangsen; Yi, Bin; Broadley, Martin R; Shi, Lei

2017-08-01

Breeding crops with ideal root system architecture for efficient absorption of phosphorus is an important strategy to reduce the use of phosphate fertilizers. To investigate genetic variants leading to changes in root system architecture, 405 oilseed rape cultivars were genotyped with a 60K Brassica Infinium SNP array in low and high P environments. A total of 285 single-nucleotide polymorphisms were associated with root system architecture traits at varying phosphorus levels. Nine single-nucleotide polymorphisms corroborate a previous linkage analysis of root system architecture quantitative trait loci in the BnaTNDH population. One peak single-nucleotide polymorphism region on A3 was associated with all root system architecture traits and co-localized with a quantitative trait locus for primary root length at low phosphorus. Two more single-nucleotide polymorphism peaks on A5 for root dry weight at low phosphorus were detected in both growth systems and co-localized with a quantitative trait locus for the same trait. The candidate genes identified on A3 form a haplotype 'BnA3Hap', that will be important for understanding the phosphorus/root system interaction and for the incorporation into Brassica napus breeding programs. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Exonic Splicing Mutations Are More Prevalent than Currently Estimated and Can Be Predicted by Using In Silico Tools

PubMed Central

Soukarieh, Omar; Gaildrat, Pascaline; Hamieh, Mohamad; Drouet, Aurélie; Baert-Desurmont, Stéphanie; Frébourg, Thierry; Tosi, Mario; Martins, Alexandra

2016-01-01

The identification of a causal mutation is essential for molecular diagnosis and clinical management of many genetic disorders. However, even if next-generation exome sequencing has greatly improved the detection of nucleotide changes, the biological interpretation of most exonic variants remains challenging. Moreover, particular attention is typically given to protein-coding changes often neglecting the potential impact of exonic variants on RNA splicing. Here, we used the exon 10 of MLH1, a gene implicated in hereditary cancer, as a model system to assess the prevalence of RNA splicing mutations among all single-nucleotide variants identified in a given exon. We performed comprehensive minigene assays and analyzed patient’s RNA when available. Our study revealed a staggering number of splicing mutations in MLH1 exon 10 (77% of the 22 analyzed variants), including mutations directly affecting splice sites and, particularly, mutations altering potential splicing regulatory elements (ESRs). We then used this thoroughly characterized dataset, together with experimental data derived from previous studies on BRCA1, BRCA2, CFTR and NF1, to evaluate the predictive power of 3 in silico approaches recently described as promising tools for pinpointing ESR-mutations. Our results indicate that ΔtESRseq and ΔHZEI-based approaches not only discriminate which variants affect splicing, but also predict the direction and severity of the induced splicing defects. In contrast, the ΔΨ-based approach did not show a compelling predictive power. Our data indicates that exonic splicing mutations are more prevalent than currently appreciated and that they can now be predicted by using bioinformatics methods. These findings have implications for all genetically-caused diseases. PMID:26761715
Genome-scale characterization of RNA tertiary structures and their functional impact by RNA solvent accessibility prediction.

PubMed

Yang, Yuedong; Li, Xiaomei; Zhao, Huiying; Zhan, Jian; Wang, Jihua; Zhou, Yaoqi

2017-01-01

As most RNA structures are elusive to structure determination, obtaining solvent accessible surface areas (ASAs) of nucleotides in an RNA structure is an important first step to characterize potential functional sites and core structural regions. Here, we developed RNAsnap, the first machine-learning method trained on protein-bound RNA structures for solvent accessibility prediction. Built on sequence profiles from multiple sequence alignment (RNAsnap-prof), the method provided robust prediction in fivefold cross-validation and an independent test (Pearson correlation coefficients, r, between predicted and actual ASA values are 0.66 and 0.63, respectively). Application of the method to 6178 mRNAs revealed its positive correlation to mRNA accessibility by dimethyl sulphate (DMS) experimentally measured in vivo (r = 0.37) but not in vitro (r = 0.07), despite the lack of training on mRNAs and the fact that DMS accessibility is only an approximation to solvent accessibility. We further found strong association across coding and noncoding regions between predicted solvent accessibility of the mutation site of a single nucleotide variant (SNV) and the frequency of that variant in the population for 2.2 million SNVs obtained in the 1000 Genomes Project. Moreover, mapping solvent accessibility of RNAs to the human genome indicated that introns, 5' cap of 5' and 3' cap of 3' untranslated regions, are more solvent accessible, consistent with their respective functional roles. These results support conformational selections as the mechanism for the formation of RNA-protein complexes and highlight the utility of genome-scale characterization of RNA tertiary structures by RNAsnap. The server and its stand-alone downloadable version are available at http://sparks-lab.org. © 2016 Yang et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Genetic diversity based on 28S rDNA sequences among populations of Culex quinquefasciatus collected at different locations in Tamil Nadu, India.

PubMed

Sakthivelkumar, S; Ramaraj, P; Veeramani, V; Janarthanan, S

2015-09-01

The basis of the present study was to distinguish the existence of any genetic variability among populations of Culex quinquefasciatus which would be a valuable tool in the management of mosquito control programmes. In the present study, population of Cx. quinquefasciatus collected at different locations in Tamil Nadu were analyzed for their genetic variation based on 28S rDNA D2 region nucleotide sequences. A high degree of genetic polymorphism was detected in the sequences of D2 region of 28S rDNA on the predicted secondary structures in spite of high nucleotide sequence similarity. The findings based on secondary structure using rDNA sequences suggested the existence of a complex genotypic diversity of Cx. quinquefasciatus population collected at different locations of Tamil Nadu, India. This complexity in genetic diversity in a single mosquito population collected at different locations is considered an important issue towards their influence and nature of vector potential of these mosquitoes.
Nucleotide Substitution in 3' Arm of Bovine MIR-2467 in Five Cattle Breeds.

PubMed

Łukaszewicz, Aneta; Basiak, Szymon; Proskura, Witold Stanisław; Dybus, Andrzej

2015-01-01

The T > C single nucleotide polymorphism (SNP) in the MIR2467 gene was investigated in order to confirm its presence in cattle genome and to check for possible differences in its genotype distribution among different breeds. Additional purpose of the study was to investigate in silico potential effect of that substitution on the structure and stability of precursor mir-2467. The study involved 634 individuals of five cattle breeds: Angus, Hereford, Holstein-Friesian, Jersey, and Limousin, which were genotyped using PCR-RFLP assay. In this study, the presence of T > C polymorphism at position 24 was observed in all the cattle breeds excepting Hereford. In addition, the differences in the genotype distribution among analyzed breeds were indicated. On the basis of minimum free energy structure prediction, the C allele was indicated to have possible impact on decreasing the stability of the pre-mir-2467, thus altering its ability to regulate target genes expression.
Effective oligonucleotide-mediated gene disruption in ES cells lacking the mismatch repair protein MSH3.

PubMed

Dekker, M; Brouwers, C; Aarts, M; van der Torre, J; de Vries, S; van de Vrugt, H; te Riele, H

2006-04-01

We have previously demonstrated that site-specific insertion, deletion or substitution of one or two nucleotides in mouse embryonic stem cells (ES cells) by single-stranded deoxyribo-oligonucleotides is several hundred-fold suppressed by DNA mismatch repair (MMR) activity. Here, we have investigated whether compound mismatches and larger insertions escape detection by the MMR machinery and can be effectively introduced in MMR-proficient cells. We identified several compound mismatches that escaped detection by the MMR machinery to some extent, but could not define general rules predicting the efficacy of complex base-pair substitutions. In contrast, we found that four-nucleotide insertions were largely subject to suppression by the MSH2/MSH3 branch of MMR and could be effectively introduced in Msh3-deficient cells. As these cells have no overt mutator phenotype and Msh3-deficient mice do not develop cancer, Msh3-deficient ES cells can be used for oligonucleotide-mediated gene disruption. As an example, we present disruption of the Fanconi anemia gene Fancf.
Nucleotide sequence of the Saccharomyces cerevisiae PUT4 proline-permease-encoding gene: similarities between CAN1, HIP1 and PUT4 permeases.

PubMed

Vandenbol, M; Jauniaux, J C; Grenson, M

1989-11-15

The complete nucleotide (nt) sequence of the PUT4 gene, whose product is required for high-affinity proline active transport in the yeast Saccharomyces cerevisiae, is presented. The sequence contains a single long open reading frame of 1881 nt, encoding a polypeptide with a calculated Mr of 68,795. The predicted protein is strongly hydrophobic and exhibits six potential glycosylation sites. Its hydropathy profile suggests the presence of twelve membrane-spanning regions flanked by hydrophilic N- and C-terminal domains. The N terminus does not resemble signal sequences found in secreted proteins. These features are characteristic of integral membrane proteins catalyzing translocation of ligands across cellular membranes. Protein sequence comparisons indicate strong resemblance to the arginine and histidine permeases of S. cerevisiae, but no marked sequence similarity to the proline permease of Escherichia coli or to other known prokaryotic or eukaryotic transport proteins. The strong similarity between the three yeast amino acid permeases suggests a common ancestor for the three proteins.
Molecular epidemiology of measles viruses in China, 1995–2003

PubMed Central

Zhang, Yan; Zhu, Zhen; Rota, Paul A; Jiang, Xiaohong; Hu, Jiayu; Wang, Jianguo; Tang, Wei; Zhang, Zhenying; Li, Congyong; Wang, Changyin; Wang, Tongzhan; Zheng, Lei; Tian, Hong; Ling, Hua; Zhao, Chunfang; Ma, Yan; Lin, Chunyan; He, Jilan; Tian, Jiang; Ma, Yan; Li, Ping; Guan, Ronghui; He, Weikuan; Zhou, Jianhui; Liu, Guiyan; Zhang, Hong; Yan, Xinge; Yang, Xuelei; Zhang, Jinlin; Lu, Yiyu; Zhou, Shunde; Ba, Zhuoma; Liu, Wei; Yang , Xiuhui; Ma, Yujie; Liang, Yong; Li, Yeqiang; Ji, Yixin; Featherstone, David; Bellini, William J; Xu, Songtao; Liang, Guodong; Xu, Wenbo

2007-01-01

This report describes the genetic characterization of 297 wild-type measles viruses that were isolated in 24 provinces of China between 1995 and 2003. Phylogenetic analysis of the N gene sequences showed that all of the isolates belonged to genotype H1 except 3 isolates, which were genotype A. The nucleotide sequence and predicted amino acid homologies of the 294-genotype H1 strains were 94.7%–100% and 93.3%–100%, respectively. The genotype H1 isolates were divided into 2 clusters, which differed by approximately 2.9% at the nucleotide level. Viruses from both clusters were distributed throughout China with no apparent geographic restriction and multiple co-circulating lineages were present in many provinces. Even though other measles genotypes have been detected in countries that border China, this report shows that genotype H1 is widely distributed throughout the country and that China has a single, endemic genotype. This important baseline data will help to monitor the progress of measles control in China. PMID:17280609
Clonal architecture of secondary acute myeloid leukemia defined by single-cell sequencing.

PubMed

Hughes, Andrew E O; Magrini, Vincent; Demeter, Ryan; Miller, Christopher A; Fulton, Robert; Fulton, Lucinda L; Eades, William C; Elliott, Kevin; Heath, Sharon; Westervelt, Peter; Ding, Li; Conrad, Donald F; White, Brian S; Shao, Jin; Link, Daniel C; DiPersio, John F; Mardis, Elaine R; Wilson, Richard K; Ley, Timothy J; Walter, Matthew J; Graubert, Timothy A

2014-07-01

Next-generation sequencing has been used to infer the clonality of heterogeneous tumor samples. These analyses yield specific predictions-the population frequency of individual clones, their genetic composition, and their evolutionary relationships-which we set out to test by sequencing individual cells from three subjects diagnosed with secondary acute myeloid leukemia, each of whom had been previously characterized by whole genome sequencing of unfractionated tumor samples. Single-cell mutation profiling strongly supported the clonal architecture implied by the analysis of bulk material. In addition, it resolved the clonal assignment of single nucleotide variants that had been initially ambiguous and identified areas of previously unappreciated complexity. Accordingly, we find that many of the key assumptions underlying the analysis of tumor clonality by deep sequencing of unfractionated material are valid. Furthermore, we illustrate a single-cell sequencing strategy for interrogating the clonal relationships among known variants that is cost-effective, scalable, and adaptable to the analysis of both hematopoietic and solid tumors, or any heterogeneous population of cells.
Development of 101 novel EST-derived single nucleotide polymorphism markers for Zhikong scallop ( Chlamys farreri)

NASA Astrophysics Data System (ADS)

Li, Jiqin; Bao, Zhenmin; Li, Ling; Wang, Xiaojian; Wang, Shi; Hu, Xiaoli

2013-09-01

Zhikong scallop ( Chlamys farreri) is an important maricultured species in China. Many researches on this species, such as population genetics and QTL fine-mapping, need a large number of molecular markers. In this study, based on the expressed sequence tags (EST), a total of 300 putative single nucleotide polymorphisms (SNPs) were selected and validated using high resolution melting (HRM) technology with unlabeled probe. Of them, 101 (33.7%) were found to be polymorphic in 48 individuals from 4 populations. Further evaluation with 48 individuals from Qingdao population showed that all the polymorphic loci had two alleles with the minor allele frequency ranged from 0.046 to 0.500. The observed and expected heterozygosities ranged from 0.000 to 0.925 and from 0.089 to 0.505, respectively. Fifteen loci deviated significantly from Hardy-Weinberg equilibrium and significant linkage disequilibrate was detected in one pair of markers. BLASTx gave significant hits for 72 of the 101 polymorphic SNP-containing ESTs. Thirty four polymorphic SNP loci were predicted to be non-synonymous substitutions as they caused either the change of codons (33 SNPs) or pretermination of translation (1 SNP). The markers developed can be used for the population studies and genetic improvement on Zhikong scallop.
Pharmacogenetics of asthma

PubMed Central

Lima, John J.; Blake, Kathryn V.; Tantisira, Kelan G.; Weiss, Scott T.

2009-01-01

Purpose of review Patient response to the asthma drug classes, bronchodilators, inhaled corticosteroids and leukotriene modifiers, are characterized by a large degree of heterogeneity, which is attributable in part to genetic variation. Herein, we review and update the pharmacogenetics and pharmaogenomics of common asthma drugs. Recent findings Early studies suggest that bronchodilator reversibility and asthma worsening in patients on continuous short-acting and long-acting β-agonists are related to the Gly16Arg genotype for the ADRB2. More recent studies including genome-wide association studies implicate variants in other genes contribute to bronchodilator response heterogeneity and fail to replicate asthma worsening associated with continuous β-agonist use. Genetic determinants of the safety of long-acting β-agonist require further study. Variants in CRHR1, TBX21, and FCER2 contribute to variability in response for lung function, airways responsiveness, and exacerbations in patients taking inhaled corticosteroids. Variants in ALOX5, LTA4H, LTC4S, ABCC1, CYSLTR2, and SLCO2B1 contribute to variability in response to leukotriene modifiers. Summary Identification of novel variants that contribute to response heterogeneity supports future studies of single nucleotide polymorphism discovery and include gene expression and genome-wide association studies. Statistical models that predict the genomics of response to asthma drugs will complement single nucleotide polymorphism discovery in moving toward personalized medicine. PMID:19077707
Substitution scanning identifies a novel, catalytically active ibrutinib-resistant BTK cysteine 481 to threonine (C481T) variant

PubMed Central

Hamasy, A; Wang, Q; Blomberg, K E M; Mohammad, D K; Yu, L; Vihinen, M; Berglöf, A; Smith, C I E

2017-01-01

Irreversible Bruton tyrosine kinase (BTK) inhibitors, ibrutinib and acalabrutinib have demonstrated remarkable clinical responses in multiple B-cell malignancies. Acquired resistance has been identified in a sub-population of patients in which mutations affecting BTK predominantly substitute cysteine 481 in the kinase domain for catalytically active serine, thereby ablating covalent binding of inhibitors. Activating substitutions in the BTK substrate phospholipase Cγ2 (PLCγ2) instead confers resistance independent of BTK. Herein, we generated all six possible amino acid substitutions due to single nucleotide alterations for the cysteine 481 codon, in addition to threonine, requiring two nucleotide substitutions, and performed functional analysis. Replacement by arginine, phenylalanine, tryptophan or tyrosine completely inactivated the catalytic activity, whereas substitution with glycine caused severe impairment. BTK with threonine replacement was catalytically active, similar to substitution with serine. We identify three potential ibrutinib resistance scenarios for cysteine 481 replacement: (1) Serine, being catalytically active and therefore predominating among patients. (2) Threonine, also being catalytically active, but predicted to be scarce, because two nucleotide changes are needed. (3) As BTK variants replaced with other residues are catalytically inactive, they presumably need compensatory mutations, therefore being very scarce. Glycine and tryptophan variants were not yet reported but likely also provide resistance. PMID:27282255
Identification of two allelic IgG1 C(H) coding regions (Cgamma1) of cat.

PubMed

Kanai, T H; Ueda, S; Nakamura, T

2000-01-31

Two types of cDNA encoding IgG1 heavy chain (gamma1) were isolated from a single domestic short-hair cat. Sequence analysis indicated a higher level of similarity of these Cgamma1 sequences to human Cgamma1 sequence (76.9 and 77.0%) than to mouse sequence (70.0 and 69.7%) at the nucleotide level. Predicted primary structures of both the feline Cgamma1 genes, designated as Cgamma1a and Cgamma1b, were similar to that of human Cgamma1 gene, for instance, as to the size of constant domains, the presence of six conserved cysteine residues involved in formation of the domain structure, and the location of a conserved N-linked glycosylation site. Sequence comparison between the two alleles showed that 7 out of 10 nucleotide differences were within the C(H)3 domain coding region, all leading to nonsynonymous changes in amino acid residues. Partial sequence analysis of genomic clones showed three nucleotide substitutions between the two Cgamma1 alleles in the intron between the CH2 and C(H)3 domain coding regions. In 12 domestic short-hair cats used in this study, the frequency of Cgamma1a allele (62.5%) was higher than that of the Cgamma1b allele (37.5%).
Molecular Characterization of Bombyx mori Cytoplasmic Polyhedrosis Virus Genome Segment 4

PubMed Central

Ikeda, Keiko; Nagaoka, Sumiharu; Winkler, Stefan; Kotani, Kumiko; Yagi, Hiroaki; Nakanishi, Kae; Miyajima, Shigetoshi; Kobayashi, Jun; Mori, Hajime

2001-01-01

The complete nucleotide sequence of the genome segment 4 (S4) of Bombyx mori cytoplasmic polyhedrosis virus (BmCPV) was determined. The 3,259-nucleotide sequence contains a single long open reading frame which spans nucleotides 14 to 3187 and which is predicted to encode a protein with a molecular mass of about 130 kDa. Western blot analysis showed that S4 encodes BmCPV protein VP3, which is one of the outer components of the BmCPV virion. Sequence analysis of the deduced amino acid sequence of BmCPV VP3 revealed possible sequence homology with proteins from rice ragged stunt virus (RRSV) S2, Nilaparvata lugens reovirus S4, and Fiji disease fijivirus S4. This may suggest that plant reoviruses originated from insect viruses and that RRSV emerged more recently than other plant reoviruses. A chimeric protein consisting of BmCPV VP3 and green fluorescent protein (GFP) was constructed and expressed with BmCPV polyhedrin using a baculovirus expression vector. The VP3-GFP chimera was incorporated into BmCPV polyhedra and released under alkaline conditions. The results indicate that specific interactions occur between BmCPV polyhedrin and VP3 which might facilitate BmCPV virion occlusion into the polyhedra. PMID:11134312
Integration of Structural Dynamics and Molecular Evolution via Protein Interaction Networks: A New Era in Genomic Medicine

PubMed Central

Kumar, Avishek; Butler, Brandon M.; Kumar, Sudhir; Ozkan, S. Banu

2016-01-01

Summary Sequencing technologies are revealing many new non-synonymous single nucleotide variants (nsSNVs) in each personal exome. To assess their functional impacts, comparative genomics is frequently employed to predict if they are benign or not. However, evolutionary analysis alone is insufficient, because it misdiagnoses many disease-associated nsSNVs, such as those at positions involved in protein interfaces, and because evolutionary predictions do not provide mechanistic insights into functional change or loss. Structural analyses can aid in overcoming both of these problems by incorporating conformational dynamics and allostery in nSNV diagnosis. Finally, protein-protein interaction networks using systems-level methodologies shed light onto disease etiology and pathogenesis. Bridging these network approaches with structurally resolved protein interactions and dynamics will advance genomic medicine. PMID:26684487

Prediction of Nucleotide Binding Peptides Using Star Graph Topological Indices.

PubMed

Liu, Yong; Munteanu, Cristian R; Fernández Blanco, Enrique; Tan, Zhiliang; Santos Del Riego, Antonino; Pazos, Alejandro

2015-11-01

The nucleotide binding proteins are involved in many important cellular processes, such as transmission of genetic information or energy transfer and storage. Therefore, the screening of new peptides for this biological function is an important research topic. The current study proposes a mixed methodology to obtain the first classification model that is able to predict new nucleotide binding peptides, using only the amino acid sequence. Thus, the methodology uses a Star graph molecular descriptor of the peptide sequences and the Machine Learning technique for the best classifier. The best model represents a Random Forest classifier based on two features of the embedded and non-embedded graphs. The performance of the model is excellent, considering similar models in the field, with an Area Under the Receiver Operating Characteristic Curve (AUROC) value of 0.938 and true positive rate (TPR) of 0.886 (test subset). The prediction of new nucleotide binding peptides with this model could be useful for drug target studies in drug development. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Association of Cytokine Candidate Genes with Severity of Pain and Co-Occurring Symptoms in Breast Cancer Patients Receiving Chemotherapy

DTIC Science & Technology

2013-10-01

identify common genetic variations (i.e., single nucleotide polymorphisms [ SNPs ] and haplotypes) in cytokine genes, as well demographic, clinical, and...Center. The purpose of the proposed project is to identify common genetic variations (i.e., single nucleotide polymorphisms [ SNPs ] and haplotypes) in...research team continues to meet monthly to discuss progress with regards to recruitment, enrollment, and data collection. Training in Genetics In year
Single-cell analysis of intercellular heteroplasmy of mtDNA in Leber hereditary optic neuropathy

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kobayashi, Y.; Sharpe, H.; Brown, N.

1994-07-01

The authors have investigated the distribution of mutant mtDNA molecules in single cells from a patient with Leber hereditary optic neuropathy (LHON). LHON is a maternally inherited disease that is characterized by a sudden-onset bilateral loss of central vision, which typically occurs in early adulthood. More than 50% of all LHON patients carry an mtDNA mutation at nucleotide position 11778. This nucleotide change converts a highly conserved arginine residue to histidine at codon 340 in the NADH-ubiquinone oxidoreductase subunit 4 (ND4) gene of mtDNA. In the present study, the authors used PCR amplification of mtDNA from lymphocytes to investigate mtDNAmore » heteroplasmy at the single-cell level in a LHON patient. They found that most cells were either homoplasmic normal or homoplasmic mutant at nucleotide position 11778. Some (16%) cells contained both mutant and normal mtDNA.« less
Single Locked Nucleic Acid-Enhanced Nanopore Genetic Discrimination of Pathogenic Serotypes and Cancer Driver Mutations.

PubMed

Tian, Kai; Chen, Xiaowei; Luan, Binquan; Singh, Prashant; Yang, Zhiyu; Gates, Kent S; Lin, Mengshi; Mustapha, Azlin; Gu, Li-Qun

2018-05-22

Accurate and rapid detection of single-nucleotide polymorphism (SNP) in pathogenic mutants is crucial for many fields such as food safety regulation and disease diagnostics. Current detection methods involve laborious sample preparations and expensive characterizations. Here, we investigated a single locked nucleic acid (LNA) approach, facilitated by a nanopore single-molecule sensor, to accurately determine SNPs for detection of Shiga toxin producing Escherichia coli (STEC) serotype O157:H7, and cancer-derived EGFR L858R and KRAS G12D driver mutations. Current LNA applications that require incorporation and optimization of multiple LNA nucleotides. But we found that in the nanopore system, a single LNA introduced in the probe is sufficient to enhance the SNP discrimination capability by over 10-fold, allowing accurate detection of the pathogenic mutant DNA mixed in a large amount of the wild-type DNA. Importantly, the molecular mechanistic study suggests that such a significant improvement is due to the effect of the single-LNA that both stabilizes the fully matched base-pair and destabilizes the mismatched base-pair. This sensitive method, with a simplified, low cost, easy-to-operate LNA design, could be generalized for various applications that need rapid and accurate identification of single-nucleotide variations.
Genetic risk of prediabetes and diabetes development in chronic myeloid leukemia patients treated with nilotinib.

PubMed

Martino, Bruno; Mammì, Corrado; Labate, Claudia; Rodi, Silvia; Ielo, Domenica; Priolo, Manuela; Postorino, Maurizio; Tripepi, Giovanni; Ronco, Francesca; Laganà, Carmelo; Musolino, Caterina; Greco, Marianna; La Nasa, Giorgio; Caocci, Giovanni

2017-11-01

Impaired fasting glucose and type 2 diabetes represent adverse events in patients with chronic myeloid leukemia (CML) treated with the second generation tyrosine kinase inhibitor nilotinib. An unweighted genetic risk score (uGRS) for the prediction of insulin resistance, consisting of 10 multiple single-nucleotide polymorphisms, has been proposed. We evaluated uGRS predictivity in 61 CML patients treated with nilotinib. Patients were genotyped for IRS1, GRB14, ARL15, PPARG, PEPD, ANKRD55/MAP3K1, PDGFC, LYPLAL1, RSPO3, and FAM13A1 genes. The uGRS was based on the sum of the risk alleles within the set of selected single-nucleotide polymorphisms. Molecular response (MR) 3.0 and MR 4.0 were achieved in 90% and 79% of patients, respectively. Before treatment, none of the patients had abnormal blood glucose. During treatment and subsequent follow-up at 80.2 months (range: 1-298), seven patients (11.5%) had developed diabetes that required oral treatment, a median of 14 months (range: 3-98) after starting nilotinib treatment. Twelve patients (19.7%) had developed prediabetes. Prediabetes/diabetes-free survival was significantly higher in patients with a uGRS <10 than in those with higher scores (100% vs. 22.8 ± 12.4%, p <0.001). Each increment of one unit in the uGRS caused a 42% increase in the prediabetes/diabetes risk (hazard ratio = 1.42, confidence interval: 1.04-1.94, p = 0.026). The presence of more than 10 allelic variants associated with insulin secretion, processing, sensitivity, and clearance is predictive of prediabetes/diabetes development in CML patients treated with nilotinib. In clinical practice, uGRS could help tailor the best tyrosine kinase inhibitor therapy. Copyright © 2017 ISEH – Society for Hematology and Stem Cells. Published by Elsevier Inc. All rights reserved.
Association of Allelic Interaction of Single Nucleotide Polymorphisms of Influx and Efflux Transporters Genes With Nonhematologic Adverse Events of Docetaxel in Breast Cancer Patients.

PubMed

Jabir, Rafid Salim; Ho, Gwo Fuang; Annuar, Muhammad Azrif Bin Ahmad; Stanslas, Johnson

2018-05-04

Nonhematologic adverse events (AEs) of docetaxel constitute an extra burden in the treatment of cancer patients and necessitate either a dose reduction or an outright switch of docetaxel for other regimens. These AEs are frequently associated with genetic polymorphisms of genes encoding for proteins involved docetaxel disposition. Therefore, we investigated that association in Malaysian breast cancer patients. A total of 110 Malaysian breast cancer patients were enrolled in the present study, and their blood samples were investigated for different single nucleotide polymorphisms using polymerase chain reaction restriction fragment length polymorphism. AEs were evaluated using the Common Terminology Criteria for Adverse Events, version 4.0. Fatigue, nausea, oral mucositis, and vomiting were the most common nonhematologic AEs. Rash was associated with heterozygous and mutant genotypes of ABCB1 3435C>T (P < .05). Moreover, patients carrying the GG genotype of ABCB1 2677G>A/T reported more fatigue than those carrying the heterozygous genotype GA (P < .05). The presence of ABCB1 3435-T, ABCC2 3972-C, ABCC2 1249-G, and ABCB1 2677-G alleles was significantly associated with nausea and oral mucositis. The coexistence of ABCB1 3435-C, ABCC2 3972-C, ABCC2 1249-G, and ABCB1 2677-A was significantly associated with vomiting (P < .05). The prevalence of nonhematologic AEs in breast cancer patients treated with docetaxel has been relatively high. The variant allele of ABCB1 3435C>T polymorphism could be a potential predictive biomarker of docetaxel-induced rash, and homozygous wild-type ABCB1 2677G>A/T might predict for a greater risk of fatigue. In addition, the concurrent presence of specific alleles could be predictive of vomiting, nausea, and oral mucositis. Copyright © 2018 Elsevier Inc. All rights reserved.
Single-Nucleotide Polymorphisms Reveal Spatial Diversity Among Clones of Yersinia pestis During Plague Outbreaks in Colorado and the Western United States.

PubMed

Lowell, Jennifer L; Antolin, Michael F; Andersen, Gary L; Hu, Ping; Stokowski, Renee P; Gage, Kenneth L

2015-05-01

In western North America, plague epizootics caused by Yersinia pestis appear to sweep across landscapes, primarily infecting and killing rodents, especially ground squirrels and prairie dogs. During these epizootics, the risk of Y. pestis transmission to humans is highest. While empirical models that include climatic conditions and densities of rodent hosts and fleas can predict when epizootics are triggered, bacterial transmission patterns across landscapes, and the scale at which Y. pestis is maintained in nature during inter-epizootic periods, are poorly defined. Elucidating the spatial extent of Y. pestis clones during epizootics can determine whether bacteria are propagated across landscapes or arise independently from local inter-epizootic maintenance reservoirs. We used DNA microarray technology to identify single-nucleotide polymorphisms (SNPs) in 34 Y. pestis isolates collected in the western United States from 1980 to 2006, 21 of which were collected during plague epizootics in Colorado. Phylogenetic comparisons were used to elucidate the hypothesized spread of Y. pestis between the mountainous Front Range and the eastern plains of northern Colorado during epizootics. Isolates collected from across the western United States were included for regional comparisons. By identifying SNPs that mark individual clones, our results strongly suggest that Y. pestis is maintained locally and that widespread epizootic activity is caused by multiple clones arising independently at small geographic scales. This is in contrast to propagation of individual clones being transported widely across landscapes. Regionally, our data are consistent with the notion that Y. pestis diversifies at relatively local scales following long-range translocation events. We recommend that surveillance and prediction by public health and wildlife management professionals focus more on models of local or regional weather patterns and ecological factors that may increase risk of widespread epizootics, rather than predicting or attempting to explain epizootics on the basis of movement of host species that may transport plague.
EnsembleGASVR: a novel ensemble method for classifying missense single nucleotide polymorphisms.

PubMed

Rapakoulia, Trisevgeni; Theofilatos, Konstantinos; Kleftogiannis, Dimitrios; Likothanasis, Spiros; Tsakalidis, Athanasios; Mavroudi, Seferina

2014-08-15

Single nucleotide polymorphisms (SNPs) are considered the most frequently occurring DNA sequence variations. Several computational methods have been proposed for the classification of missense SNPs to neutral and disease associated. However, existing computational approaches fail to select relevant features by choosing them arbitrarily without sufficient documentation. Moreover, they are limited to the problem of missing values, imbalance between the learning datasets and most of them do not support their predictions with confidence scores. To overcome these limitations, a novel ensemble computational methodology is proposed. EnsembleGASVR facilitates a two-step algorithm, which in its first step applies a novel evolutionary embedded algorithm to locate close to optimal Support Vector Regression models. In its second step, these models are combined to extract a universal predictor, which is less prone to overfitting issues, systematizes the rebalancing of the learning sets and uses an internal approach for solving the missing values problem without loss of information. Confidence scores support all the predictions and the model becomes tunable by modifying the classification thresholds. An extensive study was performed for collecting the most relevant features for the problem of classifying SNPs, and a superset of 88 features was constructed. Experimental results show that the proposed framework outperforms well-known algorithms in terms of classification performance in the examined datasets. Finally, the proposed algorithmic framework was able to uncover the significant role of certain features such as the solvent accessibility feature, and the top-scored predictions were further validated by linking them with disease phenotypes. Datasets and codes are freely available on the Web at http://prlab.ceid.upatras.gr/EnsembleGASVR/dataset-codes.zip. All the required information about the article is available through http://prlab.ceid.upatras.gr/EnsembleGASVR/site.html. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Polygenic hazard score to guide screening for aggressive prostate cancer: development and validation in large scale cohorts.

PubMed

Seibert, Tyler M; Fan, Chun Chieh; Wang, Yunpeng; Zuber, Verena; Karunamuni, Roshan; Parsons, J Kellogg; Eeles, Rosalind A; Easton, Douglas F; Kote-Jarai, ZSofia; Al Olama, Ali Amin; Garcia, Sara Benlloch; Muir, Kenneth; Grönberg, Henrik; Wiklund, Fredrik; Aly, Markus; Schleutker, Johanna; Sipeky, Csilla; Tammela, Teuvo Lj; Nordestgaard, Børge G; Nielsen, Sune F; Weischer, Maren; Bisbjerg, Rasmus; Røder, M Andreas; Iversen, Peter; Key, Tim J; Travis, Ruth C; Neal, David E; Donovan, Jenny L; Hamdy, Freddie C; Pharoah, Paul; Pashayan, Nora; Khaw, Kay-Tee; Maier, Christiane; Vogel, Walther; Luedeke, Manuel; Herkommer, Kathleen; Kibel, Adam S; Cybulski, Cezary; Wokolorczyk, Dominika; Kluzniak, Wojciech; Cannon-Albright, Lisa; Brenner, Hermann; Cuk, Katarina; Saum, Kai-Uwe; Park, Jong Y; Sellers, Thomas A; Slavov, Chavdar; Kaneva, Radka; Mitev, Vanio; Batra, Jyotsna; Clements, Judith A; Spurdle, Amanda; Teixeira, Manuel R; Paulo, Paula; Maia, Sofia; Pandha, Hardev; Michael, Agnieszka; Kierzek, Andrzej; Karow, David S; Mills, Ian G; Andreassen, Ole A; Dale, Anders M

2018-01-10

To develop and validate a genetic tool to predict age of onset of aggressive prostate cancer (PCa) and to guide decisions of who to screen and at what age. Analysis of genotype, PCa status, and age to select single nucleotide polymorphisms (SNPs) associated with diagnosis. These polymorphisms were incorporated into a survival analysis to estimate their effects on age at diagnosis of aggressive PCa (that is, not eligible for surveillance according to National Comprehensive Cancer Network guidelines; any of Gleason score ≥7, stage T3-T4, PSA (prostate specific antigen) concentration ≥10 ng/L, nodal metastasis, distant metastasis). The resulting polygenic hazard score is an assessment of individual genetic risk. The final model was applied to an independent dataset containing genotype and PSA screening data. The hazard score was calculated for these men to test prediction of survival free from PCa. Multiple institutions that were members of international PRACTICAL consortium. All consortium participants of European ancestry with known age, PCa status, and quality assured custom (iCOGS) array genotype data. The development dataset comprised 31 747 men; the validation dataset comprised 6411 men. Prediction with hazard score of age of onset of aggressive cancer in validation set. In the independent validation set, the hazard score calculated from 54 single nucleotide polymorphisms was a highly significant predictor of age at diagnosis of aggressive cancer (z=11.2, P<10 -16 ). When men in the validation set with high scores (>98th centile) were compared with those with average scores (30th-70th centile), the hazard ratio for aggressive cancer was 2.9 (95% confidence interval 2.4 to 3.4). Inclusion of family history in a combined model did not improve prediction of onset of aggressive PCa (P=0.59), and polygenic hazard score performance remained high when family history was accounted for. Additionally, the positive predictive value of PSA screening for aggressive PCa was increased with increasing polygenic hazard score. Polygenic hazard scores can be used for personalised genetic risk estimates that can predict for age at onset of aggressive PCa. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Draft genome sequence of Cicer reticulatum L., the wild progenitor of chickpea provides a resource for agronomic trait improvement.

PubMed

Gupta, Sonal; Nawaz, Kashif; Parween, Sabiha; Roy, Riti; Sahu, Kamlesh; Kumar Pole, Anil; Khandal, Hitaishi; Srivastava, Rishi; Kumar Parida, Swarup; Chattopadhyay, Debasis

2017-02-01

Cicer reticulatum L. is the wild progenitor of the fourth most important legume crop chickpea (C. arietinum L.). We assembled short-read sequences into 416 Mb draft genome of C. reticulatum and anchored 78% (327 Mb) of this assembly to eight linkage groups. Genome annotation predicted 25,680 protein-coding genes covering more than 90% of predicted gene space. The genome assembly shared a substantial synteny and conservation of gene orders with the genome of the model legume Medicago truncatula. Resistance gene homologs of wild and domesticated chickpeas showed high sequence homology and conserved synteny. Comparison of gene sequences and nucleotide diversity using 66 wild and domesticated chickpea accessions suggested that the desi type chickpea was genetically closer to the wild species than the kabuli type. Comparative analyses predicted gene flow between the wild and the cultivated species during domestication. Molecular diversity and population genetic structure determination using 15,096 genome-wide single nucleotide polymorphisms revealed an admixed domestication pattern among cultivated (desi and kabuli) and wild chickpea accessions belonging to three population groups reflecting significant influence of parentage or geographical origin for their cultivar-specific population classification. The assembly and the polymorphic sequence resources presented here would facilitate the study of chickpea domestication and targeted use of wild Cicer germplasms for agronomic trait improvement in chickpea. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Single-nucleotide polymorphism in the human mu opioid receptor gene alters beta-endorphin binding and activity: possible implications for opiate addiction.

PubMed

Bond, C; LaForge, K S; Tian, M; Melia, D; Zhang, S; Borg, L; Gong, J; Schluger, J; Strong, J A; Leal, S M; Tischfield, J A; Kreek, M J; Yu, L

1998-08-04

Opioid drugs play important roles in the clinical management of pain, as well as in the development and treatment of drug abuse. The mu opioid receptor is the primary site of action for the most commonly used opioids, including morphine, heroin, fentanyl, and methadone. By sequencing DNA from 113 former heroin addicts in methadone maintenance and 39 individuals with no history of drug or alcohol abuse or dependence, we have identified five different single-nucleotide polymorphisms (SNPs) in the coding region of the mu opioid receptor gene. The most prevalent SNP is a nucleotide substitution at position 118 (A118G), predicting an amino acid change at a putative N-glycosylation site. This SNP displays an allelic frequency of approximately 10% in our study population. Significant differences in allele distribution were observed among ethnic groups studied. The variant receptor resulting from the A118G SNP did not show altered binding affinities for most opioid peptides and alkaloids tested. However, the A118G variant receptor binds beta-endorphin, an endogenous opioid that activates the mu opioid receptor, approximately three times more tightly than the most common allelic form of the receptor. Furthermore, beta-endorphin is approximately three times more potent at the A118G variant receptor than at the most common allelic form in agonist-induced activation of G protein-coupled potassium channels. These results show that SNPs in the mu opioid receptor gene can alter binding and signal transduction in the resulting receptor and may have implications for normal physiology, therapeutics, and vulnerability to develop or protection from diverse diseases including the addictive diseases.
Structural Basis for Nucleotide Exchange in Heterotrimeric G Proteins

PubMed Central

Dror, Ron O.; Mildorf, Thomas J.; Hilger, Daniel; Manglik, Aashish; Borhani, David W.; Arlow, Daniel H.; Philippsen, Ansgar; Villanueva, Nicolas; Yang, Zhongyu; Lerch, Michael T.; Hubbell, Wayne L.; Kobilka, Brian K.; Sunahara, Roger K.; Shaw, David E.

2016-01-01

G protein–coupled receptors (GPCRs) relay diverse extracellular signals into cells by catalyzing nucleotide release from heterotrimeric G proteins, but the mechanism underlying this quintessential molecular signaling event has remained unclear. Here we use atomic-level simulations to elucidate the nucleotide-release mechanism. We find that the G protein α subunit Ras and helical domains—previously observed to separate widely upon receptor binding to expose the nucleotide-binding site—separate spontaneously and frequently even in the absence of a receptor. Domain separation is necessary but not sufficient for rapid nucleotide release. Rather, receptors catalyze nucleotide release by favoring an internal structural rearrangement of the Ras domain that weakens its nucleotide affinity. We use double electron-electron resonance spectroscopy and protein engineering to confirm predictions of our computationally determined mechanism. PMID:26089515
Predicting protein-binding regions in RNA using nucleotide profiles and compositions.

PubMed

Choi, Daesik; Park, Byungkyu; Chae, Hanju; Lee, Wook; Han, Kyungsook

2017-03-14

Motivated by the increased amount of data on protein-RNA interactions and the availability of complete genome sequences of several organisms, many computational methods have been proposed to predict binding sites in protein-RNA interactions. However, most computational methods are limited to finding RNA-binding sites in proteins instead of protein-binding sites in RNAs. Predicting protein-binding sites in RNA is more challenging than predicting RNA-binding sites in proteins. Recent computational methods for finding protein-binding sites in RNAs have several drawbacks for practical use. We developed a new support vector machine (SVM) model for predicting protein-binding regions in mRNA sequences. The model uses sequence profiles constructed from log-odds scores of mono- and di-nucleotides and nucleotide compositions. The model was evaluated by standard 10-fold cross validation, leave-one-protein-out (LOPO) cross validation and independent testing. Since actual mRNA sequences have more non-binding regions than protein-binding regions, we tested the model on several datasets with different ratios of protein-binding regions to non-binding regions. The best performance of the model was obtained in a balanced dataset of positive and negative instances. 10-fold cross validation with a balanced dataset achieved a sensitivity of 91.6%, a specificity of 92.4%, an accuracy of 92.0%, a positive predictive value (PPV) of 91.7%, a negative predictive value (NPV) of 92.3% and a Matthews correlation coefficient (MCC) of 0.840. LOPO cross validation showed a lower performance than the 10-fold cross validation, but the performance remains high (87.6% accuracy and 0.752 MCC). In testing the model on independent datasets, it achieved an accuracy of 82.2% and an MCC of 0.656. Testing of our model and other state-of-the-art methods on a same dataset showed that our model is better than the others. Sequence profiles of log-odds scores of mono- and di-nucleotides were much more powerful features than nucleotide compositions in finding protein-binding regions in RNA sequences. But, a slight performance gain was obtained when using the sequence profiles along with nucleotide compositions. These are preliminary results of ongoing research, but demonstrate the potential of our approach as a powerful predictor of protein-binding regions in RNA. The program and supporting data are available at http://bclab.inha.ac.kr/RBPbinding .
Distinct requirements within the Msh3 nucleotide binding pocket for mismatch and double-strand break repair.

PubMed

Kumar, Charanya; Williams, Gregory M; Havens, Brett; Dinicola, Michelle K; Surtees, Jennifer A

2013-06-12

In Saccharomyces cerevisiae, repair of insertion/deletion loops is carried out by Msh2-Msh3-mediated mismatch repair (MMR). Msh2-Msh3 is also required for 3' non-homologous tail removal (3' NHTR) in double-strand break repair. In both pathways, Msh2-Msh3 binds double-strand/single-strand junctions and initiates repair in an ATP-dependent manner. However, the kinetics of the two processes appear different; MMR is likely rapid in order to coordinate with the replication fork, whereas 3' NHTR has been shown to be a slower process. To understand the molecular requirements in both repair pathways, we performed an in vivo analysis of well-conserved residues in Msh3 that are hypothesized to be required for MMR and/or 3' NHTR. These residues are predicted to be involved in either communication between the DNA-binding and ATPase domains within the complex or nucleotide binding and/or exchange within Msh2-Msh3. We identified a set of aromatic residues within the FLY motif of the predicted Msh3 nucleotide binding pocket that are essential for Msh2-Msh3-mediated MMR but are largely dispensable for 3' NHTR. In contrast, mutations in other regions gave similar phenotypes in both assays. Based on these results, we suggest that the two pathways have distinct requirements with respect to the position of the bound ATP within Msh3. We propose that the differences are related, at least in part, to the kinetics of each pathway. Proper binding and positioning of ATP is required to induce rapid conformational changes at the replication fork, but is less important when more time is available for repair, as in 3' NHTR. Copyright © 2013 Elsevier Ltd. All rights reserved.
Distinct requirements within the Msh3 nucleotide binding pocket for mismatch and double-strand break repair

PubMed Central

Kumar, Charanya; Williams, Gregory M.; Havens, Brett; Dinicola, Michelle; Surtees, Jennifer A.

2013-01-01

In Saccharomyces cerevisiae, repair of insertion/deletion loops is carried out by Msh2-Msh3-mediated mismatch repair (MMR). Msh2-Msh3 is also required for 3’ non-homologous tail removal (3’NHTR) in double-strand break repair. In both pathways, Msh2-Msh3 binds double-strand/single-strand junctions and initiates repair in an ATP-dependent manner. However, the kinetics of the two processes appear different; MMR is likely rapid in order to coordinate with the replication fork, whereas 3’ NHTR has been shown to be a slower process. To understand the molecular requirements in both repair pathways, we performed an in vivo analysis of well conserved residues in Msh3 that are hypothesized to be required for MMR and/or 3’NHTR. These residues are predicted to be involved in either communication between the DNA-binding and ATPase domains within the complex or nucleotide binding and/or exchange within Msh2-Msh3. We identified a set of aromatic residues within the FLY motif of the predicted Msh3 nucleotide binding pocket that are essential for Msh2-Msh3-mediated MMR but are largely dispensable for 3’NHTR. In contrast, mutations in other regions gave similar phenotypes in both assays. Based on these results, we suggest the two pathways have distinct requirements with respect to the position of the bound ATP within Msh3. We propose that the differences are related, at least in part, to the kinetics of each pathway. Proper binding and positioning of ATP is required to induce rapid conformational changes at the replication fork, but is less important when more time is available for repair, as in 3’ NHTR. PMID:23458407
In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features.

PubMed

Ding, Yiliang; Tang, Yin; Kwok, Chun Kit; Zhang, Yu; Bevilacqua, Philip C; Assmann, Sarah M

2014-01-30

RNA structure has critical roles in processes ranging from ligand sensing to the regulation of translation, polyadenylation and splicing. However, a lack of genome-wide in vivo RNA structural data has limited our understanding of how RNA structure regulates gene expression in living cells. Here we present a high-throughput, genome-wide in vivo RNA structure probing method, structure-seq, in which dimethyl sulphate methylation of unprotected adenines and cytosines is identified by next-generation sequencing. Application of this method to Arabidopsis thaliana seedlings yielded the first in vivo genome-wide RNA structure map at nucleotide resolution for any organism, with quantitative structural information across more than 10,000 transcripts. Our analysis reveals a three-nucleotide periodic repeat pattern in the structure of coding regions, as well as a less-structured region immediately upstream of the start codon, and shows that these features are strongly correlated with translation efficiency. We also find patterns of strong and weak secondary structure at sites of alternative polyadenylation, as well as strong secondary structure at 5' splice sites that correlates with unspliced events. Notably, in vivo structures of messenger RNAs annotated for stress responses are poorly predicted in silico, whereas mRNA structures of genes related to cell function maintenance are well predicted. Global comparison of several structural features between these two categories shows that the mRNAs associated with stress responses tend to have more single-strandedness, longer maximal loop length and higher free energy per nucleotide, features that may allow these RNAs to undergo conformational changes in response to environmental conditions. Structure-seq allows the RNA structurome and its biological roles to be interrogated on a genome-wide scale and should be applicable to any organism.
Mutations that Cause Human Disease: A Computational/Experimental Approach

DOE Office of Scientific and Technical Information (OSTI.GOV)

Beernink, P; Barsky, D; Pesavento, B

International genome sequencing projects have produced billions of nucleotides (letters) of DNA sequence data, including the complete genome sequences of 74 organisms. These genome sequences have created many new scientific opportunities, including the ability to identify sequence variations among individuals within a species. These genetic differences, which are known as single nucleotide polymorphisms (SNPs), are particularly important in understanding the genetic basis for disease susceptibility. Since the report of the complete human genome sequence, over two million human SNPs have been identified, including a large-scale comparison of an entire chromosome from twenty individuals. Of the protein coding SNPs (cSNPs), approximatelymore » half leads to a single amino acid change in the encoded protein (non-synonymous coding SNPs). Most of these changes are functionally silent, while the remainder negatively impact the protein and sometimes cause human disease. To date, over 550 SNPs have been found to cause single locus (monogenic) diseases and many others have been associated with polygenic diseases. SNPs have been linked to specific human diseases, including late-onset Parkinson disease, autism, rheumatoid arthritis and cancer. The ability to predict accurately the effects of these SNPs on protein function would represent a major advance toward understanding these diseases. To date several attempts have been made toward predicting the effects of such mutations. The most successful of these is a computational approach called ''Sorting Intolerant From Tolerant'' (SIFT). This method uses sequence conservation among many similar proteins to predict which residues in a protein are functionally important. However, this method suffers from several limitations. First, a query sequence must have a sufficient number of relatives to infer sequence conservation. Second, this method does not make use of or provide any information on protein structure, which can be used to understand how an amino acid change affects the protein. The experimental methods that provide the most detailed structural information on proteins are X-ray crystallography and NMR spectroscopy. However, these methods are labor intensive and currently cannot be carried out on a genomic scale. Nonetheless, Structural Genomics projects are being pursued by more than a dozen groups and consortia worldwide and as a result the number of experimentally determined structures is rising exponentially. Based on the expectation that protein structures will continue to be determined at an ever-increasing rate, reliable structure prediction schemes will become increasingly valuable, leading to information on protein function and disease for many different proteins. Given known genetic variability and experimentally determined protein structures, can we accurately predict the effects of single amino acid substitutions? An objective assessment of this question would involve comparing predicted and experimentally determined structures, which thus far has not been rigorously performed. The completed research leveraged existing expertise at LLNL in computational and structural biology, as well as significant computing resources, to address this question.« less
Pre-steady-state Kinetic Analysis of a Family D DNA Polymerase from Thermococcus sp. 9°N Reveals Mechanisms for Archaeal Genomic Replication and Maintenance*

PubMed Central

Schermerhorn, Kelly M.; Gardner, Andrew F.

2015-01-01

Family D DNA polymerases (polDs) have been implicated as the major replicative polymerase in archaea, excluding the Crenarchaeota branch, and bear little sequence homology to other DNA polymerase families. Here we report a detailed kinetic analysis of nucleotide incorporation and exonuclease activity for a Family D DNA polymerase from Thermococcus sp. 9°N. Pre-steady-state single-turnover nucleotide incorporation assays were performed to obtain the kinetic parameters, kpol and Kd, for correct nucleotide incorporation, incorrect nucleotide incorporation, and ribonucleotide incorporation by exonuclease-deficient polD. Correct nucleotide incorporation kinetics revealed a relatively slow maximal rate of polymerization (kpol ∼2.5 s−1) and especially tight nucleotide binding (Kd(dNTP) ∼1.7 μm), compared with DNA polymerases from Families A, B, C, X, and Y. Furthermore, pre-steady-state nucleotide incorporation assays revealed that polD prevents the incorporation of incorrect nucleotides and ribonucleotides primarily through reduced nucleotide binding affinity. Pre-steady-state single-turnover assays on wild-type 9°N polD were used to examine 3′-5′ exonuclease hydrolysis activity in the presence of Mg2+ and Mn2+. Interestingly, substituting Mn2+ for Mg2+ accelerated hydrolysis rates >40-fold (kexo ≥110 s−1 versus ≥2.5 s−1). Preference for Mn2+ over Mg2+ in exonuclease hydrolysis activity is a property unique to the polD family. The kinetic assays performed in this work provide critical insight into the mechanisms that polD employs to accurately and efficiently replicate the archaeal genome. Furthermore, despite the unique properties of polD, this work suggests that a conserved polymerase kinetic pathway is present in all known DNA polymerase families. PMID:26160179
Predictors of Outcome in Ulcerative Colitis.

PubMed

Waterman, Matti; Knight, Jo; Dinani, Amreen; Xu, Wei; Stempak, Joanne M; Croitoru, Kenneth; Nguyen, Geoffrey C; Cohen, Zane; McLeod, Robin S; Greenberg, Gordon R; Steinhart, A Hillary; Silverberg, Mark S

2015-09-01

Approximately 80% of patients with ulcerative colitis (UC) have intermittently active disease and up to 20% will require a colectomy, but little data available on predictors of poor disease course. The aim of this study was to identify clinical and genetic markers that can predict prognosis. Medical records of patients with UC with ≥5 years of follow-up and available DNA and serum were retrospectively assessed. Immunochip was used to genotype loci associated with immune mediated inflammatory disorders (IMIDs), inflammatory bowel diseases, and other single nucleotide polypmorphisms previously associated with disease severity. Serum levels of pANCA, ASCA, CBir1, and OmpC were also evaluated. Requirement for colectomy, medication, and hospitalization were used to group patients into 3 prognostic groups. Six hundred one patients with UC were classified as mild (n = 78), moderate (n = 273), or severe disease (n = 250). Proximal disease location frequencies at diagnosis were 13%, 21%, and 30% for mild, moderate, and severe UC, respectively (P = 0.001). Disease severity was associated with greater proximal extension rates on follow-up (P < 0.0001) and with shorter time to extension (P = 0.03) and to prednisone initiation (P = 0.0004). When comparing severe UC with mild and moderate UC together, diagnosis age >40 and proximal disease location were associated with severe UC (odds ratios = 1.94 and 2.12, respectively). None of the single nucleotide polypmorphisms or serum markers tested was associated with severe UC, proximal disease extension or colectomy. Older age and proximal disease location at diagnosis, but not genetic and serum markers, were associated with a more severe course. Further work is required to identify biomarkers that will predict outcomes in UC.
Integrating multiple genomic data to predict disease-causing nonsynonymous single nucleotide variants in exome sequencing studies.

PubMed

Wu, Jiaxin; Li, Yanda; Jiang, Rui

2014-03-01

Exome sequencing has been widely used in detecting pathogenic nonsynonymous single nucleotide variants (SNVs) for human inherited diseases. However, traditional statistical genetics methods are ineffective in analyzing exome sequencing data, due to such facts as the large number of sequenced variants, the presence of non-negligible fraction of pathogenic rare variants or de novo mutations, and the limited size of affected and normal populations. Indeed, prevalent applications of exome sequencing have been appealing for an effective computational method for identifying causative nonsynonymous SNVs from a large number of sequenced variants. Here, we propose a bioinformatics approach called SPRING (Snv PRioritization via the INtegration of Genomic data) for identifying pathogenic nonsynonymous SNVs for a given query disease. Based on six functional effect scores calculated by existing methods (SIFT, PolyPhen2, LRT, MutationTaster, GERP and PhyloP) and five association scores derived from a variety of genomic data sources (gene ontology, protein-protein interactions, protein sequences, protein domain annotations and gene pathway annotations), SPRING calculates the statistical significance that an SNV is causative for a query disease and hence provides a means of prioritizing candidate SNVs. With a series of comprehensive validation experiments, we demonstrate that SPRING is valid for diseases whose genetic bases are either partly known or completely unknown and effective for diseases with a variety of inheritance styles. In applications of our method to real exome sequencing data sets, we show the capability of SPRING in detecting causative de novo mutations for autism, epileptic encephalopathies and intellectual disability. We further provide an online service, the standalone software and genome-wide predictions of causative SNVs for 5,080 diseases at http://bioinfo.au.tsinghua.edu.cn/spring.

Polygenic risk score in postmortem diagnosed sporadic early-onset Alzheimer's disease.

PubMed

Chaudhury, Sultan; Patel, Tulsi; Barber, Imelda S; Guetta-Baranes, Tamar; Brookes, Keeley J; Chappell, Sally; Turton, James; Guerreiro, Rita; Bras, Jose; Hernandez, Dena; Singleton, Andrew; Hardy, John; Mann, David; Morgan, Kevin

2018-02-01

Sporadic early-onset Alzheimer's disease (sEOAD) exhibits the symptoms of late-onset Alzheimer's disease but lacks the familial aspect of the early-onset familial form. The genetics of Alzheimer's disease (AD) identifies APOEε4 to be the greatest risk factor; however, it is a complex disease involving both environmental risk factors and multiple genetic loci. Polygenic risk scores (PRSs) accumulate the total risk of a phenotype in an individual based on variants present in their genome. We determined whether sEOAD cases had a higher PRS compared to controls. A cohort of sEOAD cases was genotyped on the NeuroX array, and PRSs were generated using PRSice. The target data set consisted of 408 sEOAD cases and 436 controls. The base data set was collated by the International Genomics of Alzheimer's Project consortium, with association data from 17,008 late-onset Alzheimer's disease cases and 37,154 controls, which can be used for identifying sEOAD cases due to having shared phenotype. PRSs were generated using all common single nucleotide polymorphisms between the base and target data set, PRS were also generated using only single nucleotide polymorphisms within a 500 kb region surrounding the APOE gene. Sex and number of APOE ε2 or ε4 alleles were used as variables for logistic regression and combined with PRS. The results show that PRS is higher on average in sEOAD cases than controls, although there is still overlap among the whole cohort. Predictive ability of identifying cases and controls using PRSice was calculated with 72.9% accuracy, greater than the APOE locus alone (65.2%). Predictive ability was further improved with logistic regression, identifying cases and controls with 75.5% accuracy. Copyright © 2017 Elsevier Inc. All rights reserved.
FSR: feature set reduction for scalable and accurate multi-class cancer subtype classification based on copy number.

PubMed

Wong, Gerard; Leckie, Christopher; Kowalczyk, Adam

2012-01-15

Feature selection is a key concept in machine learning for microarray datasets, where features represented by probesets are typically several orders of magnitude larger than the available sample size. Computational tractability is a key challenge for feature selection algorithms in handling very high-dimensional datasets beyond a hundred thousand features, such as in datasets produced on single nucleotide polymorphism microarrays. In this article, we present a novel feature set reduction approach that enables scalable feature selection on datasets with hundreds of thousands of features and beyond. Our approach enables more efficient handling of higher resolution datasets to achieve better disease subtype classification of samples for potentially more accurate diagnosis and prognosis, which allows clinicians to make more informed decisions in regards to patient treatment options. We applied our feature set reduction approach to several publicly available cancer single nucleotide polymorphism (SNP) array datasets and evaluated its performance in terms of its multiclass predictive classification accuracy over different cancer subtypes, its speedup in execution as well as its scalability with respect to sample size and array resolution. Feature Set Reduction (FSR) was able to reduce the dimensions of an SNP array dataset by more than two orders of magnitude while achieving at least equal, and in most cases superior predictive classification performance over that achieved on features selected by existing feature selection methods alone. An examination of the biological relevance of frequently selected features from FSR-reduced feature sets revealed strong enrichment in association with cancer. FSR was implemented in MATLAB R2010b and is available at http://ww2.cs.mu.oz.au/~gwong/FSR.
Unique CD44 intronic SNP is associated with tumor grade in breast cancer: a case control study and in silico analysis.

PubMed

Esmaeili, Rezvan; Abdoli, Nasrin; Yadegari, Fatemeh; Neishaboury, Mohamadreza; Farahmand, Leila; Kaviani, Ahmad; Majidzadeh-A, Keivan

2018-01-01

CD44 encoded by a single gene is a cell surface transmembrane glycoprotein. Exon 2 is one of the important exons to bind CD44 protein to hyaluronan. Experimental evidences show that hyaluronan-CD44 interaction intensifies the proliferation, migration, and invasion of breast cancer cells. Therefore, the current study aimed at investigating the association between specific polymorphisms in exon 2 and its flanking region of CD44 with predisposition to breast cancer. In the current study, 175 Iranian female patients with breast cancer and 175 age-matched healthy controls were recruited in biobank, Breast Cancer Research Center, Tehran, Iran. Single nucleotide polymorphisms of CD44 exon 2 and its flanking were analyzed via polymerase chain reaction and gene sequencing techniques. Association between the observed variation with breast cancer risk and clinico-pathological characteristics were studied. Subsequently, bioinformatics analysis was conducted to predict potential exonic splicing enhancer (ESE) motifs changed as the result of a mutation. A unique polymorphism of the gene encoding CD44 was identified at position 14 nucleotide upstream of exon 2 (A37692→G) by the sequencing method. The A > G polymorphism exhibited a significant association with higher-grades of breast cancer, although no significant relation was found between this polymorphism and breast cancer risk. Finally, computational analysis revealed that the intronic mutation generated a new consensus-binding motif for the splicing factor, SC35, within intron 1. The current study results indicated that A > G polymorphism was associated with breast cancer development; in addition, in silico analysis with ESE finder prediction software showed that the change created a new SC35 binding site.
Translating natural genetic variation to gene expression in a computational model of the Drosophila gap gene regulatory network

PubMed Central

Kozlov, Konstantin N.; Kulakovskiy, Ivan V.; Zubair, Asif; Marjoram, Paul; Lawrie, David S.; Nuzhdin, Sergey V.; Samsonova, Maria G.

2017-01-01

Annotating the genotype-phenotype relationship, and developing a proper quantitative description of the relationship, requires understanding the impact of natural genomic variation on gene expression. We apply a sequence-level model of gap gene expression in the early development of Drosophila to analyze single nucleotide polymorphisms (SNPs) in a panel of natural sequenced D. melanogaster lines. Using a thermodynamic modeling framework, we provide both analytical and computational descriptions of how single-nucleotide variants affect gene expression. The analysis reveals that the sequence variants increase (decrease) gene expression if located within binding sites of repressors (activators). We show that the sign of SNP influence (activation or repression) may change in time and space and elucidate the origin of this change in specific examples. The thermodynamic modeling approach predicts non-local and non-linear effects arising from SNPs, and combinations of SNPs, in individual fly genotypes. Simulation of individual fly genotypes using our model reveals that this non-linearity reduces to almost additive inputs from multiple SNPs. Further, we see signatures of the action of purifying selection in the gap gene regulatory regions. To infer the specific targets of purifying selection, we analyze the patterns of polymorphism in the data at two phenotypic levels: the strengths of binding and expression. We find that combinations of SNPs show evidence of being under selective pressure, while individual SNPs do not. The model predicts that SNPs appear to accumulate in the genotypes of the natural population in a way biased towards small increases in activating action on the expression pattern. Taken together, these results provide a systems-level view of how genetic variation translates to the level of gene regulatory networks via combinatorial SNP effects. PMID:28898266
Prospective assessment of XRCC3, XPD and Aurora kinase A single-nucleotide polymorphisms in advanced lung cancer.

PubMed

Provencio, M; Camps, C; Cobo, M; De las Peñas, R; Massuti, B; Blanco, R; Alberola, V; Jimenez, U; Delgado, J R; Cardenal, F; Tarón, M; Ramírez, J L; Sanchez, A; Rosell, R

2012-12-01

New therapeutic approaches are being developed based on findings that several genetic abnormalities underlying non-small-cell lung cancer (NSCLC) can influence chemosensitivity. The identification of molecular markers, useful for therapeutic decisions in lung cancer, is thus crucial for disease management. The present study evaluated single-nucleotide polymorphisms (SNPs) in XRCC3, XPD and Aurora kinase A in NSCLC patients in order to assess whether these biomarkers were able to predict the outcomes of the patients. The Spanish Lung Cancer Group prospectively assessed this clinical study. Eligible patients had histologically confirmed stage IV or IIIB (with malignant pleural effusion) NSCLC, which had not previously been treated with chemotherapy, and a World Health Organization performance status (PS) of 0-1. Patients received intravenous doses of vinorelbine 25 mg/m(2) on days 1 and 8, and cisplatin 75 mg/m(2) on day 1, every 21 days for a maximum of 6 cycles. Venous blood was collected from each, and genomic DNA was isolated. SNPs in XRCC3 T241M, XPD K751Q, XPD D312N, AURORA 91, AURORA 169 were assessed. The study included 180 patients. Median age was 62 years; 87 % were male; 34 % had PS 0; and 83 % had stage IV disease. The median number of cycles was 4. Time to progression was 5.1 months (95 % CI, 4.2-5.9). Overall median survival was 8.6 months (95 % CI, 7.1-10.1). There was no significant association between SNPs in XRCC3 T241M, XPD K751Q, XPD D312N, AURORA 91, AURORA 169 in outcome or toxicity. Our findings indicate that SNPs in XRCC3, XPD or Aurora kinase A cannot predict outcomes in advanced NSCLC patients treated with platinum-based chemotherapy.
Combined Medical and Surgical Approach Improves Healing of Septic Perianal Crohn's Disease.

PubMed

Choi, Christine S; Berg, Arthur S; Sangster, William; Schieffer, Kathleen M; Harris, Leonard R; Deiling, Sue M; Koltun, Walter A

2016-09-01

Septic perianal Crohn's disease (SPCD) is a treatment challenge in spite of tumor necrosis factor antagonists (anti-TNF). Our aim was to define the success of SPCD management with a combined medical and surgical approach and to identify clinical and genetic factors predictive of healing. A retrospective chart review of patients with SPCD treated at the Penn State Milton S Hershey Medical Center was done. Primary end point was complete healing (ie normal clinical exam and no pain for at least 6 months). Genetic analysis of 185 single nucleotide polymorphisms associated with Crohn's disease was performed in 78 patients. One hundred and thirty-five episodes of SPCD were identified in 114 patients with a mean follow-up of 77 ± 7.4 months. Overall, 80 of 135 episodes healed (59.3%) and did not differ between those receiving anti-TNF and not (60.4% vs 56.8%). There appeared to be a consistent improved heal rate in each subcategory of surgically managed patients that received anti-TNF. Female sex was significantly predictive of healing in only those receiving anti-TNF agents (63.6% vs 25.0%; p = 0.0005). Twenty-two (19.3%) patients ultimately received a permanent diversion with either a total proctocolectomy or completion proctectomy. Multivariate analysis suggested several single nucleotide polymorphisms in Crohn's disease-associated genes to be possibly associated with healing, but lost significance after Bonferroni correction. Overall, there is an approximate 60% rate of healing SPCD using a combined medical and surgical approach. About 20% of SPCD patients will require a permanent stoma. There were no clear genetic predictors of healing SPCD. Copyright © 2016 American College of Surgeons. Published by Elsevier Inc. All rights reserved.
Genome-scale engineering of Saccharomyces cerevisiae with single-nucleotide precision.

PubMed

Bao, Zehua; HamediRad, Mohammad; Xue, Pu; Xiao, Han; Tasan, Ipek; Chao, Ran; Liang, Jing; Zhao, Huimin

2018-07-01

We developed a CRISPR-Cas9- and homology-directed-repair-assisted genome-scale engineering method named CHAnGE that can rapidly output tens of thousands of specific genetic variants in yeast. More than 98% of target sequences were efficiently edited with an average frequency of 82%. We validate the single-nucleotide resolution genome-editing capability of this technology by creating a genome-wide gene disruption collection and apply our method to improve tolerance to growth inhibitors.
Predicting genotypes environmental range from genome-environment associations.

PubMed

Manel, Stéphanie; Andrello, Marco; Henry, Karine; Verdelet, Daphné; Darracq, Aude; Guerin, Pierre-Edouard; Desprez, Bruno; Devaux, Pierre

2018-05-17

Genome-environment association methods aim to detect genetic markers associated with environmental variables. The detected associations are usually analysed separately to identify the genomic regions involved in local adaptation. However, a recent study suggests that single-locus associations can be combined and used in a predictive way to estimate environmental variables for new individuals on the basis of their genotypes. Here, we introduce an original approach to predict the environmental range (values and upper and lower limits) of species genotypes from the genetic markers significantly associated with those environmental variables in an independent set of individuals. We illustrate this approach to predict aridity in a database constituted of 950 individuals of wild beets and 299 individuals of cultivated beets genotyped at 14,409 random Single Nucleotide Polymorphisms (SNPs). We detected 66 alleles associated with aridity and used them to calculate the fraction (I) of aridity-associated alleles in each individual. The fraction I correctly predicted the values of aridity in an independent validation set of wild individuals and was then used to predict aridity in the 299 cultivated individuals. Wild individuals had higher median values and a wider range of values of aridity than the cultivated individuals, suggesting that wild individuals have higher ability to resist to stress-aridity conditions and could be used to improve the resistance of cultivated varieties to aridity. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants

PubMed Central

Knecht, Carolin; Mort, Matthew; Junge, Olaf; Cooper, David N.; Krawczak, Michael

2017-01-01

Abstract The in silico prediction of the functional consequences of mutations is an important goal of human pathogenetics. However, bioinformatic tools that classify mutations according to their functionality employ different algorithms so that predictions may vary markedly between tools. We therefore integrated nine popular prediction tools (PolyPhen-2, SNPs&GO, MutPred, SIFT, MutationTaster2, Mutation Assessor and FATHMM as well as conservation-based Grantham Score and PhyloP) into a single predictor. The optimal combination of these tools was selected by means of a wide range of statistical modeling techniques, drawing upon 10 029 disease-causing single nucleotide variants (SNVs) from Human Gene Mutation Database and 10 002 putatively ‘benign’ non-synonymous SNVs from UCSC. Predictive performance was found to be markedly improved by model-based integration, whilst maximum predictive capability was obtained with either random forest, decision tree or logistic regression analysis. A combination of PolyPhen-2, SNPs&GO, MutPred, MutationTaster2 and FATHMM was found to perform as well as all tools combined. Comparison of our approach with other integrative approaches such as Condel, CoVEC, CAROL, CADD, MetaSVM and MetaLR using an independent validation dataset, revealed the superiority of our newly proposed integrative approach. An online implementation of this approach, IMHOTEP (‘Integrating Molecular Heuristics and Other Tools for Effect Prediction’), is provided at http://www.uni-kiel.de/medinfo/cgi-bin/predictor/. PMID:28180317
Genomic prediction of piglet response to infection with one of two porcine reproductive and respiratory syndrome virus isolates.

PubMed

Waide, Emily H; Tuggle, Christopher K; Serão, Nick V L; Schroyen, Martine; Hess, Andrew; Rowland, Raymond R R; Lunney, Joan K; Plastow, Graham; Dekkers, Jack C M

2018-02-01

Genomic prediction of the pig's response to the porcine reproductive and respiratory syndrome (PRRS) virus (PRRSV) would be a useful tool in the swine industry. This study investigated the accuracy of genomic prediction based on porcine SNP60 Beadchip data using training and validation datasets from populations with different genetic backgrounds that were challenged with different PRRSV isolates. Genomic prediction accuracy averaged 0.34 for viral load (VL) and 0.23 for weight gain (WG) following experimental PRRSV challenge, which demonstrates that genomic selection could be used to improve response to PRRSV infection. Training on WG data during infection with a less virulent PRRSV, KS06, resulted in poor accuracy of prediction for WG during infection with a more virulent PRRSV, NVSL. Inclusion of single nucleotide polymorphisms (SNPs) that are in linkage disequilibrium with a major quantitative trait locus (QTL) on chromosome 4 was vital for accurate prediction of VL. Overall, SNPs that were significantly associated with either trait in single SNP genome-wide association analysis were unable to predict the phenotypes with an accuracy as high as that obtained by using all genotyped SNPs across the genome. Inclusion of data from close relatives into the training population increased whole genome prediction accuracy by 33% for VL and by 37% for WG but did not affect the accuracy of prediction when using only SNPs in the major QTL region. Results show that genomic prediction of response to PRRSV infection is moderately accurate and, when using all SNPs on the porcine SNP60 Beadchip, is not very sensitive to differences in virulence of the PRRSV in training and validation populations. Including close relatives in the training population increased prediction accuracy when using the whole genome or SNPs other than those near a major QTL.
Structural and Biochemical Determinants of Ligand Binding by the c-di-GMP Riboswitch

DOE Office of Scientific and Technical Information (OSTI.GOV)

Smith, K.; Lipchock, S; Livingston,

2010-01-01

The bacterial second messenger c-di-GMP is used in many species to control essential processes that allow the organism to adapt to its environment. The c-di-GMP riboswitch (GEMM) is an important downstream target in this signaling pathway and alters gene expression in response to changing concentrations of c-di-GMP. The riboswitch selectively recognizes its second messenger ligand primarily through contacts with two critical nucleotides. However, these two nucleotides are not the most highly conserved residues within the riboswitch sequence. Instead, nucleotides that stack with c-di-GMP and that form tertiary RNA contacts are the most invariant. Biochemical and structural evidence reveals that themore » most common natural variants are able to make alternative pairing interactions with both guanine bases of the ligand. Additionally, a high-resolution (2.3 {angstrom}) crystal structure of the native complex reveals that a single metal coordinates the c-di-GMP backbone. Evidence is also provided that after transcription of the first nucleotide on the 3{prime}-side of the P1 helix, which is predicted to be the molecular switch, the aptamer is functional for ligand binding. Although large energetic effects occur when several residues in the RNA are altered, mutations at the most conserved positions, rather than at positions that base pair with c-di-GMP, have the most detrimental effects on binding. Many mutants retain sufficient c-di-GMP affinity for the RNA to remain biologically relevant, which suggests that this motif is quite resilient to mutation.« less
MethSMRT: an integrative database for DNA N6-methyladenine and N4-methylcytosine generated by single-molecular real-time sequencing.

PubMed

Ye, Pohao; Luan, Yizhao; Chen, Kaining; Liu, Yizhi; Xiao, Chuanle; Xie, Zhi

2017-01-04

DNA methylation is an important type of epigenetic modifications, where 5- methylcytosine (5mC), 6-methyadenine (6mA) and 4-methylcytosine (4mC) are the most common types. Previous efforts have been largely focused on 5mC, providing invaluable insights into epigenetic regulation through DNA methylation. Recently developed single-molecule real-time (SMRT) sequencing technology provides a unique opportunity to detect the less studied DNA 6mA and 4mC modifications at single-nucleotide resolution. With a rapidly increased amount of SMRT sequencing data generated, there is an emerging demand to systematically explore DNA 6mA and 4mC modifications from these data sets. MethSMRT is the first resource hosting DNA 6mA and 4mC methylomes. All the data sets were processed using the same analysis pipeline with the same quality control. The current version of the database provides a platform to store, browse, search and download epigenome-wide methylation profiles of 156 species, including seven eukaryotes such as Arabidopsis, C. elegans, Drosophila, mouse and yeast, as well as 149 prokaryotes. It also offers a genome browser to visualize the methylation sites and related information such as single nucleotide polymorphisms (SNP) and genomic annotation. Furthermore, the database provides a quick summary of statistics of methylome of 6mA and 4mC and predicted methylation motifs for each species. MethSMRT is publicly available at http://sysbio.sysu.edu.cn/methsmrt/ without use restriction. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Population Structure Shapes Copy Number Variation in Malaria Parasites.

PubMed

Cheeseman, Ian H; Miller, Becky; Tan, John C; Tan, Asako; Nair, Shalini; Nkhoma, Standwell C; De Donato, Marcos; Rodulfo, Hectorina; Dondorp, Arjen; Branch, Oralee H; Mesia, Lastenia Ruiz; Newton, Paul; Mayxay, Mayfong; Amambua-Ngwa, Alfred; Conway, David J; Nosten, François; Ferdig, Michael T; Anderson, Tim J C

2016-03-01

If copy number variants (CNVs) are predominantly deleterious, we would expect them to be more efficiently purged from populations with a large effective population size (Ne) than from populations with a small Ne. Malaria parasites (Plasmodium falciparum) provide an excellent organism to examine this prediction, because this protozoan shows a broad spectrum of population structures within a single species, with large, stable, outbred populations in Africa, small unstable inbred populations in South America and with intermediate population characteristics in South East Asia. We characterized 122 single-clone parasites, without prior laboratory culture, from malaria-infected patients in seven countries in Africa, South East Asia and South America using a high-density single-nucleotide polymorphism/CNV microarray. We scored 134 high-confidence CNVs across the parasite exome, including 33 deletions and 102 amplifications, which ranged in size from <500 bp to 59 kb, as well as 10,107 flanking, biallelic single-nucleotide polymorphisms. Overall, CNVs were rare, small, and skewed toward low frequency variants, consistent with the deleterious model. Relative to African and South East Asian populations, CNVs were significantly more common in South America, showed significantly less skew in allele frequencies, and were significantly larger. On this background of low frequency CNV, we also identified several high-frequency CNVs under putative positive selection using an FST outlier analysis. These included known adaptive CNVs containing rh2b and pfmdr1, and several other CNVs (e.g., DNA helicase and three conserved proteins) that require further investigation. Our data are consistent with a significant impact of genetic structure on CNV burden in an important human pathogen. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
In silico prediction of a disease-associated STIL mutant and its affect on the recruitment of centromere protein J (CENPJ).

PubMed

Kumar, Ambuj; Rajendran, Vidya; Sethumadhavan, Rao; Purohit, Rituraj

2012-01-01

Human STIL (SCL/TAL1 interrupting locus) protein maintains centriole stability and spindle pole localisation. It helps in recruitment of CENPJ (Centromere protein J)/CPAP (centrosomal P4.1-associated protein) and other centrosomal proteins. Mutations in STIL protein are reported in several disorders, especially in deregulation of cell cycle cascades. In this work, we examined the non-synonymous single nucleotide polymorphisms (nsSNPs) reported in STIL protein for their disease association. Different SNP prediction tools were used to predict disease-associated nsSNPs. Our evaluation technique predicted rs147744459 (R242C) as a highly deleterious disease-associated nsSNP and its interaction behaviour with CENPJ protein. Molecular modelling, docking and molecular dynamics simulation were conducted to examine the structural consequences of the predicted disease-associated mutation. By molecular dynamic simulation we observed structural consequences of R242C mutation which affects interaction of STIL and CENPJ functional domains. The result obtained in this study will provide a biophysical insight into future investigations of pathological nsSNPs using a computational platform.
SNPdbe: constructing an nsSNP functional impacts database.

PubMed

Schaefer, Christian; Meier, Alice; Rost, Burkhard; Bromberg, Yana

2012-02-15

Many existing databases annotate experimentally characterized single nucleotide polymorphisms (SNPs). Each non-synonymous SNP (nsSNP) changes one amino acid in the gene product (single amino acid substitution;SAAS). This change can either affect protein function or be neutral in that respect. Most polymorphisms lack experimental annotation of their functional impact. Here, we introduce SNPdbe-SNP database of effects, with predictions of computationally annotated functional impacts of SNPs. Database entries represent nsSNPs in dbSNP and 1000 Genomes collection, as well as variants from UniProt and PMD. SAASs come from >2600 organisms; 'human' being the most prevalent. The impact of each SAAS on protein function is predicted using the SNAP and SIFT algorithms and augmented with experimentally derived function/structure information and disease associations from PMD, OMIM and UniProt. SNPdbe is consistently updated and easily augmented with new sources of information. The database is available as an MySQL dump and via a web front end that allows searches with any combination of organism names, sequences and mutation IDs. http://www.rostlab.org/services/snpdbe.
Nucleotide sequence and genetic organization of barley stripe mosaic virus RNA gamma.

PubMed

Gustafson, G; Hunter, B; Hanau, R; Armour, S L; Jackson, A O

1987-06-01

The complete nucleotide sequences of RNA gamma from the Type and ND18 strains of barley stripe mosaic virus (BSMV) have been determined. The sequences are 3164 (Type) and 2791 (ND18) nucleotides in length. Both sequences contain a 5'-noncoding region (87 or 88 nucleotides) which is followed by a long open reading frame (ORF1). A 42-nucleotide intercistronic region separates ORF1 from a second, shorter open reading frame (ORF2) located near the 3'-end of the RNA. There is a high degree of homology between the Type and ND18 strains in the nucleotide sequence of ORF1. However, the Type strain contains a 366 nucleotide direct tandem repeat within ORF1 which is absent in the ND18 strain. Consequently, the predicted translation product of Type RNA gamma ORF1 (mol wt 87,312) is significantly larger than that of ND18 RNA gamma ORF1 (mol wt 74,011). The amino acid sequence of the ORF1 polypeptide contains homologies with putative RNA polymerases from other RNA viruses, suggesting that this protein may function in replication of the BSMV genome. The nucleotide sequence of RNA gamma ORF2 is nearly identical in the Type and ND18 strains. ORF2 codes for a polypeptide with a predicted molecular weight of 17,209 (Type) or 17,074 (ND18) which is known to be translated from a subgenomic (sg) RNA. The initiation point of this sgRNA has been mapped to a location 27 nucleotides upstream of the ORF2 initiation codon in the intercistronic region between ORF1 and ORF2. The sgRNA is not coterminal with the 3'-end of the genomic RNA, but instead contains heterogeneous poly(A) termini up to 150 nucleotides long (J. Stanley, R. Hanau, and A. O. Jackson, 1984, Virology 139, 375-383). In the genomic RNA gamma, ORF2 is followed by a short poly(A) tract and a 238-nucleotide tRNA-like structure.
Comparative genomic analyses reveal a vast, novel network of nucleotide-centric systems in biological conflicts, immunity and signaling

PubMed Central

Burroughs, A. Maxwell; Zhang, Dapeng; Schäffer, Daniel E.; Iyer, Lakshminarayan M.; Aravind, L.

2015-01-01

Cyclic di- and linear oligo-nucleotide signals activate defenses against invasive nucleic acids in animal immunity; however, their evolutionary antecedents are poorly understood. Using comparative genomics, sequence and structure analysis, we uncovered a vast network of systems defined by conserved prokaryotic gene-neighborhoods, which encode enzymes generating such nucleotides or alternatively processing them to yield potential signaling molecules. The nucleotide-generating enzymes include several clades of the DNA-polymerase β-like superfamily (including Vibrio cholerae DncV), a minimal version of the CRISPR polymerase and DisA-like cyclic-di-AMP synthetases. Nucleotide-binding/processing domains include TIR domains and members of a superfamily prototyped by Smf/DprA proteins and base (cytokinin)-releasing LOG enzymes. They are combined in conserved gene-neighborhoods with genes for a plethora of protein superfamilies, which we predict to function as nucleotide-sensors and effectors targeting nucleic acids, proteins or membranes (pore-forming agents). These systems are sometimes combined with other biological conflict-systems such as restriction-modification and CRISPR/Cas. Interestingly, several are coupled in mutually exclusive neighborhoods with either a prokaryotic ubiquitin-system or a HORMA domain-PCH2-like AAA+ ATPase dyad. The latter are potential precursors of equivalent proteins in eukaryotic chromosome dynamics. Further, components from these nucleotide-centric systems have been utilized in several other systems including a novel diversity-generating system with a reverse transcriptase. We also found the Smf/DprA/LOG domain from these systems to be recruited as a predicted nucleotide-binding domain in eukaryotic TRPM channels. These findings point to evolutionary and mechanistic links, which bring together CRISPR/Cas, animal interferon-induced immunity, and several other systems that combine nucleic-acid-sensing and nucleotide-dependent signaling. PMID:26590262
Analysis of in vivo correction of defined mismatches in the DNA mismatch repair mutants msh2, msh3 and msh6 of Saccharomyces cerevisiae.

PubMed

Lühr, B; Scheller, J; Meyer, P; Kramer, W

1998-02-01

We have analysed the correction of defined mismatches in wild-type and msh2, msh3, msh6 and msh3 msh6 mutants of Saccharomyces cerevisiae in two different yeast strain backgrounds by transformation with plasmid heteroduplex DNA constructs. Ten different base/base mismatches, two single-nucleotide loops and a 38-nucleotide loop were tested. Repair of all types of mismatches was severely impaired in msh2 and msh3 msh6 mutants. In msh6 mutants, repair efficiency of most base/base mismatches was reduced to a similar extent as in msh3 msh6 double mutants. G/T and A/C mismatches, however, displayed residual repair in msh6 mutants in one strain background, implying a role for Msh3p in recognition of base/base mismatches. Furthermore, the efficiency of repair of base/base mismatches was considerably reduced in msh3 mutants in one strain background, indicating a requirement for MSH3 for fully efficient mismatch correction. Also the efficiency of repair of the 38-nucleotide loop was reduced in msh3 mutants, and to a lesser extent in msh6 mutants. The single-nucleotide loop with an unpaired A was less efficiently repaired in msh3 mutants and that with an unpaired T was less efficiently corrected in msh6 mutants, indicating non-redundant functions for the two proteins in the recognition of single-nucleotide loops.
Demonstration of Protein-Based Human Identification Using the Hair Shaft Proteome

PubMed Central

Leppert, Tami; Anex, Deon S.; Hilmer, Jonathan K.; Matsunami, Nori; Baird, Lisa; Stevens, Jeffery; Parsawar, Krishna; Durbin-Johnson, Blythe P.; Rocke, David M.; Nelson, Chad; Fairbanks, Daniel J.; Wilson, Andrew S.; Rice, Robert H.; Woodward, Scott R.; Bothner, Brian; Hart, Bradley R.; Leppert, Mark

2016-01-01

Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 single nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). This study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts. PMID:27603779
Recombination-dependent replication and gene conversion homogenize repeat sequences and diversify plastid genome structure.

PubMed

Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K

2017-04-01

There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.

Integration of structural dynamics and molecular evolution via protein interaction networks: a new era in genomic medicine.

PubMed

Kumar, Avishek; Butler, Brandon M; Kumar, Sudhir; Ozkan, S Banu

2015-12-01

Sequencing technologies are revealing many new non-synonymous single nucleotide variants (nsSNVs) in each personal exome. To assess their functional impacts, comparative genomics is frequently employed to predict if they are benign or not. However, evolutionary analysis alone is insufficient, because it misdiagnoses many disease-associated nsSNVs, such as those at positions involved in protein interfaces, and because evolutionary predictions do not provide mechanistic insights into functional change or loss. Structural analyses can aid in overcoming both of these problems by incorporating conformational dynamics and allostery in nSNV diagnosis. Finally, protein-protein interaction networks using systems-level methodologies shed light onto disease etiology and pathogenesis. Bridging these network approaches with structurally resolved protein interactions and dynamics will advance genomic medicine. Copyright © 2015 Elsevier Ltd. All rights reserved.
Sequence of a cDNA encoding pancreatic preprosomatostatin-22.

PubMed Central

Magazin, M; Minth, C D; Funckes, C L; Deschenes, R; Tavianini, M A; Dixon, J E

1982-01-01

We report the nucleotide sequence of a precursor to somatostatin that upon proteolytic processing may give rise to a hormone of 22 amino acids. The nucleotide sequence of a cDNA from the channel catfish (Ictalurus punctatus) encodes a precursor to somatostatin that is 105 amino acids (Mr, 11,500). The cDNA coding for somatostatin-22 consists of 36 nucleotides in the 5' untranslated region, 315 nucleotides that code for the precursor to somatostatin-22, 269 nucleotides at the 3' untranslated region, and a variable length of poly(A). The putative preprohormone contains a sequence of hydrophobic amino acids at the amino terminus that has the properties of a "signal" peptide. A connecting sequence of approximately 57 amino acids is followed by a single Arg-Arg sequence, which immediately precedes the hormone. Somatostatin-22 is homologous to somatostatin-14 in 7 of the 14 amino acids, including the Phe-Trp-Lys sequence. Hybridization selection of mRNA, followed by its translation in a wheat germ cell-free system, resulted in the synthesis of a single polypeptide having a molecular weight of approximately 10,000 as estimated on Na-DodSO4/polyacrylamide gels. Images PMID:6127673
Simultaneous determination of nucleotide sugars with ion-pair reversed-phase HPLC.

PubMed

Nakajima, Kazuki; Kitazume, Shinobu; Angata, Takashi; Fujinawa, Reiko; Ohtsubo, Kazuaki; Miyoshi, Eiji; Taniguchi, Naoyuki

2010-07-01

Nucleotide sugars are important in determining cell surface glycoprotein glycosylation, which can modulate cellular properties such as growth and arrest. We have developed a conventional HPLC method for simultaneous determination of nucleotide sugars. A mixture of nucleotide sugars (CMP-NeuAc, UDP-Gal, UDP-Glc, UDP-GalNAc, UDP-GlcNAc, GDP-Man, GDP-Fuc and UDP-GlcUA) and relevant nucleotides were perfectly separated in an optimized ion-pair reversed-phase mode using Inertsil ODS-4 and ODS-3 columns. The newly developed method enabled us to determine the nucleotide sugars in cellular extracts from 1 x 10(6) cells in a single run. We applied this method to characterize nucleotide sugar levels in breast and pancreatic cancer cell lines and revealed that the abundance of UDP-GlcNAc, UDP-GalNAc, UDP-GlcUA and GDP-Fuc were a cell-type-specific feature. To determine the physiological significance of changes in nucleotide sugar levels, we analyzed their changes by glucose deprivation and found that the determination of nucleotide sugar levels provided us with valuable information with respect to studying the overview of cellular glycosylation status.
Metastatic phaeochromocytoma in a 23-year-old woman with an unclassified variant in the von Hippel Lindau disease gene: how can the pathogenicity of this variant be determined?

PubMed

Russell, Nicholas; Delatycki, Martin; Grossmann, Mathis

2015-07-01

A 23-year-old woman with metastatic phaeochromocytoma was found to have a previously unclassified variant in the von Hippel Lindau disease gene (c.361G>C). We use this case to highlight the issue of unclassified single nucleotide variants and the approaches to help predict whether they are disease causing or neutral. With increasing use of genetic testing, and widespread clinical use of next-generation sequencing around the corner, this issue is likely to become more prominent. © 2015 John Wiley & Sons Ltd.
Optimizing complex phenotypes through model-guided multiplex genome engineering

DOE PAGES

Kuznetsov, Gleb; Goodman, Daniel B.; Filsinger, Gabriel T.; ...

2017-05-25

Here, we present a method for identifying genomic modifications that optimize a complex phenotype through multiplex genome engineering and predictive modeling. We apply our method to identify six single nucleotide mutations that recover 59% of the fitness defect exhibited by the 63-codon E. coli strain C321.ΔA. By introducing targeted combinations of changes in multiplex we generate rich genotypic and phenotypic diversity and characterize clones using whole-genome sequencing and doubling time measurements. Regularized multivariate linear regression accurately quantifies individual allelic effects and overcomes bias from hitchhiking mutations and context-dependence of genome editing efficiency that would confound other strategies.
Optimizing complex phenotypes through model-guided multiplex genome engineering

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kuznetsov, Gleb; Goodman, Daniel B.; Filsinger, Gabriel T.

Here, we present a method for identifying genomic modifications that optimize a complex phenotype through multiplex genome engineering and predictive modeling. We apply our method to identify six single nucleotide mutations that recover 59% of the fitness defect exhibited by the 63-codon E. coli strain C321.ΔA. By introducing targeted combinations of changes in multiplex we generate rich genotypic and phenotypic diversity and characterize clones using whole-genome sequencing and doubling time measurements. Regularized multivariate linear regression accurately quantifies individual allelic effects and overcomes bias from hitchhiking mutations and context-dependence of genome editing efficiency that would confound other strategies.
Generalization of Associations of Kidney-Related Genetic Loci to American Indians

PubMed Central

Haack, Karin; Almasy, Laura; Laston, Sandra; Lee, Elisa T.; Best, Lyle G.; Fabsitz, Richard R.; MacCluer, Jean W.; Howard, Barbara V.; Umans, Jason G.; Cole, Shelley A.

2014-01-01

Summary Background and objectives CKD disproportionally affects American Indians, who similar to other populations, show genetic susceptibility to kidney outcomes. Recent studies have identified several loci associated with kidney traits, but their relevance in American Indians is unknown. Design, setting, participants, & measurements This study used data from a large, family-based genetic study of American Indians (the Strong Heart Family Study), which includes 94 multigenerational families enrolled from communities located in Oklahoma, the Dakotas, and Arizona. Individuals were recruited from the Strong Heart Study, a population-based study of cardiovascular disease in American Indians. This study selected 25 single nucleotide polymorphisms in 23 loci identified from recently published kidney-related genome-wide association studies in individuals of European ancestry to evaluate their associations with kidney function (estimated GFR; individuals 18 years or older, up to 3282 individuals) and albuminuria (urinary albumin to creatinine ratio; n=3552) in the Strong Heart Family Study. This study also examined the association of single nucleotide polymorphisms in the APOL1 region with estimated GFR in 1121 Strong Heart Family Study participants. GFR was estimated using the abbreviated Modification of Diet in Renal Disease Equation. Additive genetic models adjusted for age and sex were used. Results This study identified significant associations of single nucleotide polymorphisms with estimated GFR in or nearby PRKAG2, SLC6A13, UBE2Q2, PIP5K1B, and WDR72 (P<2.1 × 10-3 to account for multiple testing). Single nucleotide polymorphisms in these loci explained 2.2% of the estimated GFR total variance and 2.9% of its heritability. An intronic variant of BCAS3 was significantly associated with urinary albumin to creatinine ratio. APOL1 single nucleotide polymorphisms were not associated with estimated GFR in a single variant test or haplotype analyses, and the at-risk variants identified in individuals with African ancestry were not detected in DNA sequencing of American Indians. Conclusion This study extends the genetic associations of loci affecting kidney function to American Indians, a population at high risk of kidney disease, and provides additional support for a potential biologic relevance of these loci across ancestries. PMID:24311711
Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign

PubMed Central

2007-01-01

Background Joint alignment and secondary structure prediction of two RNA sequences can significantly improve the accuracy of the structural predictions. Methods addressing this problem, however, are forced to employ constraints that reduce computation by restricting the alignments and/or structures (i.e. folds) that are permissible. In this paper, a new methodology is presented for the purpose of establishing alignment constraints based on nucleotide alignment and insertion posterior probabilities. Using a hidden Markov model, posterior probabilities of alignment and insertion are computed for all possible pairings of nucleotide positions from the two sequences. These alignment and insertion posterior probabilities are additively combined to obtain probabilities of co-incidence for nucleotide position pairs. A suitable alignment constraint is obtained by thresholding the co-incidence probabilities. The constraint is integrated with Dynalign, a free energy minimization algorithm for joint alignment and secondary structure prediction. The resulting method is benchmarked against the previous version of Dynalign and against other programs for pairwise RNA structure prediction. Results The proposed technique eliminates manual parameter selection in Dynalign and provides significant computational time savings in comparison to prior constraints in Dynalign while simultaneously providing a small improvement in the structural prediction accuracy. Savings are also realized in memory. In experiments over a 5S RNA dataset with average sequence length of approximately 120 nucleotides, the method reduces computation by a factor of 2. The method performs favorably in comparison to other programs for pairwise RNA structure prediction: yielding better accuracy, on average, and requiring significantly lesser computational resources. Conclusion Probabilistic analysis can be utilized in order to automate the determination of alignment constraints for pairwise RNA structure prediction methods in a principled fashion. These constraints can reduce the computational and memory requirements of these methods while maintaining or improving their accuracy of structural prediction. This extends the practical reach of these methods to longer length sequences. The revised Dynalign code is freely available for download. PMID:17445273
Regionally clustered ABCC8 polymorphisms in a prospective cohort predict cerebral oedema and outcome in severe traumatic brain injury.

PubMed

Jha, Ruchira Menka; Koleck, Theresa A; Puccio, Ava M; Okonkwo, David O; Park, Seo-Young; Zusman, Benjamin E; Clark, Robert S B; Shutter, Lori A; Wallisch, Jessica S; Empey, Philip E; Kochanek, Patrick M; Conley, Yvette P

2018-04-19

ABCC8 encodes sulfonylurea receptor 1, a key regulatory protein of cerebral oedema in many neurological disorders including traumatic brain injury (TBI). Sulfonylurea-receptor-1 inhibition has been promising in ameliorating cerebral oedema in clinical trials. We evaluated whether ABCC8 tag single-nucleotide polymorphisms predicted oedema and outcome in TBI. DNA was extracted from 485 prospectively enrolled patients with severe TBI. 410 were analysed after quality control. ABCC8 tag single-nucleotide polymorphisms (SNPs) were identified (Hapmap, r 2 >0.8, minor-allele frequency >0.20) and sequenced (iPlex-Gold, MassArray). Outcomes included radiographic oedema, intracranial pressure (ICP) and 3-month Glasgow Outcome Scale (GOS) score. Proxy SNPs, spatial modelling, amino acid topology and functional predictions were determined using established software programs. Wild-type rs7105832 and rs2237982 alleles and genotypes were associated with lower average ICP (β=-2.91, p=0.001; β=-2.28, p=0.003) and decreased radiographic oedema (OR 0.42, p=0.012; OR 0.52, p=0.017). Wild-type rs2237982 also increased favourable 3-month GOS (OR 2.45, p=0.006); this was partially mediated by oedema (p=0.03). Different polymorphisms predicted 3-month outcome: variant rs11024286 increased (OR 1.84, p=0.006) and wild-type rs4148622 decreased (OR 0.40, p=0.01) the odds of favourable outcome. Significant tag and concordant proxy SNPs regionally span introns/exons 2-15 of the 39-exon gene. This study identifies four ABCC8 tag SNPs associated with cerebral oedema and/or outcome in TBI, tagging a region including 33 polymorphisms. In polymorphisms predictive of oedema, variant alleles/genotypes confer increased risk. Different variant polymorphisms were associated with favourable outcome, potentially suggesting distinct mechanisms. Significant polymorphisms spatially clustered flanking exons encoding the sulfonylurea receptor site and transmembrane domain 0/loop 0 (juxtaposing the channel pore/binding site). This, if validated, may help build a foundation for developing future strategies that may guide individualised care, treatment response, prognosis and patient selection for clinical trials. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Predictive ability of direct genomic values for lifetime net merit of Holstein sires using selected subsets of single nucleotide polymorphism markers.

PubMed

Weigel, K A; de los Campos, G; González-Recio, O; Naya, H; Wu, X L; Long, N; Rosa, G J M; Gianola, D

2009-10-01

The objective of the present study was to assess the predictive ability of subsets of single nucleotide polymorphism (SNP) markers for development of low-cost, low-density genotyping assays in dairy cattle. Dense SNP genotypes of 4,703 Holstein bulls were provided by the USDA Agricultural Research Service. A subset of 3,305 bulls born from 1952 to 1998 was used to fit various models (training set), and a subset of 1,398 bulls born from 1999 to 2002 was used to evaluate their predictive ability (testing set). After editing, data included genotypes for 32,518 SNP and August 2003 and April 2008 predicted transmitting abilities (PTA) for lifetime net merit (LNM$), the latter resulting from progeny testing. The Bayesian least absolute shrinkage and selection operator method was used to regress August 2003 PTA on marker covariates in the training set to arrive at estimates of marker effects and direct genomic PTA. The coefficient of determination (R(2)) from regressing the April 2008 progeny test PTA of bulls in the testing set on their August 2003 direct genomic PTA was 0.375. Subsets of 300, 500, 750, 1,000, 1,250, 1,500, and 2,000 SNP were created by choosing equally spaced and highly ranked SNP, with the latter based on the absolute value of their estimated effects obtained from the training set. The SNP effects were re-estimated from the training set for each subset of SNP, and the 2008 progeny test PTA of bulls in the testing set were regressed on corresponding direct genomic PTA. The R(2) values for subsets of 300, 500, 750, 1,000, 1,250, 1,500, and 2,000 SNP with largest effects (evenly spaced SNP) were 0.184 (0.064), 0.236 (0.111), 0.269 (0.190), 0.289 (0.179), 0.307 (0.228), 0.313 (0.268), and 0.322 (0.291), respectively. These results indicate that a low-density assay comprising selected SNP could be a cost-effective alternative for selection decisions and that significant gains in predictive ability may be achieved by increasing the number of SNP allocated to such an assay from 300 or fewer to 1,000 or more.
Single-Nucleotide Polymorphisms Associated with Skin Naphthyl–Keratin Adduct Levels in Workers Exposed to Naphthalene

PubMed Central

Jiang, Rong; French, John E.; Stober, Vandy P.; Kang-Sickel, Juei-Chuan C.; Zou, Fei

2012-01-01

Background: Individual genetic variation that results in differences in systemic response to xenobiotic exposure is not accounted for as a predictor of outcome in current exposure assessment models. Objective: We developed a strategy to investigate individual differences in single-nucleotide polymorphisms (SNPs) as genetic markers associated with naphthyl–keratin adduct (NKA) levels measured in the skin of workers exposed to naphthalene. Methods: The SNP-association analysis was conducted in PLINK using candidate-gene analysis and genome-wide analysis. We identified significant SNP–NKA associations and investigated the potential impact of these SNPs along with personal and workplace factors on NKA levels using a multiple linear regression model and the Pratt index. Results: In candidate-gene analysis, a SNP (rs4852279) located near the CYP26B1 gene contributed to the 2-naphthyl–keratin adduct (2NKA) level. In the multiple linear regression model, the SNP rs4852279, dermal exposure, exposure time, task replacing foam, age, and ethnicity all were significant predictors of 2NKA level. In genome-wide analysis, no single SNP reached genome-wide significance for NKA levels (all p ≥ 1.05 × 10–5). Pathway and network analyses of SNPs associated with NKA levels were predicted to be involved in the regulation of cellular processes and homeostasis. Conclusions: These results provide evidence that a quantitative biomarker can be used as an intermediate phenotype when investigating the association between genetic markers and exposure–dose relationship in a small, well-characterized exposed worker population. PMID:22391508
Warfarin pharmacogenetics: a single VKORC1 polymorphism is predictive of dose across 3 racial groups.

PubMed

Limdi, Nita A; Wadelius, Mia; Cavallari, Larisa; Eriksson, Niclas; Crawford, Dana C; Lee, Ming-Ta M; Chen, Chien-Hsiun; Motsinger-Reif, Alison; Sagreiya, Hersh; Liu, Nianjun; Wu, Alan H B; Gage, Brian F; Jorgensen, Andrea; Pirmohamed, Munir; Shin, Jae-Gook; Suarez-Kurtz, Guilherme; Kimmel, Stephen E; Johnson, Julie A; Klein, Teri E; Wagner, Michael J

2010-05-06

Warfarin-dosing algorithms incorporating CYP2C9 and VKORC1 -1639G>A improve dose prediction compared with algorithms based solely on clinical and demographic factors. However, these algorithms better capture dose variability among whites than Asians or blacks. Herein, we evaluate whether other VKORC1 polymorphisms and haplotypes explain additional variation in warfarin dose beyond that explained by VKORC1 -1639G>A among Asians (n = 1103), blacks (n = 670), and whites (n = 3113). Participants were recruited from 11 countries as part of the International Warfarin Pharmacogenetics Consortium effort. Evaluation of the effects of individual VKORC1 single nucleotide polymorphisms (SNPs) and haplotypes on warfarin dose used both univariate and multi variable linear regression. VKORC1 -1639G>A and 1173C>T individually explained the greatest variance in dose in all 3 racial groups. Incorporation of additional VKORC1 SNPs or haplotypes did not further improve dose prediction. VKORC1 explained greater variability in dose among whites than blacks and Asians. Differences in the percentage of variance in dose explained by VKORC1 across race were largely accounted for by the frequency of the -1639A (or 1173T) allele. Thus, clinicians should recognize that, although at a population level, the contribution of VKORC1 toward dose requirements is higher in whites than in nonwhites; genotype predicts similar dose requirements across racial groups.
Adolescent idiopathic scoliosis and the single-nucleotide polymorphism of the growth hormone receptor and IGF-1 genes.

PubMed

Yang, Yong; Wu, Zhihong; Zhao, Taimao; Wang, Hai; Zhao, Dong; Zhang, Jianguo; Wang, Yipeng; Ding, Yaozhong; Qiu, Guixing

2009-06-01

The etiology of adolescent idiopathic scoliosis is undetermined despite years of research. A number of hypotheses have been postulated to explain its development, including growth abnormalities. The irregular expression of growth hormone and insulin-like growth factor-1 (IGF-1) may disturb hormone metabolism, result in a gross asymmetry, and promote the progress of adolescent idiopathic scoliosis. Initial association studies in complex diseases have demonstrated the power of candidate gene association. Prior to our study, 1 study in this field had a negative result. A replicable study is vital for reliability. To determine the relationship of growth hormone receptor and IGF-1 genes with adolescent idiopathic scoliosis, a population-based association study was performed. Single nucleotide polymorphisms with potential function were selected from candidate genes and a distribution analysis was performed. A conclusion was made confirming the insufficiency of an association between adolescent idiopathic scoliosis and the single-nucleotide polymorphism of the growth hormone receptor and IGF-1 genes in Han Chinese.
A single splice site mutation in human-specific ARHGAP11B causes basal progenitor amplification

PubMed Central

Florio, Marta; Namba, Takashi; Pääbo, Svante; Hiller, Michael; Huttner, Wieland B.

2016-01-01

The gene ARHGAP11B promotes basal progenitor amplification and is implicated in neocortex expansion. It arose on the human evolutionary lineage by partial duplication of ARHGAP11A, which encodes a Rho guanosine triphosphatase–activating protein (RhoGAP). However, a lack of 55 nucleotides in ARHGAP11B mRNA leads to loss of RhoGAP activity by GAP domain truncation and addition of a human-specific carboxy-terminal amino acid sequence. We show that these 55 nucleotides are deleted by mRNA splicing due to a single C→G substitution that creates a novel splice donor site. We reconstructed an ancestral ARHGAP11B complementary DNA without this substitution. Ancestral ARHGAP11B exhibits RhoGAP activity but has no ability to increase basal progenitors during neocortex development. Hence, a single nucleotide substitution underlies the specific properties of ARHGAP11B that likely contributed to the evolutionary expansion of the human neocortex. PMID:27957544
Failure of replicating the association between hippocampal volume and 3 single-nucleotide polymorphisms identified from the European genome-wide association study in Asian populations.

PubMed

Li, Ming; Ohi, Kazutaka; Chen, Chunhui; He, Qinghua; Liu, Jie-Wei; Chen, Chuansheng; Luo, Xiong-Jian; Dong, Qi; Hashimoto, Ryota; Su, Bing

2014-12-01

Hippocampal volume is a key brain structure for learning ability and memory process, and hippocampal atrophy is a recognized biological marker of Alzheimer's disease. However, the genetic bases of hippocampal volume are still unclear although it is a heritable trait. Genome-wide association studies (GWASs) on hippocampal volume have implicated several significantly associated genetic variants in Europeans. Here, to test the contributions of these GWASs identified genetic variants to hippocampal volume in different ethnic populations, we screened the GWAS-identified candidate single-nucleotide polymorphisms in 3 independent healthy Asian brain imaging samples (a total of 990 subjects). The results showed that none of these single-nucleotide polymorphisms were associated with hippocampal volume in either individual or combined Asian samples. The replication results suggested a complexity of genetic architecture for hippocampal volume and potential genetic heterogeneity between different ethnic populations. Copyright © 2014 Elsevier Inc. All rights reserved.
Detecting Single-Nucleotide Substitutions Induced by Genome Editing.

PubMed

Miyaoka, Yuichiro; Chan, Amanda H; Conklin, Bruce R

2016-08-01

The detection of genome editing is critical in evaluating genome-editing tools or conditions, but it is not an easy task to detect genome-editing events-especially single-nucleotide substitutions-without a surrogate marker. Here we introduce a procedure that significantly contributes to the advancement of genome-editing technologies. It uses droplet digital polymerase chain reaction (ddPCR) and allele-specific hydrolysis probes to detect single-nucleotide substitutions generated by genome editing (via homology-directed repair, or HDR). HDR events that introduce substitutions using donor DNA are generally infrequent, even with genome-editing tools, and the outcome is only one base pair difference in 3 billion base pairs of the human genome. This task is particularly difficult in induced pluripotent stem (iPS) cells, in which editing events can be very rare. Therefore, the technological advances described here have implications for therapeutic genome editing and experimental approaches to disease modeling with iPS cells. © 2016 Cold Spring Harbor Laboratory Press.
ESR1 single nucleotide polymorphisms predict breast cancer susceptibility in the central European Caucasian population.

PubMed

Lipphardt, Mark F; Deryal, Mustafa; Ong, Mei Fang; Schmidt, Werner; Mahlknecht, Ulrich

2013-01-01

Estrogen and progesterone hormones are key regulators of a wide variety of biological processes. In addition to their influence on reproduction, cell differentiation and apoptosis, they affect inflammatory response, cell metabolism and most importantly, they regulate physiological breast tissue proliferation and differentiation as well as the development and progression of breast cancer. In order to assess whether genetic variants in the steroid hormone receptor gene ESR1 (estrogen receptor alpha) had an effect on sporadic breast cancer susceptibility, we assessed 7 ESR1 single nucleotide polymorphisms (SNPs) for associations with breast cancer susceptibility and clinical parameters in 221 breast cancer patients and 221 controls, respectively. We identified ESR1 intron SNP +2464 C/T (rs3020314) and ESR1 intron SNP -4576 A/C (rs1514348) to correlate with breast cancer susceptibility and progesterone receptor expression status. Patients genotyped CT for ESR1 intron SNP +2464 (rs3020314) (p ≤ 0.045) or genotyped AC for ESR1 intron SNP -4576 (rs1514348) (p ≤ 0.000026) were identified to carry a significant risk as to the development of breast cancer in the Central European Caucasian population (both together: p ≤ 0.000488). Our study could confirm previous associations and revealed new associations of SNP rs1514348 with susceptibility to breast cancer and clinical outcome, which might be used as new additional SNP markers.
Multiple thrombophilic single nucleotide polymorphisms lack a significant effect on outcomes in fresh IVF cycles: an analysis of 1717 patients.

PubMed

Patounakis, George; Bergh, Eric; Forman, Eric J; Tao, Xin; Lonczak, Agnieszka; Franasiak, Jason M; Treff, Nathan; Scott, Richard T

2016-01-01

The aim of the study is to determine if thrombophilic single nucleotide polymorphisms (SNPs) affect outcomes in fresh in vitro fertilization (IVF) cycles in a large general infertility population. A prospective cohort analysis was performed at a university-affiliated private IVF center of female patients undergoing fresh non-donor IVF cycles. The effect of the following thrombophilic SNPs on IVF outcomes were explored: factor V (Leiden and H1299R), prothrombin (G20210A), factor XIII (V34L), β-fibrinogen (-455G → A), plasminogen activator inhibitor-1 (4G/5G), human platelet antigen-1 (a/b9L33P), and methylenetetrahydrofolate reductase (C677T and A1298C). The main outcome measures included positive pregnancy test, clinical pregnancy, embryo implantation, live birth, and pregnancy loss. Patients (1717) were enrolled in the study, and a total of 4169 embryos were transferred. There were no statistically significant differences in positive pregnancy test, clinical pregnancy, embryo implantation, live birth, or pregnancy loss in the analysis of 1717 patients attempting their first cycle of IVF. Receiver operator characteristics and logistic regression analyses showed that outcomes cannot be predicted by the cumulative number of thrombophilic mutations present in the patient. Individual and cumulative thrombophilic SNPs do not affect IVF outcomes. Therefore, initial screening for these SNPs is not indicated.
A noncoding melanophilin gene (MLPH) SNP at the splice donor of exon 1 represents a candidate causal mutation for coat color dilution in dogs.

PubMed

Drögemüller, Cord; Philipp, Ute; Haase, Bianca; Günzel-Apel, Anne-Rose; Leeb, Tosso

2007-01-01

Coat color dilution in several breeds of dog is characterized by a specific pigmentation phenotype and sometimes accompanied by hair loss and recurrent skin inflammation, the so-called color dilution alopecia or black hair follicular dysplasia. Coat color dilution (d) is inherited as a Mendelian autosomal recessive trait. In a previous study, MLPH polymorphisms showed perfect cosegregation with the dilute phenotype within breeds. However, different dilute haplotypes were found in different breeds, and no single polymorphism was identified in the coding sequence that was likely to be causative for the dilute phenotype. We resequenced the 5'-region of the canine MLPH gene and identified a strong candidate single nucleotide polymorphism within the nontranslated exon 1, which showed perfect association to the dilute phenotype in 65 dilute dogs from 7 different breeds. The A/G polymorphism is located at the last nucleotide of exon 1 and the mutant A-allele is predicted to reduce splicing efficiency 8-fold. An MLPH mRNA expression study using quantitative reverse transcriptase-polymerase chain reaction confirmed that dd animals had only about approximately 25% of the MLPH transcript compared with DD animals. These results provide preliminary evidence that the reported regulatory MLPH mutation might represent a causal mutation for coat color dilution in dogs.
Prenatal detection of fetal triploidy from cell-free DNA testing in maternal blood.

PubMed

Nicolaides, Kypros H; Syngelaki, Argyro; del Mar Gil, Maria; Quezada, Maria Soledad; Zinevich, Yana

2014-01-01

To investigate potential performance of cell-free DNA (cfDNA) testing in maternal blood in detecting fetal triploidy. Plasma and buffy coat samples obtained at 11-13 weeks' gestation from singleton pregnancies with diandric triploidy (n=4), digynic triploidy (n=4), euploid fetuses (n=48) were sent to Natera, Inc. (San Carlos, Calif., USA) for cfDNA testing. Multiplex polymerase chain reaction amplification of cfDNA followed by sequencing of single nucleotide polymorphic loci covering chromosomes 13, 18, 21, X, and Y was performed. Sequencing data were analyzed using the NATUS algorithm which identifies copy number for each of the five chromosomes. cfDNA testing provided a result in 44 (91.7%) of the 48 euploid cases and correctly predicted the fetal sex and the presence of two copies each of chromosome 21, 18 and 13. In diandric triploidy, cfDNA testing identified multiple paternal haplotypes (indicating fetal trisomy 21, trisomy 18 and trisomy 13) suggesting the presence of either triploidy or dizygotic twins. In digynic triploidy the fetal fraction corrected for maternal weight and gestational age was below the 0.5th percentile. cfDNA testing by targeted sequencing and allelic ratio analysis of single nucleotide polymorphisms covering chromosomes 21, 18, 13, X, and Y can detect diandric triploidy and raise the suspicion of digynic triploidy. © 2013 S. Karger AG, Basel.

Joint Identification of Genetic Variants for Physical Activity in Korean Population

PubMed Central

Kim, Jayoun; Kim, Jaehee; Min, Haesook; Oh, Sohee; Kim, Yeonjung; Lee, Andy H.; Park, Taesung

2014-01-01

There has been limited research on genome-wide association with physical activity (PA). This study ascertained genetic associations between PA and 344,893 single nucleotide polymorphism (SNP) markers in 8842 Korean samples. PA data were obtained from a validated questionnaire that included information on PA intensity and duration. Metabolic equivalent of tasks were calculated to estimate the total daily PA level for each individual. In addition to single- and multiple-SNP association tests, a pathway enrichment analysis was performed to identify the biological significance of SNP markers. Although no significant SNP was found at genome-wide significance level via single-SNP association tests, 59 genetic variants mapped to 76 genes were identified via a multiple SNP approach using a bootstrap selection stability measure. Pathway analysis for these 59 variants showed that maturity onset diabetes of the young (MODY) was enriched. Joint identification of SNPs could enable the identification of multiple SNPs with good predictive power for PA and a pathway enriched for PA. PMID:25026172
HRM and SNaPshot as alternative forensic SNP genotyping methods.

PubMed

Mehta, Bhavik; Daniel, Runa; McNevin, Dennis

2017-09-01

Single nucleotide polymorphisms (SNPs) have been widely used in forensics for prediction of identity, biogeographical ancestry (BGA) and externally visible characteristics (EVCs). Single base extension (SBE) assays, most notably SNaPshot® (Thermo Fisher Scientific), are commonly used for forensic SNP genotyping as they can be employed on standard instrumentation in forensic laboratories (e.g. capillary electrophoresis). High resolution melt (HRM) analysis is an alternative method and is a simple, fast, single tube assay for low throughput SNP typing. This study compares HRM and SNaPshot®. HRM produced reproducible and concordant genotypes at 500 pg, however, difficulties were encountered when genotyping SNPs with high GC content in flanking regions and differentiating variants of symmetrical SNPs. SNaPshot® was reproducible at 100 pg and is less dependent on SNP choice. HRM has a shorter processing time in comparison to SNaPshot®, avoids post PCR contamination risk and has potential as a screening tool for many forensic applications.
Genomic prediction of the polled and horned phenotypes in Merino sheep.

PubMed

Duijvesteijn, Naomi; Bolormaa, Sunduimijid; Daetwyler, Hans D; van der Werf, Julius H J

2018-05-22

In horned sheep breeds, breeding for polledness has been of interest for decades. The objective of this study was to improve prediction of the horned and polled phenotypes using horn scores classified as polled, scurs, knobs or horns. Derived phenotypes polled/non-polled (P/NP) and horned/non-horned (H/NH) were used to test four different strategies for prediction in 4001 purebred Merino sheep. These strategies include the use of single 'single nucleotide polymorphism' (SNP) genotypes, multiple-SNP haplotypes, genome-wide and chromosome-wide genomic best linear unbiased prediction and information from imputed sequence variants from the region including the RXFP2 gene. Low-density genotypes of these animals were imputed to the Illumina Ovine high-density (600k) chip and the 1.78-kb insertion polymorphism in RXFP2 was included in the imputation process to whole-genome sequence. We evaluated the mode of inheritance and validated models by a fivefold cross-validation and across- and between-family prediction. The most significant SNPs for prediction of P/NP and H/NH were OAR10_29546872.1 and OAR10_29458450, respectively, located on chromosome 10 close to the 1.78-kb insertion at 29.5 Mb. The mode of inheritance included an additive effect and a sex-dependent effect for dominance for P/NP and a sex-dependent additive and dominance effect for H/NH. Models with the highest prediction accuracies for H/NH used either single SNPs or 3-SNP haplotypes and included a polygenic effect estimated based on traditional pedigree relationships. Prediction accuracies for H/NH were 0.323 for females and 0.725 for males. For predicting P/NP, the best models were the same as for H/NH but included a genomic relationship matrix with accuracies of 0.713 for females and 0.620 for males. Our results show that prediction accuracy is high using a single SNP, but does not reach 1 since the causative mutation is not genotyped. Incomplete penetrance or allelic heterogeneity, which can influence expression of the phenotype, may explain why prediction accuracy did not approach 1 with any of the genetic models tested here. Nevertheless, a breeding program to eradicate horns from Merino sheep can be effective by selecting genotypes GG of SNP OAR10_29458450 or TT of SNP OAR10_29546872.1 since all sheep with these genotypes will be non-horned.
SNPGenie: estimating evolutionary parameters to detect natural selection using pooled next-generation sequencing data.

PubMed

Nelson, Chase W; Moncla, Louise H; Hughes, Austin L

2015-11-15

New applications of next-generation sequencing technologies use pools of DNA from multiple individuals to estimate population genetic parameters. However, no publicly available tools exist to analyse single-nucleotide polymorphism (SNP) calling results directly for evolutionary parameters important in detecting natural selection, including nucleotide diversity and gene diversity. We have developed SNPGenie to fill this gap. The user submits a FASTA reference sequence(s), a Gene Transfer Format (.GTF) file with CDS information and a SNP report(s) in an increasing selection of formats. The program estimates nucleotide diversity, distance from the reference and gene diversity. Sites are flagged for multiple overlapping reading frames, and are categorized by polymorphism type: nonsynonymous, synonymous, or ambiguous. The results allow single nucleotide, single codon, sliding window, whole gene and whole genome/population analyses that aid in the detection of positive and purifying natural selection in the source population. SNPGenie version 1.2 is a Perl program with no additional dependencies. It is free, open-source, and available for download at https://github.com/hugheslab/snpgenie. nelsoncw@email.sc.edu or austin@biol.sc.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Base Preferences in Non-Templated Nucleotide Incorporation by MMLV-Derived Reverse Transcriptases

PubMed Central

Zajac, Pawel; Islam, Saiful; Hochgerner, Hannah; Lönnerberg, Peter; Linnarsson, Sten

2013-01-01

Reverse transcriptases derived from Moloney Murine Leukemia Virus (MMLV) have an intrinsic terminal transferase activity, which causes the addition of a few non-templated nucleotides at the 3´ end of cDNA, with a preference for cytosine. This mechanism can be exploited to make the reverse transcriptase switch template from the RNA molecule to a secondary oligonucleotide during first-strand cDNA synthesis, and thereby to introduce arbitrary barcode or adaptor sequences in the cDNA. Because the mechanism is relatively efficient and occurs in a single reaction, it has recently found use in several protocols for single-cell RNA sequencing. However, the base preference of the terminal transferase activity is not known in detail, which may lead to inefficiencies in template switching when starting from tiny amounts of mRNA. Here, we used fully degenerate oligos to determine the exact base preference at the template switching site up to a distance of ten nucleotides. We found a strong preference for guanosine at the first non-templated nucleotide, with a greatly reduced bias at progressively more distant positions. Based on this result, and a number of careful optimizations, we report conditions for efficient template switching for cDNA amplification from single cells. PMID:24392002
Assessment of genetic and nongenetic interactions for the prediction of depressive symptomatology: an analysis of the Wisconsin Longitudinal Study using machine learning algorithms.

PubMed

Roetker, Nicholas S; Page, C David; Yonker, James A; Chang, Vicky; Roan, Carol L; Herd, Pamela; Hauser, Taissa S; Hauser, Robert M; Atwood, Craig S

2013-10-01

We examined depression within a multidimensional framework consisting of genetic, environmental, and sociobehavioral factors and, using machine learning algorithms, explored interactions among these factors that might better explain the etiology of depressive symptoms. We measured current depressive symptoms using the Center for Epidemiologic Studies Depression Scale (n = 6378 participants in the Wisconsin Longitudinal Study). Genetic factors were 78 single nucleotide polymorphisms (SNPs); environmental factors-13 stressful life events (SLEs), plus a composite proportion of SLEs index; and sociobehavioral factors-18 personality, intelligence, and other health or behavioral measures. We performed traditional SNP associations via logistic regression likelihood ratio testing and explored interactions with support vector machines and Bayesian networks. After correction for multiple testing, we found no significant single genotypic associations with depressive symptoms. Machine learning algorithms showed no evidence of interactions. Naïve Bayes produced the best models in both subsets and included only environmental and sociobehavioral factors. We found no single or interactive associations with genetic factors and depressive symptoms. Various environmental and sociobehavioral factors were more predictive of depressive symptoms, yet their impacts were independent of one another. A genome-wide analysis of genetic alterations using machine learning methodologies will provide a framework for identifying genetic-environmental-sociobehavioral interactions in depressive symptoms.
The complete nucleotide sequence of the glnALG operon of Escherichia coli K12.

PubMed Central

Miranda-Ríos, J; Sánchez-Pescador, R; Urdea, M; Covarrubias, A A

1987-01-01

The nucleotide sequence of the E. coli glnALG operon has been determined. The glnL (ntrB) and glnG (ntrC) genes present a high homology, at the nucleotide and aminoacid levels, with the corresponding genes of Klebsiella pneumoniae. The predicted aminoacid sequence for glutamine synthetase allowed us to locate some of the enzyme domains. The structure of this operon is discussed. PMID:2882477
A Polymorphism in the Retinol Binding Protein 4 Gene is Not Associated with Gestational Diabetes Mellitus in Several Different Ethnic Groups

PubMed Central

Urschitz, Johann; Sultan, Omar; Ward, Kenneth

2011-01-01

Objective Various Asian and Pacifific Islander groups have higher prevalence rates of type 2 diabetes and gestational diabetes. This increased incidence is likely to include genetic factors. Single nucleotide polymorphisms in the retinol binding protein 4 gene have been linked to the occurrence of type 2 diabetes. Hypothesizing a link between retinol binding protein 4 and gestational diabetes, we performed a candidate gene study to look for an association between an important retinol binding protein gene polymorphism (rs3758539) and gestational diabetes. Study Design Blood was collected from Caucasian, Asian, and Pacific Islander women diagnosed with gestational diabetes and from ethnically matched non-diabetic controls. DNA was extracted and real time PCR technology (TaqMan, Applied Biosystems) used to screen for the rs3758539 single nucleotide polymorphism located 5′ of exon 1 of the retinol binding protein 4 gene. Results Genotype and allele frequencies in the controls and gestational diabetes cases were tested using chi-square contingency tests. Genotype frequencies were in Hardy-Weinberg equilibrium. There was no association between the rs3758539 retinol binding protein 4 single nucleotide polymorphism and gestational diabetes in the Caucasian, Filipino, or Pacific Islander groups. Conclusion Interestingly, the rs3758539 retinol binding protein 4 single nucleotide polymorphism was not found to be associated with gestational diabetes. The absence of association suggests that gestational and type 2 diabetes may have more divergent molecular pathophysiology than previously suspected. PMID:21886308
Pooled genome wide association detects association upstream of FCRL3 with Graves' disease.

PubMed

Khong, Jwu Jin; Burdon, Kathryn P; Lu, Yi; Laurie, Kate; Leonardos, Lefta; Baird, Paul N; Sahebjada, Srujana; Walsh, John P; Gajdatsy, Adam; Ebeling, Peter R; Hamblin, Peter Shane; Wong, Rosemary; Forehan, Simon P; Fourlanos, Spiros; Roberts, Anthony P; Doogue, Matthew; Selva, Dinesh; Montgomery, Grant W; Macgregor, Stuart; Craig, Jamie E

2016-11-18

Graves' disease is an autoimmune thyroid disease of complex inheritance. Multiple genetic susceptibility loci are thought to be involved in Graves' disease and it is therefore likely that these can be identified by genome wide association studies. This study aimed to determine if a genome wide association study, using a pooling methodology, could detect genomic loci associated with Graves' disease. Nineteen of the top ranking single nucleotide polymorphisms including HLA-DQA1 and C6orf10, were clustered within the Major Histo-compatibility Complex region on chromosome 6p21, with rs1613056 reaching genome wide significance (p = 5 × 10 -8 ). Technical validation of top ranking non-Major Histo-compatablity complex single nucleotide polymorphisms with individual genotyping in the discovery cohort revealed four single nucleotide polymorphisms with p ≤ 10 -4 . Rs17676303 on chromosome 1q23.1, located upstream of FCRL3, showed evidence of association with Graves' disease across the discovery, replication and combined cohorts. A second single nucleotide polymorphism rs9644119 downstream of DPYSL2 showed some evidence of association supported by finding in the replication cohort that warrants further study. Pooled genome wide association study identified a genetic variant upstream of FCRL3 as a susceptibility locus for Graves' disease in addition to those identified in the Major Histo-compatibility Complex. A second locus downstream of DPYSL2 is potentially a novel genetic variant in Graves' disease that requires further confirmation.
Association of glutathione S-transferase pi isoform single-nucleotide polymorphisms with exudative age-related macular degeneration in a Chinese population.

PubMed

Gu, Hong; Sun, Erdan; Cui, Lei; Yang, Xiufen; Lim, Apiradee; Xu, Jun; Snellingen, Torkel; Liu, Xipu; Wang, Ningli; Liu, Ningpu

2012-10-01

To investigate the association between single-nucleotide polymorphisms in the pi isoform of glutathione S-transferase (GSTP1) gene and the risk of exudative age-related macular degeneration (AMD) in a Chinese case-control cohort. A total of 131 Chinese patients with exudative AMD and 138 control individuals were recruited. Genomic DNA was extracted from venous blood leukocytes. Two common nonsynonymous single-nucleotide polymorphisms in GSTP1 (rs1695 and rs1138272) were genotyped by polymerase chain reaction followed by allele-specific restriction enzyme digestion and direct sequencing. Significant association with exudative AMD was detected for single-nucleotide polymorphism, rs1695 (P = 0.019). The risk G allele frequencies were 21.8% in AMD patients and 12.7% in control subjects (P = 0.007). Compared with the wild-type AA genotype, odds ratio for the risk of AMD was 1.91 (95% confidence interval, 1.09-3.35) for the heterozygous AG genotype and 2.52 (95% confidence interval, 0.6-10.61) for the homozygous GG genotype. In contrast, rs1138272 was not associated with exudative AMD (P = 1.00). The risk G allele frequencies of rs1138272 were 0.4% in AMD patients and 0.4% in control subjects (P = 1.00). Our data suggest that the GSTP1 variant rs1695 moderately increases the risk of exudative AMD. The variant rs1138272 was rare and was not associated with exudative AMD in this Chinese cohort.
Contribution of 20 single nucleotide polymorphisms of 13 genes to dyslipidemia associated with antiretroviral therapy.

PubMed

Arnedo, Mireia; Taffé, Patrick; Sahli, Roland; Furrer, Hansjakob; Hirschel, Bernard; Elzi, Luigia; Weber, Rainer; Vernazza, Pietro; Bernasconi, Enos; Darioli, Roger; Bergmann, Sven; Beckmann, Jacques S; Telenti, Amalio; Tarr, Philip E

2007-09-01

HIV-1 infected individuals have an increased cardiovascular risk which is partially mediated by dyslipidemia. Single nucleotide polymorphisms in multiple genes involved in lipid transport and metabolism are presumed to modulate the risk of dyslipidemia in response to antiretroviral therapy. The contribution to dyslipidemia of 20 selected single nucleotide polymorphisms of 13 genes reported in the literature to be associated with plasma lipid levels (ABCA1, ADRB2, APOA5, APOC3, APOE, CETP, LIPC, LIPG, LPL, MDR1, MTP, SCARB1, and TNF) was assessed by longitudinally modeling more than 4400 plasma lipid determinations in 438 antiretroviral therapy-treated participants during a median period of 4.8 years. An exploratory genetic score was tested that takes into account the cumulative contribution of multiple gene variants to plasma lipids. Variants of ABCA1, APOA5, APOC3, APOE, and CETP contributed to plasma triglyceride levels, particularly in the setting of ritonavir-containing antiretroviral therapy. Variants of APOA5 and CETP contributed to high-density lipoprotein-cholesterol levels. Variants of CETP and LIPG contributed to non-high-density lipoprotein-cholesterol levels, a finding not reported previously. Sustained hypertriglyceridemia and low high-density lipoprotein-cholesterol during the study period was significantly associated with the genetic score. Single nucleotide polymorphisms of ABCA1, APOA5, APOC3, APOE, and CETP contribute to plasma triglyceride and high-density lipoprotein-cholesterol levels during antiretroviral therapy exposure. Genetic profiling may contribute to the identification of patients at risk for antiretroviral therapy-related dyslipidemia.
Computational Prediction of miRNA Genes from Small RNA Sequencing Data

PubMed Central

Kang, Wenjing; Friedländer, Marc R.

2015-01-01

Next-generation sequencing now for the first time allows researchers to gage the depth and variation of entire transcriptomes. However, now as rare transcripts can be detected that are present in cells at single copies, more advanced computational tools are needed to accurately annotate and profile them. microRNAs (miRNAs) are 22 nucleotide small RNAs (sRNAs) that post-transcriptionally reduce the output of protein coding genes. They have established roles in numerous biological processes, including cancers and other diseases. During miRNA biogenesis, the sRNAs are sequentially cleaved from precursor molecules that have a characteristic hairpin RNA structure. The vast majority of new miRNA genes that are discovered are mined from small RNA sequencing (sRNA-seq), which can detect more than a billion RNAs in a single run. However, given that many of the detected RNAs are degradation products from all types of transcripts, the accurate identification of miRNAs remain a non-trivial computational problem. Here, we review the tools available to predict animal miRNAs from sRNA sequencing data. We present tools for generalist and specialist use cases, including prediction from massively pooled data or in species without reference genome. We also present wet-lab methods used to validate predicted miRNAs, and approaches to computationally benchmark prediction accuracy. For each tool, we reference validation experiments and benchmarking efforts. Last, we discuss the future of the field. PMID:25674563
Hybridization properties of long nucleic acid probes for detection of variable target sequences, and development of a hybridization prediction algorithm

PubMed Central

Öhrmalm, Christina; Jobs, Magnus; Eriksson, Ronnie; Golbob, Sultan; Elfaitouri, Amal; Benachenhou, Farid; Strømme, Maria; Blomberg, Jonas

2010-01-01

One of the main problems in nucleic acid-based techniques for detection of infectious agents, such as influenza viruses, is that of nucleic acid sequence variation. DNA probes, 70-nt long, some including the nucleotide analog deoxyribose-Inosine (dInosine), were analyzed for hybridization tolerance to different amounts and distributions of mismatching bases, e.g. synonymous mutations, in target DNA. Microsphere-linked 70-mer probes were hybridized in 3M TMAC buffer to biotinylated single-stranded (ss) DNA for subsequent analysis in a Luminex® system. When mismatches interrupted contiguous matching stretches of 6 nt or longer, it had a strong impact on hybridization. Contiguous matching stretches are more important than the same number of matching nucleotides separated by mismatches into several regions. dInosine, but not 5-nitroindole, substitutions at mismatching positions stabilized hybridization remarkably well, comparable to N (4-fold) wobbles in the same positions. In contrast to shorter probes, 70-nt probes with judiciously placed dInosine substitutions and/or wobble positions were remarkably mismatch tolerant, with preserved specificity. An algorithm, NucZip, was constructed to model the nucleation and zipping phases of hybridization, integrating both local and distant binding contributions. It predicted hybridization more exactly than previous algorithms, and has the potential to guide the design of variation-tolerant yet specific probes. PMID:20864443
Can mutational GC-pressure create new linear B-cell epitopes in herpes simplex virus type 1 glycoprotein B?

PubMed

Khrustalev, Vladislav Victorovich

2009-01-01

We showed that GC-content of nucleotide sequences coding for linear B-cell epitopes of herpes simplex virus type 1 (HSV1) glycoprotein B (gB) is higher than GC-content of sequences coding for epitope-free regions of this glycoprotein (G + C = 73 and 64%, respectively). Linear B-cell epitopes have been predicted in HSV1 gB by BepiPred algorithm ( www.cbs.dtu.dk/services/BepiPred ). Proline is an acrophilic amino acid residue (it is usually situated on the surface of protein globules, and so included in linear B-cell epitopes). Indeed, the level of proline is much higher in predicted epitopes of gB than in epitope-free regions (17.8% versus 1.8%). This amino acid is coded by GC-rich codons (CCX) that can be produced due to nucleotide substitutions caused by mutational GC-pressure. GC-pressure will also lead to disappearance of acrophobic phenylalanine, isoleucine, methionine and tyrosine coded by GC-poor codons. Results of our "in-silico directed mutagenesis" showed that single nonsynonymous substitutions in AT to GC direction in two long epitope-free regions of gB will cause formation of new linear epitopes or elongation of previously existing epitopes flanking these regions in 25% of 539 possible cases. The calculations of GC-content and amino acid content have been performed by CodonChanges algorithm ( www.barkovsky.hotmail.ru ).
Genomic Prediction and Association Mapping of Curd-Related Traits in Gene Bank Accessions of Cauliflower.

PubMed

Thorwarth, Patrick; Yousef, Eltohamy A A; Schmid, Karl J

2018-02-02

Genetic resources are an important source of genetic variation for plant breeding. Genome-wide association studies (GWAS) and genomic prediction greatly facilitate the analysis and utilization of useful genetic diversity for improving complex phenotypic traits in crop plants. We explored the potential of GWAS and genomic prediction for improving curd-related traits in cauliflower ( Brassica oleracea var. botrytis ) by combining 174 randomly selected cauliflower gene bank accessions from two different gene banks. The collection was genotyped with genotyping-by-sequencing (GBS) and phenotyped for six curd-related traits at two locations and three growing seasons. A GWAS analysis based on 120,693 single-nucleotide polymorphisms identified a total of 24 significant associations for curd-related traits. The potential for genomic prediction was assessed with a genomic best linear unbiased prediction model and BayesB. Prediction abilities ranged from 0.10 to 0.66 for different traits and did not differ between prediction methods. Imputation of missing genotypes only slightly improved prediction ability. Our results demonstrate that GWAS and genomic prediction in combination with GBS and phenotyping of highly heritable traits can be used to identify useful quantitative trait loci and genotypes among genetically diverse gene bank material for subsequent utilization as genetic resources in cauliflower breeding. Copyright © 2018 Thorwarth et al.
Predictions for Proteins, RNAs and DNAs with the Gaussian Dielectric Function Using DelPhiPKa

PubMed Central

Wang, Lin; Li, Lin; Alexov, Emil

2015-01-01

We developed a Poisson-Boltzmann based approach to calculate the PKa values of protein ionizable residues (Glu, Asp, His, Lys and Arg), nucleotides of RNA and single stranded DNA. Two novel features were utilized: the dielectric properties of the macromolecules and water phase were modeled via the smooth Gaussian-based dielectric function in DelPhi and the corresponding electrostatic energies were calculated without defining the molecular surface. We tested the algorithm by calculating PKa values for more than 300 residues from 32 proteins from the PPD dataset and achieved an overall RMSD of 0.77. Particularly, the RMSD of 0.55 was achieved for surface residues, while the RMSD of 1.1 for buried residues. The approach was also found capable of capturing the large PKa shifts of various single point mutations in staphylococcal nuclease (SNase) from PKa -cooperative dataset, resulting in an overall RMSD of 1.6 for this set of pKa’s. Investigations showed that predictions for most of buried mutant residues of SNase could be improved by using higher dielectric constant values. Furthermore, an option to generate different hydrogen positions also improves PKa predictions for buried carboxyl residues. Finally, the PKa calculations on two RNAs demonstrated the capability of this approach for other types of biomolecules. PMID:26408449
Oxytocin receptor gene variations predict neural and behavioral response to oxytocin in autism

PubMed Central

Watanabe, Takamitsu; Otowa, Takeshi; Abe, Osamu; Kuwabara, Hitoshi; Aoki, Yuta; Natsubori, Tatsunobu; Takao, Hidemasa; Kakiuchi, Chihiro; Kondo, Kenji; Ikeda, Masashi; Iwata, Nakao; Kasai, Kiyoto; Sasaki, Tsukasa

2017-01-01

Abstract Oxytocin appears beneficial for autism spectrum disorder (ASD), and more than 20 single-nucleotide polymorphisms (SNPs) in oxytocin receptor (OXTR) are relevant to ASD. However, neither biological functions of OXTR SNPs in ASD nor critical OXTR SNPs that determine oxytocin’s effects on ASD remains known. Here, using a machine-learning algorithm that was designed to evaluate collective effects of multiple SNPs and automatically identify most informative SNPs, we examined relationships between 27 representative OXTR SNPs and six types of behavioral/neural response to oxytocin in ASD individuals. The oxytocin effects were extracted from our previous placebo-controlled within-participant clinical trial administering single-dose intranasal oxytocin to 38 high-functioning adult Japanese ASD males. Consequently, we identified six different SNP sets that could accurately predict the six different oxytocin efficacies, and confirmed the robustness of these SNP selections against variations of the datasets and analysis parameters. Moreover, major alleles of several prominent OXTR SNPs—including rs53576 and rs2254298—were found to have dissociable effects on the oxytocin efficacies. These findings suggest biological functions of the OXTR SNP variants on autistic oxytocin responses, and implied that clinical oxytocin efficacy may be genetically predicted before its actual administration, which would contribute to establishment of future precision medicines for ASD. PMID:27798253
Sasquatch: predicting the impact of regulatory SNPs on transcription factor binding from cell- and tissue-specific DNase footprints

PubMed Central

Suciu, Maria C.; Telenius, Jelena

2017-01-01

In the era of genome-wide association studies (GWAS) and personalized medicine, predicting the impact of single nucleotide polymorphisms (SNPs) in regulatory elements is an important goal. Current approaches to determine the potential of regulatory SNPs depend on inadequate knowledge of cell-specific DNA binding motifs. Here, we present Sasquatch, a new computational approach that uses DNase footprint data to estimate and visualize the effects of noncoding variants on transcription factor binding. Sasquatch performs a comprehensive k-mer-based analysis of DNase footprints to determine any k-mer's potential for protein binding in a specific cell type and how this may be changed by sequence variants. Therefore, Sasquatch uses an unbiased approach, independent of known transcription factor binding sites and motifs. Sasquatch only requires a single DNase-seq data set per cell type, from any genotype, and produces consistent predictions from data generated by different experimental procedures and at different sequence depths. Here we demonstrate the effectiveness of Sasquatch using previously validated functional SNPs and benchmark its performance against existing approaches. Sasquatch is available as a versatile webtool incorporating publicly available data, including the human ENCODE collection. Thus, Sasquatch provides a powerful tool and repository for prioritizing likely regulatory SNPs in the noncoding genome. PMID:28904015
Associations between single nucleotide polymorphisms in folate uptake and metabolizing genes with blood folate, homocysteine and DNA uracil concentrations

USDA-ARS?s Scientific Manuscript database

Background: Folate is an essential nutrient which supports nucleotide synthesis and biological methylation reactions. Diminished folate status results in chromosome breakage and is associated with several diseases including colorectal cancer. Folate status is also inversely related to plasma homocys...
Genetic diversity of tyrosine hydroxylase (TH) and dopamine β-hydroxylase (DBH) genes in cattle breeds

PubMed Central

Lourenco-Jaramillo, Diana Lelidett; Sifuentes-Rincón, Ana María; Parra-Bracamonte, Gaspar Manuel; de la Rosa-Reyna, Xochitl Fabiola; Segura-Cabrera, Aldo; Arellano-Vera, Williams

2012-01-01

DNA from four cattle breeds was used to re-sequence all of the exons and 56% of the introns of the bovine tyrosine hydroxylase (TH) gene and 97% and 13% of the bovine dopamine β-hydroxylase (DBH) coding and non-coding sequences, respectively. Two novel single nucleotide polymorphisms (SNPs) and a microsatellite motif were found in the TH sequences. The DBH sequences contained 62 nucleotide changes, including eight non-synonymous SNPs (nsSNPs) that are of particular interest because they may alter protein function and therefore affect the phenotype. These DBH nsSNPs resulted in amino acid substitutions that were predicted to destabilize the protein structure. Six SNPs (one from TH and five from DBH non-synonymous SNPs) were genotyped in 140 animals; all of them were polymorphic and had a minor allele frequency of > 9%. There were significant differences in the intra- and inter-population haplotype distributions. The haplotype differences between Brahman cattle and the three B. t. taurus breeds (Charolais, Holstein and Lidia) were interesting from a behavioural point of view because of the differences in temperament between these breeds. PMID:22888292

Improving accuracies of genomic predictions for drought tolerance in maize by joint modeling of additive and dominance effects in multi-environment trials.

PubMed

Dias, Kaio Olímpio Das Graças; Gezan, Salvador Alejandro; Guimarães, Claudia Teixeira; Nazarian, Alireza; da Costa E Silva, Luciano; Parentoni, Sidney Netto; de Oliveira Guimarães, Paulo Evaristo; de Oliveira Anoni, Carina; Pádua, José Maria Villela; de Oliveira Pinto, Marcos; Noda, Roberto Willians; Ribeiro, Carlos Alexandre Gomes; de Magalhães, Jurandir Vieira; Garcia, Antonio Augusto Franco; de Souza, João Cândido; Guimarães, Lauro José Moreira; Pastina, Maria Marta

2018-07-01

Breeding for drought tolerance is a challenging task that requires costly, extensive, and precise phenotyping. Genomic selection (GS) can be used to maximize selection efficiency and the genetic gains in maize (Zea mays L.) breeding programs for drought tolerance. Here, we evaluated the accuracy of genomic selection (GS) using additive (A) and additive + dominance (AD) models to predict the performance of untested maize single-cross hybrids for drought tolerance in multi-environment trials. Phenotypic data of five drought tolerance traits were measured in 308 hybrids along eight trials under water-stressed (WS) and well-watered (WW) conditions over two years and two locations in Brazil. Hybrids' genotypes were inferred based on their parents' genotypes (inbred lines) using single-nucleotide polymorphism markers obtained via genotyping-by-sequencing. GS analyses were performed using genomic best linear unbiased prediction by fitting a factor analytic (FA) multiplicative mixed model. Two cross-validation (CV) schemes were tested: CV1 and CV2. The FA framework allowed for investigating the stability of additive and dominance effects across environments, as well as the additive-by-environment and the dominance-by-environment interactions, with interesting applications for parental and hybrid selection. Results showed differences in the predictive accuracy between A and AD models, using both CV1 and CV2, for the five traits in both water conditions. For grain yield (GY) under WS and using CV1, the AD model doubled the predictive accuracy in comparison to the A model. Through CV2, GS models benefit from borrowing information of correlated trials, resulting in an increase of 40% and 9% in the predictive accuracy of GY under WS for A and AD models, respectively. These results highlight the importance of multi-environment trial analyses using GS models that incorporate additive and dominance effects for genomic predictions of GY under drought in maize single-cross hybrids.
An integrative approach to predicting the functional effects of small indels in non-coding regions of the human genome

PubMed Central

Ferlaino, Michael; Rogers, Mark F.; Shihab, Hashem A.; Mort, Matthew; Cooper, David N.; Gaunt, Tom R.; Campbell, Colin

2018-01-01

Background Small insertions and deletions (indels) have a significant influence in human disease and, in terms of frequency, they are second only to single nucleotide variants as pathogenic mutations. As the majority of mutations associated with complex traits are located outside the exome, it is crucial to investigate the potential pathogenic impact of indels in non-coding regions of the human genome. Results We present FATHMM-indel, an integrative approach to predict the functional effect, pathogenic or neutral, of indels in non-coding regions of the human genome. Our method exploits various genomic annotations in addition to sequence data. When validated on benchmark data, FATHMM-indel significantly outperforms CADD and GAVIN, state of the art models in assessing the pathogenic impact of non-coding variants. FATHMM-indel is available via a web server at indels.biocompute.org.uk. Conclusions FATHMM-indel can accurately predict the functional impact and prioritise small indels throughout the whole non-coding genome. PMID:28985712
An integrative approach to predicting the functional effects of small indels in non-coding regions of the human genome.

PubMed

Ferlaino, Michael; Rogers, Mark F; Shihab, Hashem A; Mort, Matthew; Cooper, David N; Gaunt, Tom R; Campbell, Colin

2017-10-06

Small insertions and deletions (indels) have a significant influence in human disease and, in terms of frequency, they are second only to single nucleotide variants as pathogenic mutations. As the majority of mutations associated with complex traits are located outside the exome, it is crucial to investigate the potential pathogenic impact of indels in non-coding regions of the human genome. We present FATHMM-indel, an integrative approach to predict the functional effect, pathogenic or neutral, of indels in non-coding regions of the human genome. Our method exploits various genomic annotations in addition to sequence data. When validated on benchmark data, FATHMM-indel significantly outperforms CADD and GAVIN, state of the art models in assessing the pathogenic impact of non-coding variants. FATHMM-indel is available via a web server at indels.biocompute.org.uk. FATHMM-indel can accurately predict the functional impact and prioritise small indels throughout the whole non-coding genome.
Observational study to calculate addictive risk to opioids: a validation study of a predictive algorithm to evaluate opioid use disorder.

PubMed

Brenton, Ashley; Richeimer, Steven; Sharma, Maneesh; Lee, Chee; Kantorovich, Svetlana; Blanchard, John; Meshkin, Brian

2017-01-01

Opioid abuse in chronic pain patients is a major public health issue, with rapidly increasing addiction rates and deaths from unintentional overdose more than quadrupling since 1999. This study seeks to determine the predictability of aberrant behavior to opioids using a comprehensive scoring algorithm incorporating phenotypic risk factors and neuroscience-associated single-nucleotide polymorphisms (SNPs). The Proove Opioid Risk (POR) algorithm determines the predictability of aberrant behavior to opioids using a comprehensive scoring algorithm incorporating phenotypic risk factors and neuroscience-associated SNPs. In a validation study with 258 subjects with diagnosed opioid use disorder (OUD) and 650 controls who reported using opioids, the POR successfully categorized patients at high and moderate risks of opioid misuse or abuse with 95.7% sensitivity. Regardless of changes in the prevalence of opioid misuse or abuse, the sensitivity of POR remained >95%. The POR correctly stratifies patients into low-, moderate-, and high-risk categories to appropriately identify patients at need for additional guidance, monitoring, or treatment changes.
Identification of protein-interacting nucleotides in a RNA sequence using composition profile of tri-nucleotides.

PubMed

Panwar, Bharat; Raghava, Gajendra P S

2015-04-01

The RNA-protein interactions play a diverse role in the cells, thus identification of RNA-protein interface is essential for the biologist to understand their function. In the past, several methods have been developed for predicting RNA interacting residues in proteins, but limited efforts have been made for the identification of protein-interacting nucleotides in RNAs. In order to discriminate protein-interacting and non-interacting nucleotides, we used various classifiers (NaiveBayes, NaiveBayesMultinomial, BayesNet, ComplementNaiveBayes, MultilayerPerceptron, J48, SMO, RandomForest, SMO and SVM(light)) for prediction model development using various features and achieved highest 83.92% sensitivity, 84.82 specificity, 84.62% accuracy and 0.62 Matthew's correlation coefficient by SVM(light) based models. We observed that certain tri-nucleotides like ACA, ACC, AGA, CAC, CCA, GAG, UGA, and UUU preferred in protein-interaction. All the models have been developed using a non-redundant dataset and are evaluated using five-fold cross validation technique. A web-server called RNApin has been developed for the scientific community (http://crdd.osdd.net/raghava/rnapin/). Copyright © 2015 Elsevier Inc. All rights reserved.
Universal digital high-resolution melt: a novel approach to broad-based profiling of heterogeneous biological samples.

PubMed

Fraley, Stephanie I; Hardick, Justin; Masek, Billie J; Jo Masek, Billie; Athamanolap, Pornpat; Rothman, Richard E; Gaydos, Charlotte A; Carroll, Karen C; Wakefield, Teresa; Wang, Tza-Huei; Yang, Samuel

2013-10-01

Comprehensive profiling of nucleic acids in genetically heterogeneous samples is important for clinical and basic research applications. Universal digital high-resolution melt (U-dHRM) is a new approach to broad-based PCR diagnostics and profiling technologies that can overcome issues of poor sensitivity due to contaminating nucleic acids and poor specificity due to primer or probe hybridization inaccuracies for single nucleotide variations. The U-dHRM approach uses broad-based primers or ligated adapter sequences to universally amplify all nucleic acid molecules in a heterogeneous sample, which have been partitioned, as in digital PCR. Extensive assay optimization enables direct sequence identification by algorithm-based matching of melt curve shape and Tm to a database of known sequence-specific melt curves. We show that single-molecule detection and single nucleotide sensitivity is possible. The feasibility and utility of U-dHRM is demonstrated through detection of bacteria associated with polymicrobial blood infection and microRNAs (miRNAs) associated with host response to infection. U-dHRM using broad-based 16S rRNA gene primers demonstrates universal single cell detection of bacterial pathogens, even in the presence of larger amounts of contaminating bacteria; U-dHRM using universally adapted Lethal-7 miRNAs in a heterogeneous mixture showcases the single copy sensitivity and single nucleotide specificity of this approach.
3'-End labeling of nucleic acids by a polymerase ribozyme.

PubMed

Samanta, Biswajit; Horning, David P; Joyce, Gerald F

2018-06-13

A polymerase ribozyme can be used to label the 3' end of RNA or DNA molecules by incorporating a variety of functionalized nucleotide analogs. Guided by a complementary template, the ribozyme adds a single nucleotide that may contain a fluorophore, biotin, azide or alkyne moiety, thus enabling the detection and/or capture of selectively labeled materials. Employing a variety of commercially available nucleotide analogs, efficient labeling was demonstrated for model RNAs and DNAs, human microRNAs and natural tRNA.
Efficiency and Fidelity of Human DNA Polymerases λ and β during Gap-Filling DNA Synthesis

PubMed Central

Brown, Jessica A.; Pack, Lindsey R.; Sanman, Laura E.; Suo, Zucai

2010-01-01

The base excision repair (BER) pathway coordinates the replacement of 1 to 10 nucleotides at sites of single-base lesions. This process generates DNA substrates with various gap sizes which can alter the catalytic efficiency and fidelity of a DNA polymerase during gap-filling DNA synthesis. Here, we quantitatively determined the substrate specificity and base substitution fidelity of human DNA polymerase λ (Pol λ), an enzyme proposed to support the known BER DNA polymerase β (Pol β), as it filled 1- to 10-nucleotide gaps at 1-nucleotide intervals. Pol λ incorporated a correct nucleotide with relatively high efficiency until the gap size exceeded 9 nucleotides. Unlike Pol λ, Pol β did not have an absolute threshold on gap size as the catalytic efficiency for a correct dNTP gradually decreased as the gap size increased from 2 to 10 nucleotides and then recovered for non-gapped DNA. Surprisingly, an increase in gap size resulted in lower polymerase fidelity for Pol λ, and this downregulation of fidelity was controlled by its non-enzymatic N-terminal domains. Overall, Pol λ was up to 160-fold more error-prone than Pol β, thereby suggesting Pol λ would be more mutagenic during long gap-filling DNA synthesis. In addition, dCTP was the preferred misincorporation for Pol λ and its N-terminal domain truncation mutants. This nucleotide preference was shown to be dependent upon the identity of the adjacent 5′-template base. Our results suggested that both Pol λ and Pol β would catalyze nucleotide incorporation with the highest combination of efficiency and accuracy when the DNA substrate contains a single-nucleotide gap. Thus, Pol λ, like Pol β, is better suited to catalyze gap-filling DNA synthesis during short-patch BER in vivo, although, Pol λ may play a role in long-patch BER. PMID:20961817
DOE Office of Scientific and Technical Information (OSTI.GOV)

Mangoni, Monica; Bisanzi, Simonetta; Carozzi, Francesca

Purpose: Clinical radiosensitivity varies considerably among patients, and radiation-induced side effects developing in normal tissue can be therapy limiting. Some single nucleotide polymorphisms (SNPs) have been shown to correlate with hypersensitivity to radiotherapy. We conducted a prospective study of 87 female patients with breast cancer who received radiotherapy after breast surgery. We evaluated the association between acute skin reaction following radiotherapy and 11 genetic polymorphisms in DNA repair genes: XRCC1 (Arg399Gln and Arg194Trp), XRCC3 (Thr241Met), XPD (Asp312Asn and Lys751Gln), MSH2 (gIVS12-6T>C), MLH1 (Ile219Val), MSH3 (Ala1045Thr), MGMT (Leu84Phe), and in damage-detoxification GSTM1 and GSTT1 genes (allele deletion). Methods and Materials: Individualmore » genetic polymorphisms were determined by polymerase chain reaction and single nucleotide primer extension for single nucleotide polymorphisms or by a multiplex polymerase chain reaction assay for deletion polymorphisms. The development of severe acute skin reaction (moist desquamation or interruption of radiotherapy due to toxicity) associated with genetic polymorphisms was modeled using Cox proportional hazards, accounting for cumulative biologically effective radiation dose. Results: Radiosensitivity developed in eight patients and was increased in carriers of variants XRCC3-241Met allele (hazard ratio [HR] unquantifiably high), MSH2 gIVS12-6nt-C allele (HR = 53.36; 95% confidence intervals [95% CI], 3.56-798.98), and MSH3-1045Ala allele (HR unquantifiably high). Carriers of XRCC1-Arg194Trp variant allele in combination with XRCC1-Arg399Gln wild-type allele had a significant risk of radiosensitivity (HR = 38.26; 95% CI, 1.19-1232.52). Conclusions: To our knowledge, this is the first report to find an association between MSH2 and MSH3 genetic variants and the development of radiosensitivity in breast cancer patients. Our findings suggest the hypothesis that mismatch repair mechanisms may be involved in cellular response to radiotherapy. Genetic polymorphisms may be promising candidates for predicting acute radiosensitivity, but further studies are necessary to confirm our findings.« less
ATP binding and hydrolysis by Saccharomyces cerevisiae Msh2-Msh3 are differentially modulated by mismatch and double-strand break repair DNA substrates.

PubMed

Kumar, Charanya; Eichmiller, Robin; Wang, Bangchen; Williams, Gregory M; Bianco, Piero R; Surtees, Jennifer A

2014-06-01

In Saccharomyces cerevisiae, Msh2-Msh3-mediated mismatch repair (MMR) recognizes and targets insertion/deletion loops for repair. Msh2-Msh3 is also required for 3' non-homologous tail removal (3'NHTR) in double-strand break repair. In both pathways, Msh2-Msh3 binds double-strand/single-strand junctions and initiates repair in an ATP-dependent manner. However, we recently demonstrated that the two pathways have distinct requirements with respect to Msh2-Msh3 activities. We identified a set of aromatic residues in the nucleotide binding pocket (FLY motif) of Msh3 that, when mutated, disrupted MMR, but left 3'NHTR largely intact. One of these mutations, msh3Y942A, was predicted to disrupt the nucleotide sandwich and allow altered positioning of ATP within the pocket. To develop a mechanistic understanding of the differential requirements for ATP binding and/or hydrolysis in the two pathways, we characterized Msh2-Msh3 and Msh2-msh3Y942A ATP binding and hydrolysis activities in the presence of MMR and 3'NHTR DNA substrates. We observed distinct, substrate-dependent ATP hydrolysis and nucleotide turnover by Msh2-Msh3, indicating that the MMR and 3'NHTR DNA substrates differentially modify the ATP binding/hydrolysis activities of Msh2-Msh3. Msh2-msh3Y942A retained the ability to bind DNA and ATP but exhibited altered ATP hydrolysis and nucleotide turnover. We propose that both ATP and structure-specific repair substrates cooperate to direct Msh2-Msh3-mediated repair and suggest an explanation for the msh3Y942A separation-of-function phenotype. Copyright © 2014 Elsevier B.V. All rights reserved.
ATP binding and hydrolysis by Saccharomyces cerevisiae Msh2-Msh3 are differentially modulated by Mismatch and Double-strand Break Repair DNA substrates

PubMed Central

Kumar, Charanya; Eichmiller, Robin; Wang, Bangchen; Williams, Gregory M.; Bianco, Piero R.; Surtees, Jennifer A.

2014-01-01

In Saccharomyces cerevisiae, Msh2-Msh3-mediated mismatch repair (MMR) recognizes and targets insertion/deletion loops for repair. Msh2-Msh3 is also required for 3′ non-homologous tail removal (3′NHTR) in double-strand break repair. In both pathways, Msh2-Msh3 binds double-strand/single-strand junctions and initiates repair in an ATP-dependent manner. However, we recently demonstrated that the two pathways have distinct requirements with respect to Msh2-Msh3 activities. We identified a set of aromatic residues in the nucleotide binding pocket (FLY motif) of Msh3 that, when mutated, disrupted MMR, but left 3′ NHTR largely intact. One of these mutations, msh3Y942A, was predicted to disrupt the nucleotide sandwich and allow altered positioning of ATP within the pocket. To develop a mechanistic understanding of the differential requirements for ATP binding and/or hydrolysis in the two pathways, we characterized Msh2-Msh3 and Msh2-msh3Y942A ATP binding and hydrolysis activities in the presence of MMR and 3′ NHTR DNA substrates. We observed distinct, substrate-dependent ATP hydrolysis and nucleotide turnover by Msh2-Msh3, indicating that the MMR and 3′ NHTR DNA substrates differentially modify the ATP binding/hydrolysis activities of Msh2-Msh3. Msh2-msh3Y942A retained the ability to bind DNA and ATP but exhibited altered ATP hydrolysis and nucleotide turnover. We propose that both ATP and structure-specific repair substrates cooperate to direct Msh2-Msh3-mediated repair and suggest an explanation for the msh3Y942A separation-of-function phenotype. PMID:24746922
Improving Disease Prediction by Incorporating Family Disease History in Risk Prediction Models with Large-Scale Genetic Data.

PubMed

Gim, Jungsoo; Kim, Wonji; Kwak, Soo Heon; Choi, Hosik; Park, Changyi; Park, Kyong Soo; Kwon, Sunghoon; Park, Taesung; Won, Sungho

2017-11-01

Despite the many successes of genome-wide association studies (GWAS), the known susceptibility variants identified by GWAS have modest effect sizes, leading to notable skepticism about the effectiveness of building a risk prediction model from large-scale genetic data. However, in contrast to genetic variants, the family history of diseases has been largely accepted as an important risk factor in clinical diagnosis and risk prediction. Nevertheless, the complicated structures of the family history of diseases have limited their application in clinical practice. Here, we developed a new method that enables incorporation of the general family history of diseases with a liability threshold model, and propose a new analysis strategy for risk prediction with penalized regression analysis that incorporates both large numbers of genetic variants and clinical risk factors. Application of our model to type 2 diabetes in the Korean population (1846 cases and 1846 controls) demonstrated that single-nucleotide polymorphisms accounted for 32.5% of the variation explained by the predicted risk scores in the test data set, and incorporation of family history led to an additional 6.3% improvement in prediction. Our results illustrate that family medical history provides valuable information on the variation of complex diseases and improves prediction performance. Copyright © 2017 by the Genetics Society of America.
Detecting and Analyzing Genetic Recombination Using RDP4.

PubMed

Martin, Darren P; Murrell, Ben; Khoosal, Arjun; Muhire, Brejnev

2017-01-01

Recombination between nucleotide sequences is a major process influencing the evolution of most species on Earth. The evolutionary value of recombination has been widely debated and so too has its influence on evolutionary analysis methods that assume nucleotide sequences replicate without recombining. When nucleic acids recombine, the evolution of the daughter or recombinant molecule cannot be accurately described by a single phylogeny. This simple fact can seriously undermine the accuracy of any phylogenetics-based analytical approach which assumes that the evolutionary history of a set of recombining sequences can be adequately described by a single phylogenetic tree. There are presently a large number of available methods and associated computer programs for analyzing and characterizing recombination in various classes of nucleotide sequence datasets. Here we examine the use of some of these methods to derive and test recombination hypotheses using multiple sequence alignments.
Cy3 and Cy5 dyes attached to oligonucleotide terminus stabilize DNA duplexes: predictive thermodynamic model.

PubMed

Moreira, Bernardo G; You, Yong; Owczarzy, Richard

2015-03-01

Cyanine dyes are important chemical modifications of oligonucleotides exhibiting intensive and stable fluorescence at visible light wavelengths. When Cy3 or Cy5 dye is attached to 5' end of a DNA duplex, the dye stacks on the terminal base pair and stabilizes the duplex. Using optical melting experiments, we have determined thermodynamic parameters that can predict the effects of the dyes on duplex stability quantitatively (ΔG°, Tm). Both Cy dyes enhance duplex formation by 1.2 kcal/mol on average, however, this Gibbs energy contribution is sequence-dependent. If the Cy5 is attached to a pyrimidine nucleotide of pyrimidine-purine base pair, the stabilization is larger compared to the attachment to a purine nucleotide. This is likely due to increased stacking interactions of the dye to the purine of the complementary strand. Dangling (unpaired) nucleotides at duplex terminus are also known to enhance duplex stability. Stabilization originated from the Cy dyes is significantly larger than the stabilization due to the presence of dangling nucleotides. If both the dangling base and Cy3 are present, their thermodynamic contributions are approximately additive. New thermodynamic parameters improve predictions of duplex folding, which will help design oligonucleotide sequences for biophysical, biological, engineering, and nanotechnology applications. Copyright © 2015. Published by Elsevier B.V.
Genomics DNA Profiling in Elite Professional Soccer Players: A Pilot Study

PubMed Central

Kambouris, M; Del Buono, A; Maffulli, N

2014-01-01

Functional variants in exonic regions have been associated with development of cardiovascular disease, diabetes and cancer. Athletic performance can be considered a multi-factorial complex phenotype. Genomic DNA was extracted from buccal swabs of seven soccer players from the Fulham football team. Single nucleotide polymorphism (SNPs) genotyping was undertaken. To achieve optimal athletic performance, predictive genomics DNA profiling for sports performance can be used to aid in sport selection and elaboration of personalized training and nutrition programs. Predictive DNA profiling may be able to detect athletes with potential or frank injuries, or screening and selection of future athletes, and can help them to maximize utilization of their potential and improve performance in sports. The aim of this study is to provide a wide scenario of specific genomic variants that an athlete carries, to implement which measures should be taken to maximize the athlete’s potential. PMID:24809029
Computational and Experimental Approaches to Reveal the Effects of Single Nucleotide Polymorphisms with Respect to Disease Diagnostics

PubMed Central

Kucukkal, Tugba G.; Yang, Ye; Chapman, Susan C.; Cao, Weiguo; Alexov, Emil

2014-01-01

DNA mutations are the cause of many human diseases and they are the reason for natural differences among individuals by affecting the structure, function, interactions, and other properties of DNA and expressed proteins. The ability to predict whether a given mutation is disease-causing or harmless is of great importance for the early detection of patients with a high risk of developing a particular disease and would pave the way for personalized medicine and diagnostics. Here we review existing methods and techniques to study and predict the effects of DNA mutations from three different perspectives: in silico, in vitro and in vivo. It is emphasized that the problem is complicated and successful detection of a pathogenic mutation frequently requires a combination of several methods and a knowledge of the biological phenomena associated with the corresponding macromolecules. PMID:24886813
Psychological Distress Following Marital Separation Interacts with a Polymorphism in the Serotonin Transporter Gene to Predict Cardiac Vagal Control in the Laboratory

PubMed Central

Hasselmo, Karen; Sbarra, David A.; O'Connor, Mary-Frances; Moreno, Francisco A.

2015-01-01

Marital separation is linked to negative mental and physical health; however, the strength of this link may vary across people. This study examined changes in respiratory sinus arrhythmia (RSA), used to assess cardiac vagal control, in recently separated adults (N = 79; M time since separation = 3.5 months). When reflecting over the separation, self-reported psychological distress following the separation interacted with a polymorphism in the serotonin transporter gene (5-HTTLPR) and a relevant single nucleotide polymorphism (SNP), rs25531, to predict RSA. Among people reporting emotional difficulties after the separation, those who were homozygous for the short allele had lower RSA levels while reflecting on their relationship than other genotypes. The findings, although limited by the relatively small sample size, are discussed in terms of how higher-sensitivity genotypes may interact with psychological responses to stress to alter physiology. PMID:25630596
Exploring the Potential of Direct-To-Consumer Genomic Test Data for Predicting Adverse Drug Events.

PubMed

Zhang, Patrick M; Sarkar, Indra Neil

2018-01-01

Recent technological advancements in genetic testing and the growing accessibility of public genomic data provide researchers with a unique avenue to approach personalized medicine. This feasibility study examined the potential of direct-to-consumer (DTC) genomic tests (focusing on 23andMe) in research and clinical applications. In particular, we combined population genetics information from the Personal Genome Project with adverse event reports from AEOLUS and pharmacogenetic information from PharmGKB. Primarily, associations between drugs based on co-occurring genetic variations and associations between variants and adverse events were used to assess the potential for leveraging single nucleotide polymorphism information from 23andMe. The results of this study suggest potential clinical uses of DTC tests in light of potential drug interactions. Furthermore, the results suggest great potential for analyzing associations at a population level to facilitate knowledge discovery in the realm of predicting adverse drug events.
Extension of the COG and arCOG databases by amino acid and nucleotide sequences

PubMed Central

Meereis, Florian; Kaufmann, Michael

2008-01-01

Background The current versions of the COG and arCOG databases, both excellent frameworks for studies in comparative and functional genomics, do not contain the nucleotide sequences corresponding to their protein or protein domain entries. Results Using sequence information obtained from GenBank flat files covering the completely sequenced genomes of the COG and arCOG databases, we constructed NUCOCOG (nucleotide sequences containing COG databases) as an extended version including all nucleotide sequences and in addition the amino acid sequences originally utilized to construct the current COG and arCOG databases. We make available three comprehensive single XML files containing the complete databases including all sequence information. In addition, we provide a web interface as a utility suitable to browse the NUCOCOG database for sequence retrieval. The database is accessible at . Conclusion NUCOCOG offers the possibility to analyze any sequence related property in the context of the COG and arCOG framework simply by using script languages such as PERL applied to a large but single XML document. PMID:19014535
Molecular Characterization of an Avian Astrovirus

PubMed Central

Koci, Matthew D.; Seal, Bruce S.; Schultz-Cherry, Stacey

2000-01-01

Astroviruses are known to cause enteric disease in several animal species, including turkeys. However, only human astroviruses have been well characterized at the nucleotide level. Herein we report the nucleotide sequence, genomic organization, and predicted amino acid sequence of a turkey astrovirus isolated from poults with an emerging enteric disease. PMID:10846102

Obesity-Related Genomic Loci Are Associated with Type 2 Diabetes in a Han Chinese Population

PubMed Central

Zhao, Qi; He, Jiang; Chen, Li; Zhao, Zhigang; Li, Qiang; Ge, Jiapu; Chen, Gang; Guo, Xiaohui; Lu, Juming; Weng, Jianping; Jia, Weiping; Ji, Linong; Xiao, Jianzhong; Shan, Zhongyan; Liu, Jie; Tian, Haoming; Ji, Qiuhe; Zhu, Dalong; Zhou, Zhiguang; Shan, Guangliang; Yang, Wenying

2014-01-01

Background and Aims Obesity is a well-known risk factor for type 2 diabetes. Genome-wide association studies have identified a number of genetic loci associated with obesity. The aim of this study is to examine the contribution of obesity-related genomic loci to type 2 diabetes in a Chinese population. Methods We successfully genotyped 18 obesity-related single nucleotide polymorphisms among 5338 type 2 diabetic patients and 4663 controls. Both individual and joint effects of these single nucleotide polymorphisms on type 2 diabetes and quantitative glycemic traits (assessing β-cell function and insulin resistance) were analyzed using logistic and linear regression models, respectively. Results Two single nucleotide polymorphisms near MC4R and GNPDA2 genes were significantly associated with type 2 diabetes before adjusting for body mass index and waist circumference (OR (95% CI) = 1.14 (1.06, 1.22) for the A allele of rs12970134, P = 4.75×10−4; OR (95% CI) = 1.10 (1.03, 1.17) for the G allele of rs10938397, P = 4.54×10−3). When body mass index and waist circumference were further adjusted, the association of MC4R with type 2 diabetes remained significant (P = 1.81×10−2) and that of GNPDA2 was attenuated (P = 1.26×10−1), suggesting the effect of the locus including GNPDA2 on type 2 diabetes may be mediated through obesity. Single nucleotide polymorphism rs2260000 within BAT2 was significantly associated with type 2 diabetes after adjusting for body mass index and waist circumference (P = 1.04×10−2). In addition, four single nucleotide polymorphisms (near or within SEC16B, BDNF, MAF and PRL genes) showed significant associations with quantitative glycemic traits in controls even after adjusting for body mass index and waist circumference (all P values<0.05). Conclusions This study indicates that obesity-related genomic loci were associated with type 2 diabetes and glycemic traits in the Han Chinese population. PMID:25093408
SRD5A1 and SRD5A2 are associated with treatment for benign prostatic hyperplasia with the combination of 5α-reductase inhibitors and α-adrenergic receptor antagonists.

PubMed

Gu, Xin; Na, Rong; Huang, Tao; Wang, Li; Tao, Sha; Tian, Lu; Chen, Zhuo; Jiao, Yang; Kang, Jian; Zheng, Siqun; Xu, Jianfeng; Sun, Jielin; Qi, Jun

2013-08-01

Common treatments for benign prostatic hyperplasia include 5α-reductase inhibitors and α-adrenergic receptor antagonists. However, these treatments can only partially decrease the risk of benign prostatic hyperplasia progression. SRD5A1 and SRD5A2 are 5α-reductase inhibitor targets. We investigated the association between drug efficacy and single nucleotide polymorphisms in the SRD5A1 and SRD5A2 genes in a Chinese population. We genotyped 11 tagging single nucleotide polymorphisms in the SRD5A1 and SRD5A2 genes in a total of 426 benign prostatic hyperplasia cases and 1,008 controls from Xinhua Hospital, Shanghai, People's Republic of China. Cases were treated with type II 5α-reductase inhibitors and α-adrenergic receptor antagonists. We tested the association of tagging single nucleotide polymorphisms with benign prostatic hyperplasia risk/progression, clinical characteristics at baseline, including the I-PSS (International Prostate Symptom Score) and total prostate volume, and changes in clinical characteristics after treatment. The 11 tagging single nucleotide polymorphisms were not significantly associated with benign prostatic hyperplasia risk or progression (each p >0.05). In the SRD5A1 gene rs6884552 and rs3797177 were significantly associated with baseline I-PSS (p = 0.04 and 0.003, respectively). In the SRD5A2 gene rs523349 (V89L) and rs9332975 were significantly associated with baseline total prostate volume (p = 0.01 and 0.001, respectively). In SRD5A1 rs166050 was significantly associated with the posttreatment change in total prostate volume (p = 0.04). In SRD5A2 rs523349 and rs612224 were significantly associated with the posttreatment I-PSS change (p = 0.03 and 0.009, respectively). SRD5A1 and SRD5A2 single nucleotide polymorphisms are significantly associated with the clinical characteristics of benign prostatic hyperplasia and the efficacy of benign prostatic hyperplasia treatment. Copyright © 2013 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
Clinical evaluation, biochemistry and genetic polymorphism analysis for the diagnosis of lactose intolerance in a population from northeastern Brazil.

PubMed

Ponte, Paulo Roberto Lins; de Medeiros, Pedro Henrique Quintela Soares; Havt, Alexandre; Caetano, Joselany Afio; Cid, David A C; Prata, Mara de Moura Gondim; Soares, Alberto Melo; Guerrant, Richard L; Mychaleckyj, Josyf; Lima, Aldo Ângelo Moreira

2016-02-01

This work aimed to evaluate and correlate symptoms, biochemical blood test results and single nucleotide polymorphisms for lactose intolerance diagnosis. A cross-sectional study was conducted in Fortaleza, Ceará, Brazil, with a total of 119 patients, 54 of whom were lactose intolerant. Clinical evaluation and biochemical blood tests were conducted after lactose ingestion and blood samples were collected for genotyping evaluation. In particular, the single nucleotide polymorphisms C>T-13910 and G>A-22018 were analyzed by restriction fragment length polymorphism/polymerase chain reaction and validated by DNA sequencing. Lactose-intolerant patients presented with more symptoms of flatulence (81.4%), bloating (68.5%), borborygmus (59.3%) and diarrhea (46.3%) compared with non-lactose-intolerant patients (p<0.05). We observed a significant association between the presence of the alleles T-13910 and A-22018 and the lactose-tolerant phenotype (p<0.05). After evaluation of the biochemical blood test results for lactose, we found that the most effective cutoff for glucose levels obtained for lactose malabsorbers was <15 mg/dL, presenting an area under the receiver operating characteristic curve greater than 80.3%, with satisfactory values for sensitivity and specificity. These data corroborate the association of these single nucleotide polymorphisms (C>T-13910 and G>A-22018) with lactose tolerance in this population and suggest clinical management for patients with lactose intolerance that considers single nucleotide polymorphism detection and a change in the biochemical blood test cutoff from <25 mg/dL to <15 mg/dL.
17q25 Locus Is Associated With White Matter Hyperintensity Volume in Ischemic Stroke, But Not With Lacunar Stroke Status

PubMed Central

Adib-Samii, Poneh; Rost, Natalia; Traylor, Matthew; Devan, William; Biffi, Alessandro; Lanfranconi, Silvia; Fitzpatrick, Kaitlin; Bevan, Steve; Kanakis, Allison; Valant, Valerie; Gschwendtner, Andreas; Malik, Rainer; Richie, Alexa; Gamble, Dale; Segal, Helen; Parati, Eugenio A.; Ciusani, Emilio; Holliday, Elizabeth G.; Maguire, Jane; Wardlaw, Joanna; Worrall, Bradford; Bis, Joshua; Wiggins, Kerri L.; Longstreth, Will; Kittner, Steve J.; Cheng, Yu-Ching; Mosley, Thomas; Falcone, Guido J.; Furie, Karen L.; Leiva-Salinas, Carlos; Lau, Benison C.; Khan, Muhammed Saleem; Sharma, Pankaj; Fornage, Myriam; Mitchell, Braxton D.; Psaty, Bruce M.; Sudlow, Cathie; Levi, Christopher; Boncoraglio, Giorgio B.; Rothwell, Peter M.; Meschia, James; Dichgans, Martin; Rosand, Jonathan; Markus, Hugh S.

2013-01-01

Background and Purpose Recently, a novel locus at 17q25 was associated with white matter hyperintensities (WMH) on MRI in stroke-free individuals. We aimed to replicate the association with WMH volume (WMHV) in patients with ischemic stroke. If the association acts by promoting a small vessel arteriopathy, it might be expected to also associate with lacunar stroke. Methods We quantified WMH on MRI in the stroke-free hemisphere of 2588 ischemic stroke cases. Association between WMHV and 6 single-nucleotide polymorphisms at chromosome 17q25 was assessed by linear regression. These single-nucleotide polymorphisms were also investigated for association with lacunar stroke in 1854 cases and 51 939 stroke-free controls from METASTROKE. Meta-analyses with previous reports and a genetic risk score approach were applied to identify other novel WMHV risk variants and uncover shared genetic contributions to WMHV in community participants without stroke and ischemic stroke. Results Single-nucleotide polymorphisms at 17q25 were associated with WMHV in ischemic stroke, the most significant being rs9894383 (P=0.0006). In contrast, there was no association between any single-nucleotide polymorphism and lacunar stroke. A genetic risk score analysis revealed further genetic components to WMHV shared between community participants without stroke and ischemic stroke. Conclusions This study provides support for an association between the 17q25 locus and WMH. In contrast, it is not associated with lacunar stroke, suggesting that the association does not act by promoting small-vessel arteriopathy or the same arteriopathy responsible for lacunar infarction. PMID:23674528
Association of Nitric Oxide Synthase and Matrix Metalloprotease Single Nucleotide Polymorphisms with Preeclampsia and Its Complications

PubMed Central

Leonardo, Daniela P.; Albuquerque, Dulcinéia M.; Lanaro, Carolina; Baptista, Letícia C.; Cecatti, José G.; Surita, Fernanda G.; Parpinelli, Mary A.; Costa, Fernando F.; Franco-Penteado, Carla F.; Fertrin, Kleber Y.; Costa, Maria Laura

2015-01-01

Background Preeclampsia is one of the leading causes of maternal and neonatal morbidity and mortality in the world, but its appearance is still unpredictable and its pathophysiology has not been entirely elucidated. Genetic studies have associated single nucleotide polymorphisms in genes encoding nitric oxide synthase and matrix metalloproteases with preeclampsia, but the results are largely inconclusive across different populations. Objectives To investigate the association of single nucleotide polymorphisms (SNPs) in NOS3 (G894T, T-786C, and a variable number of tandem repetitions VNTR in intron 4), MMP2 (C-1306T), and MMP9 (C-1562T) genes with preeclampsia in patients from Southeastern Brazil. Methods This prospective case-control study enrolled 77 women with preeclampsia and 266 control pregnant women. Clinical data were collected to assess risk factors and the presence of severe complications, such as eclampsia and HELLP (hemolysis, elevated liver enzymes, and low platelets) syndrome. Results We found a significant association between the single nucleotide polymorphism NOS3 T-786C and preeclampsia, independently from age, height, weight, or the other SNPs studied, and no association was found with the other polymorphisms. Age and history of preeclampsia were also identified as risk factors. The presence of at least one polymorphic allele for NOS3 T-786C was also associated with the occurrence of eclampsia or HELLP syndrome among preeclamptic women. Conclusions Our data support that the NOS3 T-786C SNP is associated with preeclampsia and the severity of its complications. PMID:26317342
Clinical evaluation, biochemistry and genetic polymorphism analysis for the diagnosis of lactose intolerance in a population from northeastern Brazil

PubMed Central

Ponte, Paulo Roberto Lins; de Medeiros, Pedro Henrique Quintela Soares; Havt, Alexandre; Caetano, Joselany Afio; Cid, David A C; de Moura Gondim Prata, Mara; Soares, Alberto Melo; Guerrant, Richard L; Mychaleckyj, Josyf; Lima, Aldo Ângelo Moreira

2016-01-01

OBJECTIVE: This work aimed to evaluate and correlate symptoms, biochemical blood test results and single nucleotide polymorphisms for lactose intolerance diagnosis. METHOD: A cross-sectional study was conducted in Fortaleza, Ceará, Brazil, with a total of 119 patients, 54 of whom were lactose intolerant. Clinical evaluation and biochemical blood tests were conducted after lactose ingestion and blood samples were collected for genotyping evaluation. In particular, the single nucleotide polymorphisms C>T-13910 and G>A-22018 were analyzed by restriction fragment length polymorphism/polymerase chain reaction and validated by DNA sequencing. RESULTS: Lactose-intolerant patients presented with more symptoms of flatulence (81.4%), bloating (68.5%), borborygmus (59.3%) and diarrhea (46.3%) compared with non-lactose-intolerant patients (p<0.05). We observed a significant association between the presence of the alleles T-13910 and A-22018 and the lactose-tolerant phenotype (p<0.05). After evaluation of the biochemical blood test results for lactose, we found that the most effective cutoff for glucose levels obtained for lactose malabsorbers was <15 mg/dL, presenting an area under the receiver operating characteristic curve greater than 80.3%, with satisfactory values for sensitivity and specificity. CONCLUSIONS: These data corroborate the association of these single nucleotide polymorphisms (C>T-13910 and G>A-22018) with lactose tolerance in this population and suggest clinical management for patients with lactose intolerance that considers single nucleotide polymorphism detection and a change in the biochemical blood test cutoff from <25 mg/dL to <15 mg/dL. PMID:26934237
Contribution of domestic production records, Interbull estimated breeding values, and single nucleotide polymorphism genetic markers to the single-step genomic evaluation of milk production.

PubMed

Přibyl, J; Madsen, P; Bauer, J; Přibylová, J; Simečková, M; Vostrý, L; Zavadilová, L

2013-03-01

Estimated breeding values (EBV) for first-lactation milk production of Holstein cattle in the Czech Republic were calculated using a conventional animal model and by single-step prediction of the genomic enhanced breeding value. Two overlapping data sets of milk production data were evaluated: (1) calving years 1991 to 2006, with 861,429 lactations and 1,918,901 animals in the pedigree and (2) calving years 1991 to 2010, with 1,097,319 lactations and 1,906,576 animals in the pedigree. Global Interbull (Uppsala, Sweden) deregressed proofs of 114,189 bulls were used in the analyses. Reliabilities of Interbull values were equivalent to an average of 8.53 effective records, which were used in a weighted analysis. A total of 1,341 bulls were genotyped using the Illumina BovineSNP50 BeadChip V2 (Illumina Inc., San Diego, CA). Among the genotyped bulls were 332 young bulls with no daughters in the first data set but more than 50 daughters (88.41, on average) with performance records in the second data set. For young bulls, correlations of EBV and genomic enhanced breeding value before and after progeny testing, corresponding average expected reliabilities, and effective daughter contributions (EDC) were calculated. The reliability of prediction pedigree EBV of young bulls was 0.41, corresponding to EDC=10.6. Including Interbull deregressed proofs improved the reliability of prediction by EDC=13.4 and including genotyping improved prediction reliability by EDC=6.2. Total average expected reliability of prediction reached 0.67, corresponding to EDC=30.2. The combination of domestic and Interbull sources for both genotyped and nongenotyped animals is valuable for improving the accuracy of genetic prediction in small populations of dairy cattle. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Predicting Hybrid Performances for Quality Traits through Genomic-Assisted Approaches in Central European Wheat

PubMed Central

Liu, Guozheng; Zhao, Yusheng; Gowda, Manje; Longin, C. Friedrich H.; Reif, Jochen C.; Mette, Michael F.

2016-01-01

Bread-making quality traits are central targets for wheat breeding. The objectives of our study were to (1) examine the presence of major effect QTLs for quality traits in a Central European elite wheat population, (2) explore the optimal strategy for predicting the hybrid performance for wheat quality traits, and (3) investigate the effects of marker density and the composition and size of the training population on the accuracy of prediction of hybrid performance. In total 135 inbred lines of Central European bread wheat (Triticum aestivum L.) and 1,604 hybrids derived from them were evaluated for seven quality traits in up to six environments. The 135 parental lines were genotyped using a 90k single-nucleotide polymorphism array. Genome-wide association mapping initially suggested presence of several quantitative trait loci (QTLs), but cross-validation rather indicated the absence of major effect QTLs for all quality traits except of 1000-kernel weight. Genomic selection substantially outperformed marker-assisted selection in predicting hybrid performance. A resampling study revealed that increasing the effective population size in the estimation set of hybrids is relevant to boost the accuracy of prediction for an unrelated test population. PMID:27383841
Bridging the gap between marker-assisted and genomic selection of heading time and plant height in hybrid wheat.

PubMed

Zhao, Y; Mette, M F; Gowda, M; Longin, C F H; Reif, J C

2014-06-01

Based on data from field trials with a large collection of 135 elite winter wheat inbred lines and 1604 F1 hybrids derived from them, we compared the accuracy of prediction of marker-assisted selection and current genomic selection approaches for the model traits heading time and plant height in a cross-validation approach. For heading time, the high accuracy seen with marker-assisted selection severely dropped with genomic selection approaches RR-BLUP (ridge regression best linear unbiased prediction) and BayesCπ, whereas for plant height, accuracy was low with marker-assisted selection as well as RR-BLUP and BayesCπ. Differences in the linkage disequilibrium structure of the functional and single-nucleotide polymorphism markers relevant for the two traits were identified in a simulation study as a likely explanation for the different trends in accuracies of prediction. A new genomic selection approach, weighted best linear unbiased prediction (W-BLUP), designed to treat the effects of known functional markers more appropriately, proved to increase the accuracy of prediction for both traits and thus closes the gap between marker-assisted and genomic selection.
Bridging the gap between marker-assisted and genomic selection of heading time and plant height in hybrid wheat

PubMed Central

Zhao, Y; Mette, M F; Gowda, M; Longin, C F H; Reif, J C

2014-01-01

Based on data from field trials with a large collection of 135 elite winter wheat inbred lines and 1604 F1 hybrids derived from them, we compared the accuracy of prediction of marker-assisted selection and current genomic selection approaches for the model traits heading time and plant height in a cross-validation approach. For heading time, the high accuracy seen with marker-assisted selection severely dropped with genomic selection approaches RR-BLUP (ridge regression best linear unbiased prediction) and BayesCπ, whereas for plant height, accuracy was low with marker-assisted selection as well as RR-BLUP and BayesCπ. Differences in the linkage disequilibrium structure of the functional and single-nucleotide polymorphism markers relevant for the two traits were identified in a simulation study as a likely explanation for the different trends in accuracies of prediction. A new genomic selection approach, weighted best linear unbiased prediction (W-BLUP), designed to treat the effects of known functional markers more appropriately, proved to increase the accuracy of prediction for both traits and thus closes the gap between marker-assisted and genomic selection. PMID:24518889
Short double-stranded RNAs with an overhanging 5' ppp-nucleotide, as found in arenavirus genomes, act as RIG-I decoys.

PubMed

Marq, Jean-Baptiste; Hausmann, Stéphane; Veillard, Nicolas; Kolakofsky, Daniel; Garcin, Dominique

2011-02-25

Arenavirus RNA genomes are initiated by a "prime and realign" mechanism, such that the initiating GTP is found as a single unpaired (overhanging) nucleotide when the complementary genome ends anneal to form double-stranded (ds) RNA panhandle structures. dsRNAs modeled on these structures do not induce interferon (IFN), as opposed to blunt-ended (5' ppp)dsRNA. This study examines whether these viral structures can also act as decoys, by trapping RIG-I in inactive dsRNA complexes. We examined the ability of various dsRNAs to activate the RIG-I ATPase (presumably a measure of helicase translocation on dsRNA) relative to their ability to induce IFN. We found that there is no simple relationship between these two properties, as if RIG-I can translocate on short dsRNAs without inducing IFN. Moreover, we found that (5' ppp)dsRNAs with a single unpaired 5' ppp-nucleotide can in fact competitively inhibit the ability of blunt-ended (5' ppp)dsRNAs to induce IFN when co-transfected into cells and that this inhibition is strongly dependent on the presence of the 5' ppp. In contrast, (5' ppp)dsRNAs with a single unpaired 5' ppp-nucleotide does not inhibit poly(I-C)-induced IFN activation, which is independent of the presence of a 5' ppp group.
Four Linked Genes Participate in Controlling Sporulation Efficiency in Budding Yeast

PubMed Central

Ben-Ari, Giora; Zenvirth, Drora; Sherman, Amir; David, Lior; Klutstein, Michael; Lavi, Uri; Hillel, Jossi; Simchen, Giora

2006-01-01

Quantitative traits are conditioned by several genetic determinants. Since such genes influence many important complex traits in various organisms, the identification of quantitative trait loci (QTLs) is of major interest, but still encounters serious difficulties. We detected four linked genes within one QTL, which participate in controlling sporulation efficiency in Saccharomyces cerevisiae. Following the identification of single nucleotide polymorphisms by comparing the sequences of 145 genes between the parental strains SK1 and S288c, we analyzed the segregating progeny of the cross between them. Through reciprocal hemizygosity analysis, four genes, RAS2, PMS1, SWS2, and FKH2, located in a region of 60 kilobases on Chromosome 14, were found to be associated with sporulation efficiency. Three of the four “high” sporulation alleles are derived from the “low” sporulating strain. Two of these sporulation-related genes were verified through allele replacements. For RAS2, the causative variation was suggested to be a single nucleotide difference in the upstream region of the gene. This quantitative trait nucleotide accounts for sporulation variability among a set of ten closely related winery yeast strains. Our results provide a detailed view of genetic complexity in one “QTL region” that controls a quantitative trait and reports a single nucleotide polymorphism-trait association in wild strains. Moreover, these findings have implications on QTL identification in higher eukaryotes. PMID:17112318
Genetics of Oxidative Stress in Obesity

PubMed Central

Rupérez, Azahara I.; Gil, Angel; Aguilera, Concepción M.

2014-01-01

Obesity is a multifactorial disease characterized by the excessive accumulation of fat in adipose tissue and peripheral organs. Its derived metabolic complications are mediated by the associated oxidative stress, inflammation and hypoxia. Oxidative stress is due to the excessive production of reactive oxygen species or diminished antioxidant defenses. Genetic variants, such as single nucleotide polymorphisms in antioxidant defense system genes, could alter the efficacy of these enzymes and, ultimately, the risk of obesity; thus, studies investigating the role of genetic variations in genes related to oxidative stress could be useful for better understanding the etiology of obesity and its metabolic complications. The lack of existing literature reviews in this field encouraged us to gather the findings from studies focusing on the impact of single nucleotide polymorphisms in antioxidant enzymes, oxidative stress-producing systems and transcription factor genes concerning their association with obesity risk and its phenotypes. In the future, the characterization of these single nucleotide polymorphisms (SNPs) in obese patients could contribute to the development of controlled antioxidant therapies potentially beneficial for the treatment of obesity-derived metabolic complications. PMID:24562334
Genetics of oxidative stress in obesity.

PubMed

Rupérez, Azahara I; Gil, Angel; Aguilera, Concepción M

2014-02-20

Obesity is a multifactorial disease characterized by the excessive accumulation of fat in adipose tissue and peripheral organs. Its derived metabolic complications are mediated by the associated oxidative stress, inflammation and hypoxia. Oxidative stress is due to the excessive production of reactive oxygen species or diminished antioxidant defenses. Genetic variants, such as single nucleotide polymorphisms in antioxidant defense system genes, could alter the efficacy of these enzymes and, ultimately, the risk of obesity; thus, studies investigating the role of genetic variations in genes related to oxidative stress could be useful for better understanding the etiology of obesity and its metabolic complications. The lack of existing literature reviews in this field encouraged us to gather the findings from studies focusing on the impact of single nucleotide polymorphisms in antioxidant enzymes, oxidative stress-producing systems and transcription factor genes concerning their association with obesity risk and its phenotypes. In the future, the characterization of these single nucleotide polymorphisms (SNPs) in obese patients could contribute to the development of controlled antioxidant therapies potentially beneficial for the treatment of obesity-derived metabolic complications.
A graphene-based platform for single nucleotide polymorphism (SNP) genotyping.

PubMed

Liu, Meng; Zhao, Huimin; Chen, Shuo; Yu, Hongtao; Zhang, Yaobin; Quan, Xie

2011-06-15

A facile, rapid, stable and sensitive approach for fluorescent detection of single nucleotide polymorphism (SNP) is designed based on DNA ligase reaction and π-stacking between the graphene and the nucleotide bases. In the presence of perfectly matched DNA, DNA ligase can catalyze the linkage of fluorescein amidite-labeled single-stranded DNA (ssDNA) and a phosphorylated ssDNA, and thus the formation of a stable duplex in high yield. However, the catalytic reaction cannot effectively carry out with one-base mismatched DNA target. In this case, we add graphene to the system in order to produce different quenching signals due to its different adsorption affinity for ssDNA and double-stranded DNA. Taking advantage of the unique surface property of graphene and the high discriminability of DNA ligase, the proposed protocol exhibits good performance in SNP genotyping. The results indicate that it is possible to accurately determine SNP with frequency as low as 2.6% within 40 min. Furthermore, the presented flexible strategy facilitates the development of other biosensing applications in the future. Copyright © 2011 Elsevier B.V. All rights reserved.
Converging evidence for the association of functional genetic variation in the serotonin receptor 2a gene with prefrontal function and olanzapine treatment.

PubMed

Blasi, Giuseppe; De Virgilio, Caterina; Papazacharias, Apostolos; Taurisano, Paolo; Gelao, Barbara; Fazio, Leonardo; Ursini, Gianluca; Sinibaldi, Lorenzo; Andriola, Ileana; Masellis, Rita; Romano, Raffaella; Rampino, Antonio; Di Giorgio, Annabella; Lo Bianco, Luciana; Caforio, Grazia; Piva, Francesco; Popolizio, Teresa; Bellantuono, Cesario; Todarello, Orlando; Kleinman, Joel E; Gadaleta, Gemma; Weinberger, Daniel R; Bertolino, Alessandro

2013-09-01

Serotonin (5-hydroxytryptamine) receptor 2a (5-HT2AR) signaling is important for modulation of corticostriatal pathways and prefrontal activity during cognition. Furthermore, newer antipsychotic drugs target 5-HT2AR. A single-nucleotide polymorphism in the 5-HT2AR gene (HTR2A rs6314, C>T; OMIM 182135) has been weakly associated with differential 5-HT2AR signaling and with physiologic as well as behavioral effects. To use a hierarchical approach to determine the functional effects of this single-nucleotide polymorphism on 5-HT2AR messenger RNA and protein expression, on prefrontal phenotypes linked with genetic risk for schizophrenia, and on treatment with olanzapine. In silico predictions, in vitro, and case-control investigations. Academic and clinical facilities. The postmortem study included 112 brains from healthy individuals; the in vivo investigation included a total sample of 371 healthy individuals and patients with schizophrenia. EXPOSURES Patients received olanzapine monotherapy for 8 weeks. In silico predictions, messenger RNA, and protein expression in postmortem human prefrontal cortex and HeLa cells, functional magnetic resonance imaging prefrontal activity and behavior during working memory and attention in healthy individuals, and response to an 8-week trial of olanzapine treatment in patients with schizophrenia. Bioinformatic analysis predicted that rs6314 alters patterns of splicing, with possible effects on HTR2A expression. Moreover, the T allele was associated with reduced prefrontal messenger RNA expression in postmortem prefrontal cortex, with reduced protein expression in vitro, inefficient prefrontal blood oxygen level-dependent functional magnetic resonance imaging response during working memory and attentional control processing, and impaired working memory and attention behavior, as well as with attenuated improvement in negative symptoms after olanzapine treatment. Our results suggest that HTR2A rs6314 affects 5-HT2AR expression and functionally contributes to genetic modulation of known endophenotypes of schizophrenia-like higher-level cognitive behaviors and related prefrontal activity, as well as response to treatment with olanzapine.
WEB-server for search of a periodicity in amino acid and nucleotide sequences

NASA Astrophysics Data System (ADS)

E Frenkel, F.; Skryabin, K. G.; Korotkov, E. V.

2017-12-01

A new web server (http://victoria.biengi.ac.ru/splinter/login.php) was designed and developed to search for periodicity in nucleotide and amino acid sequences. The web server operation is based upon a new mathematical method of searching for multiple alignments, which is founded on the position weight matrices optimization, as well as on implementation of the two-dimensional dynamic programming. This approach allows the construction of multiple alignments of the indistinctly similar amino acid and nucleotide sequences that accumulated more than 1.5 substitutions per a single amino acid or a nucleotide without performing the sequences paired comparisons. The article examines the principles of the web server operation and two examples of studying amino acid and nucleotide sequences, as well as information that could be obtained using the web server.
Nucleotide Selectivity in Abiotic RNA Polymerization Reactions.

PubMed

Coari, Kristin M; Martin, Rebecca C; Jain, Kopal; McGown, Linda B

2017-09-01

In order to establish an RNA world on early Earth, the nucleotides must form polymers through chemical rather than biochemical reactions. The polymerization products must be long enough to perform catalytic functions, including self-replication, and to preserve genetic information. These functions depend not only on the length of the polymers, but also on their sequences. To date, studies of abiotic RNA polymerization generally have focused on routes to polymerization of a single nucleotide and lengths of the homopolymer products. Less work has been done the selectivity of the reaction toward incorporation of some nucleotides over others in nucleotide mixtures. Such information is an essential step toward understanding the chemical evolution of RNA. To address this question, in the present work RNA polymerization reactions were performed in the presence of montmorillonite clay catalyst. The nucleotides included the monophosphates of adenosine, cytosine, guanosine, uridine and inosine. Experiments included reactions of mixtures of an imidazole-activated nucleotide (ImpX) with one or more unactivated nucleotides (XMP), of two or more ImpX, and of XMP that were activated in situ in the polymerization reaction itself. The reaction products were analyzed using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) to identify the lengths and nucleotide compositions of the polymerization products. The results show that the extent of polymerization, the degree of heteropolymerization vs. homopolymerization, and the composition of the polymeric products all vary among the different nucleotides and depend upon which nucleotides and how many different nucleotides are present in the mixture.
Nucleotide Selectivity in Abiotic RNA Polymerization Reactions

NASA Astrophysics Data System (ADS)

Coari, Kristin M.; Martin, Rebecca C.; Jain, Kopal; McGown, Linda B.

2017-09-01

In order to establish an RNA world on early Earth, the nucleotides must form polymers through chemical rather than biochemical reactions. The polymerization products must be long enough to perform catalytic functions, including self-replication, and to preserve genetic information. These functions depend not only on the length of the polymers, but also on their sequences. To date, studies of abiotic RNA polymerization generally have focused on routes to polymerization of a single nucleotide and lengths of the homopolymer products. Less work has been done the selectivity of the reaction toward incorporation of some nucleotides over others in nucleotide mixtures. Such information is an essential step toward understanding the chemical evolution of RNA. To address this question, in the present work RNA polymerization reactions were performed in the presence of montmorillonite clay catalyst. The nucleotides included the monophosphates of adenosine, cytosine, guanosine, uridine and inosine. Experiments included reactions of mixtures of an imidazole-activated nucleotide (ImpX) with one or more unactivated nucleotides (XMP), of two or more ImpX, and of XMP that were activated in situ in the polymerization reaction itself. The reaction products were analyzed using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) to identify the lengths and nucleotide compositions of the polymerization products. The results show that the extent of polymerization, the degree of heteropolymerization vs. homopolymerization, and the composition of the polymeric products all vary among the different nucleotides and depend upon which nucleotides and how many different nucleotides are present in the mixture.
Intraspecific Variation and Phylogenetic Relationships Are Revealed by ITS1 Secondary Structure Analysis and Single-Nucleotide Polymorphism in Ganoderma lucidum

PubMed Central

Pei, Haisheng; Chen, Zhou; Tan, Xiaoyan; Hu, Jing; Yang, Bin; Sun, Junshe

2017-01-01

Ganoderma lucidum is a typical polypore fungus used for traditional Chinese medical purposes. The taxonomic delimitation of Ganoderma lucidum is still debated. In this study, we sequenced seven internal transcribed spacer (ITS) sequences of Ganoderma lucidum strains and annotated the ITS1 and ITS2 regions. Phylogenetic analysis of ITS1 differentiated the strains into three geographic groups. Groups 1–3 were originated from Europe, tropical Asia, and eastern Asia, respectively. While ITS2 could only differentiate the strains into two groups in which Group 2 originated from tropical Asia gathered with Groups 1 and 3 originated from Europe and eastern Asia. By determining the secondary structures of the ITS1 sequences, these three groups exhibited similar structures with a conserved central core and differed helices. While compared to Group 2, Groups 1 and 3 of ITS2 sequences shared similar structures with the difference in helix 4. Large-scale evaluation of ITS1 and ITS2 both exhibited that the majority of subgroups in the same group shared the similar structures. Further Weblogo analysis of ITS1 sequences revealed two main variable regions located in helix 2 in which C/T or A/G substitutions frequently occurred and ITS1 exhibited more nucleotide variances compared to ITS2. ITS1 multi-alignment of seven spawn strains and culture tests indicated that a single-nucleotide polymorphism (SNP) site at position 180 correlated with strain antagonism. The HZ, TK and 203 fusion strains of Ganoderma lucidum had a T at position 180, whereas other strains exhibiting antagonism, including DB, RB, JQ, and YS, had a C. Taken together, compared to ITS2 region, ITS1 region could differentiated Ganoderma lucidum into three geographic originations based on phylogenetic analysis and secondary structure prediction. Besides, a SNP in ITS 1 could delineate Ganoderma lucidum strains at the intraspecific level. These findings will be implemented to improve species quality control in the Ganoderma industry. PMID:28056060

An intergenic non-coding rRNA correlated with expression of the rRNA and frequency of an rRNA single nucleotide polymorphism in lung cancer cells.

PubMed

Shiao, Yih-Horng; Lupascu, Sorin T; Gu, Yuhan D; Kasprzak, Wojciech; Hwang, Christopher J; Fields, Janet R; Leighty, Robert M; Quiñones, Octavio; Shapiro, Bruce A; Alvord, W Gregory; Anderson, Lucy M

2009-10-19

Ribosomal RNA (rRNA) is a central regulator of cell growth and may control cancer development. A cis noncoding rRNA (nc-rRNA) upstream from the 45S rRNA transcription start site has recently been implicated in control of rRNA transcription in mouse fibroblasts. We investigated whether a similar nc-rRNA might be expressed in human cancer epithelial cells, and related to any genomic characteristics. Using quantitative rRNA measurement, we demonstrated that a nc-rRNA is transcribed in human lung epithelial and lung cancer cells, starting from approximately -1000 nucleotides upstream of the rRNA transcription start site (+1) and extending at least to +203. This nc-rRNA was significantly more abundant in the majority of lung cancer cell lines, relative to a nontransformed lung epithelial cell line. Its abundance correlated negatively with total 45S rRNA in 12 of 13 cell lines (P = 0.014). During sequence analysis from -388 to +306, we observed diverse, frequent intercopy single nucleotide polymorphisms (SNPs) in rRNA, with a frequency greater than predicted by chance at 12 sites. A SNP at +139 (U/C) in the 5' leader sequence varied among the cell lines and correlated negatively with level of the nc-rRNA (P = 0.014). Modelling of the secondary structure of the rRNA 5'-leader sequence indicated a small increase in structural stability due to the +139 U/C SNP and a minor shift in local configuration occurrences. The results demonstrate occurrence of a sense nc-rRNA in human lung epithelial and cancer cells, and imply a role in regulation of the rRNA gene, which may be affected by a +139 SNP in the 5' leader sequence of the primary rRNA transcript.
Intraspecific Variation and Phylogenetic Relationships Are Revealed by ITS1 Secondary Structure Analysis and Single-Nucleotide Polymorphism in Ganoderma lucidum.

PubMed

Zhang, Xiuqing; Xu, Zhangyang; Pei, Haisheng; Chen, Zhou; Tan, Xiaoyan; Hu, Jing; Yang, Bin; Sun, Junshe

2017-01-01

Ganoderma lucidum is a typical polypore fungus used for traditional Chinese medical purposes. The taxonomic delimitation of Ganoderma lucidum is still debated. In this study, we sequenced seven internal transcribed spacer (ITS) sequences of Ganoderma lucidum strains and annotated the ITS1 and ITS2 regions. Phylogenetic analysis of ITS1 differentiated the strains into three geographic groups. Groups 1-3 were originated from Europe, tropical Asia, and eastern Asia, respectively. While ITS2 could only differentiate the strains into two groups in which Group 2 originated from tropical Asia gathered with Groups 1 and 3 originated from Europe and eastern Asia. By determining the secondary structures of the ITS1 sequences, these three groups exhibited similar structures with a conserved central core and differed helices. While compared to Group 2, Groups 1 and 3 of ITS2 sequences shared similar structures with the difference in helix 4. Large-scale evaluation of ITS1 and ITS2 both exhibited that the majority of subgroups in the same group shared the similar structures. Further Weblogo analysis of ITS1 sequences revealed two main variable regions located in helix 2 in which C/T or A/G substitutions frequently occurred and ITS1 exhibited more nucleotide variances compared to ITS2. ITS1 multi-alignment of seven spawn strains and culture tests indicated that a single-nucleotide polymorphism (SNP) site at position 180 correlated with strain antagonism. The HZ, TK and 203 fusion strains of Ganoderma lucidum had a T at position 180, whereas other strains exhibiting antagonism, including DB, RB, JQ, and YS, had a C. Taken together, compared to ITS2 region, ITS1 region could differentiated Ganoderma lucidum into three geographic originations based on phylogenetic analysis and secondary structure prediction. Besides, a SNP in ITS 1 could delineate Ganoderma lucidum strains at the intraspecific level. These findings will be implemented to improve species quality control in the Ganoderma industry.
Quantum-Sequencing: Fast electronic single DNA molecule sequencing

NASA Astrophysics Data System (ADS)

Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

2014-03-01

A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free, high-throughput and cost-effective, single-molecule sequencing method. Here, we present the first demonstration of unique ``electronic fingerprint'' of all nucleotides (A, G, T, C), with single-molecule DNA sequencing, using Quantum-tunneling Sequencing (Q-Seq) at room temperature. We show that the electronic state of the nucleobases shift depending on the pH, with most distinct states identified at acidic pH. We also demonstrate identification of single nucleotide modifications (methylation here). Using these unique electronic fingerprints (or tunneling data), we report a partial sequence of beta lactamase (bla) gene, which encodes resistance to beta-lactam antibiotics, with over 95% success rate. These results highlight the potential of Q-Seq as a robust technique for next-generation sequencing.
Conformational Smear Characterization and Binning of Single-Molecule Conductance Measurements for Enhanced Molecular Recognition.

PubMed

Korshoj, Lee E; Afsari, Sepideh; Chatterjee, Anushree; Nagpal, Prashant

2017-11-01

Electronic conduction or charge transport through single molecules depends primarily on molecular structure and anchoring groups and forms the basis for a wide range of studies from molecular electronics to DNA sequencing. Several high-throughput nanoelectronic methods such as mechanical break junctions, nanopores, conductive atomic force microscopy, scanning tunneling break junctions, and static nanoscale electrodes are often used for measuring single-molecule conductance. In these measurements, "smearing" due to conformational changes and other entropic factors leads to large variances in the observed molecular conductance, especially in individual measurements. Here, we show a method for characterizing smear in single-molecule conductance measurements and demonstrate how binning measurements according to smear can significantly enhance the use of individual conductance measurements for molecular recognition. Using quantum point contact measurements on single nucleotides within DNA macromolecules, we demonstrate that the distance over which molecular junctions are maintained is a measure of smear, and the resulting variance in unbiased single measurements depends on this smear parameter. Our ability to identify individual DNA nucleotides at 20× coverage increases from 81.3% accuracy without smear analysis to 93.9% with smear characterization and binning (SCRIB). Furthermore, merely 7 conductance measurements (7× coverage) are needed to achieve 97.8% accuracy for DNA nucleotide recognition when only low molecular smear measurements are used, which represents a significant improvement over contemporary sequencing methods. These results have important implications in a broad range of molecular electronics applications from designing robust molecular switches to nanoelectronic DNA sequencing.
Yeast ribonuclease III uses a network of multiple hydrogen bonds for RNA binding and cleavage.

PubMed

Lavoie, Mathieu; Abou Elela, Sherif

2008-08-19

Members of the bacterial RNase III family recognize a variety of short structured RNAs with few common features. It is not clear how this group of enzymes supports high cleavage fidelity while maintaining a broad base of substrates. Here we show that the yeast orthologue of RNase III (Rnt1p) uses a network of 2'-OH-dependent interactions to recognize substrates with different structures. We designed a series of bipartite substrates permitting the distinction between binding and cleavage defects. Each substrate was engineered to carry a single or multiple 2'- O-methyl or 2'-fluoro ribonucleotide substitutions to prevent the formation of hydrogen bonds with a specific nucleotide or group of nucleotides. Interestingly, introduction of 2'- O-methyl ribonucleotides near the cleavage site increased the rate of catalysis, indicating that 2'-OH are not required for cleavage. Substitution of nucleotides in known Rnt1p binding site with 2'- O-methyl ribonucleotides inhibited cleavage while single 2'-fluoro ribonucleotide substitutions did not. This indicates that while no single 2'-OH is essential for Rnt1p cleavage, small changes in the substrate structure are not tolerated. Strikingly, several nucleotide substitutions greatly increased the substrate dissociation constant with little or no effect on the Michaelis-Menten constant or rate of catalysis. Together, the results indicate that Rnt1p uses a network of nucleotide interactions to identify its substrate and support two distinct modes of binding. One mode is primarily mediated by the dsRNA binding domain and leads to the formation of stable RNA/protein complex, while the other requires the presence of the nuclease and N-terminal domains and leads to RNA cleavage.
Translational genomics for abiotic stress in sorghum: transcriptional profiling and validation of SNP markers between germplasm with differential cold tolerance

USDA-ARS?s Scientific Manuscript database

One focus of the Sorghum Translational Genomics Lab (part of sorghum CRIS, PSGD, CSRL, USDA-ARS, Lubbock TX) is to utilize nucleotide variation between sorghum germplasm such as those derived from RNA seq for translation and validation of Single Nucleotide Polymorphism (SNP) into easy access DNA m...
A genetic variation map for chicken with 2.8 million single nucleotide polymorphisms

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wong, G K; Hillier, L; Brandstrom, M

2005-02-20

We describe a genetic variation map for the chicken genome containing 2.8 million single nucleotide polymorphisms (SNPs), based on a comparison of the sequences of 3 domestic chickens (broiler, layer, Silkie) to their wild ancestor Red Jungle Fowl (RJF). Subsequent experiments indicate that at least 90% are true SNPs, and at least 70% are common SNPs that segregate in many domestic breeds. Mean nucleotide diversity is about 5 SNP/kb for almost every possible comparison between RJF and domestic lines, between two different domestic lines, and within domestic lines--contrary to the idea that domestic animals are highly inbred relative to theirmore » wild ancestors. In fact, most of the SNPs originated prior to domestication, and there is little to no evidence of selective sweeps for adaptive alleles on length scales of greater than 100 kb.« less
A Lateral Flow Biosensor for the Detection of Single Nucleotide Polymorphisms.

PubMed

Zeng, Lingwen; Xiao, Zhuo

2017-01-01

A lateral flow biosensor (LFB) is introduced for the detection of single nucleotide polymorphisms (SNPs). The assay is composed of two steps: circular strand displacement reaction and lateral flow biosensor detection. In step 1, the nucleotide at SNP site is recognized by T4 DNA ligase and the signal is amplified by strand displacement DNA polymerase, which can be accomplished at a constant temperature. In step 2, the reaction product of step 1 is detected by a lateral flow biosensor, which is a rapid and cost effective tool for nuclei acid detection. Comparing with conventional methods, it requires no complicated machines. It is suitable for the use of point of care diagnostics. Therefore, this simple, cost effective, robust, and promising LFB detection method of SNP has great potential for the detection of genetic diseases, personalized medicine, cancer related mutations, and drug-resistant mutations of infectious agents.
Detecting and Removing Ascertainment Bias in Microsatellites from the HGDP-CEPH Panel

PubMed Central

Eriksson, Anders; Manica, Andrea

2011-01-01

Although ascertainment bias in single nucleotide polymorphisms is a well-known problem, it is generally accepted that microsatellites have mutation rates too high for bias to be a concern. Here, we analyze in detail the large set of microsatellites typed for the Human Genetic Diversity Panel (HGDP)-CEPH panel. We develop a novel framework based on rarefaction to compare heterozygosity across markers with different mutation rates. We find that, whereas di- and tri-nucleotides show similar patterns of within- and between-population heterozygosity, tetra-nucleotides are inconsistent with the other two motifs. In addition, di- and tri-nucleotides are consistent with 16 unbiased tetra-nucleotide markers, whereas the HPGP-CEPH tetra-nucleotides are significantly different. This discrepancy is due to the HGDP-CEPH tetra-nucleotides being too homogeneous across Eurasia, even after their slower mutation rate is taken into account by rarefying the other markers. The most likely explanation for this pattern is ascertainment bias. We strongly advocate the exclusion of tetra-nucleotides from future population genetics analysis of this dataset, and we argue that other microsatellite datasets should be investigated for the presence of bias using the approach outlined in this article. PMID:22384358
Microarray study of single nucleotide polymorphisms and expression of ATP-binding cassette genes in breast tumors

NASA Astrophysics Data System (ADS)

Tsyganov, M. M.; Ibragimova, M. K.; Karabut, I. V.; Freydin, M. B.; Choinzonov, E. L.; Litvyakov, N. V.

2015-11-01

Our previous research establishes that changes of expression of the ATP-binding cassette genes family is connected with the neoadjuvant chemotherapy effect. However, the mechanism of regulation of resistance gene expression remains unclear. As many researchers believe, single nucleotide polymorphisms can be involved in this process. Thereupon, microarray analysis is used to study polymorphisms in ATP-binding cassette genes. It is thus found that MDR gene expression is connected with 5 polymorphisms, i.e. rs241432, rs241429, rs241430, rs3784867, rs59409230, which participate in the regulation of expression of own genes.
Trichomonas vaginalis Metronidazole Resistance Is Associated with Single Nucleotide Polymorphisms in the Nitroreductase Genes ntr4Tv and ntr6Tv

PubMed Central

Paulish-Miller, Teresa E.; Augostini, Peter; Schuyler, Jessica A.; Smith, William L.; Mordechai, Eli; Adelson, Martin E.; Gygax, Scott E.; Secor, William E.

2014-01-01

Metronidazole resistance in the sexually transmitted parasite Trichomonas vaginalis is a problematic public health issue. We have identified single nucleotide polymorphisms (SNPs) in two nitroreductase genes (ntr4Tv and ntr6Tv) associated with resistance. These SNPs were associated with one of two distinct T. vaginalis populations identified by multilocus sequence typing, yet one SNP (ntr6Tv A238T), which results in a premature stop codon, was associated with resistance independent of population structure and may be of diagnostic value. PMID:24550324
Polymorphisms in TS, MTHFR and ERCC1 genes as predictive markers in first-line platinum and pemetrexed therapy in NSCLC patients.

PubMed

Krawczyk, Paweł; Kucharczyk, Tomasz; Kowalski, Dariusz M; Powrózek, Tomasz; Ramlau, Rodryg; Kalinka-Warzocha, Ewa; Winiarczyk, Kinga; Knetki-Wróblewska, Magdalena; Wojas-Krawczyk, Kamila; Kałakucka, Katarzyna; Dyszkiewicz, Wojciech; Krzakowski, Maciej; Milanowski, Janusz

2014-12-01

We presented retrospective analysis of up to five polymorphisms in TS, MTHFR and ERCC1 genes as molecular predictive markers for homogeneous Caucasian, non-squamous NSCLC patients treated with pemetrexed and platinum front-line chemotherapy. The following polymorphisms in DNA isolated from 115 patients were analyzed: various number of 28-bp tandem repeats in 5'-UTR region of TS gene, single nucleotide polymorphism (SNP) within the second tandem repeat of TS gene (G>C); 6-bp deletion in 3'-UTR region of the TS (1494del6); 677C>T SNP in MTHFR; 19007C>T SNP in ERCC1. Molecular examinations' results were correlated with disease control rate, progression-free survival (PFS) and overall survival. Polymorphic tandem repeat sequence (2R, 3R) in the enhancer region of TS gene and G>C SNP within the second repeat of 3R allele seem to be important for the effectiveness of platinum and pemetrexed in first-line chemotherapy. The insignificant shortening of PFS in 3R/3R homozygotes as compared to 2R/2R and 2R/3R genotypes were observed, while it was significantly shorter in patients carrying synchronous 3R allele and G nucleotide. The combined analysis of TS VNTR and MTHFR 677C>T SNP revealed shortening of PFS in synchronous carriers of 3R allele in TS and two C alleles in MTHFR. The strongest factors increased the risk of progression were poor PS, weight loss, anemia and synchronous presence of 3R allele and G nucleotide in the second repeat of 3R allele in TS. Moreover, lack of application of second-line chemotherapy, weight loss and poor performance status and above-mentioned genotype of TS gene increased risk of early mortality. The examined polymorphisms should be accounted as molecular predictor factors for pemetrexed- and platinum-based front-line chemotherapy in non-squamous NSCLC patients.
Distinguishing functional polymorphism from random variation in the sequences of >10,000 HLA-A, -B and -C alleles.

PubMed

Robinson, James; Guethlein, Lisbeth A; Cereb, Nezih; Yang, Soo Young; Norman, Paul J; Marsh, Steven G E; Parham, Peter

2017-06-01

HLA class I glycoproteins contain the functional sites that bind peptide antigens and engage lymphocyte receptors. Recently, clinical application of sequence-based HLA typing has uncovered an unprecedented number of novel HLA class I alleles. Here we define the nature and extent of the variation in 3,489 HLA-A, 4,356 HLA-B and 3,111 HLA-C alleles. This analysis required development of suites of methods, having general applicability, for comparing and analyzing large numbers of homologous sequences. At least three amino-acid substitutions are present at every position in the polymorphic α1 and α2 domains of HLA-A, -B and -C. A minority of positions have an incidence >1% for the 'second' most frequent nucleotide, comprising 70 positions in HLA-A, 85 in HLA-B and 54 in HLA-C. The majority of these positions have three or four alternative nucleotides. These positions were subject to positive selection and correspond to binding sites for peptides and receptors. Most alleles of HLA class I (>80%) are very rare, often identified in one person or family, and they differ by point mutation from older, more common alleles. These alleles with single nucleotide polymorphisms reflect the germ-line mutation rate. Their frequency predicts the human population harbors 8-9 million HLA class I variants. The common alleles of human populations comprise 42 core alleles, which represent all selected polymorphism, and recombinants that have assorted this polymorphism.
Distinguishing functional polymorphism from random variation in the sequences of >10,000 HLA-A, -B and -C alleles

PubMed Central

Cereb, Nezih; Yang, Soo Young; Marsh, Steven G. E.; Parham, Peter

2017-01-01

HLA class I glycoproteins contain the functional sites that bind peptide antigens and engage lymphocyte receptors. Recently, clinical application of sequence-based HLA typing has uncovered an unprecedented number of novel HLA class I alleles. Here we define the nature and extent of the variation in 3,489 HLA-A, 4,356 HLA-B and 3,111 HLA-C alleles. This analysis required development of suites of methods, having general applicability, for comparing and analyzing large numbers of homologous sequences. At least three amino-acid substitutions are present at every position in the polymorphic α1 and α2 domains of HLA-A, -B and -C. A minority of positions have an incidence >1% for the ‘second’ most frequent nucleotide, comprising 70 positions in HLA-A, 85 in HLA-B and 54 in HLA-C. The majority of these positions have three or four alternative nucleotides. These positions were subject to positive selection and correspond to binding sites for peptides and receptors. Most alleles of HLA class I (>80%) are very rare, often identified in one person or family, and they differ by point mutation from older, more common alleles. These alleles with single nucleotide polymorphisms reflect the germ-line mutation rate. Their frequency predicts the human population harbors 8–9 million HLA class I variants. The common alleles of human populations comprise 42 core alleles, which represent all selected polymorphism, and recombinants that have assorted this polymorphism. PMID:28650991
Classification of pseudo pairs between nucleotide bases and amino acids by analysis of nucleotide-protein complexes.

PubMed

Kondo, Jiro; Westhof, Eric

2011-10-01

Nucleotide bases are recognized by amino acid residues in a variety of DNA/RNA binding and nucleotide binding proteins. In this study, a total of 446 crystal structures of nucleotide-protein complexes are analyzed manually and pseudo pairs together with single and bifurcated hydrogen bonds observed between bases and amino acids are classified and annotated. Only 5 of the 20 usual amino acid residues, Asn, Gln, Asp, Glu and Arg, are able to orient in a coplanar fashion in order to form pseudo pairs with nucleotide bases through two hydrogen bonds. The peptide backbone can also form pseudo pairs with nucleotide bases and presents a strong bias for binding to the adenine base. The Watson-Crick side of the nucleotide bases is the major interaction edge participating in such pseudo pairs. Pseudo pairs between the Watson-Crick edge of guanine and Asp are frequently observed. The Hoogsteen edge of the purine bases is a good discriminatory element in recognition of nucleotide bases by protein side chains through the pseudo pairing: the Hoogsteen edge of adenine is recognized by various amino acids while the Hoogsteen edge of guanine is only recognized by Arg. The sugar edge is rarely recognized by either the side-chain or peptide backbone of amino acid residues.
Effect of a single polymorphism in the Japanese quail NK-lysin gene on antimicrobial activity.

PubMed

Ishige, Taichiro; Hara, Hiromi; Hirano, Takashi; Kono, Tomohiro; Hanzawa, Kei

2016-01-01

NK-lysins are cationic peptides that play important roles in host protection, and are an important constituent of innate immunity. We identified nine single-nucleotide polymorphisms (SNPs) in the NK-lysin open reading frame (ORF) from 32 Japanese quails in six strains: A, B, ND, K, P, and Y. The G to A substitution at nucleotide position 272 in the ORF resulted in a Gly (G) to Asp (D) amino acid substitution (Cj31G and Cj31D alleles). The Cj31D allele was detected in P (frequency 0.76) and Y (frequency 0.03) strains. We compared the antimicrobial activities of four synthetic peptides from the helix 2-loop-helix 3 region of avian NK-lysins against Escherichia coli: Cj31G and Cj31D from quail and Gg29N and Gg29D from chicken. The antimicrobial activities of the four peptides decreased in the following order: Gg29N > Cj31G > Gg29D > Cj31D (P < 0.05). Although there were no differences in the predicted secondary structure of the Cj31G and Cj31D, the net charge of the Cj31G was higher than that of Cj31D. These data indicated that the antimicrobial activity of CjNKL is influenced by net charge, similar to that which has been observed in chicken. © 2015 Japanese Society of Animal Science.
Development of solution-gated graphene transistor model for biosensors

NASA Astrophysics Data System (ADS)

Karimi, Hediyeh; Yusof, Rubiyah; Rahmani, Rasoul; Hosseinpour, Hoda; Ahmadi, Mohammad T.

2014-02-01

The distinctive properties of graphene, characterized by its high carrier mobility and biocompatibility, have stimulated extreme scientific interest as a promising nanomaterial for future nanoelectronic applications. In particular, graphene-based transistors have been developed rapidly and are considered as an option for DNA sensing applications. Recent findings in the field of DNA biosensors have led to a renewed interest in the identification of genetic risk factors associated with complex human diseases for diagnosis of cancers or hereditary diseases. In this paper, an analytical model of graphene-based solution gated field effect transistors (SGFET) is proposed to constitute an important step towards development of DNA biosensors with high sensitivity and selectivity. Inspired by this fact, a novel strategy for a DNA sensor model with capability of single-nucleotide polymorphism detection is proposed and extensively explained. First of all, graphene-based DNA sensor model is optimized using particle swarm optimization algorithm. Based on the sensing mechanism of DNA sensors, detective parameters ( I ds and V gmin) are suggested to facilitate the decision making process. Finally, the behaviour of graphene-based SGFET is predicted in the presence of single-nucleotide polymorphism with an accuracy of more than 98% which guarantees the reliability of the optimized model for any application of the graphene-based DNA sensor. It is expected to achieve the rapid, quick and economical detection of DNA hybridization which could speed up the realization of the next generation of the homecare sensor system.
A Caenorhabditis elegans Wild Type Defies the Temperature–Size Rule Owing to a Single Nucleotide Polymorphism in tra-3

PubMed Central

Kammenga, Jan E; Doroszuk, Agnieszka; Riksen, Joost A. G; Hazendonk, Esther; Spiridon, Laurentiu; Petrescu, Andrei-Jose; Tijsterman, Marcel; Plasterk, Ronald H. A; Bakker, Jaap

2007-01-01

Ectotherms rely for their body heat on surrounding temperatures. A key question in biology is why most ectotherms mature at a larger size at lower temperatures, a phenomenon known as the temperature–size rule. Since temperature affects virtually all processes in a living organism, current theories to explain this phenomenon are diverse and complex and assert often from opposing assumptions. Although widely studied, the molecular genetic control of the temperature–size rule is unknown. We found that the Caenorhabditis elegans wild-type N2 complied with the temperature–size rule, whereas wild-type CB4856 defied it. Using a candidate gene approach based on an N2 × CB4856 recombinant inbred panel in combination with mutant analysis, complementation, and transgenic studies, we show that a single nucleotide polymorphism in tra-3 leads to mutation F96L in the encoded calpain-like protease. This mutation attenuates the ability of CB4856 to grow larger at low temperature. Homology modelling predicts that F96L reduces TRA-3 activity by destabilizing the DII-A domain. The data show that size adaptation of ectotherms to temperature changes may be less complex than previously thought because a subtle wild-type polymorphism modulates the temperature responsiveness of body size. These findings provide a novel step toward the molecular understanding of the temperature–size rule, which has puzzled biologists for decades. PMID:17335351
Genetic and epigenetic transgenerational implications related to omega-3 fatty acids. Part II: maternal FADS2 rs174575 genotype and DNA methylation predict toddler cognitive performance.

PubMed

Cheatham, Carol L; Lupu, Daniel S; Niculescu, Mihai D

2015-11-01

Maternal transfer of fatty acids is important to fetal brain development. The prenatal environment may differentially affect the substrates supporting declarative memory abilities, as the level of fatty acids transferred across the placenta may be affected by the maternal fatty acid desaturase 2 (FADS2) rs174575 single nucleotide polymorphism. In this study, we hypothesized that toddler and maternal rs174575 genotype and FADS2 promoter methylation would be related to the toddlers' declarative memory performance. Seventy-one 16-month-old toddlers participated in an imitation paradigm designed to test immediate and long-term declarative memory abilities. FADS2 rs174575 genotype was determined and FADS2 promoter methylation was quantified from blood by bisulfite pyrosequencing for the toddlers and their natural mothers. Toddlers of GG mothers at the FADS2 rs174575 single nucleotide polymorphism did not perform as well on memory assessments as toddlers of CC or CG mothers when controlling for plasma α-linolenic acid and child genotype. Toddler methylation status was related to immediate memory performance, whereas maternal methylation status was related to delayed memory performance. Thus, prenatal experience and maternal FADS2 status have a pervasive, long-lasting influence on the brain development of the offspring, but as the postnatal environment becomes more primary, the offsprings' own biology begins to have an effect. Copyright © 2015 Elsevier Inc. All rights reserved.
Development of solution-gated graphene transistor model for biosensors

PubMed Central

2014-01-01

The distinctive properties of graphene, characterized by its high carrier mobility and biocompatibility, have stimulated extreme scientific interest as a promising nanomaterial for future nanoelectronic applications. In particular, graphene-based transistors have been developed rapidly and are considered as an option for DNA sensing applications. Recent findings in the field of DNA biosensors have led to a renewed interest in the identification of genetic risk factors associated with complex human diseases for diagnosis of cancers or hereditary diseases. In this paper, an analytical model of graphene-based solution gated field effect transistors (SGFET) is proposed to constitute an important step towards development of DNA biosensors with high sensitivity and selectivity. Inspired by this fact, a novel strategy for a DNA sensor model with capability of single-nucleotide polymorphism detection is proposed and extensively explained. First of all, graphene-based DNA sensor model is optimized using particle swarm optimization algorithm. Based on the sensing mechanism of DNA sensors, detective parameters (Ids and Vgmin) are suggested to facilitate the decision making process. Finally, the behaviour of graphene-based SGFET is predicted in the presence of single-nucleotide polymorphism with an accuracy of more than 98% which guarantees the reliability of the optimized model for any application of the graphene-based DNA sensor. It is expected to achieve the rapid, quick and economical detection of DNA hybridization which could speed up the realization of the next generation of the homecare sensor system. PMID:24517158

Hyperglycemia and a common variant of GCKR are associated with the levels of eight amino acids in 9,369 Finnish men.

PubMed

Stancáková, Alena; Civelek, Mete; Saleem, Niyas K; Soininen, Pasi; Kangas, Antti J; Cederberg, Henna; Paananen, Jussi; Pihlajamäki, Jussi; Bonnycastle, Lori L; Morken, Mario A; Boehnke, Michael; Pajukanta, Päivi; Lusis, Aldons J; Collins, Francis S; Kuusisto, Johanna; Ala-Korpela, Mika; Laakso, Markku

2012-07-01

We investigated the association of glycemia and 43 genetic risk variants for hyperglycemia/type 2 diabetes with amino acid levels in the population-based Metabolic Syndrome in Men (METSIM) Study, including 9,369 nondiabetic or newly diagnosed type 2 diabetic Finnish men. Plasma levels of eight amino acids were measured with proton nuclear magnetic resonance spectroscopy. Increasing fasting and 2-h plasma glucose levels were associated with increasing levels of several amino acids and decreasing levels of histidine and glutamine. Alanine, leucine, isoleucine, tyrosine, and glutamine predicted incident type 2 diabetes in a 4.7-year follow-up of the METSIM Study, and their effects were largely mediated by insulin resistance (except for glutamine). We also found significant correlations between insulin sensitivity (Matsuda insulin sensitivity index) and mRNA expression of genes regulating amino acid degradation in 200 subcutaneous adipose tissue samples. Only 1 of 43 risk single nucleotide polymorphisms for type 2 diabetes or hyperglycemia, the glucose-increasing major C allele of rs780094 of GCKR, was significantly associated with decreased levels of alanine and isoleucine and elevated levels of glutamine. In conclusion, the levels of branched-chain, aromatic amino acids and alanine increased and the levels of glutamine and histidine decreased with increasing glycemia, reflecting, at least in part, insulin resistance. Only one single nucleotide polymorphism regulating hyperglycemia was significantly associated with amino acid levels.
Hyperglycemia and a Common Variant of GCKR Are Associated With the Levels of Eight Amino Acids in 9,369 Finnish Men

PubMed Central

Stančáková, Alena; Civelek, Mete; Saleem, Niyas K.; Soininen, Pasi; Kangas, Antti J.; Cederberg, Henna; Paananen, Jussi; Pihlajamäki, Jussi; Bonnycastle, Lori L.; Morken, Mario A.; Boehnke, Michael; Pajukanta, Päivi; Lusis, Aldons J.; Collins, Francis S.; Kuusisto, Johanna; Ala-Korpela, Mika; Laakso, Markku

2012-01-01

We investigated the association of glycemia and 43 genetic risk variants for hyperglycemia/type 2 diabetes with amino acid levels in the population-based Metabolic Syndrome in Men (METSIM) Study, including 9,369 nondiabetic or newly diagnosed type 2 diabetic Finnish men. Plasma levels of eight amino acids were measured with proton nuclear magnetic resonance spectroscopy. Increasing fasting and 2-h plasma glucose levels were associated with increasing levels of several amino acids and decreasing levels of histidine and glutamine. Alanine, leucine, isoleucine, tyrosine, and glutamine predicted incident type 2 diabetes in a 4.7-year follow-up of the METSIM Study, and their effects were largely mediated by insulin resistance (except for glutamine). We also found significant correlations between insulin sensitivity (Matsuda insulin sensitivity index) and mRNA expression of genes regulating amino acid degradation in 200 subcutaneous adipose tissue samples. Only 1 of 43 risk single nucleotide polymorphisms for type 2 diabetes or hyperglycemia, the glucose-increasing major C allele of rs780094 of GCKR, was significantly associated with decreased levels of alanine and isoleucine and elevated levels of glutamine. In conclusion, the levels of branched-chain, aromatic amino acids and alanine increased and the levels of glutamine and histidine decreased with increasing glycemia, reflecting, at least in part, insulin resistance. Only one single nucleotide polymorphism regulating hyperglycemia was significantly associated with amino acid levels. PMID:22553379
Rapid single nucleotide polymorphism based method for hematopoietic chimerism analysis and monitoring using high-speed droplet allele-specific PCR and allele-specific quantitative PCR.

PubMed

Taira, Chiaki; Matsuda, Kazuyuki; Yamaguchi, Akemi; Uehara, Masayuki; Sugano, Mitsutoshi; Okumura, Nobuo; Honda, Takayuki

2015-05-20

Chimerism analysis is important for the evaluation of engraftment and predicting relapse following hematopoietic stem cell transplantation (HSCT). We developed a chimerism analysis for single nucleotide polymorphisms (SNPs), including rapid screening of the discriminable donor/recipient alleles using droplet allele-specific PCR (droplet-AS-PCR) pre-HSCT and quantitation of recipient DNA using AS-quantitative PCR (AS-qPCR) following HSCT. SNP genotyping of 20 donor/recipient pairs via droplet-AS-PCR and the evaluation of the informativity of 5 SNP markers for chimerism analysis were performed. Samples from six follow-up patients were analyzed to assess the chimerism via AS-qPCR. These results were compared with that determined by short tandem repeat PCR (STR-PCR). Droplet-AS-PCR could determine genotypes within 8min. The total informativity using all 5 loci was 95% (19/20). AS-qPCR provided the percentage of recipient DNA in all 6 follow-up patients without influence of the stutter peak or the amplification efficacy, which affected the STR-PCR results. The droplet-AS-PCR had an advantage over STR-PCR in terms of rapidity and simplicity for screening before HSCT. Furthermore, AS-qPCR had better accuracy than STR-PCR for quantification of recipient DNA following HSCT. The present chimerism assay compensates for the disadvantages of STR-PCR and is readily performable in clinical laboratories. Copyright © 2015 Elsevier B.V. All rights reserved.
Impact of EZH2 polymorphisms on urothelial cell carcinoma susceptibility and clinicopathologic features.

PubMed

Yu, Yung-Luen; Su, Kuo-Jung; Hsieh, Ming-Ju; Wang, Shian-Shiang; Wang, Po-Hui; Weng, Wei-Chun; Yang, Shun-Fa

2014-01-01

The gene EZH2, the polycomb group protein enhancer of zeste 2, encodes a transcriptional repressor that also serves as a histone methyltransferase that is associated with progression to more advanced disease in a variety of malignancies. EZH2 expression level in urothelial cell carcinoma (UCC) is highly correlated with tumor aggressiveness, but it has not been determined if specific EZH2 genetic variants are associated with UCC risk. This study investigated the potential associations of EZH2 single-nucleotide polymorphisms with UCC susceptibility and its clinicopathologic characteristics. A total of 233 UCC patients and 552 cancer-free controls, all of whom were from Taiwan, were analyzed for four EZH2 single-nucleotide polymorphisms (rs6950683, rs2302427, rs3757441, and rs41277434) using real-time PCR genotyping. After adjusting for other co-variants, we found that individuals carrying at least one C allele at EZH2 rs6950683 had a lower risk of developing UCC than did major allele carriers. The CCCA or TGTA haplotype among the four EZH2 sites was also associated with a reduced risk of UCC. Furthermore, UCC patients who carried at least one G allele at rs2302427 had a lower invasive tumor stage than did patients carrying the major allele. The rs6950683 SNPs of EZH2 might contribute to the prediction of UCC susceptibility. This is the first study to provide insight into risk factors associated with EZH2 variants in carcinogenesis of UCC in Taiwan.
Whole-exome sequencing and digital PCR identified a novel compound heterozygous mutation in the NPHP1 gene in a case of Joubert syndrome and related disorders.

PubMed

Koyama, Shingo; Sato, Hidenori; Wada, Manabu; Kawanami, Toru; Emi, Mitsuru; Kato, Takeo

2017-03-27

Joubert syndrome and related disorders (JSRD) is a clinically and genetically heterogeneous condition with autosomal recessive or X-linked inheritance, which share a distinctive neuroradiological hallmark, the so-called molar tooth sign. JSRD is classified into six clinical subtypes based on associated variable multiorgan involvement. To date, 21 causative genes have been identified in JSRD, which makes genetic diagnosis difficult. We report here a case of a 28-year-old Japanese woman diagnosed with JS with oculorenal defects with a novel compound heterozygous mutation (p.Ser219*/deletion) in the NPHP1 gene. Whole-exome sequencing (WES) of the patient identified the novel nonsense mutation in an apparently homozygous state. However, it was absent in her mother and heterozygous in her father. A read depth-based copy number variation (CNV) detection algorithm using WES data of the family predicted a large heterozygous deletion mutation in the patient and her mother, which was validated by digital polymerase chain reaction, indicating that the patient was compound heterozygous for the paternal nonsense mutation and the maternal deletion mutation spanning the site of the single nucleotide change. It should be noted that analytical pipelines that focus purely on sequence information cannot distinguish homozygosity from hemizygosity because of its inability to detect large deletions. The ability to detect CNVs in addition to single nucleotide variants and small insertion/deletions makes WES an attractive diagnostic tool for genetically heterogeneous disorders.
High-resolution melting genotyping of Enterococcus faecium based on multilocus sequence typing derived single nucleotide polymorphisms.

PubMed

Tong, Steven Y C; Xie, Shirley; Richardson, Leisha J; Ballard, Susan A; Dakh, Farshid; Grabsch, Elizabeth A; Grayson, M Lindsay; Howden, Benjamin P; Johnson, Paul D R; Giffard, Philip M

2011-01-01

We have developed a single nucleotide polymorphism (SNP) nucleated high-resolution melting (HRM) technique to genotype Enterococcus faecium. Eight SNPs were derived from the E. faecium multilocus sequence typing (MLST) database and amplified fragments containing these SNPs were interrogated by HRM. We tested the HRM genotyping scheme on 85 E. faecium bloodstream isolates and compared the results with MLST, pulsed-field gel electrophoresis (PFGE) and an allele specific real-time PCR (AS kinetic PCR) SNP typing method. In silico analysis based on predicted HRM curves according to the G+C content of each fragment for all 567 sequence types (STs) in the MLST database together with empiric data from the 85 isolates demonstrated that HRM analysis resolves E. faecium into 231 "melting types" (MelTs) and provides a Simpson's Index of Diversity (D) of 0.991 with respect to MLST. This is a significant improvement on the AS kinetic PCR SNP typing scheme that resolves 61 SNP types with D of 0.95. The MelTs were concordant with the known ST of the isolates. For the 85 isolates, there were 13 PFGE patterns, 17 STs, 14 MelTs and eight SNP types. There was excellent concordance between PFGE, MLST and MelTs with Adjusted Rand Indices of PFGE to MelT 0.936 and ST to MelT 0.973. In conclusion, this HRM based method appears rapid and reproducible. The results are concordant with MLST and the MLST based population structure.
CNG and HCN channels: two peas, one pod.

PubMed

Craven, Kimberley B; Zagotta, William N

2006-01-01

Cyclic nucleotide-activated ion channels play a fundamental role in a variety of physiological processes. By opening in response to intracellular cyclic nucleotides, they translate changes in concentrations of signaling molecules to changes in membrane potential. These channels belong to two families: the cyclic nucleotide-gated (CNG) channels and the hyperpolarization-activated cyclic nucleotide-modulated (HCN) channels. The two families exhibit high sequence similarity and belong to the superfamily of voltage-gated potassium channels. Whereas HCN channels are activated by voltage and CNG channels are virtually voltage independent, both channels are activated by cyclic nucleotide binding. Furthermore, the channels are thought to have similar channel structures, leading to similar mechanisms of activation by cyclic nucleotides. However, although these channels are structurally and behaviorally similar, they have evolved to perform distinct physiological functions. This review describes the physiological roles and biophysical behavior of CNG and HCN channels. We focus on how similarities in structure and activation mechanisms result in common biophysical models, allowing CNG and HCN channels to be viewed as a single genre.
Structure of a eukaryotic cyclic nucleotide-gated channel

PubMed Central

Li, Minghui; Zhou, Xiaoyuan; Wang, Shu; Michailidis, Ioannis; Gong, Ye; Su, Deyuan; Li, Huan; Li, Xueming; Yang, Jian

2018-01-01

Summary Cyclic nucleotide-gated (CNG) channels are essential for vision and olfaction. They belong to the voltage-gated ion channel superfamily but their activities are controlled by intracellular cyclic nucleotides instead of transmembrane voltage. Here we report a 3.5 Å-resolution single-particle electron cryomicroscopy structure of a CNG channel from C. elegans in the cGMP-bound open state. The channel has an unusual voltage-sensor-like domain (VSLD), accounting for its deficient voltage dependence. A C-terminal linker connecting S6 and the cyclic nucleotide-binding domain interacts directly with both the VSLD and pore domain, forming a gating ring that couples conformational changes triggered by cyclic nucleotide binding to the gate. The selectivity filter is lined by the carboxylate side chains of a functionally important glutamate and three rings of backbone carbonyls. This structure provides a new framework for understanding mechanisms of ion permeation, gating and channelopathy of CNG channels and cyclic nucleotide modulation of related channels. PMID:28099415
Nucleotide polymorphisms in a pine ortholog of the Arabidopsis degrading enzyme cellulase KORRIGAN are associated with early growth performance in Pinus pinaster.

PubMed

Cabezas, José Antonio; González-Martínez, Santiago C; Collada, Carmen; Guevara, María Angeles; Boury, Christophe; de María, Nuria; Eveno, Emmanuelle; Aranda, Ismael; Garnier-Géré, Pauline H; Brach, Jean; Alía, Ricardo; Plomion, Christophe; Cervera, María Teresa

2015-09-01

We have carried out a candidate-gene-based association genetic study in Pinus pinaster Aiton and evaluated the predictive performance for genetic merit gain of the most significantly associated genes and single nucleotide polymorphisms (SNPs). We used a second generation 384-SNP array enriched with candidate genes for growth and wood properties to genotype mother trees collected in 20 natural populations covering most of the European distribution of the species. Phenotypic data for total height, polycyclism, root-collar diameter and biomass were obtained from a replicated provenance-progeny trial located in two sites with contrasting environments (Atlantic vs Mediterranean climate). General linear models identified strong associations between growth traits (total height and polycyclism) and four SNPs from the korrigan candidate gene, after multiple testing corrections using false discovery rate. The combined genomic breeding value predictions assessed for the four associated korrigan SNPs by ridge regression-best linear unbiased prediction (RR-BLUP) and cross-validation accounted for up to 8 and 15% of the phenotypic variance for height and polycyclic growth, respectively, and did not improve adding SNPs from other growth-related candidate genes. For root-collar diameter and total biomass, they accounted for 1.6 and 1.1% of the phenotypic variance, respectively, but increased to 15 and 4.1% when other SNPs from lp3.1, lp3.3 and cad were included in RR-BLUP models. These results point towards a desirable integration of candidate-gene studies as a means to pre-select relevant markers, and aid genomic selection in maritime pine breeding programs. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Aquaporin-4 polymorphisms and brain/body weight ratio in sudden infant death syndrome (SIDS).

PubMed

Studer, Jacqueline; Bartsch, Christine; Haas, Cordula

2014-07-01

Failure in the regulation of homeostatic water balance in the brain is associated with severe cerebral edema and increased brain weights and may also play an important role in the pathogenesis of sudden infant death syndrome (SIDS). We genotyped three single-nucleotide polymorphisms in the aquaporin-4 water channel-encoding gene (AQP4), which were previously shown to be associated with (i) SIDS in Norwegian infants (rs2075575), (ii) severe brain edema (rs9951307), and (iii) increased brain water permeability (rs3906956). We also determined whether the brain/body weight ratio is increased in SIDS infants compared with sex- and age-matched controls. Genotyping of the three AQP4 single-nucleotide polymorphisms was performed in 160 Caucasian SIDS infants and 181 healthy Swiss adults using a single-base extension method. Brain and body weights were measured during autopsy in 157 SIDS and 59 non-SIDS infants. No differences were detected in the allelic frequencies of the three AQP4 single-nucleotide polymorphisms between SIDS and adult controls. The brain/body weight ratio was similarly distributed in SIDS and non-SIDS infants. Variations in the AQP4 gene seem of limited significance as predisposing factors in Caucasian SIDS infants. Increased brain weights may only become evident in conjunction with environmental or other genetic risk factors.
Validation of Skeletal Muscle cis-Regulatory Module Predictions Reveals Nucleotide Composition Bias in Functional Enhancers

PubMed Central

Kwon, Andrew T.; Chou, Alice Yi; Arenillas, David J.; Wasserman, Wyeth W.

2011-01-01

We performed a genome-wide scan for muscle-specific cis-regulatory modules (CRMs) using three computational prediction programs. Based on the predictions, 339 candidate CRMs were tested in cell culture with NIH3T3 fibroblasts and C2C12 myoblasts for capacity to direct selective reporter gene expression to differentiated C2C12 myotubes. A subset of 19 CRMs validated as functional in the assay. The rate of predictive success reveals striking limitations of computational regulatory sequence analysis methods for CRM discovery. Motif-based methods performed no better than predictions based only on sequence conservation. Analysis of the properties of the functional sequences relative to inactive sequences identifies nucleotide sequence composition can be an important characteristic to incorporate in future methods for improved predictive specificity. Muscle-related TFBSs predicted within the functional sequences display greater sequence conservation than non-TFBS flanking regions. Comparison with recent MyoD and histone modification ChIP-Seq data supports the validity of the functional regions. PMID:22144875
Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome.

PubMed

Dresch, Jacqueline M; Zellers, Rowan G; Bork, Daniel K; Drewell, Robert A

2016-01-01

A long-standing objective in modern biology is to characterize the molecular components that drive the development of an organism. At the heart of eukaryotic development lies gene regulation. On the molecular level, much of the research in this field has focused on the binding of transcription factors (TFs) to regulatory regions in the genome known as cis-regulatory modules (CRMs). However, relatively little is known about the sequence-specific binding preferences of many TFs, especially with respect to the possible interdependencies between the nucleotides that make up binding sites. A particular limitation of many existing algorithms that aim to predict binding site sequences is that they do not allow for dependencies between nonadjacent nucleotides. In this study, we use a recently developed computational algorithm, MARZ, to compare binding site sequences using 32 distinct models in a systematic and unbiased approach to explore nucleotide dependencies within binding sites for 15 distinct TFs known to be critical to Drosophila development. Our results indicate that many of these proteins have varying levels of nucleotide interdependencies within their DNA recognition sequences, and that, in some cases, models that account for these dependencies greatly outperform traditional models that are used to predict binding sites. We also directly compare the ability of different models to identify the known KRUPPEL TF binding sites in CRMs and demonstrate that a more complex model that accounts for nucleotide interdependencies performs better when compared with simple models. This ability to identify TFs with critical nucleotide interdependencies in their binding sites will lead to a deeper understanding of how these molecular characteristics contribute to the architecture of CRMs and the precise regulation of transcription during organismal development.
Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome

PubMed Central

Dresch, Jacqueline M.; Zellers, Rowan G.; Bork, Daniel K.; Drewell, Robert A.

2016-01-01

A long-standing objective in modern biology is to characterize the molecular components that drive the development of an organism. At the heart of eukaryotic development lies gene regulation. On the molecular level, much of the research in this field has focused on the binding of transcription factors (TFs) to regulatory regions in the genome known as cis-regulatory modules (CRMs). However, relatively little is known about the sequence-specific binding preferences of many TFs, especially with respect to the possible interdependencies between the nucleotides that make up binding sites. A particular limitation of many existing algorithms that aim to predict binding site sequences is that they do not allow for dependencies between nonadjacent nucleotides. In this study, we use a recently developed computational algorithm, MARZ, to compare binding site sequences using 32 distinct models in a systematic and unbiased approach to explore nucleotide dependencies within binding sites for 15 distinct TFs known to be critical to Drosophila development. Our results indicate that many of these proteins have varying levels of nucleotide interdependencies within their DNA recognition sequences, and that, in some cases, models that account for these dependencies greatly outperform traditional models that are used to predict binding sites. We also directly compare the ability of different models to identify the known KRUPPEL TF binding sites in CRMs and demonstrate that a more complex model that accounts for nucleotide interdependencies performs better when compared with simple models. This ability to identify TFs with critical nucleotide interdependencies in their binding sites will lead to a deeper understanding of how these molecular characteristics contribute to the architecture of CRMs and the precise regulation of transcription during organismal development. PMID:27330274
Characterization of genomic sequence showing strong association with polyembryony among diverse Citrus species and cultivars, and its synteny with Vitis and Populus.

PubMed

Nakano, Michiharu; Shimada, Takehiko; Endo, Tomoko; Fujii, Hiroshi; Nesumi, Hirohisa; Kita, Masayuki; Ebina, Masumi; Shimizu, Tokurou; Omura, Mitsuo

2012-02-01

Polyembryony, in which multiple somatic nucellar cell-derived embryos develop in addition to the zygotic embryo in a seed, is common in the genus Citrus. Previous genetic studies indicated polyembryony is mainly determined by a single locus, but the underlying molecular mechanism is still unclear. As a step towards identification and characterization of the gene or genes responsible for nucellar embryogenesis in Citrus, haplotype-specific physical maps around the polyembryony locus were constructed. By sequencing three BAC clones aligned on the polyembryony haplotype, a single contiguous draft sequence consisting of 380 kb containing 70 predicted open reading frames (ORFs) was reconstructed. Single nucleotide polymorphism genotypes detected in the sequenced genomic region showed strong association with embryo type in Citrus, indicating a common polyembryony locus is shared among widely diverse Citrus cultivars and species. The arrangement of the predicted ORFs in the characterized genomic region showed high collinearity to the genomic sequence of chromosome 4 of Vitis vinifera and linkage group VI of Populus trichocarpa, suggesting that the syntenic relationship among these species is conserved even though V. vinifera and P. trichocarpa are non-apomictic species. This is the first study to characterize in detail the genomic structure of an apomixis locus determining adventitious embryony. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Assessment of Genetic and Nongenetic Interactions for the Prediction of Depressive Symptomatology: An Analysis of the Wisconsin Longitudinal Study Using Machine Learning Algorithms

PubMed Central

Roetker, Nicholas S.; Yonker, James A.; Chang, Vicky; Roan, Carol L.; Herd, Pamela; Hauser, Taissa S.; Hauser, Robert M.

2013-01-01

Objectives. We examined depression within a multidimensional framework consisting of genetic, environmental, and sociobehavioral factors and, using machine learning algorithms, explored interactions among these factors that might better explain the etiology of depressive symptoms. Methods. We measured current depressive symptoms using the Center for Epidemiologic Studies Depression Scale (n = 6378 participants in the Wisconsin Longitudinal Study). Genetic factors were 78 single nucleotide polymorphisms (SNPs); environmental factors—13 stressful life events (SLEs), plus a composite proportion of SLEs index; and sociobehavioral factors—18 personality, intelligence, and other health or behavioral measures. We performed traditional SNP associations via logistic regression likelihood ratio testing and explored interactions with support vector machines and Bayesian networks. Results. After correction for multiple testing, we found no significant single genotypic associations with depressive symptoms. Machine learning algorithms showed no evidence of interactions. Naïve Bayes produced the best models in both subsets and included only environmental and sociobehavioral factors. Conclusions. We found no single or interactive associations with genetic factors and depressive symptoms. Various environmental and sociobehavioral factors were more predictive of depressive symptoms, yet their impacts were independent of one another. A genome-wide analysis of genetic alterations using machine learning methodologies will provide a framework for identifying genetic–environmental–sociobehavioral interactions in depressive symptoms. PMID:23927508
Analysis of a minimal Rho-GTPase circuit regulating cell shape

NASA Astrophysics Data System (ADS)

Holmes, William R.; Edelstein-Keshet, Leah

2016-08-01

Networks of Rho-family GTPases regulate eukaryotic cell polarization and motility by controlling assembly and contraction of the cytoskeleton. The mutually inhibitory Rac-Rho circuit is emerging as a central, regulatory hub that can affect the shape and motility phenotype of eukaryotic cells. Recent experimental manipulation of the amounts of Rac and Rho or their regulators (guanine nucleotide-exchange factors, GTPase-activating proteins, guanine nucleotide dissociation inhibitors) have been shown to bias the prevalence of these different states and promote transitions between them. Here we show that part of this data can be understood in terms of inherent Rac-Rho mutually inhibitory dynamics. We analyze a spatio-temporal mathematical model of Rac-Rho dynamics to produce a detailed set of predictions of how parameters such as GTPase rates of activation and total amounts affect cell decisions (such as Rho-dominated contraction, Rac-dominated spreading, and spatially segregated Rac-Rho polarization). We find that in some parameter regimes, a cell can take on any of these three fates depending on its environment or stimuli. We also predict how experimental manipulations (corresponding to parameter variations) can affect cell shapes observed. Our methods are based on local perturbation analysis (a kind of nonlinear stability analysis), and an approximation of nonlinear feedback by sharp switches. We compare the Rac-Rho model to an even simpler single-GTPase (‘wave-pinning’) model and demonstrate that the overall behavior is inherent to GTPase properties, rather than stemming solely from network topology.
Positive selection in the SLC11A1 gene in the family Equidae.

PubMed

Bayerova, Zuzana; Janova, Eva; Matiasovic, Jan; Orlando, Ludovic; Horin, Petr

2016-05-01

Immunity-related genes are a suitable model for studying effects of selection at the genomic level. Some of them are highly conserved due to functional constraints and purifying selection, while others are variable and change quickly to cope with the variation of pathogens. The SLC11A1 gene encodes a transporter protein mediating antimicrobial activity of macrophages. Little is known about the patterns of selection shaping this gene during evolution. Although it is a typical evolutionarily conserved gene, functionally important polymorphisms associated with various diseases were identified in humans and other species. We analyzed the genomic organization, genetic variation, and evolution of the SLC11A1 gene in the family Equidae to identify patterns of selection within this important gene. Nucleotide SLC11A1 sequences were shown to be highly conserved in ten equid species, with more than 97 % sequence identity across the family. Single nucleotide polymorphisms (SNPs) were found in the coding and noncoding regions of the gene. Seven codon sites were identified to be under strong purifying selection. Codons located in three regions, including the glycosylated extracellular loop, were shown to be under diversifying selection. A 3-bp indel resulting in a deletion of the amino acid 321 in the predicted protein was observed in all horses, while it has been maintained in all other equid species. This codon comprised in an N-glycosylation site was found to be under positive selection. Interspecific variation in the presence of predicted N-glycosylation sites was observed.
Molecular identification and functional characterisation of uncoupling protein 4 in larva and pupa fat body mitochondria from the beetle Zophobas atratus.

PubMed

Slocinska, Malgorzata; Antos-Krzeminska, Nina; Rosinski, Grzegorz; Jarmuszkiewicz, Wieslawa

2012-08-01

Uncoupling protein 4 (UCP4) is a member of the UCP subfamily that mediates mitochondrial uncoupling, and sequence alignment predicts the existence of UCP4 in several insects. The present study demonstrates the first molecular identification of a partial Zophobas atratus UCP4-coding sequence and the functional characterisation of ZaUCP4 in the mitochondria of larval and pupal fat bodies of the beetle. ZaUCP4 shows a high similarity to predicted insect UCP4 isoforms and known mammalian UCP4s, both at the nucleotide and amino acid sequence levels. Bioenergetic studies clearly demonstrate UCP function in mitochondria from larval and pupal fat bodies. In non-phosphorylating mitochondria, ZaUCP activity was stimulated by palmitic acid and inhibited by the purine nucleotide GTP. In phosphorylating mitochondria, ZaUCP4 activity decreased the yield of oxidative phosphorylation. ZaUCP4 was immunodetected with antibodies raised against human UCP4 as a single 36-kDa band. A lower expression of ZaUCP4 at the level of mRNA and protein and a decreased ZaUCP4 activity were observed in the Z. atratus pupal fat body compared with the larval fat body. The different expression patterns and activity of ZaUCP4 during the larval-pupal transformation indicates an important physiological role for UCP4 in insect fat body development and function during insect metamorphosis. Copyright © 2012 Elsevier Inc. All rights reserved.
Typing of canine parvovirus isolates using mini-sequencing based single nucleotide polymorphism analysis.

PubMed

Naidu, Hariprasad; Subramanian, B Mohana; Chinchkar, Shankar Ramchandra; Sriraman, Rajan; Rana, Samir Kumar; Srinivasan, V A

2012-05-01

The antigenic types of canine parvovirus (CPV) are defined based on differences in the amino acids of the major capsid protein VP2. Type specificity is conferred by a limited number of amino acid changes and in particular by few nucleotide substitutions. PCR based methods are not particularly suitable for typing circulating variants which differ in a few specific nucleotide substitutions. Assays for determining SNPs can detect efficiently nucleotide substitutions and can thus be adapted to identify CPV types. In the present study, CPV typing was performed by single nucleotide extension using the mini-sequencing technique. A mini-sequencing signature was established for all the four CPV types (CPV2, 2a, 2b and 2c) and feline panleukopenia virus. The CPV typing using the mini-sequencing reaction was performed for 13 CPV field isolates and the two vaccine strains available in our repository. All the isolates had been typed earlier by full-length sequencing of the VP2 gene. The typing results obtained from mini-sequencing matched completely with that of sequencing. Typing could be achieved with less than 100 copies of standard plasmid DNA constructs or ≤10¹ FAID₅₀ of virus by mini-sequencing technique. The technique was also efficient for detecting multiple types in mixed infections. Copyright © 2012 Elsevier B.V. All rights reserved.
STRUM: structure-based prediction of protein stability changes upon single-point mutation.

PubMed

Quan, Lijun; Lv, Qiang; Zhang, Yang

2016-10-01

Mutations in human genome are mainly through single nucleotide polymorphism, some of which can affect stability and function of proteins, causing human diseases. Several methods have been proposed to predict the effect of mutations on protein stability; but most require features from experimental structure. Given the fast progress in protein structure prediction, this work explores the possibility to improve the mutation-induced stability change prediction using low-resolution structure modeling. We developed a new method (STRUM) for predicting stability change caused by single-point mutations. Starting from wild-type sequences, 3D models are constructed by the iterative threading assembly refinement (I-TASSER) simulations, where physics- and knowledge-based energy functions are derived on the I-TASSER models and used to train STRUM models through gradient boosting regression. STRUM was assessed by 5-fold cross validation on 3421 experimentally determined mutations from 150 proteins. The Pearson correlation coefficient (PCC) between predicted and measured changes of Gibbs free-energy gap, ΔΔG, upon mutation reaches 0.79 with a root-mean-square error 1.2 kcal/mol in the mutation-based cross-validations. The PCC reduces if separating training and test mutations from non-homologous proteins, which reflects inherent correlations in the current mutation sample. Nevertheless, the results significantly outperform other state-of-the-art methods, including those built on experimental protein structures. Detailed analyses show that the most sensitive features in STRUM are the physics-based energy terms on I-TASSER models and the conservation scores from multiple-threading template alignments. However, the ΔΔG prediction accuracy has only a marginal dependence on the accuracy of protein structure models as long as the global fold is correct. These data demonstrate the feasibility to use low-resolution structure modeling for high-accuracy stability change prediction upon point mutations. http://zhanglab.ccmb.med.umich.edu/STRUM/ CONTACT: qiang@suda.edu.cn and zhng@umich.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

STRUM: structure-based prediction of protein stability changes upon single-point mutation

PubMed Central

Quan, Lijun; Lv, Qiang; Zhang, Yang

2016-01-01

Motivation: Mutations in human genome are mainly through single nucleotide polymorphism, some of which can affect stability and function of proteins, causing human diseases. Several methods have been proposed to predict the effect of mutations on protein stability; but most require features from experimental structure. Given the fast progress in protein structure prediction, this work explores the possibility to improve the mutation-induced stability change prediction using low-resolution structure modeling. Results: We developed a new method (STRUM) for predicting stability change caused by single-point mutations. Starting from wild-type sequences, 3D models are constructed by the iterative threading assembly refinement (I-TASSER) simulations, where physics- and knowledge-based energy functions are derived on the I-TASSER models and used to train STRUM models through gradient boosting regression. STRUM was assessed by 5-fold cross validation on 3421 experimentally determined mutations from 150 proteins. The Pearson correlation coefficient (PCC) between predicted and measured changes of Gibbs free-energy gap, ΔΔG, upon mutation reaches 0.79 with a root-mean-square error 1.2 kcal/mol in the mutation-based cross-validations. The PCC reduces if separating training and test mutations from non-homologous proteins, which reflects inherent correlations in the current mutation sample. Nevertheless, the results significantly outperform other state-of-the-art methods, including those built on experimental protein structures. Detailed analyses show that the most sensitive features in STRUM are the physics-based energy terms on I-TASSER models and the conservation scores from multiple-threading template alignments. However, the ΔΔG prediction accuracy has only a marginal dependence on the accuracy of protein structure models as long as the global fold is correct. These data demonstrate the feasibility to use low-resolution structure modeling for high-accuracy stability change prediction upon point mutations. Availability and Implementation: http://zhanglab.ccmb.med.umich.edu/STRUM/ Contact: qiang@suda.edu.cn and zhng@umich.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27318206
The nucleotide sequence and genome organization of Plasmopara halstedii virus.

PubMed

Heller-Dohmen, Marion; Göpfert, Jens C; Pfannstiel, Jens; Spring, Otmar

2011-03-17

Only very few viruses of Oomycetes have been studied in detail. Isometric virions were found in different isolates of the oomycete Plasmopara halstedii, the downy mildew pathogen of sunflower. However, complete nucleotide sequences and data on the genome organization were lacking. Viral RNA of different P. halstedii isolates was subjected to nucleotide sequencing and analysis of the viral genome. The N-terminal sequence of the viral coat protein was determined using Top-Down MALDI-TOF analysis. The complete nucleotide sequences of both single-stranded RNA segments (RNA1 and RNA2) were established. RNA1 consisted of 2793 nucleotides (nt) exclusive its 3' poly(A) tract and a single open-reading frame (ORF1) of 2745 nt. ORF1 was framed by a 5' untranslated region (5' UTR) of 18 nt and a 3' untranslated region (3' UTR) of 30 nt. ORF1 contained motifs of RNA-dependent RNA polymerases (RdRp) and showed similarities to RdRp of Scleropthora macrospora virus A (SmV A) and viruses within the Nodaviridae family. RNA2 consisted of 1526 nt exclusive its 3' poly(A) tract and a second ORF (ORF2) of 1128 nt. ORF2 coded for the single viral coat protein (CP) and was framed by a 5' UTR of 164 nt and a 3' UTR of 234 nt. The deduced amino acid sequence of ORF2 was verified by nano-LC-ESI-MS/MS experiments. Top-Down MALDI-TOF analysis revealed the N-terminal sequence of the CP. The N-terminal sequence represented a region within ORF2 suggesting a proteolytic processing of the CP in vivo. The CP showed similarities to CP of SmV A and viruses within the Tombusviridae family. Fragments of RNA1 (ca. 1.9 kb) and RNA2 (ca. 1.4 kb) were used to analyze the nucleotide sequence variation of virions in different P. halstedii isolates. Viral sequence variation was 0.3% or less regardless of their host's pathotypes, the geographical origin and the sensitivity towards the fungicide metalaxyl. The results showed the presence of a single and new virus type in different P. halstedii isolates. Insignificant viral sequence variation indicated that the virus did not account for differences in pathogenicity of the oomycete P. halstedii.
Sasquatch: predicting the impact of regulatory SNPs on transcription factor binding from cell- and tissue-specific DNase footprints.

PubMed

Schwessinger, Ron; Suciu, Maria C; McGowan, Simon J; Telenius, Jelena; Taylor, Stephen; Higgs, Doug R; Hughes, Jim R

2017-10-01

In the era of genome-wide association studies (GWAS) and personalized medicine, predicting the impact of single nucleotide polymorphisms (SNPs) in regulatory elements is an important goal. Current approaches to determine the potential of regulatory SNPs depend on inadequate knowledge of cell-specific DNA binding motifs. Here, we present Sasquatch, a new computational approach that uses DNase footprint data to estimate and visualize the effects of noncoding variants on transcription factor binding. Sasquatch performs a comprehensive k -mer-based analysis of DNase footprints to determine any k -mer's potential for protein binding in a specific cell type and how this may be changed by sequence variants. Therefore, Sasquatch uses an unbiased approach, independent of known transcription factor binding sites and motifs. Sasquatch only requires a single DNase-seq data set per cell type, from any genotype, and produces consistent predictions from data generated by different experimental procedures and at different sequence depths. Here we demonstrate the effectiveness of Sasquatch using previously validated functional SNPs and benchmark its performance against existing approaches. Sasquatch is available as a versatile webtool incorporating publicly available data, including the human ENCODE collection. Thus, Sasquatch provides a powerful tool and repository for prioritizing likely regulatory SNPs in the noncoding genome. © 2017 Schwessinger et al.; Published by Cold Spring Harbor Laboratory Press.
pKa predictions for proteins, RNAs, and DNAs with the Gaussian dielectric function using DelPhi pKa.

PubMed

Wang, Lin; Li, Lin; Alexov, Emil

2015-12-01

We developed a Poisson-Boltzmann based approach to calculate the pKa values of protein ionizable residues (Glu, Asp, His, Lys and Arg), nucleotides of RNA and single stranded DNA. Two novel features were utilized: the dielectric properties of the macromolecules and water phase were modeled via the smooth Gaussian-based dielectric function in DelPhi and the corresponding electrostatic energies were calculated without defining the molecular surface. We tested the algorithm by calculating pKa values for more than 300 residues from 32 proteins from the PPD dataset and achieved an overall RMSD of 0.77. Particularly, the RMSD of 0.55 was achieved for surface residues, while the RMSD of 1.1 for buried residues. The approach was also found capable of capturing the large pKa shifts of various single point mutations in staphylococcal nuclease (SNase) from pKa-cooperative dataset, resulting in an overall RMSD of 1.6 for this set of pKa's. Investigations showed that predictions for most of buried mutant residues of SNase could be improved by using higher dielectric constant values. Furthermore, an option to generate different hydrogen positions also improves pKa predictions for buried carboxyl residues. Finally, the pKa calculations on two RNAs demonstrated the capability of this approach for other types of biomolecules. © 2015 Wiley Periodicals, Inc.
Uncovering drug-responsive regulatory elements

PubMed Central

Luizon, Marcelo R; Ahituv, Nadav

2015-01-01

Nucleotide changes in gene regulatory elements can have a major effect on interindividual differences in drug response. For example, by reviewing all published pharmacogenomic genome-wide association studies, we show here that 96.4% of the associated single nucleotide polymorphisms reside in noncoding regions. We discuss how sequencing technologies are improving our ability to identify drug response-associated regulatory elements genome-wide and to annotate nucleotide variants within them. We highlight specific examples of how nucleotide changes in these elements can affect drug response and illustrate the techniques used to find them and functionally characterize them. Finally, we also discuss challenges in the field of drug-responsive regulatory elements that need to be considered in order to translate these findings into the clinic. PMID:26555224
Threshold models for genome-enabled prediction of ordinal categorical traits in plant breeding.

PubMed

Montesinos-López, Osval A; Montesinos-López, Abelardo; Pérez-Rodríguez, Paulino; de Los Campos, Gustavo; Eskridge, Kent; Crossa, José

2014-12-23

Categorical scores for disease susceptibility or resistance often are recorded in plant breeding. The aim of this study was to introduce genomic models for analyzing ordinal characters and to assess the predictive ability of genomic predictions for ordered categorical phenotypes using a threshold model counterpart of the Genomic Best Linear Unbiased Predictor (i.e., TGBLUP). The threshold model was used to relate a hypothetical underlying scale to the outward categorical response. We present an empirical application where a total of nine models, five without interaction and four with genomic × environment interaction (G×E) and genomic additive × additive × environment interaction (G×G×E), were used. We assessed the proposed models using data consisting of 278 maize lines genotyped with 46,347 single-nucleotide polymorphisms and evaluated for disease resistance [with ordinal scores from 1 (no disease) to 5 (complete infection)] in three environments (Colombia, Zimbabwe, and Mexico). Models with G×E captured a sizeable proportion of the total variability, which indicates the importance of introducing interaction to improve prediction accuracy. Relative to models based on main effects only, the models that included G×E achieved 9-14% gains in prediction accuracy; adding additive × additive interactions did not increase prediction accuracy consistently across locations. Copyright © 2015 Montesinos-López et al.
Genetic risk prediction using a spatial autoregressive model with adaptive lasso.

PubMed

Wen, Yalu; Shen, Xiaoxi; Lu, Qing

2018-05-31

With rapidly evolving high-throughput technologies, studies are being initiated to accelerate the process toward precision medicine. The collection of the vast amounts of sequencing data provides us with great opportunities to systematically study the role of a deep catalog of sequencing variants in risk prediction. Nevertheless, the massive amount of noise signals and low frequencies of rare variants in sequencing data pose great analytical challenges on risk prediction modeling. Motivated by the development in spatial statistics, we propose a spatial autoregressive model with adaptive lasso (SARAL) for risk prediction modeling using high-dimensional sequencing data. The SARAL is a set-based approach, and thus, it reduces the data dimension and accumulates genetic effects within a single-nucleotide variant (SNV) set. Moreover, it allows different SNV sets having various magnitudes and directions of effect sizes, which reflects the nature of complex diseases. With the adaptive lasso implemented, SARAL can shrink the effects of noise SNV sets to be zero and, thus, further improve prediction accuracy. Through simulation studies, we demonstrate that, overall, SARAL is comparable to, if not better than, the genomic best linear unbiased prediction method. The method is further illustrated by an application to the sequencing data from the Alzheimer's Disease Neuroimaging Initiative. Copyright © 2018 John Wiley & Sons, Ltd.
DNA polymorphisms predict time to progression from uncomplicated to complicated Crohn's disease.

PubMed

Pernat Drobež, Cvetka; Repnik, Katja; Gorenjak, Mario; Ferkolj, Ivan; Weersma, Rinse K; Potočnik, Uroš

2018-04-01

Most patients with Crohn's disease (CD) are diagnosed with the uncomplicated inflammatory form of the disease (Montreal stage B1). However, the majority of them will progress to complicated stricturing (B2) and penetrating (B3) CD during their lifetimes. The aim of our study was to identify the genetic factors associated with time to progression from uncomplicated to complicated CD. Patients with an inflammatory phenotype at diagnosis were followed up for 10 years. Genotyping was carried out using Illumina ImmunoChip. After quality control, association analyses, Bonferroni's adjustments, linear and Cox's regression, and Kaplan-Meier analysis were carried out for 111 patients and Manhattan plots were constructed. Ten years after diagnosis, 39.1% of the patients still had the inflammatory form and 60.9% progressed to complicated disease, with an average time to progression of 5.91 years. Ileal and ileocolonic locations were associated with the complicated CD (P=1.08E-03). We found that patients with the AA genotype at single-nucleotide polymorphism rs16857259 near the gene CACNA1E progressed to the complicated form later (8.80 years) compared with patients with the AC (5.11 years) or CC (2.00 years) genotypes (P=3.82E-07). In addition, nine single-nucleotide polymorphisms (near the genes RASGRP1, SULF2, XPO1, ZBTB44, HLA DOA/BRD2, HLA DRB1/HLA DQA1, PPARA, PUDP, and KIAA1614) showed a suggestive association with disease progression (P<10). Multivariate Cox's regression analysis on the basis of clinical and genetic data confirmed the association of the selected model with disease progression (P=5.73E-16). Our study confirmed the association between the locus on chromosome 1 near the gene CACNA1E with time to progression from inflammatory to stricturing or penetrating CD. Predicting the time to progression is useful to the clinician in terms of individualizing patients' management.
Analysis of the HLA and non-HLA susceptibility loci in Japanese type 1 diabetes.

PubMed

Yamashita, Hisakuni; Awata, Takuya; Kawasaki, Eiji; Ikegami, Hiroshi; Tanaka, Shoichiro; Maruyama, Taro; Shimada, Akira; Nakanishi, Koji; Takahashi, Kazuma; Kobayashi, Tetsuro; Kawabata, Yumiko; Miyashita, Yumi; Kurihara, Susumu; Morita-Ohkubo, Tomoko; Katayama, Shigehiro

2011-11-01

We previously reported the associations of human leukocyte antigen (HLA) (DRB1 and DQB1), INS, CTLA4, IL2RA, ERBB3 and CLEC16A with Japanese type 1 diabetes (T1D). In this study, we jointly analysed these loci in addition to IFIH1 and IL7R. A maximum of 790 T1D patients and 953 control subjects were analysed. HLA was determined by sequencing-based typing. Seven non-HLA single nucleotide polymorphisms were genotyped using TaqMan assay. HLA DRB1*0405, DRB1*0901 and DRB1*0802-DQB1*0302 haplotypes were positively associated with T1D, while the DRB1*15 haplotypes were negatively associated. Non-HLA single nucleotide polymorphisms, INS, IL2RA, ERBB3, CLEC16A and IL7R were associated with T1D. By a prediction model using the HLA loci alone (HLA model) or the non-HLA loci alone (non-HLA model), it was revealed that the cumulative effect of the non-HLA model was much weaker than that of the HLA model (average increase in odds ratio: 1.17 versus 3.14). Furthermore, the area under the receiver operating characteristic curve of the non-HLA model was also much smaller than that of the HLA model (0.65 versus 0.81, p<10(-11)). Finally, a patient-only analysis revealed the susceptible HLA haplotypes and the risk allele of INS to be negatively associated with slower onset of the disease. In addition, the DRB1*0901 haplotype and the risk alleles of ERBB3, CLEC16A and CTLA4 were positively associated with the co-occurrence of thyroid autoimmunity. Although several non-HLA susceptibility genes in Japanese were confirmed trans-racially and appear to contribute to the heterogeneity of the clinical phenotypes, the cumulative effect on the ability to predict the development of T1D was weak. Copyright © 2011 John Wiley & Sons, Ltd.
Structural insight of dopamine β-hydroxylase, a drug target for complex traits, and functional significance of exonic single nucleotide polymorphisms.

PubMed

Kapoor, Abhijeet; Shandilya, Manish; Kundu, Suman

2011-01-01

Human dopamine β-hydroxylase (DBH) is an important therapeutic target for complex traits. Several single nucleotide polymorphisms (SNPs) have also been identified in DBH with potential adverse physiological effect. However, difficulty in obtaining diffractable crystals and lack of a suitable template for modeling the protein has ensured that neither crystallographic three-dimensional structure nor computational model for the enzyme is available to aid rational drug design, prediction of functional significance of SNPs or analytical protein engineering. Adequate biochemical information regarding human DBH, structural coordinates for peptidylglycine alpha-hydroxylating monooxygenase and computational data from a partial model of rat DBH were used along with logical manual intervention in a novel way to build an in silico model of human DBH. The model provides structural insight into the active site, metal coordination, subunit interface, substrate recognition and inhibitor binding. It reveals that DOMON domain potentially promotes tetramerization, while substrate dopamine and a potential therapeutic inhibitor nepicastat are stabilized in the active site through multiple hydrogen bonding. Functional significance of several exonic SNPs could be described from a structural analysis of the model. The model confirms that SNP resulting in Ala318Ser or Leu317Pro mutation may not influence enzyme activity, while Gly482Arg might actually do so being in the proximity of the active site. Arg549Cys may cause abnormal oligomerization through non-native disulfide bond formation. Other SNPs like Glu181, Glu250, Lys239 and Asp290 could potentially inhibit tetramerization thus affecting function. The first three-dimensional model of full-length human DBH protein was obtained in a novel manner with a set of experimental data as guideline for consistency of in silico prediction. Preliminary physicochemical tests validated the model. The model confirms, rationalizes and provides structural basis for several biochemical data and claims testable hypotheses regarding function. It provides a reasonable template for drug design as well.
Gene Polymorphisms in the CCL5/CCR5 Pathway as a Genetic Biomarker for Outcome and Hand-Foot Skin Reaction in Metastatic Colorectal Cancer Patients Treated With Regorafenib.

PubMed

Suenaga, Mitsukuni; Schirripa, Marta; Cao, Shu; Zhang, Wu; Yang, Dongyun; Ning, Yan; Cremolini, Chiara; Antoniotti, Carlotta; Borelli, Beatrice; Mashima, Tetsuo; Okazaki, Satoshi; Berger, Martin D; Miyamoto, Yuji; Gopez, Roel; Barzi, Afsaneh; Lonardi, Sara; Yamaguchi, Toshiharu; Falcone, Alfredo; Loupakis, Fotios; Lenz, Heinz-Josef

2018-06-01

The C-C motif chemokine ligand 5/C-C motif chemokine receptor 5 (CCL5/CCR5) pathway has been shown to induce endothelial progenitor cell migration, resulting in increased vascular endothelial growth factor A expression. We hypothesized that genetic polymorphisms in the CCL5/CCR5 pathway predict efficacy and toxicity in patients with metastatic colorectal cancer (mCRC) treated with regorafenib. We analyzed genomic DNA extracted from 229 tumor samples from 2 different cohorts of patients who received regorafenib: an evaluation cohort of 79 Japanese patients and a validation cohort of 150 Italian patients. Single nucleotide polymorphisms of CCL5/CCR5 pathway-related genes were analyzed by PCR-based direct sequencing. CCL4 rs1634517 and CCL3 rs1130371 were associated with progression-free survival in the evaluation cohort (hazard ratio [HR] 1.54, P = .043; HR 1.48, P = .064), and progression-free survival (HR 1.74, P < .001; HR 1.66, P = .002) and overall survival (HR 1.65, P = .004; HR 1.65, P = .004) in the validation cohort. The allelic frequencies of CCL5 single nucleotide polymorphisms varied between the evaluation and validation cohorts (G/G variant in rs2280789, 21.5% vs. 1.3%, P < .001; T/T variant in rs3817655, 22.8% vs. 2.7%, P < .001). In the evaluation cohort, patients with the G/G variant in rs2280789 had a higher incidence of grade 3+ hand-foot skin reaction compared to any A allele (53% vs. 27%, P = .078), and similarly to the T/T variant in rs3817655 compared to any A allele (56% vs. 26%, P = .026). Genetic variants in the CCL5/CCR5 pathway may serve as prognostic markers and may predict severe hand-foot skin reaction in mCRC patients receiving regorafenib therapy. Copyright © 2018 Elsevier Inc. All rights reserved.
δ-Aminolevulinic Acid Dehydratase Single Nucleotide Polymorphism 2 (ALAD2) and Peptide Transporter 2*2 Haplotype (hPEPT2*2) Differently Influence Neurobehavior in Low-Level Lead Exposed Children

PubMed Central

Sobin, Christina; Gisel Flores-Montoya, Mayra; Gutierrez, Marisela; Parisi, Natali; Schaub, Tanner

2014-01-01

Delta-aminolevulinic acid dehydratase single nucleotide polymorphism 2 (ALAD2) and peptide transporter haplotype 2*2 (hPEPT2*2) through different pathways can increase brain levels of delta-aminolevulinic acid and are associated with higher blood lead burden in young children. Past child and adult findings regarding ALAD2 and neurobehavior have been inconsistent, and the possible association of hPEPT2*2 and neurobehavior has not yet been examined. Mean blood lead level (BLL), genotype, and neurobehavioral function (fine motor dexterity, working memory, visual attention and short-term memory) were assessed in 206 males and 215 females ages 5.1 to 11.8 years. Ninety-six percent of children had BLLs < 5.0 µg/dL. After adjusting for covariates (sex, age and mother’s level of education) and sibling exclusion (N = 252), generalized linear mixed model analyses showed opposite effects for the ALAD2 and hPEPT2*2 genetic variants. Significant effects for ALAD2 were observed only as interactions with BLL and the results suggested that ALAD2 was neuroprotective. As BLL increased, ALAD2 was associated with enhanced visual attention and enhanced working memory (fewer commission errors). Independent of BLL, hPEPT2*2 predicted poorer motor dexterity and poorer working memory (more commission errors). BLL alone predicted poorer working memory from increased omission errors. The findings provided further substantiation that (independent of the genetic variants examined) lowest-level lead exposure disrupted early neurobehavioral function, and suggested that common genetic variants alter the neurotoxic potential of low-level lead. ALAD2 and hPEPT2*2 may be valuable markers of risk, and indicate novel mechanisms of lead-induced neurotoxicity. Longitudinal studies are needed to examine long-term influences of these genetic variants on neurobehavior. PMID:25514583
δ-Aminolevulinic acid dehydratase single nucleotide polymorphism 2 (ALAD2) and peptide transporter 2*2 haplotype (hPEPT2*2) differently influence neurobehavior in low-level lead exposed children.

PubMed

Sobin, Christina; Flores-Montoya, Mayra Gisel; Gutierrez, Marisela; Parisi, Natali; Schaub, Tanner

2015-01-01

Delta-aminolevulinic acid dehydratase single nucleotide polymorphism 2 (ALAD2) and peptide transporter haplotype 2*2 (hPEPT2*2) through different pathways can increase brain levels of delta-aminolevulinic acid and are associated with higher blood lead burden in young children. Past child and adult findings regarding ALAD2 and neurobehavior have been inconsistent, and the possible association of hPEPT2*2 and neurobehavior has not yet been examined. Mean blood lead level (BLL), genotype, and neurobehavioral function (fine motor dexterity, working memory, visual attention and short-term memory) were assessed in 206 males and 215 females ages 5.1-11.8years. Ninety-six percent of children had BLLs<5.0μg/dl. After adjusting for covariates (sex, age and mother's level of education) and sibling exclusion (N=252), generalized linear mixed model analyses showed opposite effects for the ALAD2 and hPEPT2*2 genetic variants. Significant effects for ALAD2 were observed only as interactions with BLL and the results suggested that ALAD2 was neuroprotective. As BLL increased, ALAD2 was associated with enhanced visual attention and enhanced working memory (fewer commission errors). Independent of BLL, hPEPT2*2 predicted poorer motor dexterity and poorer working memory (more commission errors). BLL alone predicted poorer working memory from increased omission errors. The findings provided further substantiation that (independent of the genetic variants examined) lowest-level lead exposure disrupted early neurobehavioral function, and suggested that common genetic variants alter the neurotoxic potential of low-level lead. ALAD2 and hPEPT2*2 may be valuable markers of risk, and indicate novel mechanisms of lead-induced neurotoxicity. Longitudinal studies are needed to examine long-term influences of these genetic variants on neurobehavior. Copyright © 2014 Elsevier Inc. All rights reserved.
Phylogeny and polymorphism in the long control regions E6, E7, and L1 of HPV Type 56 in women from southwest China

PubMed Central

Jing, Yaling; Wang, Tao; Chen, Zuyi; Ding, Xianping; Xu, Jianju; Mu, Xuemei; Cao, Man; Chen, Honghan

2018-01-01

Globally, human papillomavirus (HPV)-56 accounts for a small proportion of all high-risk HPV types; however, HPV-56 is detected at a higher rate in Asia, particularly in southwest China. The present study analyzed polymorphisms, intratypic variants, and genetic variability in the long control regions (LCR), E6, E7, and L1 of HPV-56 (n=75). The LCRs, E6, E7 and L1 were sequenced using a polymerase chain reaction and the sequences were submitted to GenBank. Maximum-likelihood trees were constructed using Kimura's two-parameter model, followed by secondary structure analysis and protein damaging prediction. Additionally, in order to assess the effect of variations in the LCR on putative binding sites for cellular proteins, MATCH server was used. Finally, the selection pressures of the E6-E7 and L1 genes were estimated. A total of 18 point substitutions, a 42-bp deletion and a 19-bp deletion of LCR were identified. Some of those mutations are embedded in the putative binding sites for transcription factors. 18 single nucleotide changes occurred in the E6-E7 sequence, 11/18 were non-synonymous substitutions and 7/18 were synonymous mutations. A total 24 single nucleotide changes were identified in the L1 sequence, 6/24 being non-synonymous mutations and 18/24 synonymous mutations. Selective pressure analysis predicted that the majority of mutations of HPV-56 E6, E7 and L1 were of positive selection. The phylogenetic tree demonstrated that the isolates distributed in two lineages. Data on the prevalence and genetic variation of HPV-56 types in southwest China may aid future studies on viral molecular mechanisms and contribute to future investigations of diagnostic probes and therapeutic vaccines. PMID:29568922
Incorporating Single-nucleotide Polymorphisms Into the Lyman Model to Improve Prediction of Radiation Pneumonitis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tucker, Susan L., E-mail: sltucker@mdanderson.org; Li Minghuan; Xu Ting

2013-01-01

Purpose: To determine whether single-nucleotide polymorphisms (SNPs) in genes associated with DNA repair, cell cycle, transforming growth factor-{beta}, tumor necrosis factor and receptor, folic acid metabolism, and angiogenesis can significantly improve the fit of the Lyman-Kutcher-Burman (LKB) normal-tissue complication probability (NTCP) model of radiation pneumonitis (RP) risk among patients with non-small cell lung cancer (NSCLC). Methods and Materials: Sixteen SNPs from 10 different genes (XRCC1, XRCC3, APEX1, MDM2, TGF{beta}, TNF{alpha}, TNFR, MTHFR, MTRR, and VEGF) were genotyped in 141 NSCLC patients treated with definitive radiation therapy, with or without chemotherapy. The LKB model was used to estimate the risk ofmore » severe (grade {>=}3) RP as a function of mean lung dose (MLD), with SNPs and patient smoking status incorporated into the model as dose-modifying factors. Multivariate analyses were performed by adding significant factors to the MLD model in a forward stepwise procedure, with significance assessed using the likelihood-ratio test. Bootstrap analyses were used to assess the reproducibility of results under variations in the data. Results: Five SNPs were selected for inclusion in the multivariate NTCP model based on MLD alone. SNPs associated with an increased risk of severe RP were in genes for TGF{beta}, VEGF, TNF{alpha}, XRCC1 and APEX1. With smoking status included in the multivariate model, the SNPs significantly associated with increased risk of RP were in genes for TGF{beta}, VEGF, and XRCC3. Bootstrap analyses selected a median of 4 SNPs per model fit, with the 6 genes listed above selected most often. Conclusions: This study provides evidence that SNPs can significantly improve the predictive ability of the Lyman MLD model. With a small number of SNPs, it was possible to distinguish cohorts with >50% risk vs <10% risk of RP when they were exposed to high MLDs.« less
Structured oligonucleotides for target indexing to allow single-vessel PCR amplification and solid support microarray hybridization.

PubMed

Girard, Laurie D; Boissinot, Karel; Peytavi, Régis; Boissinot, Maurice; Bergeron, Michel G

2015-02-07

The combination of molecular diagnostic technologies is increasingly used to overcome limitations on sensitivity, specificity or multiplexing capabilities, and provide efficient lab-on-chip devices. Two such techniques, PCR amplification and microarray hybridization are used serially to take advantage of the high sensitivity and specificity of the former combined with high multiplexing capacities of the latter. These methods are usually performed in different buffers and reaction chambers. However, these elaborate methods have high complexity and cost related to reagent requirements, liquid storage and the number of reaction chambers to integrate into automated devices. Furthermore, microarray hybridizations have a sequence dependent efficiency not always predictable. In this work, we have developed the concept of a structured oligonucleotide probe which is activated by cleavage from polymerase exonuclease activity. This technology is called SCISSOHR for Structured Cleavage Induced Single-Stranded Oligonucleotide Hybridization Reaction. The SCISSOHR probes enable indexing the target sequence to a tag sequence. The SCISSOHR technology also allows the combination of nucleic acid amplification and microarray hybridization in a single vessel in presence of the PCR buffer only. The SCISSOHR technology uses an amplification probe that is irreversibly modified in presence of the target, releasing a single-stranded DNA tag for microarray hybridization. Each tag is composed of a 3-nucleotide sequence-dependent segment and a unique "target sequence-independent" 14-nucleotide segment allowing for optimal hybridization with minimal cross-hybridization. We evaluated the performance of five (5) PCR buffers to support microarray hybridization, compared to a conventional hybridization buffer. Finally, as a proof of concept, we developed a multiplexed assay for the amplification, detection, and identification of three (3) DNA targets. This new technology will facilitate the design of lab-on-chip microfluidic devices, while also reducing consumable costs. At term, it will allow the cost-effective automation of highly multiplexed assays for detection and identification of genetic targets.
Transcriptional fidelities of human mitochondrial POLRMT, yeast mitochondrial Rpo41, and phage T7 single-subunit RNA polymerases.

PubMed

Sultana, Shemaila; Solotchi, Mihai; Ramachandran, Aparna; Patel, Smita S

2017-11-03

Single-subunit RNA polymerases (RNAPs) are present in phage T7 and in mitochondria of all eukaryotes. This RNAP class plays important roles in biotechnology and cellular energy production, but we know little about its fidelity and error rates. Herein, we report the error rates of three single-subunit RNAPs measured from the catalytic efficiencies of correct and all possible incorrect nucleotides. The average error rates of T7 RNAP (2 × 10 -6 ), yeast mitochondrial Rpo41 (6 × 10 -6 ), and human mitochondrial POLRMT (RNA polymerase mitochondrial) (2 × 10 -5 ) indicate high accuracy/fidelity of RNA synthesis resembling those of replicative DNA polymerases. All three RNAPs exhibit a distinctly high propensity for GTP misincorporation opposite dT, predicting frequent A→G errors in RNA with rates of ∼10 -4 The A→C, G→A, A→U, C→U, G→U, U→C, and U→G errors mostly due to pyrimidine-purine mismatches were relatively frequent (10 -5 -10 -6 ), whereas C→G, U→A, G→C, and C→A errors from purine-purine and pyrimidine-pyrimidine mismatches were rare (10 -7 -10 -10 ). POLRMT also shows a high C→A error rate on 8-oxo-dG templates (∼10 -4 ). Strikingly, POLRMT shows a high mutagenic bypass rate, which is exacerbated by TEFM (transcription elongation factor mitochondrial). The lifetime of POLRMT on terminally mismatched elongation substrate is increased in the presence of TEFM, which allows POLRMT to efficiently bypass the error and continue with transcription. This investigation of nucleotide selectivity on normal and oxidatively damaged DNA by three single-subunit RNAPs provides the basic information to understand the error rates in mitochondria and, in the case of T7 RNAP, to assess the quality of in vitro transcribed RNAs. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Demonstration of protein-based human identification using the hair shaft proteome [Protein-based human identification: A proof of concept using the hair shaft proteome

DOE PAGES

Parker, Glendon J.; Leppert, Tami; Anex, Deon S.; ...

2016-09-07

Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 singlemore » nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). Furthermore, this study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts.« less
Association of α-, β-, and γ-Synuclein With Diffuse Lewy Body Disease

PubMed Central

Nishioka, Kenya; Wider, Christian; Vilariño-Güell, Carles; Soto-Ortolaza, Alexandra I.; Lincoln, Sarah J.; Kachergus, Jennifer M.; Jasinska-Myga, Barbara; Ross, Owen A.; Rajput, Alex; Robinson, Christopher A.; Ferman, Tanis J.; Wszolek, Zbigniew K.; Dickson, Dennis W.; Farrer, Matthew J.

2016-01-01

Objective To determine the association of the genes that encode α-, β-, and γ-synuclein (SNCA, SNCB, and SNCG, respectively) with diffuse Lewy body disease (DLBD). Design Case-control study. Subjects A total of 172 patients with DLBD consistent with a clinical diagnosis of Parkinson disease dementia/dementia with Lewy bodies and 350 clinically and 97 pathologically normal controls. Interventions Sequencing of SNCA, SNCB, and SNCG and genotyping of single-nucleotide polymorphisms performed on an Applied Biosystems capillary sequencer and a Sequenom MassArray pLEX platform, respectively. Associations were determined using χ2 or Fisher exact tests. Results Initial sequencing studies of the coding regions of each gene in 89 patients with DLBD did not detect any pathogenic substitutions. Nevertheless, genotyping of known polymorphic variability in sequence-conserved regions detected several single-nucleotide polymorphisms in the SNCA and SNCG genes that were significantly associated with disease (P=.05 to <.001). Significant association was also observed for 3 single-nucleotide polymorphisms located in SNCB when comparing DLBD cases and pathologically confirmed normal controls (P=.03-.01); however, this association was not significant for the clinical controls alone or the combined clinical and pathological controls (P>.05). After correction for multiple testing, only 1 single-nucleotide polymorphism in SNCG (rs3750823) remained significant in all of the analyses (P=.05-.009). Conclusion These findings suggest that variants in all 3 members of the synuclein gene family, particularly SNCA and SNCG, affect the risk of developing DLBD and warrant further investigation in larger, pathologically defined data sets as well as clinically diagnosed Parkinson disease/dementia with Lewy bodies case-control series. PMID:20697047
Refactoring the Genetic Code for Increased Evolvability

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pines, Gur; Winkler, James D.; Pines, Assaf

ABSTRACT The standard genetic code is robust to mutations during transcription and translation. Point mutations are likely to be synonymous or to preserve the chemical properties of the original amino acid. Saturation mutagenesis experiments suggest that in some cases the best-performing mutant requires replacement of more than a single nucleotide within a codon. These replacements are essentially inaccessible to common error-based laboratory engineering techniques that alter a single nucleotide per mutation event, due to the extreme rarity of adjacent mutations. In this theoretical study, we suggest a radical reordering of the genetic code that maximizes the mutagenic potential of singlemore » nucleotide replacements. We explore several possible genetic codes that allow a greater degree of accessibility to the mutational landscape and may result in a hyperevolvable organism that could serve as an ideal platform for directed evolution experiments. We then conclude by evaluating the challenges of constructing such recoded organisms and their potential applications within the field of synthetic biology. IMPORTANCE The conservative nature of the genetic code prevents bioengineers from efficiently accessing the full mutational landscape of a gene via common error-prone methods. Here, we present two computational approaches to generate alternative genetic codes with increased accessibility. These new codes allow mutational transitions to a larger pool of amino acids and with a greater extent of chemical differences, based on a single nucleotide replacement within the codon, thus increasing evolvability both at the single-gene and at the genome levels. Given the widespread use of these techniques for strain and protein improvement, along with more fundamental evolutionary biology questions, the use of recoded organisms that maximize evolvability should significantly improve the efficiency of directed evolution, library generation, and fitness maximization.« less

Demonstration of protein-based human identification using the hair shaft proteome [Protein-based human identification: A proof of concept using the hair shaft proteome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Parker, Glendon J.; Leppert, Tami; Anex, Deon S.

Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 singlemore » nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). Furthermore, this study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts.« less
Refactoring the Genetic Code for Increased Evolvability

DOE PAGES

Pines, Gur; Winkler, James D.; Pines, Assaf; ...

2017-11-14

ABSTRACT The standard genetic code is robust to mutations during transcription and translation. Point mutations are likely to be synonymous or to preserve the chemical properties of the original amino acid. Saturation mutagenesis experiments suggest that in some cases the best-performing mutant requires replacement of more than a single nucleotide within a codon. These replacements are essentially inaccessible to common error-based laboratory engineering techniques that alter a single nucleotide per mutation event, due to the extreme rarity of adjacent mutations. In this theoretical study, we suggest a radical reordering of the genetic code that maximizes the mutagenic potential of singlemore » nucleotide replacements. We explore several possible genetic codes that allow a greater degree of accessibility to the mutational landscape and may result in a hyperevolvable organism that could serve as an ideal platform for directed evolution experiments. We then conclude by evaluating the challenges of constructing such recoded organisms and their potential applications within the field of synthetic biology. IMPORTANCE The conservative nature of the genetic code prevents bioengineers from efficiently accessing the full mutational landscape of a gene via common error-prone methods. Here, we present two computational approaches to generate alternative genetic codes with increased accessibility. These new codes allow mutational transitions to a larger pool of amino acids and with a greater extent of chemical differences, based on a single nucleotide replacement within the codon, thus increasing evolvability both at the single-gene and at the genome levels. Given the widespread use of these techniques for strain and protein improvement, along with more fundamental evolutionary biology questions, the use of recoded organisms that maximize evolvability should significantly improve the efficiency of directed evolution, library generation, and fitness maximization.« less
Transient state kinetics of transcription elongation by T7 RNA polymerase.

PubMed

Anand, Vasanti Subramanian; Patel, Smita S

2006-11-24

The single subunit DNA-dependent RNA polymerase (RNAP) from bacteriophage T7 catalyzes both promoter-dependent transcription initiation and promoter-independent elongation. Using a promoter-free substrate, we have dissected the kinetic pathway of single nucleotide incorporation during elongation. We show that T7 RNAP undergoes a slow conformational change (0.01-0.03 s(-1)) to form an elongation competent complex with the promoter-free substrate (dissociation constant (Kd) of 96 nM). The complex binds to a correct NTP (Kd of 80 microM) and incorporates the nucleoside monophosphate (NMP) into RNA primer very efficiently (220 s(-1) at 25 degrees C). An overall free energy change (-5.5 kcal/mol) and internal free energy change (-3.7 kcal/mol) of single NMP incorporation was calculated from the measured equilibrium constants. In the presence of inorganic pyrophosphate (PPi), the elongation complex catalyzes the reverse pyrophosphorolysis reaction at a maximum rate of 0.8 s(-1) with PPi Kd of 1.2 mM. Several experiments were designed to investigate the rate-limiting step in the pathway of single nucleotide addition. Acid-quench and pulse-chase kinetics indicated that an isomerization step before chemistry is rate-limiting. The very similar rate constants of sequential incorporation of two nucleotides indicated that the steps after chemistry are fast. Based on available data, we propose that the preinsertion to insertion isomerization of NTP observed in the crystallographic studies of T7 RNAP is a likely candidate for the rate-limiting step. The studies here provide a kinetic framework to investigate structure-function and fidelity of RNA synthesis and to further explore the role of the conformational change in nucleotide selection during RNA synthesis.
Competing targets of microRNA-608 affect anxiety and hypertension

PubMed Central

Hanin, Geula; Shenhar-Tsarfaty, Shani; Yayon, Nadav; Hoe, Yau Yin; Bennett, Estelle R.; Sklan, Ella H.; Rao, Dabeeru. C.; Rankinen, Tuomo; Bouchard, Claude; Geifman-Shochat, Susana; Shifman, Sagiv; Greenberg, David S.; Soreq, Hermona

2014-01-01

MicroRNAs (miRNAs) can repress multiple targets, but how a single de-balanced interaction affects others remained unclear. We found that changing a single miRNA–target interaction can simultaneously affect multiple other miRNA–target interactions and modify physiological phenotype. We show that miR-608 targets acetylcholinesterase (AChE) and demonstrate weakened miR-608 interaction with the rs17228616 AChE allele having a single-nucleotide polymorphism (SNP) in the 3′-untranslated region (3′UTR). In cultured cells, this weakened interaction potentiated miR-608-mediated suppression of other targets, including CDC42 and interleukin-6 (IL6). Postmortem human cortices homozygote for the minor rs17228616 allele showed AChE elevation and CDC42/IL6 decreases compared with major allele homozygotes. Additionally, minor allele heterozygote and homozygote subjects showed reduced cortisol and elevated blood pressure, predicting risk of anxiety and hypertension. Parallel suppression of the conserved brain CDC42 activity by intracerebroventricular ML141 injection caused acute anxiety in mice. We demonstrate that SNPs in miRNA-binding regions could cause expanded downstream effects changing important biological pathways. PMID:24722204
Characteristics of allelic gene expression in human brain cells from single-cell RNA-seq data analysis.

PubMed

Zhao, Dejian; Lin, Mingyan; Pedrosa, Erika; Lachman, Herbert M; Zheng, Deyou

2017-11-10

Monoallelic expression of autosomal genes has been implicated in human psychiatric disorders. However, there is a paucity of allelic expression studies in human brain cells at the single cell and genome wide levels. In this report, we reanalyzed a previously published single-cell RNA-seq dataset from several postmortem human brains and observed pervasive monoallelic expression in individual cells, largely in a random manner. Examining single nucleotide variants with a predicted functional disruption, we found that the "damaged" alleles were overall expressed in fewer brain cells than their counterparts, and at a lower level in cells where their expression was detected. We also identified many brain cell type-specific monoallelically expressed genes. Interestingly, many of these cell type-specific monoallelically expressed genes were enriched for functions important for those brain cell types. In addition, function analysis showed that genes displaying monoallelic expression and correlated expression across neuronal cells from different individual brains were implicated in the regulation of synaptic function. Our findings suggest that monoallelic gene expression is prevalent in human brain cells, which may play a role in generating cellular identity and neuronal diversity and thus increasing the complexity and diversity of brain cell functions.
The use of genomic information increases the accuracy of breeding value predictions for sea louse (Caligus rogercresseyi) resistance in Atlantic salmon (Salmo salar).

PubMed

Correa, Katharina; Bangera, Rama; Figueroa, René; Lhorente, Jean P; Yáñez, José M

2017-01-31

Sea lice infestations caused by Caligus rogercresseyi are a main concern to the salmon farming industry due to associated economic losses. Resistance to this parasite was shown to have low to moderate genetic variation and its genetic architecture was suggested to be polygenic. The aim of this study was to compare accuracies of breeding value predictions obtained with pedigree-based best linear unbiased prediction (P-BLUP) methodology against different genomic prediction approaches: genomic BLUP (G-BLUP), Bayesian Lasso, and Bayes C. To achieve this, 2404 individuals from 118 families were measured for C. rogercresseyi count after a challenge and genotyped using 37 K single nucleotide polymorphisms. Accuracies were assessed using fivefold cross-validation and SNP densities of 0.5, 1, 5, 10, 25 and 37 K. Accuracy of genomic predictions increased with increasing SNP density and was higher than pedigree-based BLUP predictions by up to 22%. Both Bayesian and G-BLUP methods can predict breeding values with higher accuracies than pedigree-based BLUP, however, G-BLUP may be the preferred method because of reduced computation time and ease of implementation. A relatively low marker density (i.e. 10 K) is sufficient for maximal increase in accuracy when using G-BLUP or Bayesian methods for genomic prediction of C. rogercresseyi resistance in Atlantic salmon.
Candida kantuleensis sp. nov., a d-xylose-fermenting yeast species isolated from peat in a tropical peat swamp forest.

PubMed

Nitiyon, Sukanya; Khunnamwong, Pannida; Lertwattanasakul, Noppon; Limtong, Savitree

2018-05-24

Three strains (DMKU-XE11 T , DMKU-XE15 and DMKU-XE20) representing a single novel anamorphic and d-xylose-fermenting yeast species were obtained from three peat samples collected from Khan Thulee peat swamp forest in Surat Thani province, Thailand. The strains differed from each other by one to two nucleotide substitutions in the sequences of the D1/D2 region of the large subunit (LSU) rRNA gene and zero to one nucleotide substitution in the internal transcribed spacer (ITS) region. Phylogenetic analysis based on the combined sequences of the ITS and the D1/D2 regions showed that the three strains represented a single Candida species that was distinct from the other related species in the Lodderomyces/Candida albicans clade. The three strains form a subclade with the other Candida species including Candida sanyaensis, Candida tropicalis and Candida sojae. C. sanyaensis was the most closely related species, with 2.1-2.4 % nucleotide substitutions in the D1/D2 region of the LSU rRNA gene, and 3.8-4.0 % nucleotide substitutions in the ITS region. The three strains (DMKU-XE11 T , DMKU-XE15 and DMKU-XE20) were assigned as a single novel species, which was named Candida kantuleensis sp. nov. The type strain is DMKU-XE11 T (=CBS 15219 T =TBRC 7764 T ). The MycoBank number for C. kantuleensis sp. nov. is MB 824179.
IL-TIF/IL-22: genomic organization and mapping of the human and mouse genes.

PubMed

Dumoutier, L; Van Roost, E; Ameye, G; Michaux, L; Renauld, J C

2000-12-01

IL-TIF is a new cytokine originally identified as a gene induced by IL-9 in murine T lymphocytes, and showing 22% amino acid identity with IL-10. Here, we report the sequence and organization of the mouse and human IL-TIF genes, which both consist of 6 exons spreading over approximately 6 Kb. The IL-TIF gene is a single copy gene in humans, and is located on chromosome 12q15, at 90 Kb from the IFN gamma gene, and at 27 Kb from the AK155 gene, which codes for another IL-10-related cytokine. In the mouse, the IL-TIF gene is located on chromosome 10, also in the same region as the IFN gamma gene. Although it is a single copy gene in BALB/c and DBA/2 mice, the IL-TIF gene is duplicated in other strains such as C57Bl/6, FVB and 129. The two copies, which show 98% nucleotide identity in the coding region, were named IL-TIF alpha and IL-TIF beta. Beside single nucleotide variations, they differ by a 658 nucleotide deletion in IL-TIF beta, including the first non-coding exon and 603 nucleotides from the promoter. A DNA fragment corresponding to this deletion was sufficient to confer IL-9-regulated expression of a luciferase reporter plasmid, suggesting that the IL-TIF beta gene is either differentially regulated, or not expressed at all.
Single nucleotide polymorphism-specific regulation of matrix metalloproteinase-9 by multiple miRNAs targeting the coding exon

PubMed Central

Duellman, Tyler; Warren, Christopher; Yang, Jay

2014-01-01

Microribonucleic acids (miRNAs) work with exquisite specificity and are able to distinguish a target from a non-target based on a single nucleotide mismatch in the core nucleotide domain. We questioned whether miRNA regulation of gene expression could occur in a single nucleotide polymorphism (SNP)-specific manner, manifesting as a post-transcriptional control of expression of genetic polymorphisms. In our recent study of the functional consequences of matrix metalloproteinase (MMP)-9 SNPs, we discovered that expression of a coding exon SNP in the pro-domain of the protein resulted in a profound decrease in the secreted protein. This missense SNP results in the N38S amino acid change and a loss of an N-glycosylation site. A systematic study demonstrated that the loss of secreted protein was due not to the loss of an N-glycosylation site, but rather an SNP-specific targeting by miR-671-3p and miR-657. Bioinformatics analysis identified 41 SNP-specific miRNA targeting MMP-9 SNPs, mostly in the coding exon and an extension of the analysis to chromosome 20, where the MMP-9 gene is located, suggesting that SNP-specific miRNAs targeting the coding exon are prevalent. This selective post-transcriptional regulation of a target messenger RNA harboring genetic polymorphisms by miRNAs offers an SNP-dependent post-transcriptional regulatory mechanism, allowing for polymorphic-specific differential gene regulation. PMID:24627221
25 years and still going strong: 2'-O-(pyren-1-yl)methylribonucleotides - versatile building blocks for applications in molecular biology, diagnostics and materials science.

PubMed

Hrdlicka, Patrick J; Karmakar, Saswata

2017-11-29

Oligonucleotides (ONs) modified with 2'-O-(pyren-1-yl)methylribonucleotides have been explored for a range of applications in molecular biology, nucleic acid diagnostics, and materials science for more than 25 years. The first part of this review provides an overview of synthetic strategies toward 2'-O-(pyren-1-yl)methylribonucleotides and is followed by a summary of biophysical properties of nucleic acid duplexes modified with these building blocks. Insights from structural studies are then presented to rationalize the reported properties. In the second part, applications of ONs modified with 2'-O-(pyren-1-yl)methyl-RNA monomers are reviewed, which include detection of RNA targets, discrimination of single nucleotide polymorphisms, formation of self-assembled pyrene arrays on nucleic acid scaffolds, the study of charge transfer phenomena in nucleic acid duplexes, and sequence-unrestricted recognition of double-stranded DNA. The predictable binding mode of the pyrene moiety, coupled with the microenvironment-dependent properties and synthetic feasibility, render 2'-O-(pyren-1-yl)methyl-RNA monomers as a promising class of pyrene-functionalized nucleotide building blocks for new applications in molecular biology, nucleic acid diagnostics, and materials science.
Evaluation of targeted exome sequencing for 28 protein-based blood group systems, including the homologous gene systems, for blood group genotyping.

PubMed

Schoeman, Elizna M; Lopez, Genghis H; McGowan, Eunike C; Millard, Glenda M; O'Brien, Helen; Roulis, Eileen V; Liew, Yew-Wah; Martin, Jacqueline R; McGrath, Kelli A; Powley, Tanya; Flower, Robert L; Hyland, Catherine A

2017-04-01

Blood group single nucleotide polymorphism genotyping probes for a limited range of polymorphisms. This study investigated whether massively parallel sequencing (also known as next-generation sequencing), with a targeted exome strategy, provides an extended blood group genotype and the extent to which massively parallel sequencing correctly genotypes in homologous gene systems, such as RH and MNS. Donor samples (n = 28) that were extensively phenotyped and genotyped using single nucleotide polymorphism typing, were analyzed using the TruSight One Sequencing Panel and MiSeq platform. Genes for 28 protein-based blood group systems, GATA1, and KLF1 were analyzed. Copy number variation analysis was used to characterize complex structural variants in the GYPC and RH systems. The average sequencing depth per target region was 66.2 ± 39.8. Each sample harbored on average 43 ± 9 variants, of which 10 ± 3 were used for genotyping. For the 28 samples, massively parallel sequencing variant sequences correctly matched expected sequences based on single nucleotide polymorphism genotyping data. Copy number variation analysis defined the Rh C/c alleles and complex RHD hybrids. Hybrid RHD*D-CE-D variants were correctly identified, but copy number variation analysis did not confidently distinguish between D and CE exon deletion versus rearrangement. The targeted exome sequencing strategy employed extended the range of blood group genotypes detected compared with single nucleotide polymorphism typing. This single-test format included detection of complex MNS hybrid cases and, with copy number variation analysis, defined RH hybrid genes along with the RHCE*C allele hitherto difficult to resolve by variant detection. The approach is economical compared with whole-genome sequencing and is suitable for a red blood cell reference laboratory setting. © 2017 AABB.
Non-additive Effects in Genomic Selection

PubMed Central

Varona, Luis; Legarra, Andres; Toro, Miguel A.; Vitezica, Zulma G.

2018-01-01

In the last decade, genomic selection has become a standard in the genetic evaluation of livestock populations. However, most procedures for the implementation of genomic selection only consider the additive effects associated with SNP (Single Nucleotide Polymorphism) markers used to calculate the prediction of the breeding values of candidates for selection. Nevertheless, the availability of estimates of non-additive effects is of interest because: (i) they contribute to an increase in the accuracy of the prediction of breeding values and the genetic response; (ii) they allow the definition of mate allocation procedures between candidates for selection; and (iii) they can be used to enhance non-additive genetic variation through the definition of appropriate crossbreeding or purebred breeding schemes. This study presents a review of methods for the incorporation of non-additive genetic effects into genomic selection procedures and their potential applications in the prediction of future performance, mate allocation, crossbreeding, and purebred selection. The work concludes with a brief outline of some ideas for future lines of that may help the standard inclusion of non-additive effects in genomic selection. PMID:29559995
Non-additive Effects in Genomic Selection.

PubMed

Varona, Luis; Legarra, Andres; Toro, Miguel A; Vitezica, Zulma G

2018-01-01

In the last decade, genomic selection has become a standard in the genetic evaluation of livestock populations. However, most procedures for the implementation of genomic selection only consider the additive effects associated with SNP (Single Nucleotide Polymorphism) markers used to calculate the prediction of the breeding values of candidates for selection. Nevertheless, the availability of estimates of non-additive effects is of interest because: (i) they contribute to an increase in the accuracy of the prediction of breeding values and the genetic response; (ii) they allow the definition of mate allocation procedures between candidates for selection; and (iii) they can be used to enhance non-additive genetic variation through the definition of appropriate crossbreeding or purebred breeding schemes. This study presents a review of methods for the incorporation of non-additive genetic effects into genomic selection procedures and their potential applications in the prediction of future performance, mate allocation, crossbreeding, and purebred selection. The work concludes with a brief outline of some ideas for future lines of that may help the standard inclusion of non-additive effects in genomic selection.
Observational study to calculate addictive risk to opioids: a validation study of a predictive algorithm to evaluate opioid use disorder

PubMed Central

Brenton, Ashley; Richeimer, Steven; Sharma, Maneesh; Lee, Chee; Kantorovich, Svetlana; Blanchard, John; Meshkin, Brian

2017-01-01

Background Opioid abuse in chronic pain patients is a major public health issue, with rapidly increasing addiction rates and deaths from unintentional overdose more than quadrupling since 1999. Purpose This study seeks to determine the predictability of aberrant behavior to opioids using a comprehensive scoring algorithm incorporating phenotypic risk factors and neuroscience-associated single-nucleotide polymorphisms (SNPs). Patients and methods The Proove Opioid Risk (POR) algorithm determines the predictability of aberrant behavior to opioids using a comprehensive scoring algorithm incorporating phenotypic risk factors and neuroscience-associated SNPs. In a validation study with 258 subjects with diagnosed opioid use disorder (OUD) and 650 controls who reported using opioids, the POR successfully categorized patients at high and moderate risks of opioid misuse or abuse with 95.7% sensitivity. Regardless of changes in the prevalence of opioid misuse or abuse, the sensitivity of POR remained >95%. Conclusion The POR correctly stratifies patients into low-, moderate-, and high-risk categories to appropriately identify patients at need for additional guidance, monitoring, or treatment changes. PMID:28572737
A comprehensive study of small non-frameshift insertions/deletions in proteins and prediction of their phenotypic effects by a machine learning method (KD4i)

PubMed Central

2014-01-01

Background Small insertion and deletion polymorphisms (Indels) are the second most common mutations in the human genome, after Single Nucleotide Polymorphisms (SNPs). Recent studies have shown that they have significant influence on genetic variation by altering human traits and can cause multiple human diseases. In particular, many Indels that occur in protein coding regions are known to impact the structure or function of the protein. A major challenge is to predict the effects of these Indels and to distinguish between deleterious and neutral variants. When an Indel occurs within a coding region, it can be either frameshifting (FS) or non-frameshifting (NFS). FS-Indels either modify the complete C-terminal region of the protein or result in premature termination of translation. NFS-Indels insert/delete multiples of three nucleotides leading to the insertion/deletion of one or more amino acids. Results In order to study the relationships between NFS-Indels and Mendelian diseases, we characterized NFS-Indels according to numerous structural, functional and evolutionary parameters. We then used these parameters to identify specific characteristics of disease-causing and neutral NFS-Indels. Finally, we developed a new machine learning approach, KD4i, that can be used to predict the phenotypic effects of NFS-Indels. Conclusions We demonstrate in a large-scale evaluation that the accuracy of KD4i is comparable to existing state-of-the-art methods. However, a major advantage of our approach is that we also provide the reasons for the predictions, in the form of a set of rules. The rules are interpretable by non-expert humans and they thus represent new knowledge about the relationships between the genotype and phenotypes of NFS-Indels and the causative molecular perturbations that result in the disease. PMID:24742296
Binding and Translocation of Termination Factor Rho Studied at the Single-Molecule Level

PubMed Central

Koslover, Daniel J.; Fazal, Furqan M.; Mooney, Rachel A.; Landick, Robert; Block, Steven M.

2012-01-01

Rho termination factor is an essential hexameric helicase responsible for terminating 20–50% of all mRNA synthesis in E. coli. We used single- molecule force spectroscopy to investigate Rho-RNA binding interactions at the Rho- utilization (rut) site of the ? tR1 terminator. Our results are consistent with Rho complexes adopting two states, one that binds 57 ±2 nucleotides of RNA across all six of the Rho primary binding sites, and another that binds 85 ±2 nucleotides at the six primary sites plus a single secondary site situated at the center of the hexamer. The single-molecule data serve to establish that Rho translocates 5′-to-3′ towards RNA polymerase (RNAP) by a tethered-tracking mechanism, looping out the intervening RNA between the rut site and RNAP. These findings lead to a general model for Rho binding and translocation, and establish a novel experimental approach that should facilitate additional single- molecule studies of RNA-binding proteins. PMID:22885804
Predictive potential of IL-28B genetic testing for interferon based hepatitis C virus therapy in Pakistan: Current scenario and future perspective.

PubMed

Afzal, Muhammad Sohail

2016-09-18

In Pakistan which ranked second in terms of hepatitis C virus (HCV) infection, it is highly needed to have an established diagnostic test for antiviral therapy response prediction. Interleukin 28B (IL-28B) genetic testing is widely used throughout the world for interferon based therapy prediction for HCV patients and is quite helpful not only for health care workers but also for the patients. There is a strong relationship between single nucleotide polymorphisms at or near the IL-28B gene and the sustained virological response with pegylated interferon plus ribavirin treatment for chronic hepatitis C. Pakistan is a resource limited country, with very low per capita income and there is no proper social security (health insurance) system. The allocated health budget by the government is very low and is used on other health emergencies like polio virus and dengue virus infection. Therefore it is proposed that there should be a well established diagnostic test on the basis of IL-28B which can predict the antiviral therapy response to strengthen health care set-up of Pakistan. This test once established will help in better management of HCV infected patients.
Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots.

PubMed

Hajdin, Christine E; Bellaousov, Stanislav; Huggins, Wayne; Leonard, Christopher W; Mathews, David H; Weeks, Kevin M

2013-04-02

A pseudoknot forms in an RNA when nucleotides in a loop pair with a region outside the helices that close the loop. Pseudoknots occur relatively rarely in RNA but are highly overrepresented in functionally critical motifs in large catalytic RNAs, in riboswitches, and in regulatory elements of viruses. Pseudoknots are usually excluded from RNA structure prediction algorithms. When included, these pairings are difficult to model accurately, especially in large RNAs, because allowing this structure dramatically increases the number of possible incorrect folds and because it is difficult to search the fold space for an optimal structure. We have developed a concise secondary structure modeling approach that combines SHAPE (selective 2'-hydroxyl acylation analyzed by primer extension) experimental chemical probing information and a simple, but robust, energy model for the entropic cost of single pseudoknot formation. Structures are predicted with iterative refinement, using a dynamic programming algorithm. This melded experimental and thermodynamic energy function predicted the secondary structures and the pseudoknots for a set of 21 challenging RNAs of known structure ranging in size from 34 to 530 nt. On average, 93% of known base pairs were predicted, and all pseudoknots in well-folded RNAs were identified.
Evidence for Natural Selection in Nucleotide Content Relationships Based on Complete Mitochondrial Genomes: Strong Effect of Guanine Content on Separation between Terrestrial and Aquatic Vertebrates.

PubMed

Sorimachi, Kenji; Okayasu, Teiji

2015-01-01

The complete vertebrate mitochondrial genome consists of 13 coding genes. We used this genome to investigate the existence of natural selection in vertebrate evolution. From the complete mitochondrial genomes, we predicted nucleotide contents and then separated these values into coding and non-coding regions. When nucleotide contents of a coding or non-coding region were plotted against the nucleotide content of the complete mitochondrial genomes, we obtained linear regression lines only between homonucleotides and their analogs. On every plot using G or A content purine, G content in aquatic vertebrates was higher than that in terrestrial vertebrates, while A content in aquatic vertebrates was lower than that in terrestrial vertebrates. Based on these relationships, vertebrates were separated into two groups, terrestrial and aquatic. However, using C or T content pyrimidine, clear separation between these two groups was not obtained. The hagfish (Eptatretus burgeri) was further separated from both terrestrial and aquatic vertebrates. Based on these results, nucleotide content relationships predicted from the complete vertebrate mitochondrial genomes reveal the existence of natural selection based on evolutionary separation between terrestrial and aquatic vertebrate groups. In addition, we propose that separation of the two groups might be linked to ammonia detoxification based on high G and low A contents, which encode Glu rich and Lys poor proteins.
Molecular population genetics of inversion breakpoint regions in Drosophila pseudoobscura.

PubMed

Wallace, Andre G; Detweiler, Don; Schaeffer, Stephen W

2013-07-08

Paracentric inversions in populations can have a profound effect on the pattern and organization of nucleotide variability along a chromosome. Regions near inversion breakpoints are expected to have greater levels of differentiation because of reduced genetic exchange between different gene arrangements whereas central regions in the inverted segments are predicted to have lower levels of nucleotide differentiation due to greater levels of genetic flux among different karyotypes. We used the inversion polymorphism on the third chromosome of Drosophila pseudoobscura to test these predictions with an analysis of nucleotide diversity of 18 genetic markers near and away from inversion breakpoints. We tested hypotheses about how the presence of different chromosomal arrangements affects the pattern and organization of nucleotide variation. Overall, markers in the distal segment of the chromosome had greater levels of nucleotide heterozygosity than markers within the proximal segment of the chromosome. In addition, our results rejected the hypothesis that the breakpoints of derived inversions will have lower levels of nucleotide variability than breakpoints of ancestral inversions, even when strains with gene conversion events were removed. High levels of linkage disequilibrium were observed within all 11 breakpoint regions as well as between the ends of most proximal and distal breakpoints. The central region of the chromosome had the greatest levels of linkage disequilibrium compared with the proximal and distal regions because this is the region that experiences the highest level of recombination suppression. These data do not fully support the idea that genetic exchange is the sole force that influences genetic variation on inverted chromosomes.

Testing for genetic association taking into account phenotypic information of relatives.

PubMed

Uh, Hae-Won; Wijk, Henk Jan van der; Houwing-Duistermaat, Jeanine J

2009-12-15

We investigated efficient case-control association analysis using family data. The outcome of interest was coronary heart disease. We employed existing and new methods that take into account the correlations among related individuals to obtain the proper type I error rates. The methods considered for autosomal single-nucleotide polymorphisms were: 1) generalized estimating equations-based methods, 2) variance-modified Cochran-Armitage (MCA) trend test incorporating kinship coefficients, and 3) genotypic modified quasi-likelihood score test. Additionally, for X-linked single-nucleotide polymorphisms we proposed a two-degrees-of-freedom test. Performance of these methods was tested using Framingham Heart Study 500 k array data.
Simultaneous genotyping of single-nucleotide polymorphisms in alcoholism-related genes using duplex and triplex allele-specific PCR with two-step thermal cycles.

PubMed

Shirasu, Naoto; Kuroki, Masahide

2014-01-01

We developed a time- and cost-effective multiplex allele-specific polymerase chain reaction (AS-PCR) method based on the two-step PCR thermal cycles for genotyping single-nucleotide polymorphisms in three alcoholism-related genes: alcohol dehydrogenase 1B, aldehyde dehydrogenase 2 and μ-opioid receptor. Applying MightyAmp(®) DNA polymerase with optimized AS-primers and PCR conditions enabled us to achieve effective and selective amplification of the target alleles from alkaline lysates of a human hair root, and simultaneously to determine the genotypes within less than 1.5 h using minimal lab equipment.
Gene-gene, gene-environment, gene-nutrient interactions and single nucleotide polymorphisms of inflammatory cytokines.

PubMed

Nadeem, Amina; Mumtaz, Sadaf; Naveed, Abdul Khaliq; Aslam, Muhammad; Siddiqui, Arif; Lodhi, Ghulam Mustafa; Ahmad, Tausif

2015-05-15

Inflammation plays a significant role in the etiology of type 2 diabetes mellitus (T2DM). The rise in the pro-inflammatory cytokines is the essential step in glucotoxicity and lipotoxicity induced mitochondrial injury, oxidative stress and beta cell apoptosis in T2DM. Among the recognized markers are interleukin (IL)-6, IL-1, IL-10, IL-18, tissue necrosis factor-alpha (TNF-α), C-reactive protein, resistin, adiponectin, tissue plasminogen activator, fibrinogen and heptoglobins. Diabetes mellitus has firm genetic and very strong environmental influence; exhibiting a polygenic mode of inheritance. Many single nucleotide polymorphisms (SNPs) in various genes including those of pro and anti-inflammatory cytokines have been reported as a risk for T2DM. Not all the SNPs have been confirmed by unifying results in different studies and wide variations have been reported in various ethnic groups. The inter-ethnic variations can be explained by the fact that gene expression may be regulated by gene-gene, gene-environment and gene-nutrient interactions. This review highlights the impact of these interactions on determining the role of single nucleotide polymorphism of IL-6, TNF-α, resistin and adiponectin in pathogenesis of T2DM.
Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale.

PubMed

Liu, Siyang; Huang, Shujia; Rao, Junhua; Ye, Weijian; Krogh, Anders; Wang, Jun

2015-01-01

Comprehensive recognition of genomic variation in one individual is important for understanding disease and developing personalized medication and treatment. Many tools based on DNA re-sequencing exist for identification of single nucleotide polymorphisms, small insertions and deletions (indels) as well as large deletions. However, these approaches consistently display a substantial bias against the recovery of complex structural variants and novel sequence in individual genomes and do not provide interpretation information such as the annotation of ancestral state and formation mechanism. We present a novel approach implemented in a single software package, AsmVar, to discover, genotype and characterize different forms of structural variation and novel sequence from population-scale de novo genome assemblies up to nucleotide resolution. Application of AsmVar to several human de novo genome assemblies captures a wide spectrum of structural variants and novel sequences present in the human population in high sensitivity and specificity. Our method provides a direct solution for investigating structural variants and novel sequences from de novo genome assemblies, facilitating the construction of population-scale pan-genomes. Our study also highlights the usefulness of the de novo assembly strategy for definition of genome structure.
Allelic imbalance of multiple sclerosis susceptibility genes IKZF3 and IQGAP1 in human peripheral blood.

PubMed

Keshari, Pankaj K; Harbo, Hanne F; Myhr, Kjell-Morten; Aarseth, Jan H; Bos, Steffan D; Berge, Tone

2016-04-14

Multiple sclerosis is a chronic inflammatory, demyelinating disease of the central nervous system. Recent genome-wide studies have revealed more than 110 single nucleotide polymorphisms as associated with susceptibility to multiple sclerosis, but their functional contribution to disease development is mostly unknown. Consistent allelic imbalance was observed for rs907091 in IKZF3 and rs11609 in IQGAP1, which are in strong linkage disequilibrium with the multiple sclerosis associated single nucleotide polymorphisms rs12946510 and rs8042861, respectively. Using multiple sclerosis patients and healthy controls heterozygous for rs907091 and rs11609, we showed that the multiple sclerosis risk alleles at IKZF3 and IQGAP1 are expressed at higher levels as compared to the protective allele. Furthermore, individuals homozygous for the multiple sclerosis risk allele at IQGAP1 had a significantly higher total expression of IQGAP1 compared to individuals homozygous for the protective allele. Our data indicate a possible regulatory role for the multiple sclerosis-associated IKZF3 and IQGAP1 variants. We suggest that such cis-acting mechanisms may contribute to the multiple sclerosis association of single nucleotide polymorphisms at IKZF3 and IQGAP1.
Using of methods of speckle optics for Chlamydia trachomatis typing

NASA Astrophysics Data System (ADS)

Ulyanov, Sergey S.; Zaytsev, Sergey S.; Ulianova, Onega V.; Saltykov, Yury V.; Feodorova, Valentina A.

2017-03-01

Specific method of transformation of nucleotide of gene into speckle pattern is suggested. Reference speckle pattern of omp1 gene of typical wild strains of Chlamydia trachomatis of genovars D, E, F, G, J and K and Chlamydia psittaci as well is generated. Perspectives of proposed technique in the gene identification and detection of natural genetic mutations as single nucleotide polymorphism (SNP) are demonstrated.
High-resolution genetic map for understanding the effect of genome-wide recombination rate, selection sweep and linkage disequilibrium on nucleotide diversity in watermelon

USDA-ARS?s Scientific Manuscript database

Genotyping by sequencing (GBS) technology was used to identify a set of 9,933 single nucleotide polymorphism (SNP) markers for constructing a high-resolution genetic map of 1,087 cM for watermelon. The genome-wide variation of recombination rate (GWRR) across the map was evaluated and a positive co...
Predicting protein-binding RNA nucleotides with consideration of binding partners.

PubMed

Tuvshinjargal, Narankhuu; Lee, Wook; Park, Byungkyu; Han, Kyungsook

2015-06-01

In recent years several computational methods have been developed to predict RNA-binding sites in protein. Most of these methods do not consider interacting partners of a protein, so they predict the same RNA-binding sites for a given protein sequence even if the protein binds to different RNAs. Unlike the problem of predicting RNA-binding sites in protein, the problem of predicting protein-binding sites in RNA has received little attention mainly because it is much more difficult and shows a lower accuracy on average. In our previous study, we developed a method that predicts protein-binding nucleotides from an RNA sequence. In an effort to improve the prediction accuracy and usefulness of the previous method, we developed a new method that uses both RNA and protein sequence data. In this study, we identified effective features of RNA and protein molecules and developed a new support vector machine (SVM) model to predict protein-binding nucleotides from RNA and protein sequence data. The new model that used both protein and RNA sequence data achieved a sensitivity of 86.5%, a specificity of 86.2%, a positive predictive value (PPV) of 72.6%, a negative predictive value (NPV) of 93.8% and Matthews correlation coefficient (MCC) of 0.69 in a 10-fold cross validation; it achieved a sensitivity of 58.8%, a specificity of 87.4%, a PPV of 65.1%, a NPV of 84.2% and MCC of 0.48 in independent testing. For comparative purpose, we built another prediction model that used RNA sequence data alone and ran it on the same dataset. In a 10 fold-cross validation it achieved a sensitivity of 85.7%, a specificity of 80.5%, a PPV of 67.7%, a NPV of 92.2% and MCC of 0.63; in independent testing it achieved a sensitivity of 67.7%, a specificity of 78.8%, a PPV of 57.6%, a NPV of 85.2% and MCC of 0.45. In both cross-validations and independent testing, the new model that used both RNA and protein sequences showed a better performance than the model that used RNA sequence data alone in most performance measures. To the best of our knowledge, this is the first sequence-based prediction of protein-binding nucleotides in RNA which considers the binding partner of RNA. The new model will provide valuable information for designing biochemical experiments to find putative protein-binding sites in RNA with unknown structure. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Apolipoprotein E genotyping by multiplex tetra-primer amplification refractory mutation system PCR in single reaction tube.

PubMed

Yang, Young Geun; Kim, Jong Yeol; Park, Su Jeong; Kim, Suhng Wook; Jeon, Ok-Hee; Kim, Doo-Sik

2007-08-31

Apolipoprotein E (APOE) plays a critical role in lipoprotein metabolism by binding to both low-density lipoprotein and APOE receptors. The APOE gene has three allelic forms, epsilon2, epsilon3, and epsilon4, which encode different isoforms of the APOE protein. In this study, we have developed a new genotyping method for APOE. Our multiplex tetra-primer amplification refractory mutation system (multiplex T-ARMS) polymerase chain reaction (PCR) was performed in a single reaction tube with six primers consisting of two common primers and two specific primers for each of two single nucleotide polymorphism (SNP) sites. We obtained definitive electropherograms that showed three (epsilon2/epsilon2, epsilon3/epsilon3, and epsilon4/epsilon4), four (epsilon2/epsilon3 and epsilon3/epsilon4), and five (epsilon2/epsilon4) amplicons by multiplex T-ARMS PCR in a single reaction tube. Multiplex T-ARMS PCR for APOE genotyping is a simple and accurate method that requires only a single PCR reaction, without any another treatments or expensive instrumentation, to simultaneously identify two sites of single nucleotide polymorphisms.
cnvScan: a CNV screening and annotation tool to improve the clinical utility of computational CNV prediction from exome sequencing data.

PubMed

Samarakoon, Pubudu Saneth; Sorte, Hanne Sørmo; Stray-Pedersen, Asbjørg; Rødningen, Olaug Kristin; Rognes, Torbjørn; Lyle, Robert

2016-01-14

With advances in next generation sequencing technology and analysis methods, single nucleotide variants (SNVs) and indels can be detected with high sensitivity and specificity in exome sequencing data. Recent studies have demonstrated the ability to detect disease-causing copy number variants (CNVs) in exome sequencing data. However, exonic CNV prediction programs have shown high false positive CNV counts, which is the major limiting factor for the applicability of these programs in clinical studies. We have developed a tool (cnvScan) to improve the clinical utility of computational CNV prediction in exome data. cnvScan can accept input from any CNV prediction program. cnvScan consists of two steps: CNV screening and CNV annotation. CNV screening evaluates CNV prediction using quality scores and refines this using an in-house CNV database, which greatly reduces the false positive rate. The annotation step provides functionally and clinically relevant information using multiple source datasets. We assessed the performance of cnvScan on CNV predictions from five different prediction programs using 64 exomes from Primary Immunodeficiency (PIDD) patients, and identified PIDD-causing CNVs in three individuals from two different families. In summary, cnvScan reduces the time and effort required to detect disease-causing CNVs by reducing the false positive count and providing annotation. This improves the clinical utility of CNV detection in exome data.
Use of support vector machines for disease risk prediction in genome-wide association studies: concerns and opportunities.

PubMed

Mittag, Florian; Büchel, Finja; Saad, Mohamad; Jahn, Andreas; Schulte, Claudia; Bochdanovits, Zoltan; Simón-Sánchez, Javier; Nalls, Mike A; Keller, Margaux; Hernandez, Dena G; Gibbs, J Raphael; Lesage, Suzanne; Brice, Alexis; Heutink, Peter; Martinez, Maria; Wood, Nicholas W; Hardy, John; Singleton, Andrew B; Zell, Andreas; Gasser, Thomas; Sharma, Manu

2012-12-01

The success of genome-wide association studies (GWAS) in deciphering the genetic architecture of complex diseases has fueled the expectations whether the individual risk can also be quantified based on the genetic architecture. So far, disease risk prediction based on top-validated single-nucleotide polymorphisms (SNPs) showed little predictive value. Here, we applied a support vector machine (SVM) to Parkinson disease (PD) and type 1 diabetes (T1D), to show that apart from magnitude of effect size of risk variants, heritability of the disease also plays an important role in disease risk prediction. Furthermore, we performed a simulation study to show the role of uncommon (frequency 1-5%) as well as rare variants (frequency <1%) in disease etiology of complex diseases. Using a cross-validation model, we were able to achieve predictions with an area under the receiver operating characteristic curve (AUC) of ~0.88 for T1D, highlighting the strong heritable component (∼90%). This is in contrast to PD, where we were unable to achieve a satisfactory prediction (AUC ~0.56; heritability ~38%). Our simulations showed that simultaneous inclusion of uncommon and rare variants in GWAS would eventually lead to feasible disease risk prediction for complex diseases such as PD. The used software is available at http://www.ra.cs.uni-tuebingen.de/software/MACLEAPS/. © 2012 Wiley Periodicals, Inc.
2'-Bispyrene-modified 2'-O-methyl RNA probes as useful tools for the detection of RNA: synthesis, fluorescent properties, and duplex stability.

PubMed

Krasheninina, Olga A; Novopashina, Darya S; Lomzov, Alexander A; Venyaminova, Alya G

2014-09-05

The synthesis and properties two series of new 2'-O-methyl RNA probes, each containing a single insertion of a 2'-bispyrenylmethylphosphorodiamidate derivative of a nucleotide (U, C, A, and G), are described. As demonstrated by UV melting studies, the probes form stable complexes with model RNAs and DNAs. Significant increases (up to 21-fold) in pyrene excimer fluorescence intensity were observed upon binding of most of the probes with complementary RNAs, but not with DNAs. The fluorescence spectra are independent of the nature of the modified nucleotides. The nucleotides on the 5'-side of the modified nucleotide have no effect on the fluorescence spectra, whereas the natures of the two nucleotides on the 3'-side are important: CC, CG, and UC dinucleotide units on the 3'-side of the modified nucleotide provide the maximum increases in excimer fluorescence intensity. This study suggests that these 2'-bispyrene-labeled 2'-O-methyl RNA probes might be useful tools for detection of RNAs. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Association between genetic polymorphisms in the XRCC1, XRCC3, XPD, GSTM1, GSTT1, MSH2, MLH1, MSH3, and MGMT genes and radiosensitivity in breast cancer patients.

PubMed

Mangoni, Monica; Bisanzi, Simonetta; Carozzi, Francesca; Sani, Cristina; Biti, Giampaolo; Livi, Lorenzo; Barletta, Emanuela; Costantini, Adele Seniori; Gorini, Giuseppe

2011-09-01

Clinical radiosensitivity varies considerably among patients, and radiation-induced side effects developing in normal tissue can be therapy limiting. Some single nucleotide polymorphisms (SNPs) have been shown to correlate with hypersensitivity to radiotherapy. We conducted a prospective study of 87 female patients with breast cancer who received radiotherapy after breast surgery. We evaluated the association between acute skin reaction following radiotherapy and 11 genetic polymorphisms in DNA repair genes: XRCC1 (Arg399Gln and Arg194Trp), XRCC3 (Thr241Met), XPD (Asp312Asn and Lys751Gln), MSH2 (gIVS12-6T>C), MLH1 (Ile219Val), MSH3 (Ala1045Thr), MGMT (Leu84Phe), and in damage-detoxification GSTM1 and GSTT1 genes (allele deletion). Individual genetic polymorphisms were determined by polymerase chain reaction and single nucleotide primer extension for single nucleotide polymorphisms or by a multiplex polymerase chain reaction assay for deletion polymorphisms. The development of severe acute skin reaction (moist desquamation or interruption of radiotherapy due to toxicity) associated with genetic polymorphisms was modeled using Cox proportional hazards, accounting for cumulative biologically effective radiation dose. Radiosensitivity developed in eight patients and was increased in carriers of variants XRCC3-241Met allele (hazard ratio [HR] unquantifiably high), MSH2 gIVS12-6nt-C allele (HR=53.36; 95% confidence intervals [95% CI], 3.56-798.98), and MSH3-1045Ala allele (HR unquantifiably high). Carriers of XRCC1-Arg194Trp variant allele in combination with XRCC1-Arg399Gln wild-type allele had a significant risk of radiosensitivity (HR=38.26; 95% CI, 1.19-1232.52). To our knowledge, this is the first report to find an association between MSH2 and MSH3 genetic variants and the development of radiosensitivity in breast cancer patients. Our findings suggest the hypothesis that mismatch repair mechanisms may be involved in cellular response to radiotherapy. Genetic polymorphisms may be promising candidates for predicting acute radiosensitivity, but further studies are necessary to confirm our findings. Copyright © 2011 Elsevier Inc. All rights reserved.
Single nucleotide resolution RNA-seq uncovers new regulatory mechanisms in the opportunistic pathogen Streptococcus agalactiae.

PubMed

Rosinski-Chupin, Isabelle; Sauvage, Elisabeth; Sismeiro, Odile; Villain, Adrien; Da Cunha, Violette; Caliot, Marie-Elise; Dillies, Marie-Agnès; Trieu-Cuot, Patrick; Bouloc, Philippe; Lartigue, Marie-Frédérique; Glaser, Philippe

2015-05-30

Streptococcus agalactiae, or Group B Streptococcus, is a leading cause of neonatal infections and an increasing cause of infections in adults with underlying diseases. In an effort to reconstruct the transcriptional networks involved in S. agalactiae physiology and pathogenesis, we performed an extensive and robust characterization of its transcriptome through a combination of differential RNA-sequencing in eight different growth conditions or genetic backgrounds and strand-specific RNA-sequencing. Our study identified 1,210 transcription start sites (TSSs) and 655 transcript ends as well as 39 riboswitches and cis-regulatory regions, 39 cis-antisense non-coding RNAs and 47 small RNAs potentially acting in trans. Among these putative regulatory RNAs, ten were differentially expressed in response to an acid stress and two riboswitches sensed directly or indirectly the pH modification. Strikingly, 15% of the TSSs identified were associated with the incorporation of pseudo-templated nucleotides, showing that reiterative transcription is a pervasive process in S. agalactiae. In particular, 40% of the TSSs upstream genes involved in nucleotide metabolism show reiterative transcription potentially regulating gene expression, as exemplified for pyrG and thyA encoding the CTP synthase and the thymidylate synthase respectively. This comprehensive map of the transcriptome at the single nucleotide resolution led to the discovery of new regulatory mechanisms in S. agalactiae. It also provides the basis for in depth analyses of transcriptional networks in S. agalactiae and of the regulatory role of reiterative transcription following variations of intra-cellular nucleotide pools.
Prevalence of combinatorial CYP2C9 and VKORC1 genotypes in Puerto Ricans: implications for warfarin management in Hispanics.

PubMed

Duconge, Jorge; Cadilla, Carmen L; Windemuth, Andreas; Kocherla, Mohan; Gorowski, Krystyna; Seip, Richard L; Bogaard, Kali; Renta, Jessica Y; Piovanetti, Paola; D'Agostino, Darrin; Santiago-Borrero, Pedro J; Ruaño, Gualberto

2009-01-01

Polymorphisms in the cytochrome P450 2C9 (CYP2C9) and vitamin K epoxide reductase complex subunit 1 (VKORC1) genes significantly alter the effective warfarin dose. We determined the frequencies of alleles, single carriers, and double carriers of single nucleotide polymorphisms (SNPs) in the CYP2C9 and VKORC1 genes in a Puerto Rican cohort and gauged the impact of these polymorphisms on warfarin dosage using a published algorithm. A total of 92 DNA samples were genotyped using Luminex x-MAP technology. The polymorphism frequencies were 6.52%, 5.43% and 28.8% for CYP2C9 *2, *3 and VKORC1-1639 C>A polymorphisms, respectively. The prevalence of combinatorial genotypes was 16% for carriers of both the CYP2C9 and VKORC1 polymorphisms, 9% for carriers of CYP2C9 polymorphisms, 35% for carriers of the VKORC1 polymorphism, and the remaining 40% were non-carriers for either gene. Based on a published warfarin dosing algorithm, single, double and triple carriers of functionally deficient polymorphisms predict reductions of 1.0-1.6, 2.0-2.9, and 2.9-3.7 mg/day, respectively, in warfarin dose. Overall, 60% of the population carried at least a single polymorphism predicting deficient warfarin metabolism or responsiveness and 13% were double carriers with polymorphisms in both genes studied. Combinatorial genotyping of CYP2C9 and VKORC1 can allow for individualized dosing of warfarin among patients with gene polymorphisms, potentially reducing the risk of stroke or bleeding.
Improved nucleic acid descriptors for siRNA efficacy prediction.

PubMed

Sciabola, Simone; Cao, Qing; Orozco, Modesto; Faustino, Ignacio; Stanton, Robert V

2013-02-01

Although considerable progress has been made recently in understanding how gene silencing is mediated by the RNAi pathway, the rational design of effective sequences is still a challenging task. In this article, we demonstrate that including three-dimensional descriptors improved the discrimination between active and inactive small interfering RNAs (siRNAs) in a statistical model. Five descriptor types were used: (i) nucleotide position along the siRNA sequence, (ii) nucleotide composition in terms of presence/absence of specific combinations of di- and trinucleotides, (iii) nucleotide interactions by means of a modified auto- and cross-covariance function, (iv) nucleotide thermodynamic stability derived by the nearest neighbor model representation and (v) nucleic acid structure flexibility. The duplex flexibility descriptors are derived from extended molecular dynamics simulations, which are able to describe the sequence-dependent elastic properties of RNA duplexes, even for non-standard oligonucleotides. The matrix of descriptors was analysed using three statistical packages in R (partial least squares, random forest, and support vector machine), and the most predictive model was implemented in a modeling tool we have made publicly available through SourceForge. Our implementation of new RNA descriptors coupled with appropriate statistical algorithms resulted in improved model performance for the selection of siRNA candidates when compared with publicly available siRNA prediction tools and previously published test sets. Additional validation studies based on in-house RNA interference projects confirmed the robustness of the scoring procedure in prospective studies.
Rose spring dwarf-associated virus has RNA structural and gene-expression features like those of Barley yellow dwarf virus

PubMed Central

Salem, Nida’ M.; Miller, W. Allen; Rowhani, Adib; Golino, Deborah A.; Moyne, Anne-Laure; Falk, Bryce W.

2015-01-01

We determined the complete nucleotide sequence of the Rose spring dwarf-associated virus (RSDaV) genomic RNA (GenBank accession no. EU024678) and compared its predicted RNA structural characteristics affecting gene expression. A cDNA library was derived from RSDaV double-stranded RNAs (dsRNAs) purified from infected tissue. Nucleotide sequence analysis of the cloned cDNAs, plus for clones generated by 5′- and 3′-RACE showed the RSDaV genomic RNA to be 5,808 nucleotides. The genomic RNA contains five major open reading frames (ORFs), and three small ORFs in the 3′-terminal 800 nucleotides, typical for viruses of genus Luteovirus in the family Luteoviridae. Northern blot hybridization analysis revealed the genomic RNA and two prominent subgenomic RNAs of approximately 3 kb and 1 kb. Putative 5′ ends of the sgRNAs were predicted by identification of conserved sequences and secondary structures which resembled the Barley yellow dwarf virus (BYDV) genomic RNA 5′ end and subgenomic RNA promoter sequences. Secondary structures of the BYDV-like ribosomal frameshift elements and cap-independent translation elements, including long-distance base pairing spanning four kb were identified. These contain similarities but also informative differences with the BYDV structures, including a strikingly different structure predicted for the 3′ cap-independent translation element. These analyses of the RSDaV genomic RNA show more complexity for the RNA structural elements for members of the Luteoviridae. PMID:18329064
Rose spring dwarf-associated virus has RNA structural and gene-expression features like those of Barley yellow dwarf virus.

PubMed

Salem, Nida' M; Miller, W Allen; Rowhani, Adib; Golino, Deborah A; Moyne, Anne-Laure; Falk, Bryce W

2008-06-05

We determined the complete nucleotide sequence of the Rose spring dwarf-associated virus (RSDaV) genomic RNA (GenBank accession no. EU024678) and compared its predicted RNA structural characteristics affecting gene expression. A cDNA library was derived from RSDaV double-stranded RNAs (dsRNAs) purified from infected tissue. Nucleotide sequence analysis of the cloned cDNAs, plus for clones generated by 5'- and 3'-RACE showed the RSDaV genomic RNA to be 5808 nucleotides. The genomic RNA contains five major open reading frames (ORFs), and three small ORFs in the 3'-terminal 800 nucleotides, typical for viruses of genus Luteovirus in the family Luteoviridae. Northern blot hybridization analysis revealed the genomic RNA and two prominent subgenomic RNAs of approximately 3 kb and 1 kb. Putative 5' ends of the sgRNAs were predicted by identification of conserved sequences and secondary structures which resembled the Barley yellow dwarf virus (BYDV) genomic RNA 5' end and subgenomic RNA promoter sequences. Secondary structures of the BYDV-like ribosomal frameshift elements and cap-independent translation elements, including long-distance base pairing spanning four kb were identified. These contain similarities but also informative differences with the BYDV structures, including a strikingly different structure predicted for the 3' cap-independent translation element. These analyses of the RSDaV genomic RNA show more complexity for the RNA structural elements for members of the Luteoviridae.
Probing Gαi1 Protein Activation at Single Amino Acid Resolution

PubMed Central

Sun, Dawei; Maeda, Shoji; Matkovic, Milos; Mendieta, Sandro; Mayer, Daniel; Dawson, Roger; Schertler, Gebhard F.X.; Madan Babu, M.; Veprintsev, Dmitry B.

2016-01-01

We present comprehensive single amino acid resolution maps of the residues stabilising the human Gαi1 subunit in nucleotide- and receptor-bound states. We generated these maps by measuring the effects of alanine mutations on the stability of Gαi1 and of the rhodopsin-Gαi1 complex. We identified stabilization clusters in the GTPase and helical domains responsible for structural integrity and the conformational changes associated with activation. In activation cluster I, helices α1 and α5 pack against strands β1-3 to stabilize the nucleotide-bound states. In the receptor-bound state, these interactions are replaced by interactions between α5 and strands β4-6. Key residues in this cluster are Y320, crucial for the stabilization of the receptor-bound state, and F336, which stabilizes nucleotide-bound states. Destabilization of helix α1, caused by rearrangement of this activation cluster, leads to the weakening of the inter-domain interface and release of GDP. PMID:26258638
Genomic Prediction of Testcross Performance in Canola (Brassica napus)

PubMed Central

Jan, Habib U.; Abbadi, Amine; Lücke, Sophie; Nichols, Richard A.; Snowdon, Rod J.

2016-01-01

Genomic selection (GS) is a modern breeding approach where genome-wide single-nucleotide polymorphism (SNP) marker profiles are simultaneously used to estimate performance of untested genotypes. In this study, the potential of genomic selection methods to predict testcross performance for hybrid canola breeding was applied for various agronomic traits based on genome-wide marker profiles. A total of 475 genetically diverse spring-type canola pollinator lines were genotyped at 24,403 single-copy, genome-wide SNP loci. In parallel, the 950 F1 testcross combinations between the pollinators and two representative testers were evaluated for a number of important agronomic traits including seedling emergence, days to flowering, lodging, oil yield and seed yield along with essential seed quality characters including seed oil content and seed glucosinolate content. A ridge-regression best linear unbiased prediction (RR-BLUP) model was applied in combination with 500 cross-validations for each trait to predict testcross performance, both across the whole population as well as within individual subpopulations or clusters, based solely on SNP profiles. Subpopulations were determined using multidimensional scaling and K-means clustering. Genomic prediction accuracy across the whole population was highest for seed oil content (0.81) followed by oil yield (0.75) and lowest for seedling emergence (0.29). For seed yieId, seed glucosinolate, lodging resistance and days to onset of flowering (DTF), prediction accuracies were 0.45, 0.61, 0.39 and 0.56, respectively. Prediction accuracies could be increased for some traits by treating subpopulations separately; a strategy which only led to moderate improvements for some traits with low heritability, like seedling emergence. No useful or consistent increase in accuracy was obtained by inclusion of a population substructure covariate in the model. Testcross performance prediction using genome-wide SNP markers shows considerable potential for pre-selection of promising hybrid combinations prior to resource-intensive field testing over multiple locations and years. PMID:26824924

Combination Testing Using a Single MSH5 Variant alongside HLA Haplotypes Improves the Sensitivity of Predicting Coeliac Disease Risk in the Polish Population.

PubMed

Paziewska, Agnieszka; Cukrowska, Bozena; Dabrowska, Michalina; Goryca, Krzysztof; Piatkowska, Magdalena; Kluska, Anna; Mikula, Michal; Karczmarski, Jakub; Oralewska, Beata; Rybak, Anna; Socha, Jerzy; Balabas, Aneta; Zeber-Lubecka, Natalia; Ambrozkiewicz, Filip; Konopka, Ewa; Trojanowska, Ilona; Zagroba, Malgorzata; Szperl, Malgorzata; Ostrowski, Jerzy

2015-01-01

Assessment of non-HLA variants alongside standard HLA testing was previously shown to improve the identification of potential coeliac disease (CD) patients. We intended to identify new genetic variants associated with CD in the Polish population that would improve CD risk prediction when used alongside HLA haplotype analysis. DNA samples of 336 CD and 264 unrelated healthy controls were used to create DNA pools for a genome wide association study (GWAS). GWAS findings were validated with individual HLA tag single nucleotide polymorphism (SNP) typing of 473 patients and 714 healthy controls. Association analysis using four HLA-tagging SNPs showed that, as was found in other populations, positive predicting genotypes (HLA-DQ2.5/DQ2.5, HLA-DQ2.5/DQ2.2, and HLA-DQ2.5/DQ8) were found at higher frequencies in CD patients than in healthy control individuals in the Polish population. Both CD-associated SNPs discovered by GWAS were found in the CD susceptibility region, confirming the previously-determined association of the major histocompatibility (MHC) region with CD pathogenesis. The two most significant SNPs from the GWAS were rs9272346 (HLA-dependent; localized within 1 Kb of DQA1) and rs3130484 (HLA-independent; mapped to MSH5). Specificity of CD prediction using the four HLA-tagging SNPs achieved 92.9%, but sensitivity was only 45.5%. However, when a testing combination of the HLA-tagging SNPs and the MSH5 SNP was used, specificity decreased to 80%, and sensitivity increased to 74%. This study confirmed that improvement of CD risk prediction sensitivity could be achieved by including non-HLA SNPs alongside HLA SNPs in genetic testing.
Single-Molecule Methods for Nucleotide Excision Repair: Building a System to Watch Repair in Real Time.

PubMed

Kong, Muwen; Beckwitt, Emily C; Springall, Luke; Kad, Neil M; Van Houten, Bennett

2017-01-01

Single-molecule approaches to solving biophysical problems are powerful tools that allow static and dynamic real-time observations of specific molecular interactions of interest in the absence of ensemble-averaging effects. Here, we provide detailed protocols for building an experimental system that employs atomic force microscopy and a single-molecule DNA tightrope assay based on oblique angle illumination fluorescence microscopy. Together with approaches for engineering site-specific lesions into DNA substrates, these complementary biophysical techniques are well suited for investigating protein-DNA interactions that involve target-specific DNA-binding proteins, such as those engaged in a variety of DNA repair pathways. In this chapter, we demonstrate the utility of the platform by applying these techniques in the studies of proteins participating in nucleotide excision repair. © 2017 Elsevier Inc. All rights reserved.
Reciprocal uniparental disomy in yeast.

PubMed

Andersen, Sabrina L; Petes, Thomas D

2012-06-19

In the diploid cells of most organisms, including humans, each chromosome is usually distinguishable from its partner homolog by multiple single-nucleotide polymorphisms. One common type of genetic alteration observed in tumor cells is uniparental disomy (UPD), in which a pair of homologous chromosomes are derived from a single parent, resulting in loss of heterozygosity for all single-nucleotide polymorphisms while maintaining diploidy. Somatic UPD events are usually explained as reflecting two consecutive nondisjunction events. Here we report a previously undescribed mode of chromosome segregation in Saccharomyces cerevisiae in which one cell division produces daughter cells with reciprocal UPD for the same pair of chromosomes without an aneuploid intermediate. One pair of sister chromatids is segregated into one daughter cell and the other pair is segregated into the other daughter cell, mimicking a meiotic chromosome segregation pattern. We term this process "reciprocal uniparental disomy."
Familial recurrence of SOX2 anophthalmia syndrome: phenotypically normal mother with two affected daughters.

PubMed

Schneider, Adele; Bardakjian, Tanya M; Zhou, Jie; Hughes, Nkecha; Keep, Rosanne; Dorsainville, Darnelle; Kherani, Femida; Katowitz, James; Schimmenti, Lisa A; Hummel, Marybeth; Fitzpatrick, David R; Young, Terri L

2008-11-01

The SOX2 anophthalmia syndrome is emerging as a clinically recognizable disorder that has been identified in 10-15% of individuals with bilateral anophthalmia. Extra-ocular anomalies are common. The majority of SOX2 mutations identified appear to arise de novo in probands ascertained through the presence of anophthalmia or microphthalmia. In this report, we describe two sisters with bilateral anophthalmia/microphthalmia, brain anomalies and a novel heterozygous SOX2 gene single-base pair nucleotide deletion, c.551delC, which predicts p.Pro184ArgfsX19. The hypothetical protein product is predicted to lead to haploinsufficient SOX2 function. Mosaicism for this mutation in the SOX2 gene was also identified in their clinically unaffected mother in peripheral blood DNA. Thus it cannot be assumed that all SOX2 mutations in individuals with anophthalmia/microphthalmia are de novo. Testing of parents is indicated when a SOX2 mutation is identified in a proband. Copyright 2008 Wiley-Liss, Inc.
Familial Recurrence of SOX2 Anophthalmia Syndrome: Phenotypically Normal Mother with Two Affected Daughters

PubMed Central

Schneider, Adele; Bardakjian, Tanya M.; Zhou, Jie; Hughes, Nkecha; Keep, Rosanne; Dorsainville, Darnelle; Kherani, Femida; Katowitz, James; Schimmenti, Lisa A.; Hummel, Marybeth; FitzPatrick, David R; Young, Terri L.

2013-01-01

The SOX2 anophthalmia syndrome is emerging as a clinically recognizable disorder that has been identified in 10–15% of individuals with bilateral anophthalmia. Extra-ocular anomalies are common. The majority of SOX2 mutations identified appear to arise de novo in probands ascertained through the presence of anophthalmia or microphthalmia. In this report, we describe two sisters with bilateral anophthalmia/microphthalmia, brain anomalies and a novel heterozygous SOX2 gene single-base pair nucleotide deletion, c.551delC, which predicts p.Pro184ArgfsX19. The hypothetical protein product is predicted to lead to haploinsufficient SOX2 function. Mosaicism for this mutation in the SOX2 gene was also identified in their clinically unaffected mother in peripheral blood DNA. Thus it cannot be assumed that all SOX2 mutations in individuals with anophthalmia /microphthalmia are de novo. Testing of parents is indicated when a SOX2 mutation is identified in a proband. PMID:18831064
Relatedness predicts multiple measures of investment in cooperative nest construction in sociable weavers

PubMed Central

Leighton, Gavin M.; Echeverri, Sebastian; Heinrich, Dirk; Kolberg, Holger

2015-01-01

Although communal goods are often critical to society, they are simultaneously susceptible to exploitation and are evolutionarily stable only if mechanisms exist to curtail exploitation. Mechanisms such as punishment and kin selection have been offered as general explanations for how communal resources can be maintained. Evidence for these mechanisms comes largely from humans and social insects, leaving their generality in question. To assess how communal resources are maintained, we observed cooperative nest construction in sociable weavers (Philetairus socius). The communal nest of sociable weavers provides thermal benefits for all individuals but requires continual maintenance. We observed cooperative nest construction and also recorded basic morphological characteristics. We also collected blood samples, performed next-generation sequencing, and isolated 2358 variable single nucleotide polymorphisms (SNPs) to estimate relatedness. We find that relatedness predicts investment in cooperative nest construction, while no other morphological characters significantly explain cooperative output. We argue that indirect benefits are a critical fitness component for maintaining the cooperative behavior that maintains the communal good. PMID:26726282
COOLAIR Antisense RNAs Form Evolutionarily Conserved Elaborate Secondary Structures

DOE PAGES

Hawkes, Emily J.; Hennelly, Scott P.; Novikova, Irina V.; ...

2016-09-20

There is considerable debate about the functionality of long non-coding RNAs (lncRNAs). Lack of sequence conservation has been used to argue against functional relevance. Here, we investigated antisense lncRNAs, called COOLAIR, at the A. thaliana FLC locus and experimentally determined their secondary structure. The major COOLAIR variants are highly structured, organized by exon. The distally polyadenylated transcript has a complex multi-domain structure, altered by a single non-coding SNP defining a functionally distinct A. thaliana FLC haplotype. The A. thaliana COOLAIR secondary structure was used to predict COOLAIR exons in evolutionarily divergent Brassicaceae species. These predictions were validated through chemical probingmore » and cloning. Despite the relatively low nucleotide sequence identity, the structures, including multi-helix junctions, show remarkable evolutionary conservation. In a number of places, the structure is conserved through covariation of a non-contiguous DNA sequence. This structural conservation supports a functional role for COOLAIR transcripts rather than, or in addition to, antisense transcription.« less
Transactions Between Substance Use Intervention, the Oxytocin Receptor (OXTR) Gene, and Peer Substance Use Predicting Youth Alcohol Use.

PubMed

Cleveland, H Harrington; Griffin, Amanda M; Wolf, Pedro S A; Wiebe, Richard P; Schlomer, Gabriel L; Feinberg, Mark E; Greenberg, Mark T; Spoth, Richard L; Redmond, Cleve; Vandenbergh, David J

2018-01-01

This study investigated the oxytocin receptor (OXTR) gene's moderation of associations between exposure to a substance misuse intervention, average peer substance use, and adolescents' own alcohol use during the 9th-grade. OXTR genetic risk was measured using five single nucleotide polymorphisms (SNPs), and peer substance use was based on youths' nominated closest friends' own reports of alcohol, cigarette, and marijuana use, based on data from the PROSPER project. Regression models revealed several findings. First, low OXTR risk was linked to affiliating with friends who reported less substance use in the intervention condition but not the control condition. Second, affiliating with high substance-using friends predicted youth alcohol risk regardless of OXTR risk or intervention condition. Third, although high OXTR risk youth in the intervention condition who associated with low substance-using friends reported somewhat higher alcohol use than comparable youth in the control group, the absolute level of alcohol use among these youth was still among the lowest in the sample.
Genotype-phenotype association study via new multi-task learning model

PubMed Central

Huo, Zhouyuan; Shen, Dinggang

2018-01-01

Research on the associations between genetic variations and imaging phenotypes is developing with the advance in high-throughput genotype and brain image techniques. Regression analysis of single nucleotide polymorphisms (SNPs) and imaging measures as quantitative traits (QTs) has been proposed to identify the quantitative trait loci (QTL) via multi-task learning models. Recent studies consider the interlinked structures within SNPs and imaging QTs through group lasso, e.g. ℓ2,1-norm, leading to better predictive results and insights of SNPs. However, group sparsity is not enough for representing the correlation between multiple tasks and ℓ2,1-norm regularization is not robust either. In this paper, we propose a new multi-task learning model to analyze the associations between SNPs and QTs. We suppose that low-rank structure is also beneficial to uncover the correlation between genetic variations and imaging phenotypes. Finally, we conduct regression analysis of SNPs and QTs. Experimental results show that our model is more accurate in prediction than compared methods and presents new insights of SNPs. PMID:29218896
COOLAIR Antisense RNAs Form Evolutionarily Conserved Elaborate Secondary Structures

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hawkes, Emily J.; Hennelly, Scott P.; Novikova, Irina V.

There is considerable debate about the functionality of long non-coding RNAs (lncRNAs). Lack of sequence conservation has been used to argue against functional relevance. Here, we investigated antisense lncRNAs, called COOLAIR, at the A. thaliana FLC locus and experimentally determined their secondary structure. The major COOLAIR variants are highly structured, organized by exon. The distally polyadenylated transcript has a complex multi-domain structure, altered by a single non-coding SNP defining a functionally distinct A. thaliana FLC haplotype. The A. thaliana COOLAIR secondary structure was used to predict COOLAIR exons in evolutionarily divergent Brassicaceae species. These predictions were validated through chemical probingmore » and cloning. Despite the relatively low nucleotide sequence identity, the structures, including multi-helix junctions, show remarkable evolutionary conservation. In a number of places, the structure is conserved through covariation of a non-contiguous DNA sequence. This structural conservation supports a functional role for COOLAIR transcripts rather than, or in addition to, antisense transcription.« less
Exploring the deleterious SNPs in XRCC4 gene using computational approach and studying their association with breast cancer in the population of West India.

PubMed

Singh, Preety K; Mistry, Kinnari N; Chiramana, Haritha; Rank, Dharamshi N; Joshi, Chaitanya G

2018-05-20

Non-homologous end joining (NHEJ) pathway has pivotal role in repair of double-strand DNA breaks that may lead to carcinogenesis. XRCC4 is one of the essential proteins of this pathway and single-nucleotide polymorphisms (SNPs) of this gene are reported to be associated with cancer risks. In our study, we first used computational approaches to predict the damaging variants of XRCC4 gene. Tools predicted rs79561451 (S110P) nsSNP as the most deleterious SNP. Along with this SNP, we analysed other two SNPs (rs3734091 and rs6869366) to study their association with breast cancer in population of West India. Variant rs3734091 was found to be significantly associated with breast cancer while rs6869366 variant did not show any association. These SNPs may influence the susceptibility of individuals to breast cancer in this population. Copyright © 2018 Elsevier B.V. All rights reserved.
Psychological distress following marital separation interacts with a polymorphism in the serotonin transporter gene to predict cardiac vagal control in the laboratory.

PubMed

Hasselmo, Karen; Sbarra, David A; O'Connor, Mary-Frances; Moreno, Francisco A

2015-06-01

Marital separation is linked to negative mental and physical health; however, the strength of this link may vary across people. This study examined changes in respiratory sinus arrhythmia (RSA), used to assess cardiac vagal control, in recently separated adults (N = 79; M time since separation = 3.5 months). When reflecting on the separation, self-reported psychological distress following the separation interacted with a polymorphism in the serotonin transporter gene (5-HTTLPR) and a relevant single nucleotide polymorphism (SNP), rs25531, to predict RSA. Among people reporting emotional difficulties after the separation, those who were homozygous for the short allele had lower RSA levels while reflecting on their relationship than other genotypes. The findings, although limited by the relatively small sample size, are discussed in terms of how higher-sensitivity genotypes may interact with psychological responses to stress to alter physiology. © 2015 Society for Psychophysiological Research.
Quantitative DNA Methylation Analysis Identifies a Single CpG Dinucleotide Important for ZAP-70 Expression and Predictive of Prognosis in Chronic Lymphocytic Leukemia

PubMed Central

Claus, Rainer; Lucas, David M.; Stilgenbauer, Stephan; Ruppert, Amy S.; Yu, Lianbo; Zucknick, Manuela; Mertens, Daniel; Bühler, Andreas; Oakes, Christopher C.; Larson, Richard A.; Kay, Neil E.; Jelinek, Diane F.; Kipps, Thomas J.; Rassenti, Laura Z.; Gribben, John G.; Döhner, Hartmut; Heerema, Nyla A.; Marcucci, Guido; Plass, Christoph; Byrd, John C.

2012-01-01

Purpose Increased ZAP-70 expression predicts poor prognosis in chronic lymphocytic leukemia (CLL). Current methods for accurately measuring ZAP-70 expression are problematic, preventing widespread application of these tests in clinical decision making. We therefore used comprehensive DNA methylation profiling of the ZAP-70 regulatory region to identify sites important for transcriptional control. Patients and Methods High-resolution quantitative DNA methylation analysis of the entire ZAP-70 gene regulatory regions was conducted on 247 samples from patients with CLL from four independent clinical studies. Results Through this comprehensive analysis, we identified a small area in the 5′ regulatory region of ZAP-70 that showed large variability in methylation in CLL samples but was universally methylated in normal B cells. High correlation with mRNA and protein expression, as well as activity in promoter reporter assays, revealed that within this differentially methylated region, a single CpG dinucleotide and neighboring nucleotides are particularly important in ZAP-70 transcriptional regulation. Furthermore, by using clustering approaches, we identified a prognostic role for this site in four independent data sets of patients with CLL using time to treatment, progression-free survival, and overall survival as clinical end points. Conclusion Comprehensive quantitative DNA methylation analysis of the ZAP-70 gene in CLL identified important regions responsible for transcriptional regulation. In addition, loss of methylation at a specific single CpG dinucleotide in the ZAP-70 5′ regulatory sequence is a highly predictive and reproducible biomarker of poor prognosis in this disease. This work demonstrates the feasibility of using quantitative specific ZAP-70 methylation analysis as a relevant clinically applicable prognostic test in CLL. PMID:22564988
GDF5 single-nucleotide polymorphism rs143383 is associated with lumbar disc degeneration in Northern European women

PubMed Central

Williams, F M K; Popham, M; Hart, D J; de Schepper, E; Bierma-Zeinstra, S; Hofman, A; Uitterlinden, A G; Arden, N K; Cooper, C; Spector, T D; Valdes, A M; van Meurs, J

2011-01-01

Objective Lumbar disc degeneration (LDD) is a serious social and medical problem which has been shown to be highly heritable. It has similarities with peripheral joint osteoarthritis (OA) in terms of both epidemiology and pathologic processes. A few known genetic variants have been identified using a candidate gene approach, but many more are thought to exist. GDF5 is a gene whose variants have been shown to play a role in skeletal height as well as predisposing to peripheral joint OA. In vitro, the gene product growth differentiation factor 5 has been shown to promote growth and repair of animal disc. This study was undertaken to investigate whether the GDF5 gene plays a role in LDD. Methods We investigated whether the 5′ upstream single-nucleotide polymorphism (SNP) variant rs143383 was associated with LDD, using plain radiography and magnetic resonance imaging to identify disc space narrowing and osteophytes, in 5 population cohorts from Northern Europe. Results An association between LDD and the SNP rs143383 was identified in women, with the same risk allele as in knee and hip OA (odds ratio 1.72 [95% confidence interval 1.15–2.57], P = 0.008). Conclusion Our findings in 5 population cohorts from Northern Europe indicate that a variant in the GDF5 gene is a risk factor for LDD in women. Many more such variants are predicted to exist, but this result highlights the growth and differentiation cellular pathway as a possible route to a better understanding of the process behind lumbar disc degeneration. PMID:21360499
Pharmacogenetic determinants of outcomes on triplet hepatic artery infusion and intravenous cetuximab for liver metastases from colorectal cancer (European trial OPTILIV, NCT00852228).

PubMed

Lévi, Francis; Karaboué, Abdoulaye; Saffroy, Raphaël; Desterke, Christophe; Boige, Valerie; Smith, Denis; Hebbar, Mohamed; Innominato, Pasquale; Taieb, Julien; Carvalho, Carlos; Guimbaud, Rosine; Focan, Christian; Bouchahda, Mohamed; Adam, René; Ducreux, Michel; Milano, Gérard; Lemoine, Antoinette

2017-09-26

The hepatic artery infusion (HAI) of irinotecan, oxaliplatin and 5-fluorouracil with intravenous cetuximab achieved outstanding efficacy in previously treated patients with initially unresectable liver metastases from colorectal cancer. This planned study aimed at the identification of pharmacogenetic predictors of outcomes. Circulating mononuclear cells were analysed for 207 single-nucleotide polymorphisms (SNPs) from 34 pharmacology genes. Single-nucleotide polymorphisms passing stringent Hardy-Weinberg equilibrium test were tested for their association with outcomes in 52 patients (male/female, 36/16; WHO PS, 0-1). VKORC1 SNPs (rs9923231 and rs9934438) were associated with early and objective responses, and survival. For rs9923231, T/T achieved more early responses than C/T (50% vs 5%, P=0.029) and greatest 4-year survival (46% vs 0%, P=0.006). N-acetyltransferase-2 (rs1041983 and rs1801280) were associated with up to seven-fold more macroscopically complete hepatectomies. Progression-free survival was largest in ABCB1 rs1045642 T/T (P=0.026) and rs2032582 T/T (P=0.035). Associations were found between toxicities and gene variants (P<0.05), including neutropenia with ABCB1 (rs1045642) and SLC0B3 (rs4149117 and rs7311358); and diarrhoea with CYP2C9 (rs1057910), CYP2C19 (rs3758581), UGT1A6 (rs4124874) and SLC22A1 (rs72552763). VKORC1, NAT2 and ABCB1 variants predicted for HAI efficacy. Pharmacogenetics could guide the personalisation of liver-targeted medico-surgical therapies.
A single nucleotide polymorphism in MGEA5 encoding O-GlcNAc-selective N-acetyl-beta-D glucosaminidase is associated with type 2 diabetes in Mexican Americans.

PubMed

Lehman, Donna M; Fu, Dong-Jing; Freeman, Angela B; Hunt, Kelly J; Leach, Robin J; Johnson-Pais, Teresa; Hamlington, Jeanette; Dyer, Thomas D; Arya, Rector; Abboud, Hanna; Göring, Harald H H; Duggirala, Ravindranath; Blangero, John; Konrad, Robert J; Stern, Michael P

2005-04-01

Excess O-glycosylation of proteins by O-linked beta-N-acetylglucosamine (O-GlcNAc) may be involved in the pathogenesis of type 2 diabetes. The enzyme O-GlcNAc-selective N-acetyl-beta-d glucosaminidase (O-GlcNAcase) encoded by MGEA5 on 10q24.1-q24.3 reverses this modification by catalyzing the removal of O-GlcNAc. We have previously reported the linkage of type 2 diabetes and age at diabetes onset to an overlapping region on chromosome 10q in the San Antonio Family Diabetes Study (SAFADS). In this study, we investigated menangioma-expressed antigen-5 (MGEA5) as a positional candidate gene. Twenty-four single nucleotide polymorphisms (SNPs), identified by sequencing 44 SAFADS subjects, were genotyped in 436 individuals from 27 families whose data were used in the original linkage report. Association tests indicated significant association of a novel SNP with the traits diabetes (P = 0.0128, relative risk = 2.77) and age at diabetes onset (P = 0.0017). The associated SNP is located in intron 10, which contains an alternate stop codon and may lead to decreased expression of the 130-kDa isoform, the isoform predicted to contain the O-GlcNAcase activity. We investigated whether this variant was responsible for the original linkage signal. The variance attributed to this SNP accounted for approximately 25% of the logarithm of odds. These results suggest that this variant within the MGEA5 gene may increase diabetes risk in Mexican Americans.
Signatures of selection in the Iberian honey bee (Apis mellifera iberiensis) revealed by a genome scan analysis of single nucleotide polymorphisms.

PubMed

Chávez-Galarza, Julio; Henriques, Dora; Johnston, J Spencer; Azevedo, João C; Patton, John C; Muñoz, Irene; De la Rúa, Pilar; Pinto, M Alice

2013-12-01

Understanding the genetic mechanisms of adaptive population divergence is one of the most fundamental endeavours in evolutionary biology and is becoming increasingly important as it will allow predictions about how organisms will respond to global environmental crisis. This is particularly important for the honey bee, a species of unquestionable ecological and economical importance that has been exposed to increasing human-mediated selection pressures. Here, we conducted a single nucleotide polymorphism (SNP)-based genome scan in honey bees collected across an environmental gradient in Iberia and used four FST -based outlier tests to identify genomic regions exhibiting signatures of selection. Additionally, we analysed associations between genetic and environmental data for the identification of factors that might be correlated or act as selective pressures. With these approaches, 4.4% (17 of 383) of outlier loci were cross-validated by four FST -based methods, and 8.9% (34 of 383) were cross-validated by at least three methods. Of the 34 outliers, 15 were found to be strongly associated with one or more environmental variables. Further support for selection, provided by functional genomic information, was particularly compelling for SNP outliers mapped to different genes putatively involved in the same function such as vision, xenobiotic detoxification and innate immune response. This study enabled a more rigorous consideration of selection as the underlying cause of diversity patterns in Iberian honey bees, representing an important first step towards the identification of polymorphisms implicated in local adaptation and possibly in response to recent human-mediated environmental changes. © 2013 John Wiley & Sons Ltd.
Functional and Structural Consequence of Rare Exonic Single Nucleotide Polymorphisms: One Story, Two Tales

PubMed Central

Gu, Wanjun; Gurguis, Christopher I.; Zhou, Jin J.; Zhu, Yihua; Ko, Eun-A.; Ko, Jae-Hong; Wang, Ting; Zhou, Tong

2015-01-01

Genetic variation arising from single nucleotide polymorphisms (SNPs) is ubiquitously found among human populations. While disease-causing variants are known in some cases, identifying functional or causative variants for most human diseases remains a challenging task. Rare SNPs, rather than common ones, are thought to be more important in the pathology of most human diseases. We propose that rare SNPs should be divided into two categories dependent on whether the minor alleles are derived or ancestral. Derived alleles are less likely to have been purified by evolutionary processes and may be more likely to induce deleterious effects. We therefore hypothesized that the rare SNPs with derived minor alleles would be more important for human diseases and predicted that these variants would have larger functional or structural consequences relative to the rare variants for which the minor alleles are ancestral. We systematically investigated the consequences of the exonic SNPs on protein function, mRNA structure, and translation. We found that the functional and structural consequences are more significant for the rare exonic variants for which the minor alleles are derived. However, this pattern is reversed when the minor alleles are ancestral. Thus, the rare exonic SNPs with derived minor alleles are more likely to be deleterious. Age estimation of rare SNPs confirms that these potentially deleterious SNPs are recently evolved in the human population. These results have important implications for understanding the function of genetic variations in human exonic regions and for prioritizing functional SNPs in genome-wide association studies of human diseases. PMID:26454016
Single-nucleotide polymorphisms g.151435C>T and g.173057T>C in PRLR gene regulated by bta-miR-302a are associated with litter size in goats.

PubMed

An, Xiaopeng; Hou, Jinxing; Gao, Teyang; Lei, Yingnan; Li, Guang; Song, Yuxuan; Wang, Jiangang; Cao, Binyun

2015-06-01

Single-nucleotide polymorphisms (SNPs) located at microRNA-binding sites (miR-SNPs) can affect the expression of genes. This study aimed to identify the miR-SNPs associated with litter size. Guanzhong (n = 321) and Boer (n = 191) goat breeds were used to detect SNPs in the caprine prolactin receptor (PRLR) gene by DNA sequencing, primer-introduced restriction analysis-polymerase chain reaction, and polymerase chain reaction-restriction fragment length polymorphism. Three novel SNPs (g.151435C>T, g.151454A>G, and g.173057T>C) were identified in the caprine PRLR gene. Statistical results indicated that the g.151435C>T and g.173057T>C SNPs were significantly associated with litter size in Guanzhong and Boer goat breeds. Further analysis revealed that combinative genotype C6 (TTAACC) was better than the others for litter size in both goat breeds. Furthermore, the PRLR g.173057T>C polymorphism was predicted to regulate the binding activity of bta-miR-302a. Luciferase reporter gene assay confirmed that 173057C to T substitution disrupted the binding site for bta-miR-302a, resulting in the reduced levels of luciferase. Taken together, these findings suggested that bta-miR-302a can influence the expression of PRLR protein by binding with 3'untranslated region, resulting in that the g.173057T>C SNP had significant effects on litter size. Copyright © 2015 Elsevier Inc. All rights reserved.
Association of Sun Exposure, Skin Colour and Body Mass Index with Vitamin D Status in Individuals Who Are Morbidly Obese.

PubMed

Dix, Clare F; Bauer, Judith D; Martin, Ian; Rochester, Sharon; Duarte Romero, Briony; Prins, Johannes B; Wright, Olivia R L

2017-10-04

Vitamin D deficiency is a common issue, particularly in obese populations, and is tested by assessing serum 25(OH)D concentrations. This study aimed to identify factors that contribute to the vitamin D status in fifty morbidly obese individuals recruited prior to bariatric surgery. Data collected included serum 25(OH)D concentrations, dietary and supplement intake of vitamin D, sun exposure measures, skin colour via spectrophotometry, and genotype analysis of several single nucleotide polymorphisms in the vitamin D metabolism pathway. Results showed a significant correlation between serum 25(OH)D concentrations and age, and serum 25(OH)D and ITAC score (natural skin colour). Natural skin colour accounted for 13.5% of variation in serum 25(OH)D, with every 10° increase in ITAC score (i.e., lighter skin) leading to a 9 nmol/L decrease in serum 25(OH)D. Multiple linear regression using age, ITAC score, and average UV index in the three months prior to testing, significantly predicted serum 25(OH)D concentrations ( R ² = 29.7%). Single nucleotide polymorphisms for all vitamin D genes tested, showed lower serum 25(OH)D for those with the rare genotype compared to the common genotype; this was most pronounced for fok1 and rs4588 , where those with the rare genotype were insufficient (<50 nmol/L), and those with the common genotype were sufficient (≥50 nmol/L). Assessing vitamin D status in individuals with morbid obesity requires testing of 25(OH)D, but potential risk factors for this population include natural skin colour and age.

Phylogenetic distribution and expression of a penicillin-binding protein homologue, Ear and its significance in virulence of Staphylococcus aureus.

PubMed

Singh, Vineet K; Ring, Robert P; Aswani, Vijay; Stemper, Mary E; Kislow, Jennifer; Ye, Zhan; Shukla, Sanjay K

2017-12-01

Staphylococcus aureus is an opportunistic human pathogen that can cause serious infections in humans. A plethora of known and putative virulence factors are produced by staphylococci that collectively orchestrate pathogenesis. Ear protein (Escherichia coli ampicillin resistance) in S. aureus is an exoprotein in COL strain, predicted to be a superantigen, and speculated to play roles in antibiotic resistance and virulence. The goal of this study was to determine if expression of ear is modulated by single nucleotide polymorphisms in its promoter and coding sequences and whether this gene plays roles in antibiotic resistance and virulence. Promoter, coding sequences and expression of the ear gene in clinical and carriage S. aureus strains with distinct genetic backgrounds were analysed. The JE2 strain and its isogenic ear mutant were used in a systemic infection mouse model to determine the competiveness of the ear mutant.Results/Key findings. The ear gene showed a variable expression, with USA300FPR3757 showing a high-level expression compared to many of the other strains tested including some showing negligible expression. Higher expression was associated with agr type 1 but not correlated with phylogenetic relatedness of the ear gene based upon single nucleotide polymorphisms in the promoter or coding regions suggesting a complex regulation. An isogenic JE2 (USA300 background) ear mutant showed no significant difference in its growth, antibiotic susceptibility or virulence in a mouse model. Our data suggests that despite being highly expressed in a USA300 genetic background, Ear is not a significant contributor to virulence in that strain.
A method to associate all possible combinations of genetic and environmental factors using GxE landscape plot.

PubMed

Nagaie, Satoshi; Ogishima, Soichi; Nakaya, Jun; Tanaka, Hiroshi

2015-01-01

Genome-wide association studies (GWAS) and linkage analysis has identified many single nucleotide polymorphisms (SNPs) related to disease. There are many unknown SNPs whose minor allele frequencies (MAFs) as low as 0.005 having intermediate effects with odds ratio between 1.5~3.0. Low frequency variants having intermediate effects on disease pathogenesis are believed to have complex interactions with environmental factors called gene-environment interactions (GxE). Hence, we describe a model using 3D Manhattan plot called GxE landscape plot to visualize the association of p-values for gene-environment interactions (GxE). We used the Gene-Environment iNteraction Simulator 2 (GENS2) program to simulate interactions between two genetic loci and one environmental factor in this exercise. The dataset used for training contains disease status, gender, 20 environmental exposures and 100 genotypes for 170 subjects, and p-values were calculated by Cochran-Mantel-Haenszel chi-squared test on known data. Subsequently, we created a 3D GxE landscape plot of negative logarithm of the association of p-values for all the possible combinations of genetic and environmental factors with their hierarchical clustering. Thus, the GxE landscape plot is a valuable model to predict association of p-values for GxE and similarity among genotypes and environments in the context of disease pathogenesis. GxE - Gene-environment interactions, GWAS - Genome-wide association study, MAFs - Minor allele frequencies, SNPs - Single nucleotide polymorphisms, EWAS - Environment-wide association study, FDR - False discovery rate, JPT+CHB - HapMap population of Japanese in Tokyo, Japan - Han Chinese in Beijing.
Single nucleotide polymorphisms of Helicobacter pylori dupA that lead to premature stop codons.

PubMed

Moura, Sílvia B; Costa, Rafaella F A; Anacleto, Charles; Rocha, Gifone A; Rocha, Andreia M C; Queiroz, Dulciene M M

2012-06-01

The detection of the putative disease-specific Helicobacter pylori marker duodenal ulcer promoting gene A (dupA) is currently based on PCR detection of jhp0917 and jhp0918 that form the gene. However, mutations that lead to premature stop codons that split off the dupA leading to truncated products cannot be evaluated by PCR. We directly sequence the complete dupA of 75 dupA-positive strains of H. pylori isolated from patients with gastritis (n = 26), duodenal ulcer (n = 29), and gastric carcinoma (n = 20), to search for frame-shifting mutations that lead to stop codon. Thirty-four strains had single nucleotide mutations in dupA that lead to premature stop codon creating smaller products than the predicted 1839 bp product and, for this reason, were considered as dupA-negative. Intact dupA was more frequently observed in strains isolated from duodenal ulcer patients (65.5%) than in patients with gastritis only (46.2%) or with gastric carcinoma (50%). In logistic analysis, the presence of the intact dupA independently associated with duodenal ulcer (OR = 5.06; 95% CI = 1.22-20.96, p = .02). We propose the primer walking methodology as a simple technique to sequence the gene. When we considered as dupA-positive only those strains that carry dupA gene without premature stop codons, the gene was associated with duodenal ulcer and, therefore, can be used as a marker for this disease in our population. © 2012 Blackwell Publishing Ltd.
On safari to Random Jungle: a fast implementation of Random Forests for high-dimensional data

PubMed Central

Schwarz, Daniel F.; König, Inke R.; Ziegler, Andreas

2010-01-01

Motivation: Genome-wide association (GWA) studies have proven to be a successful approach for helping unravel the genetic basis of complex genetic diseases. However, the identified associations are not well suited for disease prediction, and only a modest portion of the heritability can be explained for most diseases, such as Type 2 diabetes or Crohn's disease. This may partly be due to the low power of standard statistical approaches to detect gene–gene and gene–environment interactions when small marginal effects are present. A promising alternative is Random Forests, which have already been successfully applied in candidate gene analyses. Important single nucleotide polymorphisms are detected by permutation importance measures. To this day, the application to GWA data was highly cumbersome with existing implementations because of the high computational burden. Results: Here, we present the new freely available software package Random Jungle (RJ), which facilitates the rapid analysis of GWA data. The program yields valid results and computes up to 159 times faster than the fastest alternative implementation, while still maintaining all options of other programs. Specifically, it offers the different permutation importance measures available. It includes new options such as the backward elimination method. We illustrate the application of RJ to a GWA of Crohn's disease. The most important single nucleotide polymorphisms (SNPs) validate recent findings in the literature and reveal potential interactions. Availability: The RJ software package is freely available at http://www.randomjungle.org Contact: inke.koenig@imbs.uni-luebeck.de; ziegler@imbs.uni-luebeck.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20505004
Genomic diversity of the human intestinal parasite Entamoeba histolytica

PubMed Central

2012-01-01

Background Entamoeba histolytica is a significant cause of disease worldwide. However, little is known about the genetic diversity of the parasite. We re-sequenced the genomes of ten laboratory cultured lines of the eukaryotic pathogen Entamoeba histolytica in order to develop a picture of genetic diversity across the genome. Results The extreme nucleotide composition bias and repetitiveness of the E. histolytica genome provide a challenge for short-read mapping, yet we were able to define putative single nucleotide polymorphisms in a large portion of the genome. The results suggest a rather low level of single nucleotide diversity, although genes and gene families with putative roles in virulence are among the more polymorphic genes. We did observe large differences in coverage depth among genes, indicating differences in gene copy number between genomes. We found evidence indicating that recombination has occurred in the history of the sequenced genomes, suggesting that E. histolytica may reproduce sexually. Conclusions E. histolytica displays a relatively low level of nucleotide diversity across its genome. However, large differences in gene family content and gene copy number are seen among the sequenced genomes. The pattern of polymorphism indicates that E. histolytica reproduces sexually, or has done so in the past, which has previously been suggested but not proven. PMID:22630046
Assessment of primer/template mismatch effects on real-time PCR amplification of target taxa for GMO quantification.

PubMed

Ghedira, Rim; Papazova, Nina; Vuylsteke, Marnik; Ruttink, Tom; Taverniers, Isabel; De Loose, Marc

2009-10-28

GMO quantification, based on real-time PCR, relies on the amplification of an event-specific transgene assay and a species-specific reference assay. The uniformity of the nucleotide sequences targeted by both assays across various transgenic varieties is an important prerequisite for correct quantification. Single nucleotide polymorphisms (SNPs) frequently occur in the maize genome and might lead to nucleotide variation in regions used to design primers and probes for reference assays. Further, they may affect the annealing of the primer to the template and reduce the efficiency of DNA amplification. We assessed the effect of a minor DNA template modification, such as a single base pair mismatch in the primer attachment site, on real-time PCR quantification. A model system was used based on the introduction of artificial mismatches between the forward primer and the DNA template in the reference assay targeting the maize starch synthase (SSIIb) gene. The results show that the presence of a mismatch between the primer and the DNA template causes partial to complete failure of the amplification of the initial DNA template depending on the type and location of the nucleotide mismatch. With this study, we show that the presence of a primer/template mismatch affects the estimated total DNA quantity to a varying degree.
Molecular characterization of the vitamin D receptor (VDR) gene in Holstein cows.

PubMed

Ali, Mayar O; El-Adl, Mohamed A; Ibrahim, Hussam M M; Elseedy, Youssef Y; Rizk, Mohamed A; El-Khodery, Sabry A

2018-06-01

Vitamin D plays a vital role in calcium homeostasis, growth, and immunoregulation. Because little is known about the vitamin D receptor (VDR) gene in cattle, the aim of the present investigation was to present the molecular characterization of exons 5 and 6 of the VDR gene in Holstein cows. DNA extraction, genomic sequencing, phylogenetic analysis, synteny mapping and single nucleotide gene polymorphism analysis of the VDR gene were performed to assess blood samples collected from 50 clinically healthy Holstein cows. The results revealed the presence of a 450-base pair (bp) nucleotide sequence that resembled exons 5 and 6 with intron 5 enclosed between these exons. Sequence alignment and phylogenetic analysis revealed a close relationship between the sequenced VDR region and that found in Hereford cattle. A close association between this region and the corresponding region in small ruminants was also documented. Moreover, a single nucleotide polymorphism (SNP) that caused the replacement of a glutamate with an arginine in the deduced amino acid sequence was detected at position 7 of exon 5. In conclusion, Holstein and Hereford cattle differ with respect to exon 5 of the VDR gene. Phylogenetic analysis of the VDR gene based on nucleotide sequence produced different results from prior analyses based on amino acid sequence. Copyright © 2018 Elsevier Ltd. All rights reserved.
Secondary structure prediction and structure-specific sequence analysis of single-stranded DNA.

PubMed

Dong, F; Allawi, H T; Anderson, T; Neri, B P; Lyamichev, V I

2001-08-01

DNA sequence analysis by oligonucleotide binding is often affected by interference with the secondary structure of the target DNA. Here we describe an approach that improves DNA secondary structure prediction by combining enzymatic probing of DNA by structure-specific 5'-nucleases with an energy minimization algorithm that utilizes the 5'-nuclease cleavage sites as constraints. The method can identify structural differences between two DNA molecules caused by minor sequence variations such as a single nucleotide mutation. It also demonstrates the existence of long-range interactions between DNA regions separated by >300 nt and the formation of multiple alternative structures by a 244 nt DNA molecule. The differences in the secondary structure of DNA molecules revealed by 5'-nuclease probing were used to design structure-specific probes for mutation discrimination that target the regions of structural, rather than sequence, differences. We also demonstrate the performance of structure-specific 'bridge' probes complementary to non-contiguous regions of the target molecule. The structure-specific probes do not require the high stringency binding conditions necessary for methods based on mismatch formation and permit mutation detection at temperatures from 4 to 37 degrees C. Structure-specific sequence analysis is applied for mutation detection in the Mycobacterium tuberculosis katG gene and for genotyping of the hepatitis C virus.
Sequence diversity within the reovirus S2 gene: reovirus genes reassort in nature, and their termini are predicted to form a panhandle motif.

PubMed Central

Chapell, J D; Goral, M I; Rodgers, S E; dePamphilis, C W; Dermody, T S

1994-01-01

To better understand genetic diversity within mammalian reoviruses, we determined S2 nucleotide and deduced sigma 2 amino acid sequences of nine reovirus strains and compared these sequences with those of prototype strains of the three reovirus serotypes. The S2 gene and sigma 2 protein are highly conserved among the four type 1, one type 2, and seven type 3 strains studied. Phylogenetic analyses based on S2 nucleotide sequences of the 12 reovirus strains indicate that diversity within the S2 gene is independent of viral serotype. Additionally, we found marked topological differences between phylogenetic trees generated from S1 and S2 gene nucleotide sequences of the seven type 3 strains. These results demonstrate that reovirus S1 and S2 genes have distinct evolutionary histories, thus providing phylogenetic evidence for lateral transfer of reovirus genes in nature. When variability among the 12 sigma 2-encoding S2 nucleotide sequences was analyzed at synonymous positions, we found that approximately 60 nucleotides at the 5' terminus and 30 nucleotides at the 3' terminus were markedly conserved in comparison with other sigma 2-encoding regions of S2. Predictions of RNA secondary structures indicate that the more conserved S2 sequences participate in the formation of an extended region of duplex RNA interrupted by a pair of stem-loops. Among the 12 deduced sigma 2 amino acid sequences examined, substitutions were observed at only 11% of amino acid positions. This finding suggests that constraints on the structure or function of sigma 2, perhaps in part because of its location in the virion core, have limited sequence diversity within this protein. PMID:8289378
N-acetyltransferase single nucleotide polymorphisms: Emerging concepts serve as a paradigm for understanding complexities of personalized medicine

PubMed Central

Hein, David W.

2009-01-01

Arylamine N-acetyltransferase 1 (NAT1) and 2 (NAT2) exhibit single nucleotide polymorphisms (SNPs) in human populations that modify drug and carcinogen metabolism. This paper updates the identity, location, and functional effects of these SNPs and then follows with emerging concepts for understanding why pharmacogenetic findings may not be replicated consistently. Using this paradigm as an example, laboratory-based mechanistic analyses can reveal complexities such that genetic polymorphisms become biologically and medically relevant when confounding factors are more fully understood and considered. As medical care moves to a more personalized approach, the implications of these confounding factors will be important in understanding the complexities of personalized medicine. PMID:19379125
Mycobacterium leprae: genes, pseudogenes and genetic diversity

PubMed Central

Singh, Pushpendra; Cole, Stewart T

2011-01-01

Leprosy, which has afflicted human populations for millenia, results from infection with Mycobacterium leprae, an unculturable pathogen with an exceptionally long generation time. Considerable insight into the biology and drug resistance of the leprosy bacillus has been obtained from genomics. M. leprae has undergone reductive evolution and pseudogenes now occupy half of its genome. Comparative genomics of four different strains revealed remarkable conservation of the genome (99.995% identity) yet uncovered 215 polymorphic sites, mainly single nucleotide polymorphisms, and a handful of new pseudogenes. Mapping these polymorphisms in a large panel of strains defined 16 single nucleotide polymorphism-subtypes that showed strong geographical associations and helped retrace the evolution of M. leprae. PMID:21162636
Annotate-it: a Swiss-knife approach to annotation, analysis and interpretation of single nucleotide variation in human disease

PubMed Central

2012-01-01

The increasing size and complexity of exome/genome sequencing data requires new tools for clinical geneticists to discover disease-causing variants. Bottlenecks in identifying the causative variation include poor cross-sample querying, constantly changing functional annotation and not considering existing knowledge concerning the phenotype. We describe a methodology that facilitates exploration of patient sequencing data towards identification of causal variants under different genetic hypotheses. Annotate-it facilitates handling, analysis and interpretation of high-throughput single nucleotide variant data. We demonstrate our strategy using three case studies. Annotate-it is freely available and test data are accessible to all users at http://www.annotate-it.org. PMID:23013645
Fixed-Gap Tunnel Junction for Reading DNA Nucleotides

PubMed Central

2015-01-01

Previous measurements of the electronic conductance of DNA nucleotides or amino acids have used tunnel junctions in which the gap is mechanically adjusted, such as scanning tunneling microscopes or mechanically controllable break junctions. Fixed-junction devices have, at best, detected the passage of whole DNA molecules without yielding chemical information. Here, we report on a layered tunnel junction in which the tunnel gap is defined by a dielectric layer, deposited by atomic layer deposition. Reactive ion etching is used to drill a hole through the layers so that the tunnel junction can be exposed to molecules in solution. When the metal electrodes are functionalized with recognition molecules that capture DNA nucleotides via hydrogen bonds, the identities of the individual nucleotides are revealed by characteristic features of the fluctuating tunnel current associated with single-molecule binding events. PMID:25380505
Prediction of siRNA potency using sparse logistic regression.

PubMed

Hu, Wei; Hu, John

2014-06-01

RNA interference (RNAi) can modulate gene expression at post-transcriptional as well as transcriptional levels. Short interfering RNA (siRNA) serves as a trigger for the RNAi gene inhibition mechanism, and therefore is a crucial intermediate step in RNAi. There have been extensive studies to identify the sequence characteristics of potent siRNAs. One such study built a linear model using LASSO (Least Absolute Shrinkage and Selection Operator) to measure the contribution of each siRNA sequence feature. This model is simple and interpretable, but it requires a large number of nonzero weights. We have introduced a novel technique, sparse logistic regression, to build a linear model using single-position specific nucleotide compositions which has the same prediction accuracy of the linear model based on LASSO. The weights in our new model share the same general trend as those in the previous model, but have only 25 nonzero weights out of a total 84 weights, a 54% reduction compared to the previous model. Contrary to the linear model based on LASSO, our model suggests that only a few positions are influential on the efficacy of the siRNA, which are the 5' and 3' ends and the seed region of siRNA sequences. We also employed sparse logistic regression to build a linear model using dual-position specific nucleotide compositions, a task LASSO is not able to accomplish well due to its high dimensional nature. Our results demonstrate the superiority of sparse logistic regression as a technique for both feature selection and regression over LASSO in the context of siRNA design.
Pharmacogenetics Biomarkers and Their Specific Role in Neoadjuvant Chemoradiotherapy Treatments: An Exploratory Study on Rectal Cancer Patients

PubMed Central

Dreussi, Eva; Cecchin, Erika; Polesel, Jerry; Canzonieri, Vincenzo; Agostini, Marco; Boso, Caterina; Belluco, Claudio; Buonadonna, Angela; Lonardi, Sara; Bergamo, Francesca; Gagno, Sara; De Mattia, Elena; Pucciarelli, Salvatore; De Paoli, Antonino; Toffoli, Giuseppe

2016-01-01

Background: Pathological complete response (pCR) to neoadjuvant chemoradiotherapy (CRT) in locally advanced rectal cancer (LARC) is still ascribed to a minority of patients. A pathway based-approach could highlight the predictive role of germline single nucleotide polymorphisms (SNPs). The primary aim of this study was to define new predictive biomarkers considering treatment specificities. Secondary aim was to determine new potential predictive biomarkers independent from radiotherapy (RT) dosage and cotreatment with oxaliplatin. Methods: Thirty germ-line SNPs in twenty-one genes were selected according to a pathway-based approach. Genetic analyses were performed on 280 LARC patients who underwent fluoropyrimidine-based CRT. The potential predictive role of these SNPs in determining pathological tumor response was tested in Group 1 (94 patients undergoing also oxaliplatin), Group 2 (73 patients treated with high RT dosage), Group 3 (113 patients treated with standard RT dosage), and in the pooled population (280 patients). Results: Nine new predictive biomarkers were identified in the three groups. The most promising one was rs3136228-MSH6 (p = 0.004) arising from Group 3. In the pooled population, rs1801133-MTHFR showed only a trend (p = 0.073). Conclusion: This exploratory study highlighted new potential predictive biomarkers of neoadjuvant CRT and underlined the importance to strictly define treatment peculiarities in pharmacogenetic analyses. PMID:27608007
Classification of pseudo pairs between nucleotide bases and amino acids by analysis of nucleotide–protein complexes

PubMed Central

Kondo, Jiro; Westhof, Eric

2011-01-01

Nucleotide bases are recognized by amino acid residues in a variety of DNA/RNA binding and nucleotide binding proteins. In this study, a total of 446 crystal structures of nucleotide–protein complexes are analyzed manually and pseudo pairs together with single and bifurcated hydrogen bonds observed between bases and amino acids are classified and annotated. Only 5 of the 20 usual amino acid residues, Asn, Gln, Asp, Glu and Arg, are able to orient in a coplanar fashion in order to form pseudo pairs with nucleotide bases through two hydrogen bonds. The peptide backbone can also form pseudo pairs with nucleotide bases and presents a strong bias for binding to the adenine base. The Watson–Crick side of the nucleotide bases is the major interaction edge participating in such pseudo pairs. Pseudo pairs between the Watson–Crick edge of guanine and Asp are frequently observed. The Hoogsteen edge of the purine bases is a good discriminatory element in recognition of nucleotide bases by protein side chains through the pseudo pairing: the Hoogsteen edge of adenine is recognized by various amino acids while the Hoogsteen edge of guanine is only recognized by Arg. The sugar edge is rarely recognized by either the side-chain or peptide backbone of amino acid residues. PMID:21737431
Noncoding somatic and inherited single-nucleotide variants converge to promote ESR1 expression in breast cancer.

PubMed

Bailey, Swneke D; Desai, Kinjal; Kron, Ken J; Mazrooei, Parisa; Sinnott-Armstrong, Nicholas A; Treloar, Aislinn E; Dowar, Mark; Thu, Kelsie L; Cescon, David W; Silvester, Jennifer; Yang, S Y Cindy; Wu, Xue; Pezo, Rossanna C; Haibe-Kains, Benjamin; Mak, Tak W; Bedard, Philippe L; Pugh, Trevor J; Sallari, Richard C; Lupien, Mathieu

2016-10-01

Sustained expression of the estrogen receptor-α (ESR1) drives two-thirds of breast cancer and defines the ESR1-positive subtype. ESR1 engages enhancers upon estrogen stimulation to establish an oncogenic expression program. Somatic copy number alterations involving the ESR1 gene occur in approximately 1% of ESR1-positive breast cancers, suggesting that other mechanisms underlie the persistent expression of ESR1. We report significant enrichment of somatic mutations within the set of regulatory elements (SRE) regulating ESR1 in 7% of ESR1-positive breast cancers. These mutations regulate ESR1 expression by modulating transcription factor binding to the DNA. The SRE includes a recurrently mutated enhancer whose activity is also affected by rs9383590, a functional inherited single-nucleotide variant (SNV) that accounts for several breast cancer risk-associated loci. Our work highlights the importance of considering the combinatorial activity of regulatory elements as a single unit to delineate the impact of noncoding genetic alterations on single genes in cancer.
The origin of multiple clones in the parthenogenetic lizard species Darevskia rostombekowi.

PubMed

Ryskov, Alexey P; Osipov, Fedor A; Omelchenko, Andrey V; Semyenova, Seraphima K; Girnyk, Anastasiya E; Korchagin, Vitaly I; Vergun, Andrey A; Murphy, Robert W

2017-01-01

The all-female Caucasian rock lizard Darevskia rostombekowi and other unisexual species of this genus reproduce normally via true parthenogenesis. Typically, diploid parthenogenetic reptiles exhibit some amount of clonal diversity. However, allozyme data from D. rostombekowi have suggested that this species consists of a single clone. Herein, we test this hypothesis by evaluating variation at three variable microsatellite loci for 42 specimens of D. rostombekowi from four populations in Armenia. Analyses based on single nucleotide polymorphisms of each locus reveal five genotypes or presumptive clones in this species. All individuals are heterozygous at the loci. The major clone occurs in 24 individuals and involves three populations. Four rare clones involve one or several individuals from one or two populations. Most variation owes to parent-specific single nucleotide polymorphisms, which occur as heterozygotes. This result fails to reject the hypothesis of a single hybridization founder event that resulted in the initial formation of one major clone. The other clones appear to have originated via post-formation microsatellite mutations of the major clone.
Redefining the genetics of Murine Gammaherpesvirus 68 via transcriptome-based annotation

PubMed Central

Johnson, L. Steven; Willert, Erin K.; Virgin, Herbert W.

2010-01-01

Summary Viral genetic studies often focus on large open reading frames (ORFs) identified during genome annotation (ORF-based annotation). Here we provide a tool and software set for defining gene expression by murine gammaherpesvirus 68 (γHV68) nucleotide-by-nucleotide across the 119,450 basepair (bp) genome. These tools allowed us to determine that viral RNA expression was significantly more complex than predicted from ORF-based annotation, including over 73,000 nucleotides of unexpected transcription within 30 expressed genomic regions (EGRs). Approximately 90% of this RNA expression was antisense to genomic regions containing known large ORFs. We verified the existence of novel transcripts in three EGRs using standard methods to validate the approach and determined which parts of the transcriptome depend on protein or viral DNA synthesis. This redefines the genetic map of γHV68, indicates that herpesviruses contain significantly more genetic complexity than predicted from ORF-based genome annotations, and provides new tools and approaches for viral genetic studies. PMID:20542255
Blind prediction of noncanonical RNA structure at atomic accuracy.

PubMed

Watkins, Andrew M; Geniesse, Caleb; Kladwang, Wipapat; Zakrevsky, Paul; Jaeger, Luc; Das, Rhiju

2018-05-01

Prediction of RNA structure from nucleotide sequence remains an unsolved grand challenge of biochemistry and requires distinct concepts from protein structure prediction. Despite extensive algorithmic development in recent years, modeling of noncanonical base pairs of new RNA structural motifs has not been achieved in blind challenges. We report a stepwise Monte Carlo (SWM) method with a unique add-and-delete move set that enables predictions of noncanonical base pairs of complex RNA structures. A benchmark of 82 diverse motifs establishes the method's general ability to recover noncanonical pairs ab initio, including multistrand motifs that have been refractory to prior approaches. In a blind challenge, SWM models predicted nucleotide-resolution chemical mapping and compensatory mutagenesis experiments for three in vitro selected tetraloop/receptors with previously unsolved structures (C7.2, C7.10, and R1). As a final test, SWM blindly and correctly predicted all noncanonical pairs of a Zika virus double pseudoknot during a recent community-wide RNA-Puzzle. Stepwise structure formation, as encoded in the SWM method, enables modeling of noncanonical RNA structure in a variety of previously intractable problems.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.